object UnsafeRowUtils
- Alphabetic
- By Inheritance
- UnsafeRowUtils
- AnyRef
- Any
- Hide All
- Show All
- Public
- Protected
Value Members
- final def !=(arg0: Any): Boolean
- Definition Classes
- AnyRef → Any
- final def ##: Int
- Definition Classes
- AnyRef → Any
- final def ==(arg0: Any): Boolean
- Definition Classes
- AnyRef → Any
- final def asInstanceOf[T0]: T0
- Definition Classes
- Any
- def avoidSetNullAt(dt: DataType): Boolean
Returns a Boolean indicating whether one should avoid calling UnsafeRow.setNullAt for a field of the given data type.
Returns a Boolean indicating whether one should avoid calling UnsafeRow.setNullAt for a field of the given data type. Fields of type DecimalType (with precision greater than Decimal.MAX_LONG_DIGITS) and CalendarIntervalType use pointers into the variable length region, and those pointers should never get zeroed out (setNullAt will zero out those pointers) because UnsafeRow may do in-place update for these 2 types even though they are not primitive.
When avoidSetNullAt returns true, callers should not use UnsafeRow#setNullAt for fields of that data type, but instead pass a null value to the appropriate set method, e.g.:
row.setDecimal(ordinal, null, precision)
Even though only UnsafeRow has this limitation, it's safe to extend this rule to all subclasses of InternalRow, since you don't always know the concrete type of the row you are dealing with, and all subclasses of InternalRow will handle a null value appropriately.
- def clone(): AnyRef
- Attributes
- protected[lang]
- Definition Classes
- AnyRef
- Annotations
- @throws(classOf[java.lang.CloneNotSupportedException]) @native()
- final def eq(arg0: AnyRef): Boolean
- Definition Classes
- AnyRef
- def equals(arg0: AnyRef): Boolean
- Definition Classes
- AnyRef → Any
- def finalize(): Unit
- Attributes
- protected[lang]
- Definition Classes
- AnyRef
- Annotations
- @throws(classOf[java.lang.Throwable])
- final def getClass(): Class[_ <: AnyRef]
- Definition Classes
- AnyRef → Any
- Annotations
- @native()
- def getOffsetAndSize(row: UnsafeRow, index: Int): (Int, Int)
- def hashCode(): Int
- Definition Classes
- AnyRef → Any
- Annotations
- @native()
- final def isInstanceOf[T0]: Boolean
- Definition Classes
- Any
- final def ne(arg0: AnyRef): Boolean
- Definition Classes
- AnyRef
- final def notify(): Unit
- Definition Classes
- AnyRef
- Annotations
- @native()
- final def notifyAll(): Unit
- Definition Classes
- AnyRef
- Annotations
- @native()
- final def synchronized[T0](arg0: => T0): T0
- Definition Classes
- AnyRef
- def toString(): String
- Definition Classes
- AnyRef → Any
- def validateStructuralIntegrity(row: UnsafeRow, expectedSchema: StructType): Boolean
Use the following rules to check the integrity of the UnsafeRow: - schema.fields.length == row.numFields should always be true - UnsafeRow.calculateBitSetWidthInBytes(row.numFields) < row.getSizeInBytes should always be true if the expectedSchema contains at least one field.
Use the following rules to check the integrity of the UnsafeRow: - schema.fields.length == row.numFields should always be true - UnsafeRow.calculateBitSetWidthInBytes(row.numFields) < row.getSizeInBytes should always be true if the expectedSchema contains at least one field. - For variable-length fields:
- if null bit says it's null, then
- in general the offset-and-size should be zero
- special case: variable-length DecimalType is considered mutable in UnsafeRow, and to support that, the offset is set to point to the variable-length part like a non-null value, while the size is set to zero to signal that it's a null value. The offset may also be set to zero, in which case this variable-length Decimal no longer supports being mutable in the UnsafeRow.
- otherwise the field is not null, then extract offset and size: 1) 0 <= size < row.getSizeInBytes should always be true. We can be even more precise than this, where the upper bound of size can only be as big as the variable length part of the row. 2) offset should be >= fixed sized part of the row. 3) offset + size should be within the row bounds. - For fixed-length fields that are narrower than 8 bytes (boolean/byte/short/int/float), if null bit says it's null then don't do anything, else: check if the unused bits in the field are all zeros. The UnsafeRowWriter's write() methods make this guarantee. - Check the total length of the row.
- if null bit says it's null, then
- final def wait(): Unit
- Definition Classes
- AnyRef
- Annotations
- @throws(classOf[java.lang.InterruptedException])
- final def wait(arg0: Long, arg1: Int): Unit
- Definition Classes
- AnyRef
- Annotations
- @throws(classOf[java.lang.InterruptedException])
- final def wait(arg0: Long): Unit
- Definition Classes
- AnyRef
- Annotations
- @throws(classOf[java.lang.InterruptedException]) @native()