Package org.apache.drill.exec.vector
Class VarCharVector
java.lang.Object
org.apache.drill.exec.vector.BaseValueVector
org.apache.drill.exec.vector.BaseDataValueVector
org.apache.drill.exec.vector.VarCharVector
- All Implemented Interfaces:
Closeable,AutoCloseable,Iterable<ValueVector>,ValueVector,VariableWidthVector
VarCharVector implements a vector of variable width values. Elements in the vector
are accessed by position from the logical start of the vector. A fixed width offsetVector
is used to convert an element's position to it's offset from the start of the (0-based)
DrillBuf. Size is inferred from adjacent elements.
- The width of each element is 4 byte(s). Note that the actual width is variable, this width is used as a guess for certain calculations.
- The equivalent Java primitive is 'int'
-
Nested Class Summary
Nested ClassesModifier and TypeClassDescriptionfinal classstatic final classHelper class to buffer container mutation as a means to optimize native memory copy operations.final classOverviewNested classes/interfaces inherited from class org.apache.drill.exec.vector.BaseValueVector
BaseValueVector.BaseAccessor, BaseValueVector.BaseMutatorNested classes/interfaces inherited from interface org.apache.drill.exec.vector.VariableWidthVector
VariableWidthVector.VariableWidthAccessor, VariableWidthVector.VariableWidthMutator -
Field Summary
Fields inherited from class org.apache.drill.exec.vector.BaseDataValueVector
data, emptyByteArrayFields inherited from class org.apache.drill.exec.vector.BaseValueVector
allocator, field, INITIAL_VALUE_ALLOCATION, MAX_ALLOCATION_SIZEFields inherited from interface org.apache.drill.exec.vector.ValueVector
BITS_VECTOR_NAME, MAX_BUFFER_SIZE, MAX_ROW_COUNT, MIN_ROW_COUNT, OFFSETS_VECTOR_NAME, VALUES_VECTOR_NAMEFields inherited from interface org.apache.drill.exec.vector.VariableWidthVector
DEFAULT_RECORD_BYTE_COUNT, MIN_BYTE_COUNT, offsetsField -
Constructor Summary
Constructors -
Method Summary
Modifier and TypeMethodDescriptionvoidAllocate new buffers.voidallocateNew(int totalBytes, int valueCount) Allocate a new memory space for this vector.booleanAllocates new buffers.voidclear()Release the underlying DrillBuf and reset the ValueVector to empty.voidcollectLedgers(Set<AllocationManager.BufferLedger> ledgers) Add the ledgers underlying the buffers underlying the components of the vector to the set provided.voidcopyEntry(int toIndex, ValueVector from, int fromIndex) protected voidcopyFrom(int fromIndex, int thisIndex, VarCharVector from) voidcopyFromSafe(int fromIndex, int thisIndex, VarCharVector from) voidvoidexchange(ValueVector other) Exchange state with another value vector of the same type.Returns anaccessorthat is used to read from this vector instance.intReturns the total size of buffers allocated by this vector.DrillBuf[]getBuffers(boolean clear) Return the underlying buffers associated with this vector.intReturns the number of bytes that is used by this vector instance.intgetBufferSizeFor(int valueCount) Returns the number of bytes that is used by this vector if it holds the given number of values.intProvide the maximum amount of variable width bytes that can be stored int his vector.intReturn the number of bytes contained in the current var len byte vector.Get the metadata for this field.Returns anmutatorthat is used to write to this vector instance.longintgetPayloadByteCount(int valueCount) Return the number of value bytes consumed by actual data.Returns afield readerthat supports reading values from this vector.getTransferPair(String ref, BufferAllocator allocator) getTransferPair(BufferAllocator allocator) Returns atransfer pair, creating a new target vector of the same type.intReturns the maximum number of values that can be stored in this vector instance.intReturn the number of bytes contained in the current var len byte vector.voidload(UserBitShared.SerializedField metadata, DrillBuf buffer) Load the data provided in the buffer.Returns a newtransfer pairthat is used to transfer underlying buffers into the target vector.voidreAlloc()reallocRaw(int newAllocationSize) Core of vector allocation.voidreset()This method has a similar effect of allocateNew() without actually clearing and reallocating the value vector.voidsetInitialCapacity(int valueCount) Set the initial record capacityvoidsplitAndTransferTo(int startIndex, int length, VarCharVector target) voidtoNullable(ValueVector nullableVector) Convert a non-nullable vector to nullable by shuffling the data from one to the other.voidtransferTo(VarCharVector target) Methods inherited from class org.apache.drill.exec.vector.BaseDataValueVector
close, getBufferMethods inherited from class org.apache.drill.exec.vector.BaseValueVector
checkBufRefs, fillBitsVector, getAllocator, getField, getField, getMetadataBuilder, iterator, toStringMethods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, wait, wait, waitMethods inherited from interface java.lang.Iterable
forEach, iterator, spliteratorMethods inherited from interface org.apache.drill.exec.vector.ValueVector
close, getAllocator, getField
-
Constructor Details
-
VarCharVector
-
-
Method Details
-
getReader
Description copied from interface:ValueVectorReturns afield readerthat supports reading values from this vector.- Specified by:
getReaderin interfaceValueVector
-
getBufferSize
public int getBufferSize()Description copied from interface:ValueVectorReturns the number of bytes that is used by this vector instance. This is a bit of a misnomer. Returns the number of bytes used by data in this instance.- Specified by:
getBufferSizein interfaceValueVector- Overrides:
getBufferSizein classBaseDataValueVector
-
getAllocatedSize
public int getAllocatedSize()Description copied from interface:ValueVectorReturns the total size of buffers allocated by this vector. Has meaning only when vectors are directly allocated and each vector has its own buffer. Does not have meaning for vectors deserialized from the network or disk in which multiple vectors share the same vector.- Specified by:
getAllocatedSizein interfaceValueVector- Overrides:
getAllocatedSizein classBaseDataValueVector- Returns:
- allocated buffer size, in bytes
-
getBufferSizeFor
public int getBufferSizeFor(int valueCount) Description copied from interface:ValueVectorReturns the number of bytes that is used by this vector if it holds the given number of values. The result will be the same as if Mutator.setValueCount() were called, followed by calling getBufferSize(), but without any of the closing side-effects that setValueCount() implies wrt finishing off the population of a vector. Some operations might wish to use this to determine how much memory has been used by a vector so far, even though it is not finished being populated.- Specified by:
getBufferSizeForin interfaceValueVector- Parameters:
valueCount- the number of values to assume this vector contains- Returns:
- the buffer size if this vector is holding valueCount values
-
getValueCapacity
public int getValueCapacity()Description copied from interface:ValueVectorReturns the maximum number of values that can be stored in this vector instance.- Specified by:
getValueCapacityin interfaceValueVector
-
getByteCapacity
public int getByteCapacity()Description copied from interface:VariableWidthVectorProvide the maximum amount of variable width bytes that can be stored int his vector.- Specified by:
getByteCapacityin interfaceVariableWidthVector- Returns:
-
getCurrentSizeInBytes
public int getCurrentSizeInBytes()Return the number of bytes contained in the current var len byte vector. TODO: Remove getVarByteLength with it's implementation after all client's are moved to using getCurrentSizeInBytes. It's kept as is to preserve backward compatibility- Specified by:
getCurrentSizeInBytesin interfaceVariableWidthVector- Returns:
-
getVarByteLength
public int getVarByteLength()Return the number of bytes contained in the current var len byte vector.- Returns:
-
getMetadata
Description copied from interface:ValueVectorGet the metadata for this field. Used in serialization- Specified by:
getMetadatain interfaceValueVector- Overrides:
getMetadatain classBaseValueVector- Returns:
- FieldMetadata for this field.
-
clear
public void clear()Description copied from interface:ValueVectorRelease the underlying DrillBuf and reset the ValueVector to empty.- Specified by:
clearin interfaceValueVector- Overrides:
clearin classBaseDataValueVector
-
getBuffers
Description copied from interface:ValueVectorReturn the underlying buffers associated with this vector. Note that this doesn't impact the reference counts for this buffer so it only should be used for in-context access. Also note that this buffer changes regularly thus external classes shouldn't hold a reference to it (unless they change it).- Specified by:
getBuffersin interfaceValueVector- Overrides:
getBuffersin classBaseDataValueVector- Parameters:
clear- Whether to clear vector before returning; the buffers will still be refcounted; but the returned array will be the only reference to them- Returns:
- The underlying
buffersthat is used by this vector instance.
-
getOffsetAddr
public long getOffsetAddr() -
getOffsetVector
- Specified by:
getOffsetVectorin interfaceVariableWidthVector
-
getTransferPair
Description copied from interface:ValueVectorReturns atransfer pair, creating a new target vector of the same type.- Specified by:
getTransferPairin interfaceValueVector- Overrides:
getTransferPairin classBaseValueVector
-
getTransferPair
- Specified by:
getTransferPairin interfaceValueVector
-
makeTransferPair
Description copied from interface:ValueVectorReturns a newtransfer pairthat is used to transfer underlying buffers into the target vector.- Specified by:
makeTransferPairin interfaceValueVector
-
transferTo
-
splitAndTransferTo
-
copyFrom
-
copyFromSafe
-
copyEntry
- Specified by:
copyEntryin interfaceValueVector
-
collectLedgers
Description copied from interface:ValueVectorAdd the ledgers underlying the buffers underlying the components of the vector to the set provided. Used to determine actual memory allocation.- Specified by:
collectLedgersin interfaceValueVector- Overrides:
collectLedgersin classBaseDataValueVector- Parameters:
ledgers- set of ledgers to which to add ledgers for this vector
-
getPayloadByteCount
public int getPayloadByteCount(int valueCount) Description copied from interface:ValueVectorReturn the number of value bytes consumed by actual data.- Specified by:
getPayloadByteCountin interfaceValueVector
-
setInitialCapacity
public void setInitialCapacity(int valueCount) Description copied from interface:ValueVectorSet the initial record capacity- Specified by:
setInitialCapacityin interfaceValueVector- Parameters:
valueCount-
-
allocateNew
public void allocateNew()Description copied from interface:ValueVectorAllocate new buffers. ValueVector implements logic to determine how much to allocate.- Specified by:
allocateNewin interfaceValueVector
-
allocateNewSafe
public boolean allocateNewSafe()Description copied from interface:ValueVectorAllocates new buffers. ValueVector implements logic to determine how much to allocate.- Specified by:
allocateNewSafein interfaceValueVector- Returns:
- Returns true if allocation was successful.
-
allocateNew
public void allocateNew(int totalBytes, int valueCount) Description copied from interface:VariableWidthVectorAllocate a new memory space for this vector. Must be called prior to using the ValueVector.- Specified by:
allocateNewin interfaceVariableWidthVector- Parameters:
totalBytes- Desired size of the underlying data buffer.valueCount- Number of values in the vector.
-
reset
public void reset()Description copied from class:BaseDataValueVectorThis method has a similar effect of allocateNew() without actually clearing and reallocating the value vector. The purpose is to move the value vector to a "mutate" state- Overrides:
resetin classBaseDataValueVector
-
reAlloc
public void reAlloc() -
reallocRaw
Description copied from class:BaseDataValueVectorCore of vector allocation. Given a new size (which must be a power of two), allocate the new buffer, copy the current values, and leave the unused parts garbage-filled.- Specified by:
reallocRawin classBaseDataValueVector- Parameters:
newAllocationSize- new buffer size as a power of two- Returns:
- the new buffer
-
decrementAllocationMonitor
public void decrementAllocationMonitor() -
getAccessor
Description copied from interface:ValueVectorReturns anaccessorthat is used to read from this vector instance.- Specified by:
getAccessorin interfaceValueVector- Specified by:
getAccessorin interfaceVariableWidthVector
-
getMutator
Description copied from interface:ValueVectorReturns anmutatorthat is used to write to this vector instance.- Specified by:
getMutatorin interfaceValueVector- Specified by:
getMutatorin interfaceVariableWidthVector
-
exchange
Description copied from interface:ValueVectorExchange state with another value vector of the same type. Used to implement look-ahead writers.- Specified by:
exchangein interfaceValueVector- Overrides:
exchangein classBaseDataValueVector
-
toNullable
Description copied from interface:ValueVectorConvert a non-nullable vector to nullable by shuffling the data from one to the other. Avoids the need to generate copy code just to change mode. If this vector is non-nullable, accepts a nullable dual (same minor type, different mode.) If the vector is non-nullable, or non-scalar, then throws an exception.- Specified by:
toNullablein interfaceValueVector- Overrides:
toNullablein classBaseValueVector- Parameters:
nullableVector- nullable vector of the same minor type as this vector
-