Class ParquetGroupScan
java.lang.Object
org.apache.drill.exec.physical.base.AbstractBase
org.apache.drill.exec.physical.base.AbstractGroupScan
org.apache.drill.exec.physical.base.AbstractFileGroupScan
org.apache.drill.exec.physical.base.AbstractGroupScanWithMetadata<ParquetMetadataProvider>
org.apache.drill.exec.store.parquet.AbstractParquetGroupScan
org.apache.drill.exec.store.parquet.ParquetGroupScan
- All Implemented Interfaces:
Iterable<PhysicalOperator>,GraphValue<PhysicalOperator>,FileGroupScan,FragmentLeaf,GroupScan,HasAffinity,Leaf,PhysicalOperator,Scan
-
Nested Class Summary
Nested classes/interfaces inherited from class org.apache.drill.exec.store.parquet.AbstractParquetGroupScan
AbstractParquetGroupScan.RowGroupScanFilterer<B extends AbstractParquetGroupScan.RowGroupScanFilterer<B>>Nested classes/interfaces inherited from class org.apache.drill.exec.physical.base.AbstractGroupScanWithMetadata
AbstractGroupScanWithMetadata.GroupScanWithMetadataFilterer<B extends AbstractGroupScanWithMetadata.GroupScanWithMetadataFilterer<B>> -
Field Summary
Fields inherited from class org.apache.drill.exec.store.parquet.AbstractParquetGroupScan
entries, mappings, readerConfig, rowGroupsFields inherited from class org.apache.drill.exec.physical.base.AbstractGroupScanWithMetadata
columns, files, fileSet, filter, limit, matchAllMetadata, metadataProvider, nonInterestingColumnsMetadata, partitionColumns, partitions, segments, tableMetadata, usedMetastoreFields inherited from class org.apache.drill.exec.physical.base.AbstractBase
INIT_ALLOCATION, initialAllocation, MAX_ALLOCATION, maxAllocation, userNameFields inherited from interface org.apache.drill.exec.physical.base.GroupScan
ALL_COLUMNS -
Constructor Summary
ConstructorsConstructorDescriptionParquetGroupScan(String userName, FileSelection selection, ParquetFormatPlugin formatPlugin, List<SchemaPath> columns, ParquetReaderConfig readerConfig, LogicalExpression filter, MetadataProviderManager metadataProviderManager) ParquetGroupScan(String userName, FileSelection selection, ParquetFormatPlugin formatPlugin, List<SchemaPath> columns, ParquetReaderConfig readerConfig, MetadataProviderManager metadataProviderManager) ParquetGroupScan(StoragePluginRegistry engineRegistry, String userName, List<ReadEntryWithPath> entries, StoragePluginConfig storageConfig, FormatPluginConfig formatConfig, List<SchemaPath> columns, org.apache.hadoop.fs.Path selectionRoot, org.apache.hadoop.fs.Path cacheFileRoot, ParquetReaderConfig readerConfig, LogicalExpression filter, TupleMetadata schema) -
Method Summary
Modifier and TypeMethodDescriptionclone(List<SchemaPath> columns) Returns a clone of GroupScan instance, except that the new GroupScan will use the provided list of columns .clone(FileSelection selection) protected AbstractParquetGroupScancloneWithFileSelection(Collection<org.apache.hadoop.fs.Path> filePaths) protected ParquetTableMetadataProviderBuilder<?> ReturnsTableMetadataProviderBuilderinstance which may provide metadata without using Drill Metastore.ReturnsAnalyzeInfoProviderinstance which will be used when running ANALYZE statement.org.apache.hadoop.fs.Pathprotected Collection<CoordinationProtos.DrillbitEndpoint> protected AbstractParquetGroupScan.RowGroupScanFilterer<?> Returns holder for metadata values which provides API to filter metadata and build new group scan instance using filtered metadata.getNewWithChildren(List<PhysicalOperator> children) Regenerate with this node with a new set of children.getPartitionValues(LocationProvider locationProvider) org.apache.hadoop.fs.PathReturns path to the selection root.getSpecificScan(int minorFragmentId) protected booleanprotected ParquetTableMetadataProviderBuilder<?> ReturnsTableMetadataProviderBuilderinstance based on specifiedMetadataProviderManagersource.toString()Methods inherited from class org.apache.drill.exec.store.parquet.AbstractParquetGroupScan
applyAssignments, applyFilter, applyLimit, canPushdownProjects, getEntries, getFiles, getMaxParallelizationWidth, getOperatorAffinity, getReadEntries, getReaderConfig, getReaderConfigForSerialization, getRowGroupsMetadata, modifyFileSelection, pruneForPartitions, pruneRowGroupsForFiles, supportsFilterPushDownMethods inherited from class org.apache.drill.exec.physical.base.AbstractGroupScanWithMetadata
checkMetadataConsistency, getColumns, getColumnValueCount, getDigest, getFileSet, getFilesMetadata, getFilter, getFilterPredicate, getFilterPredicate, getFilterString, getLimit, getMetadataProvider, getNextOrEmpty, getNonInterestingColumnsMetadata, getPartitionColumns, getPartitionsMetadata, getPartitionValue, getScanStats, getSchema, getSegmentsMetadata, getTableMetadata, getTypeForColumn, hasFiles, init, isAllDataPruned, isGroupScanFullyMatchesFilter, isImplicitOrPartCol, isMatchAllMetadata, limitMetadata, pruneForPartitions, setFilter, setFilterForRuntime, supportsLimitPushdown, usedMetastoreMethods inherited from class org.apache.drill.exec.physical.base.AbstractFileGroupScan
supportsPartitionFilterPushdownMethods inherited from class org.apache.drill.exec.physical.base.AbstractGroupScan
accept, enforceWidth, getDistributionAffinity, getInitialAllocation, getMaxAllocation, getMinParallelizationWidth, getOperatorType, getScanStats, getScanStats, isDistributed, isExecutable, iteratorMethods inherited from class org.apache.drill.exec.physical.base.AbstractBase
accept, getCost, getOperatorId, getSVMode, getUserName, isBufferedOperator, setCost, setMaxAllocation, setOperatorIdMethods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, wait, wait, waitMethods inherited from interface org.apache.drill.common.graph.GraphValue
acceptMethods inherited from interface org.apache.drill.exec.physical.base.GroupScan
enforceWidth, getMinParallelizationWidth, getScanStats, getScanStats, isDistributedMethods inherited from interface org.apache.drill.exec.physical.base.HasAffinity
getDistributionAffinityMethods inherited from interface java.lang.Iterable
forEach, iterator, spliteratorMethods inherited from interface org.apache.drill.exec.physical.base.PhysicalOperator
accept, getCost, getInitialAllocation, getMaxAllocation, getOperatorId, getOperatorType, getSVMode, getUserName, isBufferedOperator, isExecutable, setCost, setMaxAllocation, setOperatorId
-
Constructor Details
-
ParquetGroupScan
public ParquetGroupScan(StoragePluginRegistry engineRegistry, String userName, List<ReadEntryWithPath> entries, StoragePluginConfig storageConfig, FormatPluginConfig formatConfig, List<SchemaPath> columns, org.apache.hadoop.fs.Path selectionRoot, org.apache.hadoop.fs.Path cacheFileRoot, ParquetReaderConfig readerConfig, LogicalExpression filter, TupleMetadata schema) throws IOException - Throws:
IOException
-
ParquetGroupScan
public ParquetGroupScan(String userName, FileSelection selection, ParquetFormatPlugin formatPlugin, List<SchemaPath> columns, ParquetReaderConfig readerConfig, MetadataProviderManager metadataProviderManager) throws IOException - Throws:
IOException
-
ParquetGroupScan
public ParquetGroupScan(String userName, FileSelection selection, ParquetFormatPlugin formatPlugin, List<SchemaPath> columns, ParquetReaderConfig readerConfig, LogicalExpression filter, MetadataProviderManager metadataProviderManager) throws IOException - Throws:
IOException
-
-
Method Details
-
getFormatConfig
-
getEngineConfig
-
getSelectionRoot
public org.apache.hadoop.fs.Path getSelectionRoot()Description copied from interface:GroupScanReturns path to the selection root. If this GroupScan cannot provide selection root, it returns null.- Specified by:
getSelectionRootin interfaceGroupScan- Overrides:
getSelectionRootin classAbstractGroupScan- Returns:
- path to the selection root
-
getCacheFileRoot
public org.apache.hadoop.fs.Path getCacheFileRoot() -
getSpecificScan
-
getNewWithChildren
Description copied from interface:PhysicalOperatorRegenerate with this node with a new set of children. This is used in the case of materialization or optimization.- Parameters:
children-
-
clone
Description copied from interface:GroupScanReturns a clone of GroupScan instance, except that the new GroupScan will use the provided list of columns .- Specified by:
clonein interfaceGroupScan- Overrides:
clonein classAbstractGroupScan
-
clone
- Specified by:
clonein interfaceFileGroupScan- Overrides:
clonein classAbstractFileGroupScan- Throws:
IOException
-
toString
-
cloneWithFileSelection
protected AbstractParquetGroupScan cloneWithFileSelection(Collection<org.apache.hadoop.fs.Path> filePaths) throws IOException - Specified by:
cloneWithFileSelectionin classAbstractParquetGroupScan- Throws:
IOException
-
getFilterer
Description copied from class:AbstractGroupScanWithMetadataReturns holder for metadata values which provides API to filter metadata and build new group scan instance using filtered metadata.- Specified by:
getFiltererin classAbstractParquetGroupScan
-
getDrillbits
- Specified by:
getDrillbitsin classAbstractParquetGroupScan
-
supportsFileImplicitColumns
protected boolean supportsFileImplicitColumns()- Specified by:
supportsFileImplicitColumnsin classAbstractGroupScanWithMetadata<ParquetMetadataProvider>
-
getPartitionValues
- Specified by:
getPartitionValuesin classAbstractGroupScanWithMetadata<ParquetMetadataProvider>
-
getAnalyzeInfoProvider
Description copied from interface:GroupScanReturnsAnalyzeInfoProviderinstance which will be used when running ANALYZE statement.- Specified by:
getAnalyzeInfoProviderin interfaceGroupScan- Overrides:
getAnalyzeInfoProviderin classAbstractGroupScan- Returns:
AnalyzeInfoProviderinstance
-
defaultTableMetadataProviderBuilder
protected ParquetTableMetadataProviderBuilder<?> defaultTableMetadataProviderBuilder(MetadataProviderManager source) Description copied from class:AbstractGroupScanWithMetadataReturnsTableMetadataProviderBuilderinstance which may provide metadata without using Drill Metastore.- Specified by:
defaultTableMetadataProviderBuilderin classAbstractParquetGroupScan- Parameters:
source- metadata provider manager- Returns:
TableMetadataProviderBuilderinstance
-
tableMetadataProviderBuilder
protected ParquetTableMetadataProviderBuilder<?> tableMetadataProviderBuilder(MetadataProviderManager source) Description copied from class:AbstractGroupScanWithMetadataReturnsTableMetadataProviderBuilderinstance based on specifiedMetadataProviderManagersource.- Specified by:
tableMetadataProviderBuilderin classAbstractGroupScanWithMetadata<ParquetMetadataProvider>- Parameters:
source- metadata provider manager- Returns:
TableMetadataProviderBuilderinstance
-