- AbstractCommand - Class in org.kitesdk.morphline.base
-
Base class for convenient implementation of
Command
classes.
- AbstractCommand(CommandBuilder, Config, Command, Command, MorphlineContext) - Constructor for class org.kitesdk.morphline.base.AbstractCommand
-
Using the given builder
, constructs a command rooted at the given morphline JSON
config
.
- AbstractCommand(Config, Command, Command, MorphlineContext) - Constructor for class org.kitesdk.morphline.base.AbstractCommand
-
Deprecated.
- AbstractFieldTransformCommand - Class in org.kitesdk.morphline.stdlib
-
Base class for convenient implementation of commands that do in-place updates on a field.
- AbstractFieldTransformCommand(CommandBuilder, Config, Command, Command, MorphlineContext) - Constructor for class org.kitesdk.morphline.stdlib.AbstractFieldTransformCommand
-
- AbstractFieldTransformCommand(Config, Command, Command, MorphlineContext) - Constructor for class org.kitesdk.morphline.stdlib.AbstractFieldTransformCommand
-
Deprecated.
- AbstractParser - Class in org.kitesdk.morphline.stdio
-
Base class for convenient implementation of morphline parsers.
- AbstractParser(CommandBuilder, Config, Command, Command, MorphlineContext) - Constructor for class org.kitesdk.morphline.stdio.AbstractParser
-
- AbstractParser(Config, Command, Command, MorphlineContext) - Constructor for class org.kitesdk.morphline.stdio.AbstractParser
-
Deprecated.
- add(String, Object) - Method in class org.kitesdk.data.Key.Builder
-
Add a key value for the named field.
- AddCurrentTimeBuilder - Class in org.kitesdk.morphline.stdlib
-
- AddCurrentTimeBuilder() - Constructor for class org.kitesdk.morphline.stdlib.AddCurrentTimeBuilder
-
- addJarForClass(Class<?>) - Method in class org.kitesdk.tools.TaskUtil.ConfigBuilder
-
Finds the jar that contains the required class and adds it to the
distributed cache configuration.
- addJarPathForClass(Class<?>) - Method in class org.kitesdk.tools.TaskUtil.ConfigBuilder
-
Finds the jar that contains the required class and adds its containing
directory to the distributed cache configuration.
- addJars(List<String>) - Method in class org.kitesdk.tools.TaskUtil.ConfigBuilder
-
Adds a set of jar paths to the distributed cache.
- AddLocalHostBuilder - Class in org.kitesdk.morphline.stdlib
-
A command that adds the name or IP of the local host to a given output field.
- AddLocalHostBuilder() - Constructor for class org.kitesdk.morphline.stdlib.AddLocalHostBuilder
-
- addService(Class<? extends Service>) - Method in class org.kitesdk.minicluster.MiniCluster.Builder
-
- addSupportedMimeType(String) - Method in class org.kitesdk.morphline.stdio.AbstractParser
-
- AddValuesBuilder - Class in org.kitesdk.morphline.stdlib
-
For each input field value, add the value to the given record output field.
- AddValuesBuilder() - Constructor for class org.kitesdk.morphline.stdlib.AddValuesBuilder
-
- AddValuesIfAbsentBuilder - Class in org.kitesdk.morphline.stdlib
-
For each input field value, add the value to the given record output field if the value isn't
already contained in that field.
- AddValuesIfAbsentBuilder() - Constructor for class org.kitesdk.morphline.stdlib.AddValuesIfAbsentBuilder
-
- append(Path, int, Progressable) - Method in class org.kitesdk.morphline.hadoop.rcfile.SingleStreamFileSystem
-
- appendTo(URI) - Method in class org.kitesdk.data.mapreduce.DatasetKeyOutputFormat.ConfigBuilder
-
Adds configuration for DatasetKeyOutputFormat
to append to the
given dataset or view URI, leaving any existing data intact.
- appendTo(View<?>) - Method in class org.kitesdk.data.mapreduce.DatasetKeyOutputFormat.ConfigBuilder
-
Adds configuration for DatasetKeyOutputFormat
to append to the
given dataset or view URI, leaving any existing data intact.
- appendTo(String) - Method in class org.kitesdk.data.mapreduce.DatasetKeyOutputFormat.ConfigBuilder
-
Adds configuration for DatasetKeyOutputFormat
to append to the
given dataset or view URI, leaving any existing data intact.
- asSource(View<E>) - Static method in class org.kitesdk.data.crunch.CrunchDatasets
-
- asSource(URI, Class<E>) - Static method in class org.kitesdk.data.crunch.CrunchDatasets
-
- asSource(String, Class<E>) - Static method in class org.kitesdk.data.crunch.CrunchDatasets
-
- asTarget(View<E>) - Static method in class org.kitesdk.data.crunch.CrunchDatasets
-
- asTarget(String) - Static method in class org.kitesdk.data.crunch.CrunchDatasets
-
- asTarget(URI) - Static method in class org.kitesdk.data.crunch.CrunchDatasets
-
- ATTACHMENT_BODY - Static variable in class org.kitesdk.morphline.base.Fields
-
- ATTACHMENT_CHARSET - Static variable in class org.kitesdk.morphline.base.Fields
-
- ATTACHMENT_MIME_TYPE - Static variable in class org.kitesdk.morphline.base.Fields
-
- ATTACHMENT_NAME - Static variable in class org.kitesdk.morphline.base.Fields
-
- AVRO - Static variable in class org.kitesdk.data.Formats
-
- AVRO_MEMORY_MIME_TYPE - Static variable in class org.kitesdk.morphline.avro.ReadAvroBuilder
-
The MIME type identifier that will be filled into output records
- AVRO_MEMORY_MIME_TYPE - Static variable in class org.kitesdk.morphline.hadoop.parquet.avro.ReadAvroParquetFileBuilder
-
The MIME type identifier that will be filled into output records
- CallParentPipeBuilder - Class in org.kitesdk.morphline.stdlib
-
Command that routes records to the enclosing pipe morphline object.
- CallParentPipeBuilder() - Constructor for class org.kitesdk.morphline.stdlib.CallParentPipeBuilder
-
- check(boolean, String, Object...) - Static method in exception org.kitesdk.data.DatasetRecordException
-
- check(boolean, String, Object...) - Static method in exception org.kitesdk.data.ValidationException
-
- checkOutputSpecs(JobContext) - Method in class org.kitesdk.data.mapreduce.DatasetKeyOutputFormat
-
- clean(boolean) - Method in class org.kitesdk.minicluster.MiniCluster.Builder
-
- CLEAN_KEY - Static variable in class org.kitesdk.minicluster.MiniCluster
-
- close() - Method in interface org.kitesdk.data.DatasetReader
-
Close the reader and release any system resources.
- close() - Method in interface org.kitesdk.data.DatasetWriter
-
Close the writer and release any system resources.
- column(String, String, String) - Method in class org.kitesdk.data.ColumnMapping.Builder
-
Adds a mapping to store the record field name
in a column using
the family
and qualifier
.
- column(String, String, String) - Static method in class org.kitesdk.data.FieldMapping
-
- ColumnMapping - Class in org.kitesdk.data
-
A descriptor for an entity's column mappings, which defines how an entity
maps to a columnar store.
- columnMapping(ColumnMapping) - Method in class org.kitesdk.data.DatasetDescriptor.Builder
-
Configure the dataset's column mapping descriptor (optional)
- columnMapping(File) - Method in class org.kitesdk.data.DatasetDescriptor.Builder
-
Configure the dataset's column mapping descriptor from a File.
- columnMapping(InputStream) - Method in class org.kitesdk.data.DatasetDescriptor.Builder
-
Configure the dataset's column mapping descriptor from an InputStream.
- ColumnMapping.Builder - Class in org.kitesdk.data
-
- ColumnMapping.Builder() - Constructor for class org.kitesdk.data.ColumnMapping.Builder
-
- columnMappingLiteral(String) - Method in class org.kitesdk.data.DatasetDescriptor.Builder
-
Configure the dataset's column mappings from a String literal.
- columnMappingUri(URI) - Method in class org.kitesdk.data.DatasetDescriptor.Builder
-
Configure the dataset's column mappings from a URI.
- columnMappingUri(String) - Method in class org.kitesdk.data.DatasetDescriptor.Builder
-
Configure the dataset's column mappings from a String URI.
- Command - Interface in org.kitesdk.morphline.api
-
A command transforms a record into zero or more records.
- CommandBuilder - Interface in org.kitesdk.morphline.api
-
Interface to create instances of morphline commands.
- commitTransaction() - Method in interface org.kitesdk.morphline.solr.DocumentLoader
-
Sends any outstanding documents to the destination and waits for a positive
or negative ack (i.e.
- commitTransaction() - Method in class org.kitesdk.morphline.solr.SolrServerDocumentLoader
-
- compile(File, String, MorphlineContext, Command, Config...) - Method in class org.kitesdk.morphline.base.Compiler
-
Parses the given morphlineFile, then finds the morphline with the given morphlineId within,
then compiles the morphline and returns the corresponding morphline command.
- compile(Config, MorphlineContext, Command) - Method in class org.kitesdk.morphline.base.Compiler
-
Compiles the given morphline config using the given morphline context.
- Compiler - Class in org.kitesdk.morphline.base
-
Tool to parse and compile a morphline file or morphline config.
- Compiler() - Constructor for class org.kitesdk.morphline.base.Compiler
-
- CompressionType - Enum in org.kitesdk.data
-
Contains constant definitions for the standard compression types we support.
- compressionType(CompressionType) - Method in class org.kitesdk.data.DatasetDescriptor.Builder
-
Configure the dataset's compression format (optional).
- compressionType(String) - Method in class org.kitesdk.data.DatasetDescriptor.Builder
-
Configure the dataset's compression format (optional).
- ConcurrentSchemaModificationException - Exception in org.kitesdk.data
-
Exception thrown when a schema modification collides with
another client trying to modify the schema of the same
dataset.
- ConcurrentSchemaModificationException(String) - Constructor for exception org.kitesdk.data.ConcurrentSchemaModificationException
-
- ConcurrentSchemaModificationException(Throwable) - Constructor for exception org.kitesdk.data.ConcurrentSchemaModificationException
-
- ConcurrentSchemaModificationException(String, Throwable) - Constructor for exception org.kitesdk.data.ConcurrentSchemaModificationException
-
- CONFIG_KEY_FIELD - Static variable in class org.kitesdk.morphline.hadoop.sequencefile.ReadSequenceFileBuilder
-
- CONFIG_VALUE_FIELD - Static variable in class org.kitesdk.morphline.hadoop.sequencefile.ReadSequenceFileBuilder
-
- Configs - Class in org.kitesdk.morphline.base
-
Helpers to traverse and read parts of a HOCON data structure.
- Configs() - Constructor for class org.kitesdk.morphline.base.Configs
-
- configure(Job) - Static method in class org.kitesdk.data.mapreduce.DatasetKeyInputFormat
-
Configures the Job
to use the DatasetKeyInputFormat
and
returns a helper to add further configuration.
- configure(Configuration) - Static method in class org.kitesdk.data.mapreduce.DatasetKeyInputFormat
-
Adds settings to Configuration
to use DatasetKeyInputFormat
and returns a helper to add further configuration.
- configure(Job) - Static method in class org.kitesdk.data.mapreduce.DatasetKeyOutputFormat
-
Configures the Job
to use the DatasetKeyOutputFormat
and
returns a helper to add further configuration.
- configure(Configuration) - Static method in class org.kitesdk.data.mapreduce.DatasetKeyOutputFormat
-
Returns a helper to add output options to the given Configuration
.
- configure(Service.ServiceConfig) - Method in class org.kitesdk.minicluster.FlumeService
-
- configure(Service.ServiceConfig) - Method in class org.kitesdk.minicluster.HBaseService
-
- configure(Service.ServiceConfig) - Method in class org.kitesdk.minicluster.HdfsService
-
- configure(Service.ServiceConfig) - Method in class org.kitesdk.minicluster.HiveService
-
- configure(Service.ServiceConfig) - Method in interface org.kitesdk.minicluster.Service
-
Configure this service with the ServiceConfig
- configure(Service.ServiceConfig) - Method in class org.kitesdk.minicluster.ZookeeperService
-
- configure(Job) - Static method in class org.kitesdk.tools.TaskUtil
-
Returns a configuration builder for the given
Job
.
- configure(Configuration) - Static method in class org.kitesdk.tools.TaskUtil
-
- contains(String) - Method in class org.kitesdk.minicluster.Service.ServiceConfig
-
- ContainsBuilder - Class in org.kitesdk.morphline.stdlib
-
Command that succeeds if one of the field values of the given named field is equal to one of the
the given values, and fails otherwise; Multiple fields can be named, in which case the results
are ANDed.
- ContainsBuilder() - Constructor for class org.kitesdk.morphline.stdlib.ContainsBuilder
-
- containsLifecycleEvent(Record, Notifications.LifecycleEvent) - Static method in class org.kitesdk.morphline.base.Notifications
-
Determine whether or not the given notification contains the given lifecycle event.
- context - Variable in class org.kitesdk.morphline.api.MorphlineContext.Builder
-
- ConvertHTMLBuilder - Class in org.kitesdk.morphline.saxon
-
Command that converts HTML to XHTML using the
TagSoup library.
- ConvertHTMLBuilder() - Constructor for class org.kitesdk.morphline.saxon.ConvertHTMLBuilder
-
- ConvertTimestampBuilder - Class in org.kitesdk.morphline.stdlib
-
Command that converts the timestamps in a given field from one of a set of input date formats (in
an input timezone) to an output date format (in an output timezone), while respecting daylight
savings time rules.
- ConvertTimestampBuilder() - Constructor for class org.kitesdk.morphline.stdlib.ConvertTimestampBuilder
-
- copy() - Method in class org.kitesdk.morphline.api.Record
-
Returns a shallow copy of this record.
- CopyTask<E> - Class in org.kitesdk.tools
-
- CopyTask(View<E>, View<E>) - Constructor for class org.kitesdk.tools.CopyTask
-
- counter(String, String, String) - Method in class org.kitesdk.data.ColumnMapping.Builder
-
Adds a counter mapping to store record field name
in a column
using the family
and qualifier
.
- counter(String, String, String) - Static method in class org.kitesdk.data.FieldMapping
-
- create(URI, DatasetDescriptor, Class<E>) - Static method in class org.kitesdk.data.Datasets
-
Create a
Dataset
for the given dataset or view URI.
- create(URI, DatasetDescriptor) - Static method in class org.kitesdk.data.Datasets
-
Create a
Dataset
for the given dataset or view URI.
- create(String, DatasetDescriptor, Class<E>) - Static method in class org.kitesdk.data.Datasets
-
Create a
Dataset
for the given dataset or view URI string.
- create(String, DatasetDescriptor) - Static method in class org.kitesdk.data.Datasets
-
Create a
Dataset
for the given dataset or view URI string.
- create() - Method in class org.kitesdk.morphline.api.MorphlineContext.Builder
-
- create(Path, FsPermission, boolean, int, short, long, Progressable) - Method in class org.kitesdk.morphline.hadoop.rcfile.SingleStreamFileSystem
-
- create() - Method in class org.kitesdk.morphline.solr.SolrMorphlineContext.Builder
-
- createRecordReader(InputSplit, TaskAttemptContext) - Method in class org.kitesdk.data.mapreduce.DatasetKeyInputFormat
-
- createSolrContentHandler(Metadata, SolrParams, IndexSchema) - Method in class org.kitesdk.morphline.solrcell.StripNonCharSolrContentHandlerFactory
-
- createSolrContentHandler(Metadata, SolrParams, IndexSchema) - Method in class org.kitesdk.morphline.solrcell.TrimSolrContentHandlerFactory
-
- CrunchDatasets - Class in org.kitesdk.data.crunch
-
- CrunchDatasets() - Constructor for class org.kitesdk.data.crunch.CrunchDatasets
-
- CSV - Static variable in class org.kitesdk.data.Formats
-
CSV: comma-separated values (read-only).
- DATANODE_HTTP_PORT - Static variable in class org.kitesdk.minicluster.HdfsService
-
- DATANODE_IPC_PORT - Static variable in class org.kitesdk.minicluster.HdfsService
-
- DATANODE_PORT - Static variable in class org.kitesdk.minicluster.HdfsService
-
- Dataset<E> - Interface in org.kitesdk.data
-
A logical representation of a set of data entities.
- dataset(String) - Method in class org.kitesdk.data.URIBuilder
-
Sets the dataset name that is used to construct the dataset URI.
- DATASET_NAME_OPTION - Static variable in class org.kitesdk.data.URIBuilder
-
- DATASET_SCHEME - Static variable in class org.kitesdk.data.URIBuilder
-
- DatasetDescriptor - Class in org.kitesdk.data
-
The structural definition of a
Dataset
.
- DatasetDescriptor(Schema, URL, Format, URI, Map<String, String>, PartitionStrategy) - Constructor for class org.kitesdk.data.DatasetDescriptor
-
- DatasetDescriptor(Schema, URL, Format, URI, Map<String, String>, PartitionStrategy, ColumnMapping) - Constructor for class org.kitesdk.data.DatasetDescriptor
-
- DatasetDescriptor(Schema, URI, Format, URI, Map<String, String>, PartitionStrategy, ColumnMapping, CompressionType) - Constructor for class org.kitesdk.data.DatasetDescriptor
-
- DatasetDescriptor.Builder - Class in org.kitesdk.data
-
- DatasetDescriptor.Builder() - Constructor for class org.kitesdk.data.DatasetDescriptor.Builder
-
- DatasetDescriptor.Builder(DatasetDescriptor) - Constructor for class org.kitesdk.data.DatasetDescriptor.Builder
-
Creates a Builder configured to copy descriptor
, if it is not
modified.
- DatasetException - Exception in org.kitesdk.data
-
Exception thrown for dataset-related failures.
- DatasetException() - Constructor for exception org.kitesdk.data.DatasetException
-
- DatasetException(String) - Constructor for exception org.kitesdk.data.DatasetException
-
- DatasetException(String, Throwable) - Constructor for exception org.kitesdk.data.DatasetException
-
- DatasetException(Throwable) - Constructor for exception org.kitesdk.data.DatasetException
-
- DatasetExistsException - Exception in org.kitesdk.data
-
Exception thrown when trying to create a dataset that already exists.
- DatasetExistsException(String) - Constructor for exception org.kitesdk.data.DatasetExistsException
-
- DatasetExistsException(String, Throwable) - Constructor for exception org.kitesdk.data.DatasetExistsException
-
- DatasetExistsException(Throwable) - Constructor for exception org.kitesdk.data.DatasetExistsException
-
- DatasetIOException - Exception in org.kitesdk.data
-
Exception thrown for dataset IO-related failures.
- DatasetIOException(String, IOException) - Constructor for exception org.kitesdk.data.DatasetIOException
-
- DatasetKeyInputFormat<E> - Class in org.kitesdk.data.mapreduce
-
A MapReduce
InputFormat
for reading from a
Dataset
.
- DatasetKeyInputFormat() - Constructor for class org.kitesdk.data.mapreduce.DatasetKeyInputFormat
-
- DatasetKeyInputFormat.ConfigBuilder - Class in org.kitesdk.data.mapreduce
-
- DatasetKeyOutputFormat<E> - Class in org.kitesdk.data.mapreduce
-
A MapReduce
OutputFormat
for writing to a
Dataset
.
- DatasetKeyOutputFormat() - Constructor for class org.kitesdk.data.mapreduce.DatasetKeyOutputFormat
-
- DatasetKeyOutputFormat.ConfigBuilder - Class in org.kitesdk.data.mapreduce
-
- DatasetNotFoundException - Exception in org.kitesdk.data
-
Exception thrown to indicate that a dataset was not found.
- DatasetNotFoundException(String) - Constructor for exception org.kitesdk.data.DatasetNotFoundException
-
- DatasetNotFoundException(String, Throwable) - Constructor for exception org.kitesdk.data.DatasetNotFoundException
-
- DatasetNotFoundException(Throwable) - Constructor for exception org.kitesdk.data.DatasetNotFoundException
-
- DatasetOperationException - Exception in org.kitesdk.data
-
DatasetException
thrown when an implementation-specific step fails
and prevents a dataset operation from completing successfully.
- DatasetOperationException(String, Object...) - Constructor for exception org.kitesdk.data.DatasetOperationException
-
- DatasetOperationException(Throwable, String, Object...) - Constructor for exception org.kitesdk.data.DatasetOperationException
-
- DatasetOperationException(String, Throwable) - Constructor for exception org.kitesdk.data.DatasetOperationException
-
- DatasetReader<E> - Interface in org.kitesdk.data
-
A stream-oriented dataset reader.
- DatasetRecordException - Exception in org.kitesdk.data
-
A DatasetException that signals that an individual record has failed.
- DatasetRecordException(String) - Constructor for exception org.kitesdk.data.DatasetRecordException
-
- DatasetRecordException(String, Throwable) - Constructor for exception org.kitesdk.data.DatasetRecordException
-
- Datasets - Class in org.kitesdk.data
-
Methods for working with
Dataset
instances.
- Datasets() - Constructor for class org.kitesdk.data.Datasets
-
- DatasetWriter<E> - Interface in org.kitesdk.data
-
A stream-oriented dataset writer.
- dateFormat(String, String, String) - Method in class org.kitesdk.data.PartitionStrategy.Builder
-
Configure a partitioner that applies a custom date format to a timestamp
field.
- day(String, String) - Method in class org.kitesdk.data.PartitionStrategy.Builder
-
Configure a partitioner for extracting the day from a timestamp field.
- day(String) - Method in class org.kitesdk.data.PartitionStrategy.Builder
-
Configure a partitioner for extracting the day from a timestamp field.
- DecodeBase64Builder - Class in org.kitesdk.morphline.stdlib
-
Command that converts a Base64 encoded String to a byte[] per Section 6.8.
- DecodeBase64Builder() - Constructor for class org.kitesdk.morphline.stdlib.DecodeBase64Builder
-
- DecompressBuilder - Class in org.kitesdk.morphline.tika.decompress
-
Command that decompresses the first attachment.
- DecompressBuilder() - Constructor for class org.kitesdk.morphline.tika.decompress.DecompressBuilder
-
- delete(URI) - Static method in class org.kitesdk.data.Datasets
-
Delete a
Dataset
identified by the given dataset URI.
- delete(String) - Static method in class org.kitesdk.data.Datasets
-
Delete a
Dataset
identified by the given dataset URI string.
- delete(Key) - Method in interface org.kitesdk.data.RandomAccessDataset
-
Deletes the entity in the dataset with
Key
key
.
- delete(E) - Method in interface org.kitesdk.data.RandomAccessDataset
-
Deletes the entity passed to this method in the dataset.
- delete(Path, boolean) - Method in class org.kitesdk.morphline.hadoop.rcfile.SingleStreamFileSystem
-
- delete(Path) - Method in class org.kitesdk.morphline.hadoop.rcfile.SingleStreamFileSystem
-
- deleteAll() - Method in interface org.kitesdk.data.View
-
- deleteById(String) - Method in interface org.kitesdk.morphline.solr.DocumentLoader
-
Deletes the document with the given id from the destination
- deleteById(String) - Method in class org.kitesdk.morphline.solr.SolrServerDocumentLoader
-
- deleteByQuery(String) - Method in interface org.kitesdk.morphline.solr.DocumentLoader
-
Deletes the documents matching the given query from the destination
- deleteByQuery(String) - Method in class org.kitesdk.morphline.solr.SolrServerDocumentLoader
-
- dependencies() - Method in class org.kitesdk.minicluster.FlumeService
-
- dependencies() - Method in class org.kitesdk.minicluster.HBaseService
-
- dependencies() - Method in class org.kitesdk.minicluster.HdfsService
-
- dependencies() - Method in class org.kitesdk.minicluster.HiveService
-
- dependencies() - Method in interface org.kitesdk.minicluster.Service
-
Get the other Service implementations this Service has as a dependency.
- dependencies() - Method in class org.kitesdk.minicluster.ZookeeperService
-
- detectCharset(Record, Charset) - Method in class org.kitesdk.morphline.stdio.AbstractParser
-
- DetectMimeTypeBuilder - Class in org.kitesdk.morphline.tika
-
Command that auto-detects the MIME type of the first attachment, if no MIME type is defined yet.
- DetectMimeTypeBuilder() - Constructor for class org.kitesdk.morphline.tika.DetectMimeTypeBuilder
-
- DocumentLoader - Interface in org.kitesdk.morphline.solr
-
A vehicle to load (or delete) Solr documents into some kind of destination,
such as a SolrServer or MapReduce RecordWriter;
This class should be considered private and it's API is subject to change without notice.
- doNotify(Record) - Method in class org.kitesdk.morphline.base.AbstractCommand
-
Processes the given notification on the control plane of the subtree rooted at this command.
- doProcess(Record) - Method in class org.kitesdk.morphline.base.AbstractCommand
-
Processes the given record on the data plane of this command.
- doProcess(Record) - Method in class org.kitesdk.morphline.stdio.AbstractParser
-
- doProcess(Record, InputStream) - Method in class org.kitesdk.morphline.stdio.AbstractParser
-
- doProcess(Record) - Method in class org.kitesdk.morphline.stdlib.AbstractFieldTransformCommand
-
- DownloadHdfsFileBuilder - Class in org.kitesdk.morphline.hadoop.core
-
Command for transferring HDFS files, for example to help with centralized configuration file
management.
- DownloadHdfsFileBuilder() - Constructor for class org.kitesdk.morphline.hadoop.core.DownloadHdfsFileBuilder
-
- DropRecordBuilder - Class in org.kitesdk.morphline.stdlib
-
Command that silently consumes records without ever emitting any record - think /dev/null.
- DropRecordBuilder() - Constructor for class org.kitesdk.morphline.stdlib.DropRecordBuilder
-
- DRY_RUN_SETTING_NAME - Static variable in class org.kitesdk.morphline.api.TypedSettings
-
- ELAPSED_TIME - Static variable in class org.kitesdk.morphline.base.Metrics
-
- equals(Object) - Method in class org.kitesdk.data.ColumnMapping
-
- equals(Object) - Method in class org.kitesdk.data.DatasetDescriptor
-
- equals(Object) - Method in class org.kitesdk.data.FieldMapping
-
- equals(Object) - Method in class org.kitesdk.data.Format
-
- equals(Object) - Method in class org.kitesdk.data.Key
-
- equals(Object) - Method in class org.kitesdk.data.PartitionStrategy
-
- equals(Object) - Method in class org.kitesdk.morphline.api.Record
-
- EqualsBuilder - Class in org.kitesdk.morphline.stdlib
-
Command that succeeds if all field values of the given named fields are equal to the the given
values, and fails otherwise.
- EqualsBuilder() - Constructor for class org.kitesdk.morphline.stdlib.EqualsBuilder
-
- evaluate(Record) - Method in class org.kitesdk.morphline.base.FieldExpression
-
Returns the values of a
Record
referred to by the given field expression
- evaluate(Object...) - Method in class org.kitesdk.morphline.scriptengine.java.ScriptEvaluator
-
- ExceptionHandler - Interface in org.kitesdk.morphline.api
-
Morphline-wide default handler that commands can choose to use to handle exceptions.
- exists(URI) - Static method in class org.kitesdk.data.Datasets
-
Check whether a
Dataset
identified by the given URI exists.
- exists(String) - Static method in class org.kitesdk.data.Datasets
-
Check whether a
Dataset
identified by the given URI string exists.
- ExtractAvroPathsBuilder - Class in org.kitesdk.morphline.avro
-
Command that uses zero or more avro path expressions to extract values from an Avro object.
- ExtractAvroPathsBuilder() - Constructor for class org.kitesdk.morphline.avro.ExtractAvroPathsBuilder
-
- ExtractAvroTreeBuilder - Class in org.kitesdk.morphline.avro
-
Command that converts an attached Avro datum to a morphline record by recursively walking the
Avro tree and extracting all data into a single morphline record, with fields named by their path
in the Avro tree.
- ExtractAvroTreeBuilder() - Constructor for class org.kitesdk.morphline.avro.ExtractAvroTreeBuilder
-
- ExtractJsonPathsBuilder - Class in org.kitesdk.morphline.json
-
Command that uses zero or more JSON path expressions to extract values from a JSON object.
- ExtractJsonPathsBuilder() - Constructor for class org.kitesdk.morphline.json.ExtractJsonPathsBuilder
-
- ExtractProtobufPathsBuilder - Class in org.kitesdk.morphline.protobuf
-
Command that uses zero or more protocol buffer path expressions to extract
values from a protocol buffer object.
- ExtractProtobufPathsBuilder() - Constructor for class org.kitesdk.morphline.protobuf.ExtractProtobufPathsBuilder
-
- ExtractURIComponentBuilder - Class in org.kitesdk.morphline.stdlib
-
A command that extracts a subcomponent from the URIs contained in the given input field and
appends it to the given output fields.
- ExtractURIComponentBuilder() - Constructor for class org.kitesdk.morphline.stdlib.ExtractURIComponentBuilder
-
- ExtractURIComponentsBuilder - Class in org.kitesdk.morphline.stdlib
-
A command that extracts subcomponents from the URIs contained in the given input field and
appends them to output fields with the given prefix, namely scheme, authority, host, port, path,
query, fragment, schemeSpecificPart, userInfo.
- ExtractURIComponentsBuilder() - Constructor for class org.kitesdk.morphline.stdlib.ExtractURIComponentsBuilder
-
- ExtractURIQueryParametersBuilder - Class in org.kitesdk.morphline.stdlib
-
A command that extracts the query parameters with a given name from the URIs contained in the
given input field and appends them to the given output field.
- ExtractURIQueryParametersBuilder() - Constructor for class org.kitesdk.morphline.stdlib.ExtractURIQueryParametersBuilder
-
- FaultTolerance - Class in org.kitesdk.morphline.base
-
Mission critical, large-scale online production systems need to make progress without downtime
despite some issues.
- FaultTolerance(boolean, boolean) - Constructor for class org.kitesdk.morphline.base.FaultTolerance
-
- FaultTolerance(boolean, boolean, String) - Constructor for class org.kitesdk.morphline.base.FaultTolerance
-
- FIELD_NAME - Static variable in class org.kitesdk.morphline.stdlib.AddCurrentTimeBuilder
-
- FIELD_NAME - Static variable in class org.kitesdk.morphline.stdlib.AddLocalHostBuilder
-
- FIELD_NAME - Static variable in class org.kitesdk.morphline.stdlib.GenerateUUIDBuilder
-
- FieldExpression - Class in org.kitesdk.morphline.base
-
Helper to fetch the values of a field of a
Record
referred to by a field expression,
which is a String of the form
@{fieldname}
.
- FieldExpression(String, Config) - Constructor for class org.kitesdk.morphline.base.FieldExpression
-
- fieldMapping(FieldMapping) - Method in class org.kitesdk.data.ColumnMapping.Builder
-
- FieldMapping - Class in org.kitesdk.data
-
Represents how to store a Schema field.
- FieldMapping.MappingType - Enum in org.kitesdk.data
-
The supported Mapping Types, which control how an entity field maps to
columns in an HBase table.
- fieldMappings(Collection<FieldMapping>) - Method in class org.kitesdk.data.ColumnMapping.Builder
-
- Fields - Class in org.kitesdk.morphline.base
-
Common record field names.
- Fields() - Constructor for class org.kitesdk.morphline.base.Fields
-
- FILE_UPLOAD_URL - Static variable in class org.kitesdk.morphline.hadoop.parquet.avro.ReadAvroParquetFileBuilder
-
The morphline record field containing the HDFS Path of the Parquet file to read
- find(String, Config, String) - Method in class org.kitesdk.morphline.base.Compiler
-
Finds the given morphline id within the given morphline config, using the given nameForErrorMsg
for error reporting.
- FindReplaceBuilder - Class in org.kitesdk.morphline.stdlib
-
Command that examines each string value in a given field and replaces each substring of the string value
that matches the given string literal or grok pattern with the given replacement.
- FindReplaceBuilder() - Constructor for class org.kitesdk.morphline.stdlib.FindReplaceBuilder
-
- FLUME_AGENT_NAME - Static variable in class org.kitesdk.minicluster.MiniCluster
-
- FLUME_CONFIGURATION - Static variable in class org.kitesdk.minicluster.MiniCluster
-
- flumeAgentName(String) - Method in class org.kitesdk.minicluster.MiniCluster.Builder
-
- flumeConfiguration(String) - Method in class org.kitesdk.minicluster.MiniCluster.Builder
-
- FlumeService - Class in org.kitesdk.minicluster
-
- FlumeService() - Constructor for class org.kitesdk.minicluster.FlumeService
-
- flush() - Method in interface org.kitesdk.data.Flushable
-
Ensure that data has been flushed to OS buffers on all replica data nodes.
- Flushable - Interface in org.kitesdk.data
-
A writer that can guarantee data is present on data nodes.
- format(Format) - Method in class org.kitesdk.data.DatasetDescriptor.Builder
-
Configure the dataset's format (optional).
- format(String) - Method in class org.kitesdk.data.DatasetDescriptor.Builder
-
Configure the dataset's format from a format name String (optional).
- format(String, Object...) - Static method in exception org.kitesdk.data.DatasetException
-
- Format - Class in org.kitesdk.data
-
The data format used for encoding the data in a
Dataset
.
- Formats - Class in org.kitesdk.data
-
Contains constant definitions for the standard
Format
instances
supported by the library.
- forName(String) - Static method in enum org.kitesdk.data.CompressionType
-
Return a CompressionType
for the compression type name specified.
- from(String, Comparable) - Method in interface org.kitesdk.data.RefinableView
-
Creates a sub-View
, restricted to entities whose name
field is greater than or equal to the given value
.
- fromAfter(String, Comparable) - Method in interface org.kitesdk.data.RefinableView
-
Creates a sub-View
, restricted to entities whose name
field is greater than the given value
.
- fromString(String) - Static method in class org.kitesdk.data.Formats
-
Return a
Format
for the format name specified.
- GenerateSolrSequenceKeyBuilder - Class in org.kitesdk.morphline.solr
-
A command that assigns a record unique key that is the concatenation of the given
baseIdField
record field, followed by a running count of the record number within
the current session.
- GenerateSolrSequenceKeyBuilder() - Constructor for class org.kitesdk.morphline.solr.GenerateSolrSequenceKeyBuilder
-
- GenerateUUIDBuilder - Class in org.kitesdk.morphline.stdlib
-
A command that sets a universally unique identifier on all records that are intercepted.
- GenerateUUIDBuilder() - Constructor for class org.kitesdk.morphline.stdlib.GenerateUUIDBuilder
-
- GeoIPBuilder - Class in org.kitesdk.morphline.maxmind
-
Command that returns Geolocation information for a given IP address, using an efficient in-memory
Maxmind database lookup.
- GeoIPBuilder() - Constructor for class org.kitesdk.morphline.maxmind.GeoIPBuilder
-
- get(int) - Method in class org.kitesdk.data.Key
-
Returns the value for index
.
- get(Key) - Method in interface org.kitesdk.data.RandomAccessDataset
-
Return the entity stored in the dataset at the row specified with
Key
key
.
- get(String) - Method in class org.kitesdk.minicluster.Service.ServiceConfig
-
- get(String) - Method in class org.kitesdk.morphline.api.Record
-
Returns a view of the values associated with the given key.
- getBatchSize() - Method in class org.kitesdk.morphline.solr.SolrLocator
-
- getBoolean(String, boolean) - Method in class org.kitesdk.morphline.api.TypedSettings
-
- getBoolean(Config, String, boolean) - Method in class org.kitesdk.morphline.base.Configs
-
- getBoolean(Config, String) - Method in class org.kitesdk.morphline.base.Configs
-
- getCardinality() - Method in class org.kitesdk.data.PartitionStrategy
-
Return the cardinality produced by the contained field partitioners.
- getCharset(Config, String, Charset) - Method in class org.kitesdk.morphline.base.Configs
-
- getChild() - Method in class org.kitesdk.morphline.base.AbstractCommand
-
Returns the child of this command.
- getCollectionName() - Method in class org.kitesdk.morphline.solr.SolrLocator
-
- getColumnMapping() - Method in class org.kitesdk.data.DatasetDescriptor
-
- getCommandBuilder(String) - Method in class org.kitesdk.morphline.api.MorphlineContext
-
- getCompressionType() - Method in class org.kitesdk.data.DatasetDescriptor
-
- getConf() - Method in class org.kitesdk.data.mapreduce.DatasetKeyInputFormat
-
- getConfig() - Method in class org.kitesdk.morphline.base.AbstractCommand
-
Returns the JSON configuration of this command.
- getConfig(Config, String, Config) - Method in class org.kitesdk.morphline.base.Configs
-
- getConfig(Config, String) - Method in class org.kitesdk.morphline.base.Configs
-
- getConfigList(Config, String, List<? extends Config>) - Method in class org.kitesdk.morphline.base.Configs
-
- getConfigList(Config, String) - Method in class org.kitesdk.morphline.base.Configs
-
- getConfigs() - Method in class org.kitesdk.morphline.base.AbstractCommand
-
Returns a helper for convenient access to the JSON configuration of this command.
- getContext() - Method in class org.kitesdk.morphline.base.AbstractCommand
-
Returns the context of this command.
- getCount() - Method in class org.kitesdk.tools.TransformTask
-
- getCounter(String...) - Method in class org.kitesdk.morphline.base.AbstractCommand
-
Returns a counter for maintaining main-memory metrics.
- getDataset() - Method in interface org.kitesdk.data.View
-
Returns the underlying
Dataset
for the
View
.
- getDefaultCompressionType() - Method in class org.kitesdk.data.Format
-
- getDescriptor() - Method in interface org.kitesdk.data.Dataset
-
- getDocumentLoader() - Method in class org.kitesdk.morphline.solr.SolrMorphlineContext
-
- getDouble(String, double) - Method in class org.kitesdk.morphline.api.TypedSettings
-
- getDouble(Config, String, double) - Method in class org.kitesdk.morphline.base.Configs
-
- getDouble(Config, String) - Method in class org.kitesdk.morphline.base.Configs
-
- getEntrySet(Config) - Method in class org.kitesdk.morphline.base.Configs
-
- getExceptionHandler() - Method in class org.kitesdk.morphline.api.MorphlineContext
-
- getExtension() - Method in class org.kitesdk.data.Format
-
Get the extension for use in filenames.
- getFamily() - Method in class org.kitesdk.data.FieldMapping
-
- getFamilyAsString() - Method in class org.kitesdk.data.FieldMapping
-
- getFieldMapping(String) - Method in class org.kitesdk.data.ColumnMapping
-
- getFieldMappings() - Method in class org.kitesdk.data.ColumnMapping
-
- getFieldName() - Method in class org.kitesdk.data.FieldMapping
-
- getFields() - Method in class org.kitesdk.morphline.api.Record
-
Returns the fields that are stored in this record.
- getFileStatus(Path) - Method in class org.kitesdk.morphline.hadoop.rcfile.SingleStreamFileSystem
-
- getFirstValue(String) - Method in class org.kitesdk.morphline.api.Record
-
Returns the first value associated with the given key, or null if no such value exists
- getFloat(String, float) - Method in class org.kitesdk.morphline.api.TypedSettings
-
- getFormat() - Method in class org.kitesdk.data.DatasetDescriptor
-
Get the associated
Format
the data is stored in.
- getHadoopConf() - Method in class org.kitesdk.minicluster.FlumeService
-
- getHadoopConf() - Method in class org.kitesdk.minicluster.HBaseService
-
- getHadoopConf() - Method in class org.kitesdk.minicluster.HdfsService
-
- getHadoopConf() - Method in class org.kitesdk.minicluster.HiveService
-
- getHadoopConf() - Method in interface org.kitesdk.minicluster.Service
-
Get the Hadoop configuration object for this service.
- getHadoopConf() - Method in class org.kitesdk.minicluster.Service.ServiceConfig
-
- getHadoopConf() - Method in class org.kitesdk.minicluster.ZookeeperService
-
- getHealthCheckRegistry() - Method in class org.kitesdk.morphline.api.MorphlineContext
-
- getHistogram(String...) - Method in class org.kitesdk.morphline.base.AbstractCommand
-
Returns a histogram for maintaining main-memory metrics.
- getIndexSchema() - Method in class org.kitesdk.morphline.solr.SolrLocator
-
- getIndexSchema() - Method in class org.kitesdk.morphline.solr.SolrMorphlineContext
-
- getInt(String, int) - Method in class org.kitesdk.morphline.api.TypedSettings
-
- getInt(Config, String, int) - Method in class org.kitesdk.morphline.base.Configs
-
- getInt(Config, String) - Method in class org.kitesdk.morphline.base.Configs
-
- getIOException() - Method in exception org.kitesdk.data.DatasetIOException
-
- getLifecycleEvents(Record) - Static method in class org.kitesdk.morphline.base.Notifications
-
Get all lifecycle events from the given record.
- getLoader() - Method in class org.kitesdk.morphline.solr.SolrLocator
-
- getLocale(Config, String, Locale) - Method in class org.kitesdk.morphline.base.Configs
-
- getLocale(Config, String) - Method in class org.kitesdk.morphline.base.Configs
-
- getLocation() - Method in class org.kitesdk.data.DatasetDescriptor
-
Get the URL location where the data for this
Dataset
is stored
(optional).
- getLong(String, int) - Method in class org.kitesdk.morphline.api.TypedSettings
-
- getLong(Config, String, long) - Method in class org.kitesdk.morphline.base.Configs
-
- getLong(Config, String) - Method in class org.kitesdk.morphline.base.Configs
-
- getMappingType() - Method in class org.kitesdk.data.FieldMapping
-
- getMeter(String...) - Method in class org.kitesdk.morphline.base.AbstractCommand
-
Returns a meter for maintaining main-memory metrics.
- getMetricRegistry() - Method in class org.kitesdk.morphline.api.MorphlineContext
-
- getName() - Method in enum org.kitesdk.data.CompressionType
-
Get the String
name for this compression type.
- getName() - Method in interface org.kitesdk.data.Dataset
-
Get the name of a Dataset
.
- getName() - Method in class org.kitesdk.data.Format
-
Get the format's name.
- getNames() - Method in interface org.kitesdk.morphline.api.CommandBuilder
-
Returns the names with which this command can be invoked.
- getNames() - Method in class org.kitesdk.morphline.avro.ExtractAvroPathsBuilder
-
- getNames() - Method in class org.kitesdk.morphline.avro.ExtractAvroTreeBuilder
-
- getNames() - Method in class org.kitesdk.morphline.avro.ReadAvroBuilder
-
- getNames() - Method in class org.kitesdk.morphline.avro.ReadAvroContainerBuilder
-
- getNames() - Method in class org.kitesdk.morphline.avro.ToAvroBuilder
-
- getNames() - Method in class org.kitesdk.morphline.avro.ToAvroMapBuilder
-
- getNames() - Method in class org.kitesdk.morphline.avro.WriteAvroToByteArrayBuilder
-
- getNames() - Method in class org.kitesdk.morphline.hadoop.core.DownloadHdfsFileBuilder
-
- getNames() - Method in class org.kitesdk.morphline.hadoop.core.OpenHdfsFileBuilder
-
- getNames() - Method in class org.kitesdk.morphline.hadoop.parquet.avro.ReadAvroParquetFileBuilder
-
- getNames() - Method in class org.kitesdk.morphline.hadoop.rcfile.ReadRCFileBuilder
-
- getNames() - Method in class org.kitesdk.morphline.hadoop.sequencefile.ReadSequenceFileBuilder
-
- getNames() - Method in class org.kitesdk.morphline.json.ExtractJsonPathsBuilder
-
- getNames() - Method in class org.kitesdk.morphline.json.ReadJsonBuilder
-
- getNames() - Method in class org.kitesdk.morphline.maxmind.GeoIPBuilder
-
- getNames() - Method in class org.kitesdk.morphline.metrics.servlets.RegisterJVMMetricsBuilder
-
- getNames() - Method in class org.kitesdk.morphline.metrics.servlets.StartReportingMetricsToHTTPBuilder
-
- getNames() - Method in class org.kitesdk.morphline.protobuf.ExtractProtobufPathsBuilder
-
- getNames() - Method in class org.kitesdk.morphline.protobuf.ReadProtobufBuilder
-
- getNames() - Method in class org.kitesdk.morphline.saxon.ConvertHTMLBuilder
-
- getNames() - Method in class org.kitesdk.morphline.saxon.XQueryBuilder
-
- getNames() - Method in class org.kitesdk.morphline.saxon.XSLTBuilder
-
- getNames() - Method in class org.kitesdk.morphline.solr.GenerateSolrSequenceKeyBuilder
-
- getNames() - Method in class org.kitesdk.morphline.solr.LoadSolrBuilder
-
- getNames() - Method in class org.kitesdk.morphline.solr.SanitizeUnknownSolrFieldsBuilder
-
- getNames() - Method in class org.kitesdk.morphline.solr.TokenizeTextBuilder
-
- getNames() - Method in class org.kitesdk.morphline.solrcell.SolrCellBuilder
-
- getNames() - Method in class org.kitesdk.morphline.stdio.ReadBlobBuilder
-
- getNames() - Method in class org.kitesdk.morphline.stdio.ReadClobBuilder
-
- getNames() - Method in class org.kitesdk.morphline.stdio.ReadCSVBuilder
-
- getNames() - Method in class org.kitesdk.morphline.stdio.ReadLineBuilder
-
- getNames() - Method in class org.kitesdk.morphline.stdio.ReadMultiLineBuilder
-
- getNames() - Method in class org.kitesdk.morphline.stdlib.AddCurrentTimeBuilder
-
- getNames() - Method in class org.kitesdk.morphline.stdlib.AddLocalHostBuilder
-
- getNames() - Method in class org.kitesdk.morphline.stdlib.AddValuesBuilder
-
- getNames() - Method in class org.kitesdk.morphline.stdlib.AddValuesIfAbsentBuilder
-
- getNames() - Method in class org.kitesdk.morphline.stdlib.CallParentPipeBuilder
-
- getNames() - Method in class org.kitesdk.morphline.stdlib.ContainsBuilder
-
- getNames() - Method in class org.kitesdk.morphline.stdlib.ConvertTimestampBuilder
-
- getNames() - Method in class org.kitesdk.morphline.stdlib.DecodeBase64Builder
-
- getNames() - Method in class org.kitesdk.morphline.stdlib.DropRecordBuilder
-
- getNames() - Method in class org.kitesdk.morphline.stdlib.EqualsBuilder
-
- getNames() - Method in class org.kitesdk.morphline.stdlib.ExtractURIComponentBuilder
-
- getNames() - Method in class org.kitesdk.morphline.stdlib.ExtractURIComponentsBuilder
-
- getNames() - Method in class org.kitesdk.morphline.stdlib.ExtractURIQueryParametersBuilder
-
- getNames() - Method in class org.kitesdk.morphline.stdlib.FindReplaceBuilder
-
- getNames() - Method in class org.kitesdk.morphline.stdlib.GenerateUUIDBuilder
-
- getNames() - Method in class org.kitesdk.morphline.stdlib.GrokBuilder
-
- getNames() - Method in class org.kitesdk.morphline.stdlib.HeadBuilder
-
- getNames() - Method in class org.kitesdk.morphline.stdlib.IfThenElseBuilder
-
- getNames() - Method in class org.kitesdk.morphline.stdlib.JavaBuilder
-
- getNames() - Method in class org.kitesdk.morphline.stdlib.LogDebugBuilder
-
- getNames() - Method in class org.kitesdk.morphline.stdlib.LogErrorBuilder
-
- getNames() - Method in class org.kitesdk.morphline.stdlib.LogInfoBuilder
-
- getNames() - Method in class org.kitesdk.morphline.stdlib.LogTraceBuilder
-
- getNames() - Method in class org.kitesdk.morphline.stdlib.LogWarnBuilder
-
- getNames() - Method in class org.kitesdk.morphline.stdlib.NotBuilder
-
- getNames() - Method in class org.kitesdk.morphline.stdlib.PipeBuilder
-
- getNames() - Method in class org.kitesdk.morphline.stdlib.RemoveFieldsBuilder
-
- getNames() - Method in class org.kitesdk.morphline.stdlib.RemoveValuesBuilder
-
- getNames() - Method in class org.kitesdk.morphline.stdlib.SampleBuilder
-
- getNames() - Method in class org.kitesdk.morphline.stdlib.SeparateAttachmentsBuilder
-
- getNames() - Method in class org.kitesdk.morphline.stdlib.SetValuesBuilder
-
- getNames() - Method in class org.kitesdk.morphline.stdlib.SplitBuilder
-
- getNames() - Method in class org.kitesdk.morphline.stdlib.SplitKeyValueBuilder
-
- getNames() - Method in class org.kitesdk.morphline.stdlib.StartReportingMetricsToCSVBuilder
-
- getNames() - Method in class org.kitesdk.morphline.stdlib.StartReportingMetricsToJMXBuilder
-
- getNames() - Method in class org.kitesdk.morphline.stdlib.StartReportingMetricsToSLF4JBuilder
-
- getNames() - Method in class org.kitesdk.morphline.stdlib.ToByteArrayBuilder
-
- getNames() - Method in class org.kitesdk.morphline.stdlib.ToStringBuilder
-
- getNames() - Method in class org.kitesdk.morphline.stdlib.TranslateBuilder
-
- getNames() - Method in class org.kitesdk.morphline.stdlib.TryRulesBuilder
-
- getNames() - Method in class org.kitesdk.morphline.tika.decompress.DecompressBuilder
-
- getNames() - Method in class org.kitesdk.morphline.tika.decompress.UnpackBuilder
-
- getNames() - Method in class org.kitesdk.morphline.tika.DetectMimeTypeBuilder
-
- getNames() - Method in class org.kitesdk.morphline.twitter.ReadJsonTestTweetsBuilder
-
- getNames() - Method in class org.kitesdk.morphline.useragent.UserAgentBuilder
-
- getNamespace() - Method in interface org.kitesdk.data.Dataset
-
Get the namespace that contains this Dataset
.
- getNanoseconds(Config, String, long) - Method in class org.kitesdk.morphline.base.Configs
-
- getNanoseconds(Config, String) - Method in class org.kitesdk.morphline.base.Configs
-
- getObject(String, Object) - Method in class org.kitesdk.morphline.api.TypedSettings
-
- getOutputCommitter(TaskAttemptContext) - Method in class org.kitesdk.data.mapreduce.DatasetKeyOutputFormat
-
- getParent() - Method in interface org.kitesdk.morphline.api.Command
-
Returns the parent of this command.
- getParent() - Method in class org.kitesdk.morphline.base.AbstractCommand
-
- getPartitionStrategy() - Method in class org.kitesdk.data.DatasetDescriptor
-
- getPrefix() - Method in class org.kitesdk.data.FieldMapping
-
- getProperty(String) - Method in class org.kitesdk.data.DatasetDescriptor
-
Get a named property.
- getQualifier() - Method in class org.kitesdk.data.FieldMapping
-
- getQualifierAsString() - Method in class org.kitesdk.data.FieldMapping
-
- getRecordWriter(TaskAttemptContext) - Method in class org.kitesdk.data.mapreduce.DatasetKeyOutputFormat
-
- getRequiredColumnFamilies() - Method in class org.kitesdk.data.ColumnMapping
-
Get the column families required by this schema.
- getRequiredColumns() - Method in class org.kitesdk.data.ColumnMapping
-
Get the columns required by this schema.
- getSchema() - Method in class org.kitesdk.data.DatasetDescriptor
-
- getSchemaUrl() - Method in class org.kitesdk.data.DatasetDescriptor
-
Get a URL from which the
Schema
can be retrieved (optional).
- getServerUrl() - Method in class org.kitesdk.morphline.solr.SolrLocator
-
- getSettings() - Method in class org.kitesdk.morphline.api.MorphlineContext
-
- getSolrHomeDir() - Method in class org.kitesdk.morphline.solr.SolrLocator
-
- getSolrServer() - Method in class org.kitesdk.morphline.solr.SolrLocator
-
- getSolrServer() - Method in class org.kitesdk.morphline.solr.SolrServerDocumentLoader
-
- getSplits(JobContext) - Method in class org.kitesdk.data.mapreduce.DatasetKeyInputFormat
-
- getString(String, String) - Method in class org.kitesdk.morphline.api.TypedSettings
-
- getString(Config, String, String) - Method in class org.kitesdk.morphline.base.Configs
-
- getString(Config, String) - Method in class org.kitesdk.morphline.base.Configs
-
- getStringList(Config, String, List<String>) - Method in class org.kitesdk.morphline.base.Configs
-
- getStringList(Config, String) - Method in class org.kitesdk.morphline.base.Configs
-
- getSupportedCompressionTypes() - Method in class org.kitesdk.data.Format
-
- getTimer(String...) - Method in class org.kitesdk.morphline.base.AbstractCommand
-
Returns a timer for maintaining main-memory metrics.
- getTimeUnit(Config, String, TimeUnit) - Method in class org.kitesdk.morphline.base.Configs
-
- getTimeUnit(Config, String) - Method in class org.kitesdk.morphline.base.Configs
-
- getTimeUnit(String) - Method in class org.kitesdk.morphline.base.Configs
-
- getType() - Method in interface org.kitesdk.data.View
-
Get the runtime type of entities contained in this View
.
- getTypedSettings() - Method in class org.kitesdk.morphline.api.MorphlineContext
-
- getUri() - Method in interface org.kitesdk.data.Dataset
-
Return a URI
for this Dataset
.
- getUri() - Method in interface org.kitesdk.data.View
-
Get a
URI
that identifies this
View
.
- getUri() - Method in class org.kitesdk.morphline.hadoop.rcfile.SingleStreamFileSystem
-
- getWorkingDirectory() - Method in class org.kitesdk.morphline.hadoop.rcfile.SingleStreamFileSystem
-
- getZkHost() - Method in class org.kitesdk.morphline.solr.SolrLocator
-
- GrokBuilder - Class in org.kitesdk.morphline.stdlib
-
The Grok command uses regular expression pattern matching to extract structured fields from
unstructured log data.
- GrokBuilder() - Constructor for class org.kitesdk.morphline.stdlib.GrokBuilder
-
- PARQUET - Static variable in class org.kitesdk.data.Formats
-
- parse(File, Config...) - Method in class org.kitesdk.morphline.base.Compiler
-
Loads the given config file from the local file system
- partition(PCollection<E>, View<E>) - Static method in class org.kitesdk.data.crunch.CrunchDatasets
-
Partitions collection
to be stored efficiently in View
.
- partition(PCollection<E>, Dataset<E>) - Static method in class org.kitesdk.data.crunch.CrunchDatasets
-
Partitions collection
to be stored efficiently in dataset
.
- partition(PCollection<E>, View<E>, int) - Static method in class org.kitesdk.data.crunch.CrunchDatasets
-
Partitions collection
to be stored efficiently in View
.
- partitionStrategy(PartitionStrategy) - Method in class org.kitesdk.data.DatasetDescriptor.Builder
-
Configure the dataset's partitioning strategy (optional).
- partitionStrategy(File) - Method in class org.kitesdk.data.DatasetDescriptor.Builder
-
Configure the dataset's partition strategy from a File.
- partitionStrategy(InputStream) - Method in class org.kitesdk.data.DatasetDescriptor.Builder
-
Configure the dataset's partition strategy from an InputStream.
- PartitionStrategy - Class in org.kitesdk.data
-
The strategy used to determine how a dataset is partitioned.
- PartitionStrategy.Builder - Class in org.kitesdk.data
-
- PartitionStrategy.Builder() - Constructor for class org.kitesdk.data.PartitionStrategy.Builder
-
- partitionStrategyLiteral(String) - Method in class org.kitesdk.data.DatasetDescriptor.Builder
-
Configure the dataset's partition strategy from a String literal.
- partitionStrategyUri(URI) - Method in class org.kitesdk.data.DatasetDescriptor.Builder
-
Configure the dataset's partition strategy from a URI.
- partitionStrategyUri(String) - Method in class org.kitesdk.data.DatasetDescriptor.Builder
-
Configure the dataset's partition strategy from a String URI.
- ping() - Method in interface org.kitesdk.morphline.solr.DocumentLoader
-
Issues a ping request to check if the server is alive
- ping() - Method in class org.kitesdk.morphline.solr.SolrServerDocumentLoader
-
- PipeBuilder - Class in org.kitesdk.morphline.stdlib
-
Factory to create morphline pipe instances.
- PipeBuilder() - Constructor for class org.kitesdk.morphline.stdlib.PipeBuilder
-
- populateAvroHeaders(Map<String, String>, Schema, Object) - Method in class org.kitesdk.data.flume.Log4jAppender
-
- PREFIX_NAME - Static variable in class org.kitesdk.morphline.stdlib.GenerateUUIDBuilder
-
- prepareClasspath(Configuration, Path, Class<?>...) - Method in class org.kitesdk.tools.JobClasspathHelper
-
- PRESERVE_EXISTING_NAME - Static variable in class org.kitesdk.morphline.stdlib.AddCurrentTimeBuilder
-
- PRESERVE_EXISTING_NAME - Static variable in class org.kitesdk.morphline.stdlib.AddLocalHostBuilder
-
- PRESERVE_EXISTING_NAME - Static variable in class org.kitesdk.morphline.stdlib.GenerateUUIDBuilder
-
- PrettyPrinter - Class in org.kitesdk.morphline.base
-
Utility to nicely indent and format a morphline config file.
- PrettyPrinter() - Constructor for class org.kitesdk.morphline.base.PrettyPrinter
-
- process(Record) - Method in interface org.kitesdk.morphline.api.Command
-
Processes the given record on the data plane of this command.
- process(Record) - Method in class org.kitesdk.morphline.base.AbstractCommand
-
- property(String, String) - Method in class org.kitesdk.data.DatasetDescriptor.Builder
-
Add a key-value property to the descriptor.
- provided(String) - Method in class org.kitesdk.data.PartitionStrategy.Builder
-
Configure a partitioner that uses values always provided at runtime.
- provided(String, String) - Method in class org.kitesdk.data.PartitionStrategy.Builder
-
Configure a partitioner that uses values always provided at runtime.
- put(E) - Method in interface org.kitesdk.data.RandomAccessDataset
-
Put the entity into the dataset.
- put(String, Object) - Method in class org.kitesdk.morphline.api.Record
-
Adds the given value to the values currently associated with the given key.
- putIfAbsent(String, Object) - Method in class org.kitesdk.morphline.api.Record
-
Adds the given value to the values currently associated with the given key, iff the key isn't
already associated with that same value.
- RandomAccessDataset<E> - Interface in org.kitesdk.data
-
A
Dataset
that supports random access operations on entities.
- range(String, int...) - Method in class org.kitesdk.data.PartitionStrategy.Builder
-
Configure a range partitioner with a set of upperBounds
.
- range(String, String...) - Method in class org.kitesdk.data.PartitionStrategy.Builder
-
Configure a range partitioner for strings with a set of upperBounds
.
- RC_FILE_META_DATA - Static variable in class org.kitesdk.morphline.hadoop.rcfile.ReadRCFileBuilder
-
- ReadAvroBuilder - Class in org.kitesdk.morphline.avro
-
Command that parses an InputStream that contains Avro data; for each Avro datum, the command
emits a morphline record containing the datum as an attachment in
Fields.ATTACHMENT_BODY
.
- ReadAvroBuilder() - Constructor for class org.kitesdk.morphline.avro.ReadAvroBuilder
-
- ReadAvroContainerBuilder - Class in org.kitesdk.morphline.avro
-
Command that parses an InputStream that contains Avro binary container file data; for each Avro
datum, the command emits a morphline record containing the datum as an attachment in
Fields.ATTACHMENT_BODY
.
- ReadAvroContainerBuilder() - Constructor for class org.kitesdk.morphline.avro.ReadAvroContainerBuilder
-
- ReadAvroParquetFileBuilder - Class in org.kitesdk.morphline.hadoop.parquet.avro
-
Command that parses a Hadoop File that contains Parquet data; for each Parquet Avro
datum, the command emits a morphline record containing the record as an attachment in
Fields.ATTACHMENT_BODY
.
- ReadAvroParquetFileBuilder() - Constructor for class org.kitesdk.morphline.hadoop.parquet.avro.ReadAvroParquetFileBuilder
-
- ReadBlobBuilder - Class in org.kitesdk.morphline.stdio
-
Command that emits one record for the entire input stream of the first attachment, interpreting
the stream as a Binary Large Object (BLOB), i.e.
- ReadBlobBuilder() - Constructor for class org.kitesdk.morphline.stdio.ReadBlobBuilder
-
- ReadClobBuilder - Class in org.kitesdk.morphline.stdio
-
Command that emits one record for the entire input stream of the first attachment, interpreting
the stream as a Character Large Object (CLOB).
- ReadClobBuilder() - Constructor for class org.kitesdk.morphline.stdio.ReadClobBuilder
-
- ReadCSVBuilder - Class in org.kitesdk.morphline.stdio
-
Command that extracts zero or more records from the input stream of the first attachment,
representing a Comma Separated Values (CSV) file.
- ReadCSVBuilder() - Constructor for class org.kitesdk.morphline.stdio.ReadCSVBuilder
-
- readFrom(URI) - Method in class org.kitesdk.data.mapreduce.DatasetKeyInputFormat.ConfigBuilder
-
Adds configuration for DatasetKeyInputFormat
to read from the
given dataset or view URI.
- readFrom(View<?>) - Method in class org.kitesdk.data.mapreduce.DatasetKeyInputFormat.ConfigBuilder
-
Adds configuration for
DatasetKeyInputFormat
to read from the
given
Dataset
or
View
instance.
- readFrom(String) - Method in class org.kitesdk.data.mapreduce.DatasetKeyInputFormat.ConfigBuilder
-
Adds configuration for DatasetKeyInputFormat
to read from the
given dataset or view URI string.
- ReadJsonBuilder - Class in org.kitesdk.morphline.json
-
Command that parses an InputStream that contains JSON data; for each JSON object in the stream,
the command emits a morphline record containing the object as an attachment in
Fields.ATTACHMENT_BODY
.
- ReadJsonBuilder() - Constructor for class org.kitesdk.morphline.json.ReadJsonBuilder
-
- ReadJsonTestTweetsBuilder - Class in org.kitesdk.morphline.twitter
-
JSON parser that extracts search documents from twitter tweets obtained from the twitter 1% sample firehose with the delimited=length option.
- ReadJsonTestTweetsBuilder() - Constructor for class org.kitesdk.morphline.twitter.ReadJsonTestTweetsBuilder
-
- ReadLineBuilder - Class in org.kitesdk.morphline.stdio
-
Command that emits one record per line in the input stream of the first attachment.
- ReadLineBuilder() - Constructor for class org.kitesdk.morphline.stdio.ReadLineBuilder
-
- ReadMultiLineBuilder - Class in org.kitesdk.morphline.stdio
-
Multiline log parser that collapse multiline messages into a single record; supports "regex",
"what" and "negate" configuration parameters similar to logstash.
- ReadMultiLineBuilder() - Constructor for class org.kitesdk.morphline.stdio.ReadMultiLineBuilder
-
- ReadProtobufBuilder - Class in org.kitesdk.morphline.protobuf
-
Command that parses an InputStream that contains Protocol buffer data; the
command emits a morphline record containing the object as an attachment in
Fields.ATTACHMENT_BODY
.
- ReadProtobufBuilder() - Constructor for class org.kitesdk.morphline.protobuf.ReadProtobufBuilder
-
- ReadRCFileBuilder - Class in org.kitesdk.morphline.hadoop.rcfile
-
Command for reading Record Columnar (RC) Files.
- ReadRCFileBuilder() - Constructor for class org.kitesdk.morphline.hadoop.rcfile.ReadRCFileBuilder
-
- ReadSequenceFileBuilder - Class in org.kitesdk.morphline.hadoop.sequencefile
-
Command that emits one record per sequence file entry in the input stream of the first attachment.
- ReadSequenceFileBuilder() - Constructor for class org.kitesdk.morphline.hadoop.sequencefile.ReadSequenceFileBuilder
-
- Record - Class in org.kitesdk.morphline.api
-
A record is a set of named fields where each field has a list of one or more values.
- Record() - Constructor for class org.kitesdk.morphline.api.Record
-
Creates a new empty record.
- RECOVERABLE_EXCEPTION_CLASSES - Static variable in class org.kitesdk.morphline.base.FaultTolerance
-
- RefinableView<E> - Interface in org.kitesdk.data
-
A
RefinableView
specifies a subset of a
Dataset
by one or
more logical constraints.
- REGIONSERVER_PORT_KEY - Static variable in class org.kitesdk.minicluster.HBaseService
-
- RegisterJVMMetricsBuilder - Class in org.kitesdk.morphline.metrics.servlets
-
Command that registers metrics that are related to the Java Virtual Machine with the
MorphlineContext of the morphline.
- RegisterJVMMetricsBuilder() - Constructor for class org.kitesdk.morphline.metrics.servlets.RegisterJVMMetricsBuilder
-
- registerService(Class<? extends Service>) - Static method in class org.kitesdk.minicluster.MiniCluster
-
- remove() - Method in interface org.kitesdk.data.DatasetReader
-
Remove the last entity from the reader (OPTIONAL).
- removeAll(String) - Method in class org.kitesdk.morphline.api.Record
-
Removes all values that are associated with the given key
- removeAttachments(Record) - Static method in class org.kitesdk.morphline.stdio.AbstractParser
-
- RemoveFieldsBuilder - Class in org.kitesdk.morphline.stdlib
-
Removes all record fields for which the field name matches at least one of the given blacklist
predicates but none of the given whitelist predicates.
- RemoveFieldsBuilder() - Constructor for class org.kitesdk.morphline.stdlib.RemoveFieldsBuilder
-
- RemoveValuesBuilder - Class in org.kitesdk.morphline.stdlib
-
Removes all record field values for which all of the following conditions hold:
1) the field name matches at least one of the given nameBlacklist predicates but none of the
given nameWhitelist predicates.
- RemoveValuesBuilder() - Constructor for class org.kitesdk.morphline.stdlib.RemoveValuesBuilder
-
- rename(Path, Path) - Method in class org.kitesdk.morphline.hadoop.rcfile.SingleStreamFileSystem
-
- replaceValues(String, Object) - Method in class org.kitesdk.morphline.api.Record
-
Removes all values that are associated with the given key, and then associates the given value
with the given key.
- REPO_SCHEME - Static variable in class org.kitesdk.data.URIBuilder
-
- requiresLayout() - Method in class org.kitesdk.data.flume.Log4jAppender
-
- rollbackTransaction() - Method in interface org.kitesdk.morphline.solr.DocumentLoader
-
Performs a rollback of all non-committed documents pending.
- rollbackTransaction() - Method in class org.kitesdk.morphline.solr.SolrServerDocumentLoader
-
- run() - Method in class org.kitesdk.tools.TransformTask
-
- SampleBuilder - Class in org.kitesdk.morphline.stdlib
-
Command that forwards each input record with a given probability to its child command, and
silently ignores all other input records.
- SampleBuilder() - Constructor for class org.kitesdk.morphline.stdlib.SampleBuilder
-
- SanitizeUnknownSolrFieldsBuilder - Class in org.kitesdk.morphline.solr
-
Command that sanitizes record fields that are unknown to Solr schema.xml by either deleting them
(renameToPrefix is absent or a zero length string), or by moving them to a field prefixed with
the given renameToPrefix (e.g.
- SanitizeUnknownSolrFieldsBuilder() - Constructor for class org.kitesdk.morphline.solr.SanitizeUnknownSolrFieldsBuilder
-
- schema(Schema) - Method in class org.kitesdk.data.DatasetDescriptor.Builder
-
Configure the dataset's schema.
- schema(File) - Method in class org.kitesdk.data.DatasetDescriptor.Builder
-
Configure the dataset's schema from a
File
.
- schema(InputStream) - Method in class org.kitesdk.data.DatasetDescriptor.Builder
-
- schema(Class<T>) - Method in class org.kitesdk.data.DatasetDescriptor.Builder
-
Configure the dataset's schema via a Java class type.
- schemaFromAvroDataFile(File) - Method in class org.kitesdk.data.DatasetDescriptor.Builder
-
Configure the dataset's schema by using the schema from an existing Avro
data file.
- schemaFromAvroDataFile(InputStream) - Method in class org.kitesdk.data.DatasetDescriptor.Builder
-
Configure the dataset's schema by using the schema from an existing Avro
data file.
- schemaFromAvroDataFile(URI) - Method in class org.kitesdk.data.DatasetDescriptor.Builder
-
Configure the dataset's schema by using the schema from an existing Avro
data file.
- schemaLiteral(String) - Method in class org.kitesdk.data.DatasetDescriptor.Builder
-
Configure the dataset's schema from a
String
.
- SchemaNotFoundException - Exception in org.kitesdk.data
-
Exception thrown to indicate that there is a problem
finding a particular version of a schema.
- SchemaNotFoundException(String) - Constructor for exception org.kitesdk.data.SchemaNotFoundException
-
- SchemaNotFoundException(String, Throwable) - Constructor for exception org.kitesdk.data.SchemaNotFoundException
-
- SchemaNotFoundException(Throwable) - Constructor for exception org.kitesdk.data.SchemaNotFoundException
-
- schemaUri(URI) - Method in class org.kitesdk.data.DatasetDescriptor.Builder
-
Configure the
Dataset
's schema from a URI.
- schemaUri(String) - Method in class org.kitesdk.data.DatasetDescriptor.Builder
-
Configure the
Dataset
's schema from a String URI.
- ScriptEvaluator<T> - Class in org.kitesdk.morphline.scriptengine.java
-
Creates and compiles the given Java code block, wrapped into a Java method with the given return
type and parameter types, along with a Java class definition that contains the given import
statements.
- ScriptEvaluator(String, String, Class<T>, String[], Class[], String) - Constructor for class org.kitesdk.morphline.scriptengine.java.ScriptEvaluator
-
- SeparateAttachmentsBuilder - Class in org.kitesdk.morphline.stdlib
-
Command that emits one output record for each attachment in the input record's list of
attachments; The result is a list of records, each of which has at most one attachment.
- SeparateAttachmentsBuilder() - Constructor for class org.kitesdk.morphline.stdlib.SeparateAttachmentsBuilder
-
- SEQUENCE_FILE_META_DATA - Static variable in class org.kitesdk.morphline.hadoop.sequencefile.ReadSequenceFileBuilder
-
- SerializationException - Exception in org.kitesdk.data
-
Exception thrown when an object cannot be serialized to,
or deserialized from, its binary representation.
- SerializationException(String, Throwable) - Constructor for exception org.kitesdk.data.SerializationException
-
- Service - Interface in org.kitesdk.minicluster
-
Interface for MiniCluster Service implementations.
- Service.ServiceConfig - Class in org.kitesdk.minicluster
-
A class that holds configuration settings for minicluster services.
- Service.ServiceConfig() - Constructor for class org.kitesdk.minicluster.Service.ServiceConfig
-
- set(String, String) - Method in class org.kitesdk.minicluster.Service.ServiceConfig
-
- setBatchSize(int) - Method in class org.kitesdk.morphline.solr.SolrLocator
-
- setCollectionName(String) - Method in class org.kitesdk.morphline.solr.SolrLocator
-
- setConf(Configuration) - Method in class org.kitesdk.data.mapreduce.DatasetKeyInputFormat
-
- setDatasetName(String) - Method in class org.kitesdk.data.flume.Log4jAppender
-
- setDatasetNamespace(String) - Method in class org.kitesdk.data.flume.Log4jAppender
-
- setDatasetRepositoryClass(String) - Method in class org.kitesdk.data.flume.Log4jAppender
-
Deprecated.
Use datasetRepositoryUri with a 'repo:' URI.
- setDatasetRepositoryUri(String) - Method in class org.kitesdk.data.flume.Log4jAppender
-
- setDocumentLoader(DocumentLoader) - Method in class org.kitesdk.morphline.solr.SolrMorphlineContext.Builder
-
- setExceptionHandler(ExceptionHandler) - Method in class org.kitesdk.morphline.api.MorphlineContext.Builder
-
- setHadoopConf(Configuration) - Method in class org.kitesdk.minicluster.Service.ServiceConfig
-
- setHealthCheckRegistry(HealthCheckRegistry) - Method in class org.kitesdk.morphline.api.MorphlineContext.Builder
-
- setIndexSchema(IndexSchema) - Method in class org.kitesdk.morphline.solr.SolrMorphlineContext.Builder
-
- setMetricRegistry(MetricRegistry) - Method in class org.kitesdk.morphline.api.MorphlineContext.Builder
-
- setNumWriters(int) - Method in class org.kitesdk.tools.TransformTask
-
- setServerUrl(String) - Method in class org.kitesdk.morphline.solr.SolrLocator
-
- setServiceConfig(String, String) - Method in class org.kitesdk.minicluster.MiniCluster.Builder
-
Service configs are dynamic config name-value pairs that can be
interpreted by the services.
- setSettings(Map<String, Object>) - Method in class org.kitesdk.morphline.api.MorphlineContext.Builder
-
- setSolrHomeDir(String) - Method in class org.kitesdk.morphline.solr.SolrLocator
-
- SetValuesBuilder - Class in org.kitesdk.morphline.stdlib
-
For each input field, set the given record output field to the same values.
- SetValuesBuilder() - Constructor for class org.kitesdk.morphline.stdlib.SetValuesBuilder
-
- setWorkingDirectory(Path) - Method in class org.kitesdk.morphline.hadoop.rcfile.SingleStreamFileSystem
-
- setZkHost(String) - Method in class org.kitesdk.morphline.solr.SolrLocator
-
- shutdown() - Method in interface org.kitesdk.morphline.solr.DocumentLoader
-
Releases allocated resources
- shutdown() - Method in class org.kitesdk.morphline.solr.SolrServerDocumentLoader
-
- SingleStreamFileSystem - Class in org.kitesdk.morphline.hadoop.rcfile
-
Very simple Filesystem Implementation which serves an InputStream for a given
path.
- SingleStreamFileSystem(InputStream, Path) - Constructor for class org.kitesdk.morphline.hadoop.rcfile.SingleStreamFileSystem
-
- SOLR_LOCATOR_PARAM - Static variable in class org.kitesdk.morphline.solr.LoadSolrBuilder
-
- SolrCellBuilder - Class in org.kitesdk.morphline.solrcell
-
Command that pipes the first attachment of a record into one of the given Tika parsers, then maps
the Tika output back to a record using SolrCell.
- SolrCellBuilder() - Constructor for class org.kitesdk.morphline.solrcell.SolrCellBuilder
-
- SolrLocator - Class in org.kitesdk.morphline.solr
-
Set of configuration parameters that identify the location and schema of a Solr server or
SolrCloud; Based on this information this class can return the schema and a corresponding
DocumentLoader
.
- SolrLocator(MorphlineContext) - Constructor for class org.kitesdk.morphline.solr.SolrLocator
-
- SolrLocator(Config, MorphlineContext) - Constructor for class org.kitesdk.morphline.solr.SolrLocator
-
- SolrMorphlineContext - Class in org.kitesdk.morphline.solr
-
A context that is specific to Solr.
- SolrMorphlineContext() - Constructor for class org.kitesdk.morphline.solr.SolrMorphlineContext
-
- SolrMorphlineContext.Builder - Class in org.kitesdk.morphline.solr
-
- SolrMorphlineContext.Builder() - Constructor for class org.kitesdk.morphline.solr.SolrMorphlineContext.Builder
-
- SolrServerDocumentLoader - Class in org.kitesdk.morphline.solr
-
A vehicle to load (or delete) documents into a local or remote SolrServer
.
- SolrServerDocumentLoader(SolrServer, int) - Constructor for class org.kitesdk.morphline.solr.SolrServerDocumentLoader
-
- SplitBuilder - Class in org.kitesdk.morphline.stdlib
-
Command that divides strings into substrings, by recognizing a separator (a.k.a.
- SplitBuilder() - Constructor for class org.kitesdk.morphline.stdlib.SplitBuilder
-
- SplitKeyValueBuilder - Class in org.kitesdk.morphline.stdlib
-
Command that iterates over the items in a given record input field, interprets each item as a key-value
pair where the key and value are separated by the given separator character, and adds the pair's
value to the record field named after the pair's key.
- SplitKeyValueBuilder() - Constructor for class org.kitesdk.morphline.stdlib.SplitKeyValueBuilder
-
- start() - Method in class org.kitesdk.minicluster.FlumeService
-
- start() - Method in class org.kitesdk.minicluster.HBaseService
-
- start() - Method in class org.kitesdk.minicluster.HdfsService
-
- start() - Method in class org.kitesdk.minicluster.HiveService
-
- start() - Method in class org.kitesdk.minicluster.MiniCluster
-
Starts the services in order, passing the previous service's modified
Configuration object to the next.
- start() - Method in interface org.kitesdk.minicluster.Service
-
Start the service.
- start() - Method in class org.kitesdk.minicluster.ZookeeperService
-
- startMetaStore(String, int, HiveConf) - Method in class org.kitesdk.minicluster.HiveService
-
- StartReportingMetricsToCSVBuilder - Class in org.kitesdk.morphline.stdlib
-
Command that starts periodically appending the metrics of the
com.codahale.metrics.MetricRegistry
of the
morphline context to a set of CSV files, configured via a
com.codahale.metrics.CsvReporter.Builder
.
- StartReportingMetricsToCSVBuilder() - Constructor for class org.kitesdk.morphline.stdlib.StartReportingMetricsToCSVBuilder
-
- StartReportingMetricsToHTTPBuilder - Class in org.kitesdk.morphline.metrics.servlets
-
Command that exposes liveness status, health check status, metrics state and thread dumps via a
set of HTTP URLs served by Jetty, using the AdminServlet.
- StartReportingMetricsToHTTPBuilder() - Constructor for class org.kitesdk.morphline.metrics.servlets.StartReportingMetricsToHTTPBuilder
-
- StartReportingMetricsToJMXBuilder - Class in org.kitesdk.morphline.stdlib
-
Command that starts publishing the metrics of the
com.codahale.metrics.MetricRegistry
of the morphline context
to JMX, configured via a com.codahale.metrics.JmxReporter.Builder
.
- StartReportingMetricsToJMXBuilder() - Constructor for class org.kitesdk.morphline.stdlib.StartReportingMetricsToJMXBuilder
-
- StartReportingMetricsToSLF4JBuilder - Class in org.kitesdk.morphline.stdlib
-
Command that starts periodically logging the metrics of the
com.codahale.metrics.MetricRegistry
of the
morphline context to SLF4j, configured via a
com.codahale.metrics.Slf4jReporter.Builder
.
- StartReportingMetricsToSLF4JBuilder() - Constructor for class org.kitesdk.morphline.stdlib.StartReportingMetricsToSLF4JBuilder
-
- stop() - Method in class org.kitesdk.minicluster.FlumeService
-
- stop() - Method in class org.kitesdk.minicluster.HBaseService
-
- stop() - Method in class org.kitesdk.minicluster.HdfsService
-
- stop() - Method in class org.kitesdk.minicluster.HiveService
-
- stop() - Method in class org.kitesdk.minicluster.MiniCluster
-
Stops the services in reverse of their run order.
- stop() - Method in interface org.kitesdk.minicluster.Service
-
Stop the service.
- stop() - Method in class org.kitesdk.minicluster.ZookeeperService
-
- StripNonCharSolrContentHandlerFactory - Class in org.kitesdk.morphline.solrcell
-
SolrContentHandler and associated factory that strips non-characters anhd trims on output.
- StripNonCharSolrContentHandlerFactory(Collection<String>) - Constructor for class org.kitesdk.morphline.solrcell.StripNonCharSolrContentHandlerFactory
-
- SUPPORTED_MIME_TYPES - Static variable in class org.kitesdk.morphline.stdio.AbstractParser
-
- sync() - Method in interface org.kitesdk.data.Syncable
-
Ensure that data has been synced to disk on all replica data nodes.
- Syncable - Interface in org.kitesdk.data
-
A writer that can guarantee data is persisted to disk.
- TASK_CONTEXT_SETTING_NAME - Static variable in class org.kitesdk.morphline.api.TypedSettings
-
- TaskUtil - Class in org.kitesdk.tools
-
- TaskUtil() - Constructor for class org.kitesdk.tools.TaskUtil
-
- TaskUtil.ConfigBuilder - Class in org.kitesdk.tools
-
- TIMESTAMP - Static variable in class org.kitesdk.morphline.base.Fields
-
- to(String, Comparable) - Method in interface org.kitesdk.data.RefinableView
-
Creates a sub-View
, restricted to entities whose name
field is less than or equal to the given value
.
- ToAvroBuilder - Class in org.kitesdk.morphline.avro
-
Command that converts a morphline record to an Avro record.
- ToAvroBuilder() - Constructor for class org.kitesdk.morphline.avro.ToAvroBuilder
-
- ToAvroMapBuilder - Class in org.kitesdk.morphline.avro
-
Command that converts a morphline record to an Avro record that contains a Map with string keys
and array values where the array values can be null, boolean, int, long, float, double, string,
bytes.
- ToAvroMapBuilder() - Constructor for class org.kitesdk.morphline.avro.ToAvroMapBuilder
-
- toBefore(String, Comparable) - Method in interface org.kitesdk.data.RefinableView
-
Creates a sub-View
, restricted to entities whose name
field is less than the given value
.
- ToByteArrayBuilder - Class in org.kitesdk.morphline.stdlib
-
Command that converts the Java objects in a given field via Object.toString()
to
their string representation, and then via String.getBytes(Charset)
to their byte
array representation.
- ToByteArrayBuilder() - Constructor for class org.kitesdk.morphline.stdlib.ToByteArrayBuilder
-
- toConfig(String) - Method in class org.kitesdk.morphline.solr.SolrLocator
-
- TokenizeTextBuilder - Class in org.kitesdk.morphline.solr
-
A command that uses the embedded Solr/Lucene Analyzer library to generate tokens from a text
string, without sending data to a Solr server.
- TokenizeTextBuilder() - Constructor for class org.kitesdk.morphline.solr.TokenizeTextBuilder
-
- toString() - Method in class org.kitesdk.data.ColumnMapping
-
- toString(boolean) - Method in class org.kitesdk.data.ColumnMapping
-
- toString() - Method in enum org.kitesdk.data.CompressionType
-
- toString() - Method in class org.kitesdk.data.DatasetDescriptor
-
- toString() - Method in class org.kitesdk.data.FieldMapping
-
- toString() - Method in class org.kitesdk.data.Format
-
- toString() - Method in class org.kitesdk.data.Key
-
- toString() - Method in class org.kitesdk.data.PartitionStrategy
-
- toString(boolean) - Method in class org.kitesdk.data.PartitionStrategy
-
- toString() - Method in class org.kitesdk.morphline.api.Record
-
- toString() - Method in class org.kitesdk.morphline.solr.SolrLocator
-
- ToStringBuilder - Class in org.kitesdk.morphline.stdlib
-
Command that converts the Java objects in a given field via Object.toString()
to
their string representation, and optionally also applies String.trim()
.
- ToStringBuilder() - Constructor for class org.kitesdk.morphline.stdlib.ToStringBuilder
-
- transformFieldValue(Object) - Method in class org.kitesdk.morphline.stdlib.AbstractFieldTransformCommand
-
Transforms the given input value to some output value
- TransformTask<S,T> - Class in org.kitesdk.tools
-
- TransformTask(View<S>, View<T>, DoFn<S, T>) - Constructor for class org.kitesdk.tools.TransformTask
-
- TransformTask.CheckEntityClass<E> - Class in org.kitesdk.tools
-
- TransformTask.CheckEntityClass(Class<?>) - Constructor for class org.kitesdk.tools.TransformTask.CheckEntityClass
-
- TranslateBuilder - Class in org.kitesdk.morphline.stdlib
-
Command that examines each string value in a given field and replaces it with the replacement
value defined in a given dictionary aka hash table.
- TranslateBuilder() - Constructor for class org.kitesdk.morphline.stdlib.TranslateBuilder
-
- TrimSolrContentHandlerFactory - Class in org.kitesdk.morphline.solrcell
-
SolrContentHandler and associated factory that trims field values on output.
- TrimSolrContentHandlerFactory(Collection<String>) - Constructor for class org.kitesdk.morphline.solrcell.TrimSolrContentHandlerFactory
-
- TryRulesBuilder - Class in org.kitesdk.morphline.stdlib
-
A tryRules command consists of zero or more rules.
- TryRulesBuilder() - Constructor for class org.kitesdk.morphline.stdlib.TryRulesBuilder
-
- TypedSettings - Class in org.kitesdk.morphline.api
-
A Map with convenient typed accessors for reading values.
- TypeNotFoundException - Exception in org.kitesdk.data
-
Exception thrown to indicate that there is a problem
finding a given type.
- TypeNotFoundException(String) - Constructor for exception org.kitesdk.data.TypeNotFoundException
-
- TypeNotFoundException(String, Throwable) - Constructor for exception org.kitesdk.data.TypeNotFoundException
-
- TypeNotFoundException(Throwable) - Constructor for exception org.kitesdk.data.TypeNotFoundException
-
- validateArguments() - Method in class org.kitesdk.morphline.base.AbstractCommand
-
Helper that checks if the user provided configuration parameters are valid.
- validateArguments(Config) - Method in class org.kitesdk.morphline.base.Configs
-
- validateEnum(Config, String, Class<T>, T...) - Method in class org.kitesdk.morphline.base.Validator
-
Validates that an enum of the given type with the given value exists, and that this enum is
contained in the given list of permitted choices; finally returns that enum object.
- validateRange(Config, T, Comparable<T>, Comparable<T>) - Method in class org.kitesdk.morphline.base.Validator
-
Validates that the given value is contained in the range [min, max]
- ValidationException - Exception in org.kitesdk.data
-
Exception thrown to indicate that there was a problem
parsing or validating a schema, partition strategy, or column mapping.
- ValidationException(String) - Constructor for exception org.kitesdk.data.ValidationException
-
- ValidationException(Throwable) - Constructor for exception org.kitesdk.data.ValidationException
-
- ValidationException(String, Throwable) - Constructor for exception org.kitesdk.data.ValidationException
-
- Validator<T> - Class in org.kitesdk.morphline.base
-
Simple helper to validate user configurable parameters.
- Validator() - Constructor for class org.kitesdk.morphline.base.Validator
-
- valueOf(String) - Static method in enum org.kitesdk.data.CompressionType
-
Returns the enum constant of this type with the specified name.
- valueOf(String) - Static method in enum org.kitesdk.data.FieldMapping.MappingType
-
Returns the enum constant of this type with the specified name.
- valueOf(String) - Static method in enum org.kitesdk.morphline.base.Notifications.LifecycleEvent
-
Returns the enum constant of this type with the specified name.
- values() - Static method in enum org.kitesdk.data.CompressionType
-
Returns an array containing the constants of this enum type, in
the order they are declared.
- values() - Static method in enum org.kitesdk.data.FieldMapping.MappingType
-
Returns an array containing the constants of this enum type, in
the order they are declared.
- values() - Static method in enum org.kitesdk.morphline.base.Notifications.LifecycleEvent
-
Returns an array containing the constants of this enum type, in
the order they are declared.
- version(String) - Method in class org.kitesdk.data.ColumnMapping.Builder
-
Adds an OCC version mapping for the record field name
.
- version(String) - Static method in class org.kitesdk.data.FieldMapping
-
- View<E> - Interface in org.kitesdk.data
-
- VIEW_SCHEME - Static variable in class org.kitesdk.data.URIBuilder
-