org.kitesdk.data.mapreduce
Class DatasetKeyInputFormat<E>

java.lang.Object
  extended by org.apache.hadoop.mapreduce.InputFormat<E,Void>
      extended by org.kitesdk.data.mapreduce.DatasetKeyInputFormat<E>
Type Parameters:
E - The type of entities in the Dataset.
All Implemented Interfaces:
Configurable

@Beta
public class DatasetKeyInputFormat<E>
extends InputFormat<E,Void>
implements Configurable

A MapReduce InputFormat for reading from a Dataset. Since a Dataset only contains entities (not key/value pairs), this output format ignores the value.


Field Summary
static String KITE_DATASET_NAME
           
static String KITE_PARTITION_DIR
           
static String KITE_REPOSITORY_URI
           
 
Constructor Summary
DatasetKeyInputFormat()
           
 
Method Summary
 RecordReader<E,Void> createRecordReader(InputSplit inputSplit, TaskAttemptContext taskAttemptContext)
           
 Configuration getConf()
           
 List<InputSplit> getSplits(JobContext jobContext)
           
 void setConf(Configuration configuration)
           
static void setDatasetName(Job job, String name)
           
static void setRepositoryUri(Job job, URI uri)
           
 
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
 

Field Detail

KITE_REPOSITORY_URI

public static final String KITE_REPOSITORY_URI
See Also:
Constant Field Values

KITE_DATASET_NAME

public static final String KITE_DATASET_NAME
See Also:
Constant Field Values

KITE_PARTITION_DIR

public static final String KITE_PARTITION_DIR
See Also:
Constant Field Values
Constructor Detail

DatasetKeyInputFormat

public DatasetKeyInputFormat()
Method Detail

setRepositoryUri

public static void setRepositoryUri(Job job,
                                    URI uri)

setDatasetName

public static void setDatasetName(Job job,
                                  String name)

getConf

public Configuration getConf()
Specified by:
getConf in interface Configurable

setConf

public void setConf(Configuration configuration)
Specified by:
setConf in interface Configurable

getSplits

public List<InputSplit> getSplits(JobContext jobContext)
                           throws IOException,
                                  InterruptedException
Specified by:
getSplits in class InputFormat<E,Void>
Throws:
IOException
InterruptedException

createRecordReader

public RecordReader<E,Void> createRecordReader(InputSplit inputSplit,
                                               TaskAttemptContext taskAttemptContext)
                                        throws IOException,
                                               InterruptedException
Specified by:
createRecordReader in class InputFormat<E,Void>
Throws:
IOException
InterruptedException


Copyright © 2013–2014. All rights reserved.