| 
 | ||||||||||
| PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
| SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD | |||||||||
java.lang.Objectorg.kitesdk.data.spi.AbstractDatasetRepository
org.kitesdk.data.spi.filesystem.FileSystemDatasetRepository
org.kitesdk.data.hcatalog.HCatalogDatasetRepository
public class HCatalogDatasetRepository
 A DatasetRepository that uses the Hive/HCatalog metastore for metadata,
 and stores data in a Hadoop FileSystem.
 
 The location of the data directory is either chosen by Hive/HCatalog (so called
 "managed tables"), or specified when creating an instance of this class by providing
 a FileSystem, and a root directory in the constructor ("external tables").
 
 The primary methods of interest will be
 create(String, DatasetDescriptor), FileSystemDatasetRepository.load(String), and
 delete(String) which create a new dataset, load an existing
 dataset, or delete an existing dataset, respectively. Once a dataset has been created
 or loaded, users can invoke the appropriate Dataset methods to get a reader
 or writer as needed.
 
DatasetRepository, 
Dataset| Nested Class Summary | |
|---|---|
| static class | HCatalogDatasetRepository.BuilderA fluent builder to aid in the construction of HCatalogDatasetRepositoryinstances. | 
| Method Summary | ||
|---|---|---|
| 
 | create(String name,
       DatasetDescriptor descriptor)Create a Datasetwith the supplieddescriptor. | |
|  boolean | delete(String name)Delete data for the Datasetnamednameand remove itsDatasetDescriptorfrom the underlying metadata provider. | |
|  URI | getUri()Return the URIof this repository. | |
| Methods inherited from class org.kitesdk.data.spi.filesystem.FileSystemDatasetRepository | 
|---|
| exists, getTemporaryRepository, list, load, partitionKeyForPath, toString, update | 
| Methods inherited from class java.lang.Object | 
|---|
| clone, equals, finalize, getClass, hashCode, notify, notifyAll, wait, wait, wait | 
| Method Detail | 
|---|
public <E> Dataset<E> create(String name,
                             DatasetDescriptor descriptor)
DatasetRepositoryDataset with the supplied descriptor. Depending on
 the underlying dataset storage, some schema types or configurations might
 not be supported. If you supply an illegal schema, the implementing class
 throws an exception. It is illegal to create more than one dataset with the
 same name. If you provide a duplicate name, the implementing class throws
 an exception.
create in interface DatasetRepositorycreate in class org.kitesdk.data.spi.filesystem.FileSystemDatasetRepositoryname - The fully qualified dataset namedescriptor - A descriptor that describes the schema and other
                    properties of the dataset
public boolean delete(String name)
DatasetRepositoryDataset named name and remove its
 DatasetDescriptor from the underlying metadata provider.
 After this method is called, there is no Dataset with the given
 name, unless an exception is thrown. If either data or metadata
 are removed, this method returns true. If there is no
 Dataset corresponding to the given name, this
 method makes no changes and returns false.
delete in interface DatasetRepositorydelete in class org.kitesdk.data.spi.filesystem.FileSystemDatasetRepositoryname - The name of the dataset to delete.
true if any data or metadata is removed,
         false if no action is taken.public URI getUri()
DatasetRepositoryURI of this repository. When used with the DatasetRepositories.open(java.net.URI) (or DatasetRepositories.openRandomAccess(java.net.URI)) method,
 an equivalent DatasetRepository object to this is returned.
getUri in interface DatasetRepositorygetUri in class org.kitesdk.data.spi.filesystem.FileSystemDatasetRepositoryURI of this repository| 
 | ||||||||||
| PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
| SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD | |||||||||