|
||||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |
java.lang.Objectorg.kitesdk.data.spi.AbstractDatasetRepository
org.kitesdk.data.spi.filesystem.FileSystemDatasetRepository
org.kitesdk.data.hcatalog.HCatalogDatasetRepository
public class HCatalogDatasetRepository
A DatasetRepository
that uses the Hive/HCatalog metastore for metadata,
and stores data in a Hadoop FileSystem
.
The location of the data directory is either chosen by Hive/HCatalog (so called
"managed tables"), or specified when creating an instance of this class by providing
a FileSystem
, and a root directory in the constructor ("external tables").
The primary methods of interest will be
create(String, DatasetDescriptor)
, AbstractDatasetRepository.load(String)
, and
delete(String)
which create a new dataset, load an existing
dataset, or delete an existing dataset, respectively. Once a dataset has been created
or loaded, users can invoke the appropriate Dataset
methods to get a reader
or writer as needed.
DatasetRepository
,
Dataset
Nested Class Summary | |
---|---|
static class |
HCatalogDatasetRepository.Builder
A fluent builder to aid in the construction of HCatalogDatasetRepository
instances. |
Method Summary | ||
---|---|---|
|
create(String name,
DatasetDescriptor descriptor)
Create a Dataset with the supplied descriptor . |
|
boolean |
delete(String name)
Delete data for the Dataset named name and remove its
DatasetDescriptor from the underlying metadata provider. |
|
URI |
getUri()
Return the URI of this repository. |
Methods inherited from class org.kitesdk.data.spi.filesystem.FileSystemDatasetRepository |
---|
create, exists, getTemporaryRepository, list, load, partitionKeyForPath, toString, update |
Methods inherited from class org.kitesdk.data.spi.AbstractDatasetRepository |
---|
load, update |
Methods inherited from class java.lang.Object |
---|
clone, equals, finalize, getClass, hashCode, notify, notifyAll, wait, wait, wait |
Method Detail |
---|
public <E> Dataset<E> create(String name, DatasetDescriptor descriptor)
DatasetRepository
Dataset
with the supplied descriptor
. Depending on
the underlying dataset storage, some schema types or configurations might
not be supported. If you supply an illegal schema, the implementing class
throws an exception. It is illegal to create more than one dataset with the
same name. If you provide a duplicate name, the implementing class throws
an exception.
create
in interface DatasetRepository
create
in class org.kitesdk.data.spi.AbstractDatasetRepository
name
- The fully qualified dataset namedescriptor
- A descriptor that describes the schema and other
properties of the dataset
public boolean delete(String name)
DatasetRepository
Dataset
named name
and remove its
DatasetDescriptor
from the underlying metadata provider.
After this method is called, there is no Dataset
with the given
name
, unless an exception is thrown. If either data or metadata
are removed, this method returns true
. If there is no
Dataset
corresponding to the given name
, this
method makes no changes and returns false
.
delete
in interface DatasetRepository
delete
in class org.kitesdk.data.spi.filesystem.FileSystemDatasetRepository
name
- The name of the dataset to delete.
true
if any data or metadata is removed,
false
if no action is taken.public URI getUri()
DatasetRepository
URI
of this repository. When used with the DatasetRepositories.open(java.net.URI)
(or DatasetRepositories.openRandomAccess(java.net.URI)
) method,
an equivalent DatasetRepository
object to this
is returned.
getUri
in interface DatasetRepository
getUri
in class org.kitesdk.data.spi.filesystem.FileSystemDatasetRepository
URI
of this repository
|
||||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |