title | slug | date | keyword | license |
---|---|---|---|---|
Hadoop catalog |
/hadoop-catalog |
2024-04-02 |
hadoop catalog |
Copyright 2024 Datastrato Pvt Ltd. This software is licensed under the Apache License version 2. |
Hadoop catalog is a fileset catalog that using Hadoop Compatible File System (HCFS) to manage
the storage location of the fileset. Currently, it supports local filesystem and HDFS. For
object storage like S3, GCS, and Azure Blob Storage, you can put the hadoop object store jar like
hadoop-aws into the $GRAVITINO_HOME/catalogs/hadoop/libs
directory to enable the support.
Gravitino itself hasn't yet tested the object storage support, so if you have any issue,
please create an issue.
Note that Gravitino uses Hadoop 3 dependencies to build Hadoop catalog. Theoretically, it should be compatible with both Hadoop 2.x and 3.x, since Gravitino doesn't leverage any new features in Hadoop 3. If there's any compatibility issue, please create an issue.
Property Name | Description | Default Value | Required | Since Version |
---|---|---|---|---|
location |
The storage location managed by Hadoop catalog. | (none) | No | 0.5.0 |
Refer to Catalog operations for more details.
The Hadoop catalog supports creating, updating, deleting, and listing schema.
Property name | Description | Default value | Required | Since Version |
---|---|---|---|---|
location |
The storage location managed by Hadoop schema. | (none) | No | 0.5.0 |
Refer to Schema operation for more details.
- The Hadoop catalog supports creating, updating, deleting, and listing filesets.
None.
Refer to Fileset operations for more details.