Initiating the Datahub data catalog

cd /Users/aditya.relangi/Code/datahub/docker

The following are the commands to interact with the Acryl Datahub Catalog.

Create a new dataset

curl 'http://localhost:8080/entities?action=ingest' -X POST --data '
                           "owner":"urn:li:corpuser:Zarina Malik",
                           "owner":"urn:li:corpuser:Aravind Rammoorthy",
                           "owner":"urn:li:corpuser:Prasanth Kanaujia",
                           "description":"User object represents a user of the survey funky platform",
                              "actor":"urn:li:corpuser:Zarina Malik"
                 "com.linkedin.dataset.UpstreamLineage": {
                   "upstreams": [
                       "dataset": "urn:li:dataset:(urn:li:dataPlatform:kafka,user,PROD)",
                       "type": "TRANSFORMED"
                      "customProperties": {
                            "html_wrapper": "<b> | </b>"
                        "externalUrl": "noll",
                        "description": "Kaun User7 description",
                        "tags": []
                        "actor":"urn:li:corpuser:Zarina Malik"
                     "primaryKeys": [
                        "name": "sample foreign key",
                        "foreignFields": [
                         "sourceFields": [
                         "foreignDataset": "urn:li:dataset:(urn:li:dataPlatform:kafka,user,PROD)"
                           "description":"Unique identifier of the user",
                           "description":"First name of the user",
                           "description":"Last name of the user",
                           "description":"Date on which this record is created",
                           "description":"Timestamp when this record was most recently updated",
                           "description":"Timestamp of the latest time the user has logged in",


Retrieve the dataset

curl  'http://localhost:8080/entities/urn:li:dataset:(urn:li:dataPlatform:fooe,User,PROD)'

ElasticSearch commands

Kafka-connect with Elasticsearch

curl -X POST http://localhost:8083/connectors \
-H "Content-type:application/json" \
--data-raw '{
  "name": "SINK_ELASTIC_TEST_05",
    "connector.class"                     : "io.confluent.connect.elasticsearch.ElasticsearchSinkConnector",
    "connection.url"                      : "http://elasticsearch:9200",
    "value.converter"                     : "io.confluent.connect.avro.AvroConverter",
    "value.converter.schema.registry.url" : "http://schemaregistry:8082",
    ""                           : "_doc",
    "topics"                              : "user",
    "key.ignore"                          : "true",
    "schema.ignore"                       : "true"
curl -s http://localhost:9200/user/_search \
    -H 'content-type: application/json' \
    -d '{ "size": 42  }' | jq -c '.hits.hits[]'



