Skip to main content

Modifying Domains On Dataset

Why Would You Use Domains?

Domains are curated, top-level folders or categories where related assets can be explicitly grouped. Management of Domains can be centralized, or distributed out to Domain owners Currently, an asset can belong to only one Domain at a time. For more information about domains, refer to About DataHub Domains.

Goal Of This Guide

This guide will show you how to

  • create a domain named Marketing
  • read domains attached to a dataset fct_users_created.
  • add a dataset named fct_users_created to a domain named Marketing.
  • remove the domain Marketing from the fct_users_created dataset.

Prerequisites

For this tutorial, you need to deploy DataHub Quickstart and ingest sample data. For detailed steps, please refer to Datahub Quickstart Guide.

Create Domain

mutation createDomain {
createDomain(input: { name: "Marketing", description: "Entities related to the marketing department" })
}

If you see the following response, the operation was successful:

{
"data": {
"createDomain": "<domain_urn>"
},
"extensions": {}
}

Expected Outcomes of Creating Domain

You can now see Marketing domain has been created under Govern > Domains.

domain-created

Read Domains

query {
dataset(urn: "urn:li:dataset:(urn:li:dataPlatform:hive,fct_users_created,PROD)") {
domain {
associatedUrn
domain {
urn
properties {
name
}
}
}
}
}

If you see the following response, the operation was successful:

{
"data": {
"dataset": {
"domain": {
"associatedUrn": "urn:li:dataset:(urn:li:dataPlatform:hive,fct_users_created,PROD)",
"domain": {
"urn": "urn:li:domain:71b3bf7b-2e3f-4686-bfe1-93172c8c4e10",
"properties": {
"name": "Marketing"
}
}
}
}
},
"extensions": {}
}

Add Domains

mutation setDomain {
setDomain(domainUrn: "urn:li:domain:marketing", entityUrn: "urn:li:dataset:(urn:li:dataPlatform:hive,fct_users_created,PROD)")
}

If you see the following response, the operation was successful:

{
"data": {
"setDomain": true
},
"extensions": {}
}

With GraphQL

note

Please note that there are two available endpoints (:8000, :9002) to access graphql. For more information about the differences between these endpoints, please refer to DataHub Metadata Service

Expected Outcomes of Adding Domain

You can now see CustomerAccount domain has been added to user_name column.

tag-added

Remove Domains

mutation unsetDomain {
unsetDomain(
entityUrn:"urn:li:dataset:(urn:li:dataPlatform:hive,fct_users_created,PROD)"
)
}

Expected Response:

{
"data": {
"removeDomain": true
},
"extensions": {}
}

Expected Outcomes of Removing Domain

You can now see a domain Marketing has been removed from the fct_users_created dataset.

domain-removed