Skip to main content

Objects

ActorFilter

The actors that a DataHub Access Policy applies to

Fields

NameDescription
users
[String!]

A disjunctive set of users to apply the policy to

groups
[String!]

A disjunctive set of groups to apply the policy to

resourceOwners
Boolean!

Whether the filter should return TRUE for owners of a particular resource Only applies to policies of type METADATA, which have a resource associated with them

allUsers
Boolean!

Whether the filter should apply to all users

allGroups
Boolean!

Whether the filter should apply to all groups

AggregationMetadata

Information about the aggregation that can be used for filtering, included the field value and number of results

Fields

NameDescription
value
String!

A particular value of a facet field

count
Long!

The number of search results containing the value

entity
Entity

Entity corresponding to the facet field

AnalyticsConfig

Configurations related to the Analytics Feature

Fields

NameDescription
enabled
Boolean!

Whether the Analytics feature is enabled and should be displayed

AppConfig

Config loaded at application boot time This configuration dictates the behavior of the UI, such as which features are enabled or disabled

Fields

NameDescription
analyticsConfig
AnalyticsConfig!

Configurations related to the Analytics Feature

policiesConfig
PoliciesConfig!

Configurations related to the Policies Feature

identityManagementConfig
IdentityManagementConfig!

Configurations related to the User & Group management

AuditStamp

A time stamp along with an optional actor

Fields

NameDescription
time
Long!

When the audited action took place

actor
String

Who performed the audited action

AuthenticatedUser

Information about the currently authenticated user

Fields

NameDescription
corpUser
CorpUser!

The user information associated with the authenticated user, including properties used in rendering the profile

platformPrivileges
PlatformPrivileges!

The privileges assigned to the currently authenticated user, which dictates which parts of the UI they should be able to use

AutoCompleteMultipleResults

The results returned on a multi entity autocomplete query

Fields

NameDescription
query
String!

The raw query string

suggestions
[AutoCompleteResultForEntity!]!

The autocompletion suggestions

AutoCompleteResultForEntity

An individual auto complete result specific to an individual Metadata Entity Type

Fields

NameDescription
type
EntityType!

Entity type

suggestions
[String!]!

The autocompletion results for specified entity type

AutoCompleteResults

The results returned on a single entity autocomplete query

Fields

NameDescription
query
String!

The query string

suggestions
[String!]!

The autocompletion results

BaseData

Fields

NameDescription
dataset
String!

Dataset used for the Training or Evaluation of the MLModel

motivation
String

Motivation to pick these datasets

preProcessing
[String!]

Details of Data Proprocessing

BooleanBox

Fields

NameDescription
booleanValue
Boolean!

BrowsePath

A hierarchical entity path

Fields

NameDescription
path
[String!]!

The components of the browse path

BrowseResultGroup

A group of Entities under a given browse path

Fields

NameDescription
name
String!

The path name of a group of browse results

count
Long!

The number of entities within the group

BrowseResultMetadata

Metadata about the Browse Paths response

Fields

NameDescription
path
[String!]!

The provided path

totalNumEntities
Long!

The total number of entities under the provided browse path

BrowseResults

The results of a browse path traversal query

Fields

NameDescription
entities
[Entity!]!

The browse results

groups
[BrowseResultGroup!]!

The groups present at the provided browse path

start
Int!

The starting point of paginated results

count
Int!

The number of elements included in the results

total
Int!

The total number of browse results under the path with filters applied

metadata
BrowseResultMetadata!

Metadata containing resulting browse groups

CaveatDetails

Fields

NameDescription
needsFurtherTesting
Boolean

Did the results suggest any further testing

caveatDescription
String

Caveat Description

groupsNotRepresented
[String!]

Relevant groups that were not represented in the evaluation dataset

CaveatsAndRecommendations

Fields

NameDescription
caveats
CaveatDetails

Caveats on using this MLModel

recommendations
String

Recommendations on where this MLModel should be used

idealDatasetCharacteristics
[String!]

Ideal characteristics of an evaluation dataset for this MLModel

Chart

A Chart Metadata Entity

Implements

Fields

NameDescription
urn
String!

The primary key of the Chart

type
EntityType!

A standard Entity Type

tool
String!

The chart tool name Note that this field will soon be deprecated in favor a unified notion of Data Platform

chartId
String!

An id unique within the charting tool

properties
ChartProperties

Additional read only properties about the Chart

editableProperties
ChartEditableProperties

Additional read write properties about the Chart

query
ChartQuery

Info about the query which is used to render the chart

ownership
Ownership

Ownership metadata of the chart

status
Status

Status metadata of the chart

tags
GlobalTags

The tags associated with the chart

institutionalMemory
InstitutionalMemory

References to internal resources related to the dashboard

glossaryTerms
GlossaryTerms

The structured glossary terms associated with the dashboard

relationships
EntityRelationshipsResult

Edges extending from this entity

Arguments

NameDescription
input
RelationshipsInput!
info
ChartInfo
Deprecated: No longer supported

Deprecated, use properties field instead Additional read only information about the chart

editableInfo
ChartEditableProperties
Deprecated: No longer supported

Deprecated, use editableProperties field instead Additional read write information about the Chart

globalTags
GlobalTags
Deprecated: No longer supported

Deprecated, use tags instead The structured tags associated with the chart

ChartEditableProperties

Chart properties that are editable via the UI This represents logical metadata, as opposed to technical metadata

Fields

NameDescription
description
String

Description of the Chart

ChartInfo

Deprecated, use ChartProperties instead Additional read only information about the chart

Fields

NameDescription
name
String!

Display name of the chart

description
String

Description of the chart

inputs
[Dataset!]
Deprecated: No longer supported

Deprecated, use relationship Consumes instead Data sources for the chart

externalUrl
String

Native platform URL of the chart

type
ChartType

Access level for the chart

access
AccessLevel

Access level for the chart

customProperties
[StringMapEntry!]

A list of platform specific metadata tuples

lastRefreshed
Long

The time when this chart last refreshed

created
AuditStamp!

An AuditStamp corresponding to the creation of this chart

lastModified
AuditStamp!

An AuditStamp corresponding to the modification of this chart

deleted
AuditStamp

An optional AuditStamp corresponding to the deletion of this chart

ChartProperties

Additional read only properties about the chart

Fields

NameDescription
name
String!

Display name of the chart

description
String

Description of the chart

externalUrl
String

Native platform URL of the chart

type
ChartType

Access level for the chart

access
AccessLevel

Access level for the chart

customProperties
[StringMapEntry!]

A list of platform specific metadata tuples

lastRefreshed
Long

The time when this chart last refreshed

created
AuditStamp!

An AuditStamp corresponding to the creation of this chart

lastModified
AuditStamp!

An AuditStamp corresponding to the modification of this chart

deleted
AuditStamp

An optional AuditStamp corresponding to the deletion of this chart

ChartQuery

The query that was used to populate a Chart

Fields

NameDescription
rawQuery
String!

Raw query to build a chart from input datasets

type
ChartQueryType!

The type of the chart query

CorpGroup

A DataHub Group entity, which represents a Person on the Metadata Entity Graph

Implements

Fields

NameDescription
urn
String!

The primary key of the group

type
EntityType!

A standard Entity Type

name
String!

Group name eg wherehows dev, ask_metadata

properties
CorpGroupProperties

Additional read only properties about the group

relationships
EntityRelationshipsResult

Edges extending from this entity

Arguments

NameDescription
input
RelationshipsInput!
info
CorpGroupInfo
Deprecated: No longer supported

Deprecated, use properties field instead Additional read only info about the group

CorpGroupInfo

Deprecated, use CorpUserProperties instead Additional read only info about a group

Fields

NameDescription
displayName
String

The name to display when rendering the group

description
String

The description provided for the group

email
String

email of this group

admins
[CorpUser!]
Deprecated: No longer supported

Deprecated, do not use owners of this group

members
[CorpUser!]
Deprecated: No longer supported

Deprecated, use relationship IsMemberOfGroup instead List of ldap urn in this group

groups
[String!]
Deprecated: No longer supported

Deprecated, do not use List of groups urns in this group

CorpGroupProperties

Additional read only properties about a group

Fields

NameDescription
displayName
String

display name of this group

description
String

The description provided for the group

email
String

email of this group

CorpUser

A DataHub User entity, which represents a Person on the Metadata Entity Graph

Implements

Fields

NameDescription
urn
String!

The primary key of the user

type
EntityType!

The standard Entity Type

username
String!

A username associated with the user This uniquely identifies the user within DataHub

properties
CorpUserProperties

Additional read only properties about the corp user

editableProperties
CorpUserEditableProperties

Read write properties about the corp user

status
CorpUserStatus

The status of the user

tags
GlobalTags

The tags associated with the user

relationships
EntityRelationshipsResult

Edges extending from this entity

Arguments

NameDescription
input
RelationshipsInput!
info
CorpUserInfo
Deprecated: No longer supported

Deprecated, use properties field instead Additional read only info about the corp user

editableInfo
CorpUserEditableInfo
Deprecated: No longer supported

Deprecated, use editableProperties field instead Read write info about the corp user

globalTags
GlobalTags
Deprecated: No longer supported

Deprecated, use the tags field instead The structured tags associated with the user

CorpUserEditableInfo

Deprecated, use CorpUserEditableProperties instead Additional read write info about a user

Fields

NameDescription
aboutMe
String

About me section of the user

teams
[String!]

Teams that the user belongs to

skills
[String!]

Skills that the user possesses

pictureLink
String

A URL which points to a picture which user wants to set as a profile photo

CorpUserEditableProperties

Additional read write properties about a user

Fields

NameDescription
aboutMe
String

About me section of the user

teams
[String!]

Teams that the user belongs to

skills
[String!]

Skills that the user possesses

pictureLink
String

A URL which points to a picture which user wants to set as a profile photo

CorpUserInfo

Deprecated, use CorpUserProperties instead Additional read only info about a user

Fields

NameDescription
active
Boolean!

Whether the user is active

displayName
String

Display name of the user

email
String

Email address of the user

title
String

Title of the user

manager
CorpUser

Direct manager of the user

departmentId
Long

department id the user belong to

departmentName
String

department name this user belong to

firstName
String

first name of the user

lastName
String

last name of the user

fullName
String

Common name of this user, format is firstName plus lastName

countryCode
String

two uppercase letters country code

CorpUserProperties

Additional read only properties about a user

Fields

NameDescription
active
Boolean!

Whether the user is active

displayName
String

Display name of the user

email
String

Email address of the user

title
String

Title of the user

manager
CorpUser

Direct manager of the user

departmentId
Long

department id the user belong to

departmentName
String

department name this user belong to

firstName
String

first name of the user

lastName
String

last name of the user

fullName
String

Common name of this user, format is firstName plus lastName

countryCode
String

two uppercase letters country code

Cost

Fields

NameDescription
costType
CostType!

Type of Cost Code

costValue
CostValue!

Code to which the Cost of this entity should be attributed to ie organizational cost ID

CostValue

Fields

NameDescription
costId
Float

Organizational Cost ID

costCode
String

Organizational Cost Code

Dashboard

A Dashboard Metadata Entity

Implements

Fields

NameDescription
urn
String!

The primary key of the Dashboard

type
EntityType!

A standard Entity Type

tool
String!

The dashboard tool name Note that this will soon be deprecated in favor of a standardized notion of Data Platform

dashboardId
String!

An id unique within the dashboard tool

properties
DashboardProperties

Additional read only properties about the dashboard

editableProperties
DashboardEditableProperties

Additional read write properties about the dashboard

ownership
Ownership

Ownership metadata of the dashboard

status
Status

Status metadata of the dashboard

tags
GlobalTags

The tags associated with the dashboard

institutionalMemory
InstitutionalMemory

References to internal resources related to the dashboard

glossaryTerms
GlossaryTerms

The structured glossary terms associated with the dashboard

relationships
EntityRelationshipsResult

Edges extending from this entity

Arguments

NameDescription
input
RelationshipsInput!
info
DashboardInfo
Deprecated: No longer supported

Deprecated, use properties field instead Additional read only information about the dashboard

editableInfo
DashboardEditableProperties
Deprecated: No longer supported

Deprecated, use editableProperties instead Additional read write properties about the Dashboard

globalTags
GlobalTags
Deprecated: No longer supported

Deprecated, use tags field instead The structured tags associated with the dashboard

DashboardEditableProperties

Dashboard properties that are editable via the UI This represents logical metadata, as opposed to technical metadata

Fields

NameDescription
description
String

Description of the Dashboard

DashboardInfo

Deprecated, use DashboardProperties instead Additional read only info about a Dashboard

Fields

NameDescription
name
String!

Display of the dashboard

description
String

Description of the dashboard

charts
[Chart!]!
Deprecated: No longer supported

Deprecated, use relationship Contains instead Charts that comprise the dashboard

externalUrl
String

Native platform URL of the dashboard

access
AccessLevel

Access level for the dashboard Note that this will soon be deprecated for low usage

customProperties
[StringMapEntry!]

A list of platform specific metadata tuples

lastRefreshed
Long

The time when this dashboard last refreshed

created
AuditStamp!

An AuditStamp corresponding to the creation of this dashboard

lastModified
AuditStamp!

An AuditStamp corresponding to the modification of this dashboard

deleted
AuditStamp

An optional AuditStamp corresponding to the deletion of this dashboard

DashboardProperties

Additional read only properties about a Dashboard

Fields

NameDescription
name
String!

Display of the dashboard

description
String

Description of the dashboard

externalUrl
String

Native platform URL of the dashboard

access
AccessLevel

Access level for the dashboard Note that this will soon be deprecated for low usage

customProperties
[StringMapEntry!]

A list of platform specific metadata tuples

lastRefreshed
Long

The time when this dashboard last refreshed

created
AuditStamp!

An AuditStamp corresponding to the creation of this dashboard

lastModified
AuditStamp!

An AuditStamp corresponding to the modification of this dashboard

deleted
AuditStamp

An optional AuditStamp corresponding to the deletion of this dashboard

DataFlow

A Data Flow Metadata Entity, representing an set of pipelined Data Job or Tasks required to produce an output Dataset Also known as a Data Pipeline

Implements

Fields

NameDescription
urn
String!

The primary key of a Data Flow

type
EntityType!

A standard Entity Type

orchestrator
String!

Workflow orchestrator ei Azkaban, Airflow

flowId
String!

Id of the flow

cluster
String!

Cluster of the flow

properties
DataFlowProperties

Additional read only properties about a Data flow

editableProperties
DataFlowEditableProperties

Additional read write properties about a Data Flow

ownership
Ownership

Ownership metadata of the flow

tags
GlobalTags

The tags associated with the dataflow

status
Status

Status metadata of the dataflow

institutionalMemory
InstitutionalMemory

References to internal resources related to the dashboard

glossaryTerms
GlossaryTerms

The structured glossary terms associated with the dashboard

relationships
EntityRelationshipsResult

Edges extending from this entity

Arguments

NameDescription
input
RelationshipsInput!
info
DataFlowInfo
Deprecated: No longer supported

Deprecated, use properties field instead Additional read only information about a Data flow

globalTags
GlobalTags
Deprecated: No longer supported

Deprecated, use tags field instead The structured tags associated with the dataflow

dataJobs
DataFlowDataJobsRelationships
Deprecated: No longer supported

Deprecated, use relationship IsPartOf instead Data Jobs

DataFlowDataJobsRelationships

Deprecated, use relationships query instead

Fields

NameDescription
entities
[EntityRelationshipLegacy]

DataFlowEditableProperties

Data Flow properties that are editable via the UI This represents logical metadata, as opposed to technical metadata

Fields

NameDescription
description
String

Description of the Data Flow

DataFlowInfo

Deprecated, use DataFlowProperties instead Additional read only properties about a Data Flow aka Pipeline

Fields

NameDescription
name
String!

Display name of the flow

description
String

Description of the flow

project
String

Optional project or namespace associated with the flow

externalUrl
String

External URL associated with the DataFlow

customProperties
[StringMapEntry!]

A list of platform specific metadata tuples

DataFlowProperties

Additional read only properties about a Data Flow aka Pipeline

Fields

NameDescription
name
String!

Display name of the flow

description
String

Description of the flow

project
String

Optional project or namespace associated with the flow

externalUrl
String

External URL associated with the DataFlow

customProperties
[StringMapEntry!]

A list of platform specific metadata tuples

DataJob

A Data Job Metadata Entity, representing an individual unit of computation or Task to produce an output Dataset Always part of a parent Data Flow aka Pipeline

Implements

Fields

NameDescription
urn
String!

The primary key of the Data Job

type
EntityType!

A standard Entity Type

dataFlow
DataFlow

Deprecated, use relationship IsPartOf instead The associated data flow

jobId
String!

Id of the job

properties
DataJobProperties

Additional read only properties associated with the Data Job

editableProperties
DataJobEditableProperties

Additional read write properties associated with the Data Job

tags
GlobalTags

The tags associated with the DataJob

ownership
Ownership

Ownership metadata of the job

status
Status

Status metadata of the DataJob

institutionalMemory
InstitutionalMemory

References to internal resources related to the dashboard

glossaryTerms
GlossaryTerms

The structured glossary terms associated with the dashboard

relationships
EntityRelationshipsResult

Edges extending from this entity

Arguments

NameDescription
input
RelationshipsInput!
info
DataJobInfo
Deprecated: No longer supported

Deprecated, use properties field instead Additional read only information about a Data processing job

inputOutput
DataJobInputOutput
Deprecated: No longer supported

Deprecated, use relationship Produces, Consumes, DownstreamOf instead Information about the inputs and outputs of a Data processing job

globalTags
GlobalTags
Deprecated: No longer supported

Deprecated, use the tags field instead The structured tags associated with the DataJob

DataJobEditableProperties

Data Job properties that are editable via the UI This represents logical metadata, as opposed to technical metadata

Fields

NameDescription
description
String

Description of the Data Job

DataJobInfo

Deprecated, use DataJobProperties instead Additional read only information about a Data Job aka Task

Fields

NameDescription
name
String!

Job display name

description
String

Job description

externalUrl
String

External URL associated with the DataJob

customProperties
[StringMapEntry!]

A list of platform specific metadata tuples

DataJobInputOutput

The lineage information for a DataJob TODO Rename this to align with other Lineage models

Fields

NameDescription
inputDatasets
[Dataset!]
Deprecated: No longer supported

Deprecated, use relationship Consumes instead Input datasets produced by the data job during processing

outputDatasets
[Dataset!]
Deprecated: No longer supported

Deprecated, use relationship Produces instead Output datasets produced by the data job during processing

inputDatajobs
[DataJob!]
Deprecated: No longer supported

Deprecated, use relationship DownstreamOf instead Input datajobs that this data job depends on

DataJobProperties

Additional read only properties about a Data Job aka Task

Fields

NameDescription
name
String!

Job display name

description
String

Job description

externalUrl
String

External URL associated with the DataJob

customProperties
[StringMapEntry!]

A list of platform specific metadata tuples

DataPlatform

A Data Platform represents a specific third party Data System or Tool Examples include warehouses like Snowflake, orchestrators like Airflow, and dashboarding tools like Looker

Implements

Fields

NameDescription
urn
String!

Urn of the data platform

type
EntityType!

A standard Entity Type

name
String!

Name of the data platform

properties
DataPlatformProperties

Additional read only properties associated with a data platform

displayName
String
Deprecated: No longer supported

Deprecated, use properties displayName instead Display name of the data platform

info
DataPlatformInfo
Deprecated: No longer supported

Deprecated, use properties field instead Additional properties associated with a data platform

relationships
EntityRelationshipsResult

Edges extending from this entity

Arguments

NameDescription
input
RelationshipsInput!

DataPlatformInfo

Deprecated, use DataPlatformProperties instead Additional read only information about a Data Platform

Fields

NameDescription
type
PlatformType!

The platform category

displayName
String

Display name associated with the platform

datasetNameDelimiter
String!

The delimiter in the dataset names on the data platform

logoUrl
String

A logo URL associated with the platform

DataPlatformProperties

Additional read only properties about a Data Platform

Fields

NameDescription
type
PlatformType!

The platform category

displayName
String

Display name associated with the platform

datasetNameDelimiter
String!

The delimiter in the dataset names on the data platform

logoUrl
String

A logo URL associated with the platform

Dataset

A Dataset entity, which encompasses Relational Tables, Document store collections, streaming topics, and other sets of data having an independent lifecycle

Implements

Fields

NameDescription
urn
String!

The primary key of the Dataset

type
EntityType!

The standard Entity Type

platform
DataPlatform!

Standardized platform urn where the dataset is defined

name
String!

The Dataset display name

properties
DatasetProperties

An additional set of read only properties

editableProperties
DatasetEditableProperties

An additional set of of read write properties

ownership
Ownership

Ownership metadata of the dataset

deprecation
Deprecation

The deprecation status

institutionalMemory
InstitutionalMemory

References to internal resources related to the dataset

schemaMetadata
SchemaMetadata

Schema metadata of the dataset, available by version number

Arguments

NameDescription
version
Long
editableSchemaMetadata
EditableSchemaMetadata

Editable schema metadata of the dataset

status
Status

Status of the Dataset

tags
GlobalTags

Tags used for searching dataset

glossaryTerms
GlossaryTerms

The structured glossary terms associated with the dataset

usageStats
UsageQueryResult

Statistics about how this Dataset is used

Arguments

NameDescription
resource
String!
range
TimeRange
datasetProfiles
[DatasetProfile!]

Profile Stats resource that retrieves the events in a previous unit of time in descending order If no start or end time are provided, the most recent events will be returned

Arguments

NameDescription
startTimeMillis
Long
endTimeMillis
Long
limit
Int
relationships
EntityRelationshipsResult

Edges extending from this entity

Arguments

NameDescription
input
RelationshipsInput!
schema
Schema
Deprecated: Use `schemaMetadata`

Schema metadata of the dataset

externalUrl
String
Deprecated: No longer supported

Deprecated, use properties field instead External URL associated with the Dataset

origin
FabricType!
Deprecated: No longer supported

Deprecated,se the properties field instead Environment in which the dataset belongs to or where it was generated Note that this field will soon be deprecated in favor of a more standardized concept of Environment

description
String
Deprecated: No longer supported

Deprecated, use the properties field instead Read only technical description for dataset

platformNativeType
PlatformNativeType
Deprecated: No longer supported

Deprecated, do not use this field The logical type of the dataset ie table, stream, etc

uri
String
Deprecated: No longer supported

Deprecated, use properties instead Native Dataset Uri Uri should not include any environment specific properties

globalTags
GlobalTags
Deprecated: No longer supported

Deprecated, use tags field instead The structured tags associated with the dataset

subTypes
SubTypes

Sub Types that this entity implements

viewProperties
ViewProperties

View related properties. Only relevant if subtypes field contains VIEW.

DatasetDeprecation

Deprecated, use Deprecation instead Information about Dataset deprecation status Note that this model will soon be migrated to a more general purpose Entity status

Fields

NameDescription
deprecated
Boolean!

Whether the dataset has been deprecated by owner

decommissionTime
Long

The time user plan to decommission this dataset

note
String!

Additional information about the dataset deprecation plan

actor
String

The user who will be credited for modifying this deprecation content

DatasetEditableProperties

Dataset properties that are editable via the UI This represents logical metadata, as opposed to technical metadata

Fields

NameDescription
description
String

Description of the Dataset

DatasetFieldProfile

An individual Dataset Field Profile

Fields

NameDescription
fieldPath
String!

The standardized path of the field

uniqueCount
Long

The unique value count for the field across the Dataset

uniqueProportion
Float

The proportion of rows with unique values across the Dataset

nullCount
Long

The number of NULL row values across the Dataset

nullProportion
Float

The proportion of rows with NULL values across the Dataset

min
String

The min value for the field

max
String

The max value for the field

mean
String

The mean value for the field

median
String

The median value for the field

stdev
String

The standard deviation for the field

sampleValues
[String!]

A set of sample values for the field

DatasetProfile

A Dataset Profile associated with a Dataset, containing profiling statistics about the Dataset

Implements

Fields

NameDescription
timestampMillis
Long!

The time at which the profile was reported

rowCount
Long

An optional row count of the Dataset

columnCount
Long

An optional column count of the Dataset

fieldProfiles
[DatasetFieldProfile!]

An optional set of per field statistics obtained in the profile

partitionSpec
PartitionSpec

Information about the partition that was profiled

DatasetProperties

Additional read only properties about a Dataset

Fields

NameDescription
origin
FabricType!

Environment in which the dataset belongs to or where it was generated Note that this field will soon be deprecated in favor of a more standardized concept of Environment

description
String

Read only technical description for dataset

customProperties
[StringMapEntry!]

Custom properties of the Dataset

externalUrl
String

External URL associated with the Dataset

Deprecation

Information about Metadata Entity deprecation status

Fields

NameDescription
deprecated
Boolean!

Whether the entity has been deprecated by owner

decommissionTime
Long

The time user plan to decommission this entity

note
String!

Additional information about the entity deprecation plan

actor
String

The user who will be credited for modifying this deprecation content

DownstreamEntityRelationships

Deprecated, use relationships query instead

Fields

NameDescription
entities
[EntityRelationshipLegacy]

EditableSchemaFieldInfo

Editable schema field metadata ie descriptions, tags, etc

Fields

NameDescription
fieldPath
String!

Flattened name of a field identifying the field the editable info is applied to

description
String

Edited description of the field

globalTags
GlobalTags
Deprecated: No longer supported

Deprecated, use tags field instead Tags associated with the field

tags
GlobalTags

Tags associated with the field

glossaryTerms
GlossaryTerms

Glossary terms associated with the field

EditableSchemaMetadata

Information about schema metadata that is editable via the UI

Fields

NameDescription
editableSchemaFieldInfo
[EditableSchemaFieldInfo!]!

Editable schema field metadata

EditableTagProperties

Additional read write Tag properties

Fields

NameDescription
description
String

A description of the Tag

EntityCountResult

Fields

NameDescription
entityType
EntityType!
count
Int!

EntityCountResults

Fields

NameDescription
counts
[EntityCountResult!]

EntityRelationship

A relationship between two entities TODO Migrate all entity relationships to this more generic model

Fields

NameDescription
type
String!

The type of the relationship

direction
RelationshipDirection!

The direction of the relationship relative to the source entity

entity
Entity!

Entity that is related via lineage

created
AuditStamp

An AuditStamp corresponding to the last modification of this relationship

EntityRelationshipLegacy

Deprecated, use relationships query instead

Fields

NameDescription
entity
EntityWithRelationships

Entity that is related via lineage

created
AuditStamp

An AuditStamp corresponding to the last modification of this relationship

EntityRelationshipsResult

A list of relationship information associated with a source Entity

Fields

NameDescription
start
Int

Start offset of the result set

count
Int

Number of results in the returned result set

total
Int

Total number of results in the result set

relationships
[EntityRelationship!]!

Relationships in the result set

EthicalConsiderations

Fields

NameDescription
data
[String!]

Does the model use any sensitive data eg, protected classes

humanLife
[String!]

Is the model intended to inform decisions about matters central to human life or flourishing eg, health or safety

mitigations
[String!]

What risk mitigation strategies were used during model development

risksAndHarms
[String!]

What risks may be present in model usage Try to identify the potential recipients, likelihood, and magnitude of harms If these cannot be determined, note that they were considered but remain unknown

useCases
[String!]

Are there any known model use cases that are especially fraught This may connect directly to the intended use section

FacetMetadata

Contains valid fields to filter search results further on

Fields

NameDescription
field
String!

Name of a field present in the search entity

displayName
String

Display name of the field

aggregations
[AggregationMetadata!]!

Aggregated search result counts by value of the field

FieldUsageCounts

The usage for a particular Dataset field

Fields

NameDescription
fieldName
String

The path of the field

count
Int

The count of usages

FloatBox

Fields

NameDescription
floatValue
Float!

ForeignKeyConstraint

Metadata around a foreign key constraint between two datasets

Fields

NameDescription
name
String

The human-readable name of the constraint

foreignFields
[SchemaFieldEntity]

List of fields in the foreign dataset

sourceFields
[SchemaFieldEntity]

List of fields in this dataset

foreignDataset
Dataset

The foreign dataset for easy reference

GlobalTags

Tags attached to a particular Metadata Entity

Fields

NameDescription
tags
[TagAssociation!]

The set of tags attached to the Metadata Entity

GlossaryTerm

A Glossary Term, or a node in a Business Glossary representing a standardized domain data type

Implements

Fields

NameDescription
urn
String!

The primary key of the glossary term

ownership
Ownership

Ownership metadata of the dataset

type
EntityType!

A standard Entity Type

name
String!

Display name of the glossary term

hierarchicalName
String!

hierarchicalName of glossary term

properties
GlossaryTermProperties

Additional read only properties associated with the Glossary Term

glossaryTermInfo
GlossaryTermInfo!

Deprecated, use properties field instead Details of the Glossary Term

relationships
EntityRelationshipsResult

Edges extending from this entity

Arguments

NameDescription
input
RelationshipsInput!

GlossaryTermAssociation

An edge between a Metadata Entity and a Glossary Term Modeled as a struct to permit additional attributes TODO Consider whether this query should be serviced by the relationships field

Fields

NameDescription
term
GlossaryTerm!

The glossary term itself

GlossaryTermInfo

Deprecated, use GlossaryTermProperties instead Information about a glossary term

Fields

NameDescription
definition
String!

Definition of the glossary term

termSource
String!

Term Source of the glossary term

sourceRef
String

Source Ref of the glossary term

sourceUrl
String

Source Url of the glossary term

customProperties
[StringMapEntry!]

Properties of the glossary term

rawSchema
String

Schema definition of glossary term

GlossaryTermProperties

Additional read only properties about a Glossary Term

Fields

NameDescription
definition
String!

Definition of the glossary term

termSource
String!

Term Source of the glossary term

sourceRef
String

Source Ref of the glossary term

sourceUrl
String

Source Url of the glossary term

customProperties
[StringMapEntry!]

Properties of the glossary term

rawSchema
String

Schema definition of glossary term

GlossaryTerms

Glossary Terms attached to a particular Metadata Entity

Fields

NameDescription
terms
[GlossaryTermAssociation!]

The set of glossary terms attached to the Metadata Entity

HyperParameterMap

Fields

NameDescription
key
String!
value
HyperParameterValueType!

IdentityManagementConfig

Configurations related to Identity Management

Fields

NameDescription
enabled
Boolean!

Whether identity management screen is able to be shown in the UI

InstitutionalMemory

Institutional memory metadata, meaning internal links and pointers related to an Entity

Fields

NameDescription
elements
[InstitutionalMemoryMetadata!]!

List of records that represent the institutional memory or internal documentation of an entity

InstitutionalMemoryMetadata

An institutional memory resource about a particular Metadata Entity

Fields

NameDescription
url
String!

Link to a document or wiki page or another internal resource

label
String!

Label associated with the URL

author
CorpUser!

The author of this metadata

created
AuditStamp!

An AuditStamp corresponding to the creation of this resource

description
String!
Deprecated: No longer supported

Deprecated, use label instead Description of the resource

IntBox

Fields

NameDescription
intValue
Int!

IntendedUse

Fields

NameDescription
primaryUses
[String!]

Primary Use cases for the model

primaryUsers
[IntendedUserType!]

Primary Intended Users

outOfScopeUses
[String!]

Out of scope uses of the MLModel

KeyValueSchema

Information about a raw Key Value Schema

Fields

NameDescription
keySchema
String!

Raw key schema

valueSchema
String!

Raw value schema

ListGroupsResult

The result obtained when listing DataHub Groups

Fields

NameDescription
start
Int!

The starting offset of the result set returned

count
Int!

The number of Policies in the returned result set

total
Int!

The total number of Policies in the result set

groups
[CorpGroup!]!

The groups themselves

ListPoliciesResult

The result obtained when listing DataHub Access Policies

Fields

NameDescription
start
Int!

The starting offset of the result set returned

count
Int!

The number of Policies in the returned result set

total
Int!

The total number of Policies in the result set

policies
[Policy!]!

The Policies themselves

ListUsersResult

The result obtained when listing DataHub Users

Fields

NameDescription
start
Int!

The starting offset of the result set returned

count
Int!

The number of Policies in the returned result set

total
Int!

The total number of Policies in the result set

users
[CorpUser!]!

The users themselves

MatchedField

An overview of the field that was matched in the entity search document

Fields

NameDescription
name
String!

Name of the field that matched

value
String!

Value of the field that matched

Metrics

Fields

NameDescription
performanceMeasures
[String!]

Measures of ML Model performance

decisionThreshold
[String!]

Decision Thresholds used if any

MLFeature

An ML Feature Metadata Entity Note that this entity is incubating

Implements

Fields

NameDescription
urn
String!

The primary key of the ML Feature

type
EntityType!

A standard Entity Type

name
String!

The display name for the ML Feature

featureNamespace
String!

MLFeature featureNamespace

description
String

The description about the ML Feature

dataType
MLFeatureDataType

MLFeature data type

ownership
Ownership

Ownership metadata of the MLFeature

featureProperties
MLFeatureProperties

ModelProperties metadata of the MLFeature

institutionalMemory
InstitutionalMemory

References to internal resources related to the MLFeature

status
Status

Status metadata of the MLFeature

deprecation
Deprecation

Deprecation

relationships
EntityRelationshipsResult

Edges extending from this entity

Arguments

NameDescription
input
RelationshipsInput!

MLFeatureProperties

Fields

NameDescription
description
String
dataType
MLFeatureDataType
version
VersionTag
sources
[Dataset]

MLFeatureTable

An ML Feature Table Entity Note that this entity is incubating

Implements

Fields

NameDescription
urn
String!

The primary key of the ML Feature Table

type
EntityType!

A standard Entity Type

name
String!

The display name

platform
DataPlatform!

Standardized platform urn where the MLFeatureTable is defined

description
String

MLFeatureTable description

ownership
Ownership

Ownership metadata of the MLFeatureTable

properties
MLFeatureTableProperties

Additional read only properties associated the the ML Feature Table

featureTableProperties
MLFeatureTableProperties
Deprecated: No longer supported

Deprecated, use properties field instead ModelProperties metadata of the MLFeature

institutionalMemory
InstitutionalMemory

References to internal resources related to the MLFeature

status
Status

Status metadata of the MLFeatureTable

deprecation
Deprecation

Deprecation

relationships
EntityRelationshipsResult

Edges extending from this entity

Arguments

NameDescription
input
RelationshipsInput!

MLFeatureTableProperties

Fields

NameDescription
description
String
mlFeatures
[MLFeature]
mlPrimaryKeys
[MLPrimaryKey]

MLHyperParam

Fields

NameDescription
name
String
description
String
value
String
createdAt
Long

MLMetric

Fields

NameDescription
name
String
description
String
value
String
createdAt
Long

MLModel

An ML Model Metadata Entity Note that this entity is incubating

Implements

Fields

NameDescription
urn
String!

The primary key of the ML model

type
EntityType!

A standard Entity Type

name
String!

ML model display name

platform
DataPlatform!

Standardized platform urn where the MLModel is defined

origin
FabricType!

Fabric type where mlmodel belongs to or where it was generated

description
String

Human readable description for mlmodel

globalTags
GlobalTags
Deprecated: No longer supported

Deprecated, use tags field instead The standard tags for the ML Model

tags
GlobalTags

The standard tags for the ML Model

ownership
Ownership

Ownership metadata of the mlmodel

properties
MLModelProperties

Additional read only information about the ML Model

intendedUse
IntendedUse

Intended use of the mlmodel

factorPrompts
MLModelFactorPrompts

Factors metadata of the mlmodel

metrics
Metrics

Metrics metadata of the mlmodel

evaluationData
[BaseData!]

Evaluation Data of the mlmodel

trainingData
[BaseData!]

Training Data of the mlmodel

quantitativeAnalyses
QuantitativeAnalyses

Quantitative Analyses of the mlmodel

ethicalConsiderations
EthicalConsiderations

Ethical Considerations of the mlmodel

caveatsAndRecommendations
CaveatsAndRecommendations

Caveats and Recommendations of the mlmodel

institutionalMemory
InstitutionalMemory

References to internal resources related to the mlmodel

sourceCode
SourceCode

Source Code

status
Status

Status metadata of the mlmodel

cost
Cost

Cost Aspect of the mlmodel

deprecation
Deprecation

Deprecation

relationships
EntityRelationshipsResult

Edges extending from this entity

Arguments

NameDescription
input
RelationshipsInput!

MLModelFactorPrompts

Fields

NameDescription
relevantFactors
[MLModelFactors!]

What are foreseeable salient factors for which MLModel performance may vary, and how were these determined

evaluationFactors
[MLModelFactors!]

Which factors are being reported, and why were these chosen

MLModelFactors

Fields

NameDescription
groups
[String!]

Distinct categories with similar characteristics that are present in the evaluation data instances

instrumentation
[String!]

Instrumentation used for MLModel

environment
[String!]

Environment in which the MLModel is deployed

MLModelGroup

An ML Model Group Metadata Entity Note that this entity is incubating

Implements

Fields

NameDescription
urn
String!

The primary key of the ML Model Group

type
EntityType!

A standard Entity Type

name
String!

The display name for the Entity

platform
DataPlatform!

Standardized platform urn where the MLModelGroup is defined

origin
FabricType!

Fabric type where MLModelGroup belongs to or where it was generated

description
String

Human readable description for MLModelGroup

properties
MLModelGroupProperties

Additional read only properties about the ML Model Group

ownership
Ownership

Ownership metadata of the MLModelGroup

status
Status

Status metadata of the MLFeature

deprecation
Deprecation

Deprecation

relationships
EntityRelationshipsResult

Edges extending from this entity

Arguments

NameDescription
input
RelationshipsInput!

MLModelGroupProperties

Fields

NameDescription
description
String
createdAt
Long
version
VersionTag

MLModelProperties

Fields

NameDescription
description
String
date
Long
version
String
type
String
hyperParameters
HyperParameterMap
hyperParams
[MLHyperParam]
trainingMetrics
[MLMetric]
mlFeatures
[String!]
tags
[String!]
groups
[MLModelGroup]
customProperties
[StringMapEntry!]

MLPrimaryKey

An ML Primary Key Entity Note that this entity is incubating

Implements

Fields

NameDescription
urn
String!

The primary key of the ML Primary Key

type
EntityType!

A standard Entity Type

name
String!

The display name

featureNamespace
String!

MLPrimaryKey featureNamespace

description
String

MLPrimaryKey description

dataType
MLFeatureDataType

MLPrimaryKey data type

properties
MLPrimaryKeyProperties

Additional read only properties of the ML Primary Key

primaryKeyProperties
MLPrimaryKeyProperties
Deprecated: No longer supported

Deprecated, use properties field instead MLPrimaryKeyProperties

ownership
Ownership

Ownership metadata of the MLPrimaryKey

institutionalMemory
InstitutionalMemory

References to internal resources related to the MLPrimaryKey

status
Status

Status metadata of the MLPrimaryKey

deprecation
Deprecation

Deprecation

relationships
EntityRelationshipsResult

Edges extending from this entity

Arguments

NameDescription
input
RelationshipsInput!

MLPrimaryKeyProperties

Fields

NameDescription
description
String
dataType
MLFeatureDataType
version
VersionTag
sources
[Dataset]

Owner

An owner of a Metadata Entity

Fields

NameDescription
owner
OwnerType!

Owner object

type
OwnershipType!

The type of the ownership

source
OwnershipSource

Source information for the ownership

Ownership

Ownership information about a Metadata Entity

Fields

NameDescription
owners
[Owner!]

List of owners of the entity

lastModified
AuditStamp!

Audit stamp containing who last modified the record and when

OwnershipSource

Information about the source of Ownership metadata about a Metadata Entity

Fields

NameDescription
type
OwnershipSourceType!

The type of the source

url
String

An optional reference URL for the source

PartitionSpec

Information about the partition being profiled

Fields

NameDescription
partition
String!

The partition identifier

timePartition
TimeWindow

The optional time window partition information

PlatformPrivileges

The platform privileges that the currently authenticated user has

Fields

NameDescription
viewAnalytics
Boolean!

Whether the user should be able to view analytics

managePolicies
Boolean!

Whether the user should be able to manage policies

manageIdentities
Boolean!

Whether the user should be able to manage users & groups

PoliciesConfig

Configurations related to the Policies Feature

Fields

NameDescription
enabled
Boolean!

Whether the policies feature is enabled and should be displayed in the UI

platformPrivileges
[Privilege!]!

A list of platform privileges to display in the Policy Builder experience

resourcePrivileges
[ResourcePrivileges!]!

A list of resource privileges to display in the Policy Builder experience

Policy

An DataHub Platform Access Policy Access Policies determine who can perform what actions against which resources on the platform

Fields

NameDescription
urn
String!

The primary key of the Policy

type
PolicyType!

The type of the Policy

name
String!

The name of the Policy

state
PolicyState!

The present state of the Policy

description
String

The description of the Policy

resources
ResourceFilter

The resources that the Policy privileges apply to

privileges
[String!]!

The privileges that the Policy grants

actors
ActorFilter!

The actors that the Policy grants privileges to

editable
Boolean!

Whether the Policy is editable, ie system policies, or not

Privilege

An individual DataHub Access Privilege

Fields

NameDescription
type
String!

Standardized privilege type, serving as a unique identifier for a privilege eg EDIT_ENTITY

displayName
String

The name to appear when displaying the privilege, eg Edit Entity

description
String

A description of the privilege to display

QuantitativeAnalyses

Fields

NameDescription
unitaryResults
ResultsType

Link to a dashboard with results showing how the model performed with respect to each factor

intersectionalResults
ResultsType

Link to a dashboard with results showing how the model performed with respect to the intersection of evaluated factors

ResourceFilter

The resources that a DataHub Access Policy applies to

Fields

NameDescription
type
String!

The type of the resource the policy should apply to Not required because in the future we want to support filtering by type OR by domain

resources
[String!]

A list of specific resource urns to apply the filter to

allResources
Boolean!

Whether of not to apply the filter to all resources of the type

ResourcePrivileges

A privilege associated with a particular resource type A resource is most commonly a DataHub Metadata Entity

Fields

NameDescription
resourceType
String!

Resource type associated with the Access Privilege, eg dataset

resourceTypeDisplayName
String

The name to used for displaying the resourceType

entityType
EntityType

An optional entity type to use when performing search and navigation to the entity

privileges
[Privilege!]!

A list of privileges that are supported against this resource

Schema

Deprecated, use SchemaMetadata instead Metadata about a Dataset schema

Fields

NameDescription
datasetUrn
String

Dataset this schema metadata is associated with

name
String!

Schema name

platformUrn
String!

Platform this schema metadata is associated with

version
Long!

The version of the GMS Schema metadata

cluster
String

The cluster this schema metadata is derived from

hash
String!

The SHA1 hash of the schema content

platformSchema
PlatformSchema

The native schema in the datasets platform, schemaless if it was not provided

fields
[SchemaField!]!

Client provided a list of fields from value schema

primaryKeys
[String!]

Client provided list of fields that define primary keys to access record

SchemaField

Information about an individual field in a Dataset schema

Fields

NameDescription
fieldPath
String!

Flattened name of the field computed from jsonPath field

jsonPath
String

Flattened name of a field in JSON Path notation

nullable
Boolean!

Indicates if this field is optional or nullable

description
String

Description of the field

type
SchemaFieldDataType!

Platform independent field type of the field

nativeDataType
String

The native type of the field in the datasets platform as declared by platform schema

recursive
Boolean!

Whether the field references its own type recursively

globalTags
GlobalTags
Deprecated: No longer supported

Deprecated, use tags field instead Tags associated with the field

tags
GlobalTags

Tags associated with the field

glossaryTerms
GlossaryTerms

Glossary terms associated with the field

isPartOfKey
Boolean

Whether the field is part of a key schema

SchemaFieldEntity

Standalone schema field entity. Differs from the SchemaField struct because it is not directly nested inside a schema field

Fields

NameDescription
urn
String!

Primary key of the schema field

fieldPath
String!

Field path identifying the field in its dataset

parent
String!

The primary key of the field's parent.

SchemaMetadata

Metadata about a Dataset schema

Implements

Fields

NameDescription
aspectVersion
Long

The logical version of the schema metadata, where zero represents the latest version with otherwise monotonic ordering starting at one

datasetUrn
String

Dataset this schema metadata is associated with

name
String!

Schema name

platformUrn
String!

Platform this schema metadata is associated with

version
Long!

The version of the GMS Schema metadata

cluster
String

The cluster this schema metadata is derived from

hash
String!

The SHA1 hash of the schema content

platformSchema
PlatformSchema

The native schema in the datasets platform, schemaless if it was not provided

fields
[SchemaField!]!

Client provided a list of fields from value schema

primaryKeys
[String!]

Client provided list of fields that define primary keys to access record

foreignKeys
[ForeignKeyConstraint]

Client provided list of foreign key constraints

createdAt
Long

The time at which the schema metadata information was created

SearchInsight

Insights about why a search result was returned or ranked in the way that it was

Fields

NameDescription
text
String!

The insight to display

icon
String

An optional emoji to display in front of the text

SearchResult

An individual search result hit

Fields

NameDescription
entity
Entity!

The resolved DataHub Metadata Entity matching the search query

insights
[SearchInsight!]

Insights about why the search result was matched

matchedFields
[MatchedField!]!

Matched field hint

SearchResults

Results returned by issuing a search query

Fields

NameDescription
start
Int!

The offset of the result set

count
Int!

The number of entities included in the result set

total
Int!

The total number of search results matching the query and filters

searchResults
[SearchResult!]!

The search result entities

facets
[FacetMetadata!]

Candidate facet aggregations used for search filtering

SourceCode

Fields

NameDescription
sourceCode
[SourceCodeUrl!]

Source Code along with types

SourceCodeUrl

Fields

NameDescription
type
SourceCodeUrlType!

Source Code Url Types

sourceCodeUrl
String!

Source Code Url

Status

The status of a particular Metadata Entity

Fields

NameDescription
removed
Boolean!

Whether the entity is removed or not

StringBox

Fields

NameDescription
stringValue
String!

StringMapEntry

An entry in a string string map represented as a tuple

Fields

NameDescription
key
String!

The key of the map entry

value
String

The value fo the map entry

SubTypes

Fields

NameDescription
typeNames
[String!]

The sub-types that this entity implements. e.g. Datasets that are views will implement the "view" subtype

TableSchema

Information about a raw Table Schema

Fields

NameDescription
schema
String!

Raw table schema

Tag

A Tag Entity, which can be associated with other Metadata Entities and subresources

Implements

Fields

NameDescription
urn
String!

The primary key of the TAG

type
EntityType!

A standard Entity Type

name
String!

The display name of the tag

editableProperties
EditableTagProperties

Additional read write properties about the Tag

ownership
Ownership

Ownership metadata of the dataset

relationships
EntityRelationshipsResult

Edges extending from this entity

Arguments

NameDescription
input
RelationshipsInput!
description
String
Deprecated: No longer supported

Deprecated, use editableProperties field instead Description of the tag

TagAssociation

An edge between a Metadata Entity and a Tag Modeled as a struct to permit additional attributes TODO Consider whether this query should be serviced by the relationships field

Fields

NameDescription
tag
Tag!

The tag itself

TimeWindow

A time window with a finite start and end time

Fields

NameDescription
startTimeMillis
Long!

The start time of the time window

durationMillis
Long!

The end time of the time window

UpstreamEntityRelationships

Deprecated, use relationships query instead

Fields

NameDescription
entities
[EntityRelationshipLegacy]

UsageAggregation

An aggregation of Dataset usage statistics

Fields

NameDescription
bucket
Long

The time window start time

duration
WindowDuration

The time window span

resource
String

The resource urn associated with the usage information, eg a Dataset urn

metrics
UsageAggregationMetrics

The rolled up usage metrics

UsageAggregationMetrics

Rolled up metrics about Dataset usage over time

Fields

NameDescription
uniqueUserCount
Int

The unique number of users who have queried the dataset within the time range

users
[UserUsageCounts]

Usage statistics within the time range by user

totalSqlQueries
Int

The total number of queries issued against the dataset within the time range

topSqlQueries
[String]

A set of common queries issued against the dataset within the time range

fields
[FieldUsageCounts]

Per field usage statistics within the time range

UsageQueryResult

The result of a Dataset usage query

Fields

NameDescription
buckets
[UsageAggregation]

A set of relevant time windows for use in displaying usage statistics

aggregations
UsageQueryResultAggregations

A set of rolled up aggregations about the Dataset usage

UsageQueryResultAggregations

A set of rolled up aggregations about the Dataset usage

Fields

NameDescription
uniqueUserCount
Int

The count of unique Dataset users within the queried time range

users
[UserUsageCounts]

The specific per user usage counts within the queried time range

fields
[FieldUsageCounts]

The specific per field usage counts within the queried time range

totalSqlQueries
Int

The total number of queries executed within the queried time range Note that this field will likely be deprecated in favor of a totalQueries field

UserUsageCounts

Information about individual user usage of a Dataset

Fields

NameDescription
user
CorpUser

The user of the Dataset

count
Int

The number of queries issued by the user

userEmail
String

The extracted user email Note that this field will soon be deprecated and merged with user

VersionTag

The technical version associated with a given Metadata Entity

Fields

NameDescription
versionTag
String

ViewProperties

Properties about a Dataset of type view

Fields

NameDescription
materialized
Boolean!

Whether the view is materialized or not

logic
String!

The logic associated with the view, most commonly a SQL statement

language
String!

The language in which the view logic is written, for example SQL