Release Highlights
User Experience
-
You can now create and assign Custom Ownership types within DataHub; plus, we now display the owner type on an Entity Page
-
Various bug fixes to Column Level Lineage visualization
Metadata ingestion
- You can now define column-level lineage (aka fine-grained lineage) via our file-based lineage source
- Looker: Ingest Looks that are not part of a Dashboard
- Glue: Error reporting now includes lineage failures
- BigQuery: Now support deduplicating LogEntries based on insertId, timestamp, and logName
Docs
- CSV Enricher: improvements to sample CSV and recipe
- Guide for changing default DataHub credentials
- Updated guide to apply time-based filters on Lineage
What's Changed
- ci(ingest/kafka): improve kafka integration test reliability by @hsheth2 in #8085
- fix(ingest/bigquery): Deduplicate LogEntries based on insertId, timestamp, logName by @asikowitz in #8132
- feat(ingest/glue): report glue job lineage failures, update doc by @mayurinehate in #8126
- feat(lineage source): add fine grained lineage support by @anshbansal in #7904
- docs(glue): fix broken link by @mayurinehate in #8135
- feat(custom ownership): Adds Custom ownership types as a top level entity by @pedro93 in #8045
- Update updating-datahub.md for v0.10.3 release by @iprentic in #8139
- feat: add dbt-athena adapter support for column types mapping by @svdimchenko in #8116
- docs(csv-enricher): add example csv file & recipe by @gabe-lyons in #8141
- chore(ci): update base requirements file by @anshbansal in #8144
- fix(ingest/s3): Path spec aware folder traversal by @treff7es in #8095
- fix(ui) Fix selecting columns in Lineage tab for CLL by @chriscollins3456 in #8129
- feat(search): adding support for
_entityType
filter in the application layer + frontend by @gabe-lyons in #8102 - docs(ingest/nifi): fix broken links by @mayurinehate in #8143
- fix(scroll): fix scroll cache key for hazelcast by @RyanHolstien in #8149
- chore(json): fix json vulnerability by @RyanHolstien in #8150
- fix(ingest/json-schema): handle property inheritance in unions by @hsheth2 in #8121
- chore(log): fix log as error instead of info by @anshbansal in #8146
- fix(lineagecounts) Include entities that are filtered out due to sibling logic in the filtered count of lineage counts by @iprentic in #8152
- fix(stats): display consistent query count on stats tab by @joshuaeilers in #8151
- fix(ingest): remove
original_table_name
logic in sql source by @hsheth2 in #8130 - feat(ingest): add more fail-safes to stateful ingestion by @hsheth2 in #8111
- feat(ingest/snowflake): support for more operation types by @mayurinehate in #8158
- fix(ui) Show Entities first on Domain pages again by @chriscollins3456 in #8159
- fix(ingest/nifi): allow nifi site url with context path by @mayurinehate in #8156
- feat(ingest): Create Browse Paths V2 under flag by @asikowitz in #8120
- fix(ingestion/looker): set project-name for imported_projects views by @mohdsiddique in #8086
- fix(docs): Fix ownership type typos by @pedro93 in #8155
- docs(townhall) feb and march town hall agenda and recording by @maggiehays in #7676
- feat(ingest/unity): Add qualified name to dataset properties by @asikowitz in #8164
- feat(ingest/bigquery_v2): enable platform instance using project id by @Khurzak in #8142
- feat(ingest/snowflake): Deprecate legacy lineage and optimize query history joins by @asikowitz in #8176
- fix(ingest/kafka): Fixing error printing in Kafka properties get call by @treff7es in #8145
- fix(ingest/snowflake): set use_quoted_name to profile lowercase tables by @mayurinehate in #8168
- feat(classification): support for regex based custom infotypes by @mayurinehate in #8177
- fix(restli): update base client retry logic by @david-leifker in #8172
- fix(ingest): Fix modeldocgen; bump feast to relax pyarrow constraint by @asikowitz in #8178
- refactor(ci): move from sleep to kafka lag based testing by @shirshanka in #8094
- docs(lineage): document timestamp filtering in lineage feature by @iprentic in #8174
- build(ingest/feast): Pin feast to minor version by @asikowitz in #8180
- feat(ingest/snowflake): Okta OAuth support; update docs by @asikowitz in #8157
- feat(ingest/presto-on-hive): add support for extra properties and merge property capabilities by @treff7es in #8147
- docs(managed datahub): release notes for v0.2.8 by @anshbansal in #8185
- fix(nocode): fix DeleteLegacyGraphRelationshipsStep for Elasticsearch by @david-leifker in #8181
- feat(docker):Add the jattach tool to the docker container(#7538) by @yangjiandan in #8040
- refactor: Return original exception as caused by by @Jorricks in #7722
- docs(ingest) Add MetadataChangeProposalWrapper import to example code by @iprentic in #8175
- fix(ingest/kafka): Better error handling around topic and topic description extraction by @asikowitz in #8183
- fix(vulnerabilities)/vulnerabilities_fixes_datahub (#8075) by @david-leifker in #8189
- fix: add dedicated guide on changing default credentials by @yoonhyejin in #8153
- feat(classification): configurable minimum values threshold by @mayurinehate in #8186
- fix(ingestion/looker): ingest looks not part of dashboard by @mohdsiddique in #8140
- fix(ingest/profiling): only apply monkeypatches once when profiling by @hsheth2 in #8160
- docs(tableau): site config is required for tableau cloud / tableau online by @mohdsiddique in #8041
- fix(ingest/bigquery): Swap log order to avoid confusion by @asikowitz in #8197
- fix(ingest/redshift): Adding env parameter where it was missing for urn generation by @treff7es in #8199
- revert(ingest/bigquery): Do not emit DataPlatformInstance; remove references to platform_instance by @asikowitz in #8196
- docs(managed datahub): add docs link to v0.2.8 by @anshbansal in #8202
- Add combined health check endpoint which can check multiple components by @iprentic in #8191
- chore(cp-schema-registry): bump minor version by @david-leifker in #8192
- feat(ingest): Produce browse paths v2 on demand and with platform instance by @asikowitz in #8173
New Contributors
- @svdimchenko made their first contribution in #8116
- @Khurzak made their first contribution in #8142
- @Jorricks made their first contribution in #7722
Full Changelog: v0.10.3...v0.10.4