github datahub-project/datahub v0.10.4

latest releases: v0.14.1, v0.14.1rc2, v0.14.1rc1...
17 months ago

Release Highlights

User Experience

  • You can now create and assign Custom Ownership types within DataHub; plus, we now display the owner type on an Entity Page
    ownershiptype-displayed

  • Various bug fixes to Column Level Lineage visualization

Metadata ingestion

  • You can now define column-level lineage (aka fine-grained lineage) via our file-based lineage source
  • Looker: Ingest Looks that are not part of a Dashboard
  • Glue: Error reporting now includes lineage failures
  • BigQuery: Now support deduplicating LogEntries based on insertId, timestamp, and logName

Docs

  • CSV Enricher: improvements to sample CSV and recipe
  • Guide for changing default DataHub credentials
  • Updated guide to apply time-based filters on Lineage

What's Changed

  • ci(ingest/kafka): improve kafka integration test reliability by @hsheth2 in #8085
  • fix(ingest/bigquery): Deduplicate LogEntries based on insertId, timestamp, logName by @asikowitz in #8132
  • feat(ingest/glue): report glue job lineage failures, update doc by @mayurinehate in #8126
  • feat(lineage source): add fine grained lineage support by @anshbansal in #7904
  • docs(glue): fix broken link by @mayurinehate in #8135
  • feat(custom ownership): Adds Custom ownership types as a top level entity by @pedro93 in #8045
  • Update updating-datahub.md for v0.10.3 release by @iprentic in #8139
  • feat: add dbt-athena adapter support for column types mapping by @svdimchenko in #8116
  • docs(csv-enricher): add example csv file & recipe by @gabe-lyons in #8141
  • chore(ci): update base requirements file by @anshbansal in #8144
  • fix(ingest/s3): Path spec aware folder traversal by @treff7es in #8095
  • fix(ui) Fix selecting columns in Lineage tab for CLL by @chriscollins3456 in #8129
  • feat(search): adding support for _entityType filter in the application layer + frontend by @gabe-lyons in #8102
  • docs(ingest/nifi): fix broken links by @mayurinehate in #8143
  • fix(scroll): fix scroll cache key for hazelcast by @RyanHolstien in #8149
  • chore(json): fix json vulnerability by @RyanHolstien in #8150
  • fix(ingest/json-schema): handle property inheritance in unions by @hsheth2 in #8121
  • chore(log): fix log as error instead of info by @anshbansal in #8146
  • fix(lineagecounts) Include entities that are filtered out due to sibling logic in the filtered count of lineage counts by @iprentic in #8152
  • fix(stats): display consistent query count on stats tab by @joshuaeilers in #8151
  • fix(ingest): remove original_table_name logic in sql source by @hsheth2 in #8130
  • feat(ingest): add more fail-safes to stateful ingestion by @hsheth2 in #8111
  • feat(ingest/snowflake): support for more operation types by @mayurinehate in #8158
  • fix(ui) Show Entities first on Domain pages again by @chriscollins3456 in #8159
  • fix(ingest/nifi): allow nifi site url with context path by @mayurinehate in #8156
  • feat(ingest): Create Browse Paths V2 under flag by @asikowitz in #8120
  • fix(ingestion/looker): set project-name for imported_projects views by @mohdsiddique in #8086
  • fix(docs): Fix ownership type typos by @pedro93 in #8155
  • docs(townhall) feb and march town hall agenda and recording by @maggiehays in #7676
  • feat(ingest/unity): Add qualified name to dataset properties by @asikowitz in #8164
  • feat(ingest/bigquery_v2): enable platform instance using project id by @Khurzak in #8142
  • feat(ingest/snowflake): Deprecate legacy lineage and optimize query history joins by @asikowitz in #8176
  • fix(ingest/kafka): Fixing error printing in Kafka properties get call by @treff7es in #8145
  • fix(ingest/snowflake): set use_quoted_name to profile lowercase tables by @mayurinehate in #8168
  • feat(classification): support for regex based custom infotypes by @mayurinehate in #8177
  • fix(restli): update base client retry logic by @david-leifker in #8172
  • fix(ingest): Fix modeldocgen; bump feast to relax pyarrow constraint by @asikowitz in #8178
  • refactor(ci): move from sleep to kafka lag based testing by @shirshanka in #8094
  • docs(lineage): document timestamp filtering in lineage feature by @iprentic in #8174
  • build(ingest/feast): Pin feast to minor version by @asikowitz in #8180
  • feat(ingest/snowflake): Okta OAuth support; update docs by @asikowitz in #8157
  • feat(ingest/presto-on-hive): add support for extra properties and merge property capabilities by @treff7es in #8147
  • docs(managed datahub): release notes for v0.2.8 by @anshbansal in #8185
  • fix(nocode): fix DeleteLegacyGraphRelationshipsStep for Elasticsearch by @david-leifker in #8181
  • feat(docker):Add the jattach tool to the docker container(#7538) by @yangjiandan in #8040
  • refactor: Return original exception as caused by by @Jorricks in #7722
  • docs(ingest) Add MetadataChangeProposalWrapper import to example code by @iprentic in #8175
  • fix(ingest/kafka): Better error handling around topic and topic description extraction by @asikowitz in #8183
  • fix(vulnerabilities)/vulnerabilities_fixes_datahub (#8075) by @david-leifker in #8189
  • fix: add dedicated guide on changing default credentials by @yoonhyejin in #8153
  • feat(classification): configurable minimum values threshold by @mayurinehate in #8186
  • fix(ingestion/looker): ingest looks not part of dashboard by @mohdsiddique in #8140
  • fix(ingest/profiling): only apply monkeypatches once when profiling by @hsheth2 in #8160
  • docs(tableau): site config is required for tableau cloud / tableau online by @mohdsiddique in #8041
  • fix(ingest/bigquery): Swap log order to avoid confusion by @asikowitz in #8197
  • fix(ingest/redshift): Adding env parameter where it was missing for urn generation by @treff7es in #8199
  • revert(ingest/bigquery): Do not emit DataPlatformInstance; remove references to platform_instance by @asikowitz in #8196
  • docs(managed datahub): add docs link to v0.2.8 by @anshbansal in #8202
  • Add combined health check endpoint which can check multiple components by @iprentic in #8191
  • chore(cp-schema-registry): bump minor version by @david-leifker in #8192
  • feat(ingest): Produce browse paths v2 on demand and with platform instance by @asikowitz in #8173

New Contributors

Full Changelog: v0.10.3...v0.10.4

Don't miss a new datahub release

NewReleases is sending notifications on new releases.