Skip to content

Releases: datahub-project/datahub

DataHub v0.8.13

16 Sep 00:48
f665ffc
Compare
Choose a tag to compare

Release Highlights

  • Support for aggregated statistics wrt the timeseries aspect. Moved usage stats functionality to use the new framework.
  • Auto-ingest common data platforms on GMS boot! No more generic logos.
  • Fixes re-ingestion of modified policies at startup
  • Full changelog below

Breaking Changes

  • Usage stats endpoint now uses the time-series aspect index in Elastic, meaning that statistics ingested previously will be lost. Please re-run usage ingestion (e.g. bigquery-usage / snowflake-usage) etc. to backfill your usage statistics history.

Changelog

DataHub v0.8.12

09 Sep 05:16
940cbb1
Compare
Choose a tag to compare

Release Highlights

  • RBAC Phase 1: Added abilities to control access through policies in the UI and backend
  • Dataset page refresh!!! + improved home page, search and browse screens
  • Added the ability to monitor DataHub through Prometheus and provided example Grafana dashboards
  • GraphQL API browser hosted on /api/graphql endpoint.
  • Support for Business Glossary ingestion through yml file
  • Support for Azure AD ingestion source

Notable Changes

  • Fixed unicode rendering bug introduced in v0.8.11
  • Added the ability to search by properties in the customProperties bag: supports case-insensitive matches of the form ‘key=value’
    • For instance, query “encoding=utf-8” will return entities with “encoding”: “utf-8” in the property bag
  • Full changelog below

Changelog

DataHub v0.8.11

25 Aug 05:35
d1b5792
Compare
Choose a tag to compare

Release Highlights

  • Business Glossary: Phase 1 is feature complete. Full support for UI viewing and API-based edits, no support for UI edits.
  • Users and Groups: Just-in-time User and Group provisioning on login (SSO/OIDC), basic Group pages with membership information
  • New Integrations: Redash

Notable Changes

  • GraphQL and REST API-s are now both served by datahub-metadata-service (new name for gms). Frontend is now a proxy. Container names are not changed.
  • Kafka source will no longer tokenize on . in the topic name. This will result in a flat browse experience in UI.
  • Airflow lineage emission will only populate specific properties of Tasks and DAGs to limit bloat and avoid leaking environment variables.
  • Schema history feature turned off in UI based on feedback from the community. Will re-emerge in a future release!
  • Mongodb collections with extremely wide schemas will have schema fields sampled to keep UI responsive.
  • Full changelog below.

ChangeLog

DataHub v0.8.10

13 Aug 18:16
39a0081
Compare
Choose a tag to compare

Release Highlights

Bugfix release for 0.8.9

  • [#3096] Fix dependency injection issue introduced by this PR
  • Increase REST emitter timeout to 30 seconds by default

ChangeLog

  • #3095 @shirshanka fix(ingest): increasing default ingestion REST timeout to 30 seconds
  • #3096 @dexter-mh-lee fix(upgrade): Fix MAE consumer and upgrade's dependency issue
  • #3092 @jensenity fix(postgres): fix postgres setup to handle existing database

DataHub v0.8.9

13 Aug 05:01
c13d83b
Compare
Choose a tag to compare
DataHub v0.8.9 Pre-release
Pre-release

Release Highlights

  • Support for nested structs, union types and key-value schemas in Kafka
  • Support for JDBC Connector based sources in Kafka Connect
  • Support for Okta as a source for User and Group metadata
  • Support for using AWS Glue schema registry

Breaking Changes

  • [#3079] : Introduces a change to fieldPath encoding in schema metadata. Note: This is a backwards compatible change for the storage layer. Old fieldPaths will still be rendered correctly. At read time, fieldPaths in the new encoding will be translated to the old encoding to discover tags written before this change. Tags and Descriptions applied to fields earlier (which were being stored in the old format) will be migrated on applying new tags or editing descriptions.

Important Bug Fixes

  • [#3070] Charts and Dataset lineage was broken in release 0.8.8. This has been fixed via [gma-125]

ChangeLog

DataHub v0.8.8

04 Aug 20:09
3d06116
Compare
Choose a tag to compare

Release Notes

  • Bugfix release for release 0.8.7
  • Fixes issues with Airflow emitters, Glue default dependencies and handling system_metadata column correctly
  • Adds feature to handle redirects for non-logged in users

Changelog

  • #3032 @kevinhu fix(ingest): glue import type stubs only for testing
  • #3030 @gabe-lyons fix(gms): handling partial system metadata in gms
  • #3026 @jjoyce0510 feat(frontend): encode Original URI in Authentication Redirect
  • #3029 @gabe-lyons fix(restore-indices): add system metadata restoration to restore-indices

DataHub v0.8.7

03 Aug 19:56
3701931
Compare
Choose a tag to compare
DataHub v0.8.7 Pre-release
Pre-release

Release Stability

  • There are a few bugs reported on this release that are fixed in 0.8.8. Users are highly recommended to skip past this release!

Release Highlights

  • Dataset Profiling and support for time-series metadata
  • UI for ML Models, Features; support for AWS SageMaker and Feast
  • Cli: support for rollback operations after ingestion
  • Integration fixes for Looker, dbt, and many more.
  • Demos for all these features are available in our July Townhall video

ChangeLog

Read more

DataHub v0.8.6

12 Jul 17:47
973c08d
Compare
Choose a tag to compare

Release Highlights

  • Fix issue when using Elasticsearch as graph database in certain configurations
  • Fix caching issues in React UI
  • Efficiency improvement for schema aspect storage
  • Improvements and fixes to various ingestion sources

Changelog

DataHub v0.8.5

01 Jul 06:26
ad08113
Compare
Choose a tag to compare

Release Highlights

  • Various stability fixes for v0.8.4
  • Address docker image vulnerabilities
  • New integrations: AWS SageMaker
  • Support for restoring indexes with how-to
  • Ingestion improvements: mongodb, looker, hive, snowflake

Changelog

DataHub v0.8.4

28 Jun 06:22
942196e
Compare
Choose a tag to compare

Release Highlights

  • Dataset Popularity, Recent Queries powered by Usage logs (support for Snowflake, BigQuery)
  • Markdown descriptions and editing
  • New Integrations : Glue Jobs, Feast
  • Versioned API for metadata GETs
  • No neo4j requirement, Elastic for Graph
  • Docker image hardening
  • Improved logging
  • GCP Deployment Guide

Changelog