Skip to content

Releases: datahub-project/datahub

DataHub v0.4.3

20 Aug 16:34
236d5e6
Compare
Choose a tag to compare

Added

  • #1782 improve security of k8s / helm charts
  • #1791 Add description of dataset to the search index
  • #1803 Add an example crawler for MS SQL
  • #1811 Sync our internal backend code externally to HEAD (we're caught up now!)
    • Added ESBulkWriterDAO to bulk write to ElasticSearch. Planned usage is for integration tests.
    • Add Strongly Consistent Secondary Index (SCSI) Implementation for MySQL.
    • Start adding code to generate aspect-entity specific metadata events, rather than our current single event approach.
    • Add support in the GMS to ask for no aspects on entities by setting the aspectNames param to null (omitting the param is still considered as asking for all aspects). Useful if checking the existence of an entity to avoid a large response (i.e. performing a search to just get URNs back, and nothing else).

Changed

  • #1777 Add docker files for development

Fixed

  • #1748 Remove unused model
  • #1788 Remove unused model
  • #1789 Remove unused model

Fixed

  • #1808 Clear dataset description from search index when cleared in source

DataHub v0.4.2

30 Jul 21:30
dfb74c6
Compare
Choose a tag to compare

Added

Changed

Removed

  • #1741 build(frontend): Drop unnecessary DB-related dependencies @mars-lan

Fixed

DataHub v0.4.1

26 Jun 04:11
60b7c63
Compare
Choose a tag to compare

Added

Changed

Fixed

DataHub v0.4.0

28 May 15:40
a614c56
Compare
Choose a tag to compare

Added

Changed

Deprecated

  • #1618 refactor: remove unused legacy model & move DatasetSearchConfig @mars-lan

Removed

Fixed

DataHub v0.3.1

19 Mar 19:51
ff4413b
Compare
Choose a tag to compare

Added

Changed

Removed

Fixed

DataHub v0.3.0

12 Feb 20:23
Compare
Choose a tag to compare
  • Onboarded people as a top level entity
  • Enabled people search
  • Created Docker image for running ingestion pipeline
  • Misc bug fixes
  • Documentation updates
  • Code cleanup

DataHub v0.2.0-alpha

05 Dec 20:33
Compare
Choose a tag to compare
DataHub v0.2.0-alpha Pre-release
Pre-release
  • Added Neo4j graph indexing/querying pipeline
  • Dataset downstream lineage is now powered by graph
  • Added MySQL ETL example
  • Updated docker-compose settings for low resource environments
  • Misc bug fixes

DataHub v0.1.1-alpha

27 Sep 18:54
Compare
Choose a tag to compare
DataHub v0.1.1-alpha Pre-release
Pre-release
  • Added Kafka crawler sample
  • Added support for surfacing downstream dataset lineage using search. This is a stop-gap solution until neo4j support is added

Data Hub v0.1.0-alpha

21 Sep 23:41
Compare
Choose a tag to compare
Data Hub v0.1.0-alpha Pre-release
Pre-release

First official release of Data Hub:

  • Leveraging GMA architecture
  • Backend: GMS implementation - support for dataset & user entities
  • Frontend: Data Hub Web Application
  • Pub-sub: Kafka
  • Stream processing: MXE consumer jobs using Kafka Streams
  • Generic modeling layer with CRUD on MySQL
  • Search support using Elasticsearch
  • Supported metadata sources: LDAP and Hive