github vmware/versatile-data-kit v0.13
Versatile Data Kit 0.13

latest releases: v1.4, v1.3, v1.2...
17 months ago

Major features include:

New plugin: vdk-gdp-execution-id

An installed Generative Data Pack plugin automatically expands the data sent for ingestion.

This GDP plugin detects the execution ID of a Data Job running, and decorates your data product with it. So that,
it is now possible to correlate a data record with a particular ingestion Data Job execution ID.

For more information see the plugin documentation

vdk-dag: pass arguments to jobs in a DAG

Now each job in a DAG can be passed arguments :

{
"job_name": "name-of-job",
"team_name": "team-of-job",
"fail_meta_job_on_error": false,
"arguments": <ARGUMENTS IN DICTIONARY FORMAT HERE>,
"depends_on": ["name-of-job1", "name-of-job2"]
}

vdk-notebook: VDK job input in vdk cells

Users will be able to develop jobs entirely in a Notebook file with all features of VDK available out of the box
After installation of vdk-notebook users can now will have access to job_input interface to execute templates, ingest data and all else.

image

vdk-notebook: vdk and non-vdk cells

To enable separation of product and development code vdk-notebook integration provides a way for users to set which cells are deployable and part of their production code and which are not.

image

quickstart-vdk now includes the Operations UI

When installing quickstart-vdk VDK Server is available for local testing and now includes UI:

pip install quickstart-vdk
vdk server --install 

For more information see here

Versatile Data Kit Frontend npm libraries release

The Versatile Data Kit Frontend provides 2 npm (angular) libraries which can be used to build integrate VDK UI with your own screens:

  • @versatiledatakit/data-pipelines
    Versatile Data Kit Data Pipelines library provides UI screens that helps to manage data jobs via Versatile Data Kit Control Service
  • @versatiledatakit/shared
    Versatile Data Kit Shared library enables reusability of shared features like: NgRx Redux, Error Handlers, Utils, Generic Components, etc.

What's Changed

  • control service: Add supported python version configuration by @doks5 in #1761
  • control-service: fix python api release by @murphp15 in #1946
  • control service: Dynamically set job base image in builder by @doks5 in #1864
  • control-service: Add python_version to Control Service API by @doks5 in #1806
  • control-service: Add python_version to Execution API by @mivanov1988 in #1878
  • control-service: Add python_version to GraphQL API by @mivanov1988 in #1909
  • control-service: Add support for Python 3.11 by @mivanov1988 in #1861
  • control-service: Dynamically set vdk image in JobImageDeployer by @doks5 in #1883
  • control-service: Expose supported python versions in helm by @doks5 in #1935
  • control-service: Remove support for very old k8s apiVersion by @murphp15 in #1860
  • control-service: add the frontend to helm by @murphp15 in #1885
  • control-service: enable usage of aws temporary credentials by @mrMoZ1 in #1787
  • control-service: expose supported python versions by @dakodakov in #1841
  • control-service: fix failing image publisher by @murphp15 in #1810
  • control-service: force job builder deploy by @mrMoZ1 in #1823
  • control-service: new helm release by @murphp15 in #1910
  • control-service: revert job builder python version by @mrMoZ1 in #1840
  • control-service: update helm charts for service account credentials by @mrMoZ1 in #1800
  • control-service: update job builders for aws temporary credentials by @mrMoZ1 in #1799
  • documentation: VDK components explained by @ivakoleva in #1865
  • frontend: Align code formatting in frontend projects by @gorankokin in #1863
  • frontend: Configurable OAuth by @ivakoleva in #1913
  • frontend: Update docs with build/test configuration by @DeltaMichael in #1928
  • frontend: add build.sh by @tozka in #1807
  • frontend: fix npm lint warnings by @DeltaMichael in #1808
  • frontend: increase the amount of resources for build in cicd by @murphp15 in #1931
  • frontend: prepare for official release shared and dp libs by @gorankokin in #1795
  • frontend: publish docker image for ui by @DeltaMichael in #1872
  • frontend: remove unused config in helm chart for frontend dns by @murphp15 in #1932
  • frontend: Stabilization for e2e tests by @gorankokin in #1876
  • frontend: Auth configurations organized by @ivakoleva in #1957
  • frontend: change history link in data job by @gorankokin in #1884
  • specs: VEP-1739 Update status and reorganise document by @doks5 in #1857
  • specs: VEP-1739 updated API section by @mivanov1988 in #1882
  • specs: update Multiple Python versions VEP summary by @tozka in #1792
  • vdk-vep: update vep status by @dakodakov in #1951
  • vdk-cicd: apply limit ranges for storage by @tozka in #1815
  • vdk-cicd: set ephemeral storage request/limits by @tozka in #1813
  • vdk-control-cli: fix circular import dependecy by @tozka in #1820
  • vdk-control-cli: refactor output printing with printer class by @tozka in #1819
  • vdk-control-cli: use assert_click_status by @tozka in #1817
  • vdk-control-cli: use common output printer by @tozka in #1852
  • vdk-control-cli: vdk list -mmm to return executions by @tozka in #1818
  • vdk-control-service: publish python client library by @dakodakov in #1934
  • vdk-dags: improve DAGs user-facing documentation by @yonitoo in #1892
  • vdk-gdp-execution-id: a Generative Data Pack expanding with execution ID by @ivakoleva in #1877
  • vdk-gdp-execution-id: import fix by @ivakoleva in #1961
  • vdk-github-workflows: ubuntu latest update by @ivakoleva in #1943
  • vdk-jupyter: UI test enhancements by @duyguHsnHsn in #1783
  • vdk-jupyter: add UI vdk cell marks by @duyguHsnHsn in #1891
  • vdk-jupyter: job run messages by @duyguHsnHsn in #1908
  • vdk-jupyter: remove react-test-renderer package from package.json by @duyguHsnHsn in #1881
  • vdk-lineage: support for latest version sqllineage library by @tozka in #1816
  • vdk-meta-jobs: Meta Jobs DAG validation by @yonitoo in #1785
  • vdk-meta-jobs: add DAG with args example by @yonitoo in #1859
  • vdk-meta-jobs: add some configurable variable references in the VEP by @yonitoo in #1794
  • vdk-meta-jobs: exec job with arguments by @yonitoo in #1839
  • vdk-meta-jobs: fix DAG image in example by @yonitoo in #1920
  • vdk-meta-jobs: improve DAGs code documentation by @yonitoo in #1873
  • vdk-metajobs: Deprecate plugin by @gageorgiev in #1930
  • vdk-notebook: add hook for saving error information into json file by @duyguHsnHsn in #1842
  • vdk-plugins: add connection hook activity diagram by @tozka in #1786
  • vdk-plugins: test only oldest and newest supported python version by @tozka in #1811
  • vdk-server: quickstart vdk now includes the UI by @murphp15 in #1912

New Contributors

Full Changelog: v0.12...v0.13

Don't miss a new versatile-data-kit release

NewReleases is sending notifications on new releases.