Added
- Implement dataset symlink feature #2066 @pawel-big-lebowski
- Store column lineage facets in separate table #2096 @mzareba382 @pawel-big-lebowski
- Add a lineage graph endpoint for column lineage #2124 @pawel-big-lebowski
- Enrich returned dataset resource with column lineage information #2113 @pawel-big-lebowski
- Add downstream column lineage #2159 @pawel-big-lebowski
- Implement column lineage within Marquez Java client #2163 @pawel-big-lebowski
- Provide
dataset_symlinks
table forSymlinkDatasetFacet
#2087 @pawel-big-lebowski - Display current run state for job node in lineage graph #2146 @wslulciuc
- Include column lineage in dataset resource #2148 @pawel-big-lebowski
- Add indices on the job table #2161 @phixMe
- Add endpoint to get column lineage by a job #2204 @pawel-big-lebowski
- Add column lineage methods to Python client #2209 @pawel-big-lebowski
Changed
- Update insert job function to avoid joining on symlinks for jobs with no symlinks #2144 @collado-mike
- Increase size of
column-lineage.description
column #2205 @pawel-big-lebowski
Fixed
- Add support for
parentRun
facet as reported by older Airflow OpenLineage versions #2130 @collado-mike - Add fix and tests for handling Airflow DAGs with dots and task groups #2126 @collado-mike @wslulciuc
- Fix version bump in docker/up.sh #2129 @wslulciuc
- Use clean when running shadowJar in Dockerfile #2145 @wslulciuc
- Fix bug that caused a single run event to create multiple jobs #2162 @collado-mike
- Fix column lineage returning multiple entries for job run multiple times #2176 @pawel-big-lebowski
- Fix API spec issues #2178 @phixMe
- Fix downstream recursion #2181 @pawel-big-lebowski
- Update
jobs_current_version_uuid_index
andjobs_symlink_target_uuid_index
to ignore NULL values #2186 @collado-mike