github datafold/data-diff v0.3.0
v0.3.0 - New algorithm for in-db diffing (joindiff) + tons of new features and bugfixes!

latest releases: v0.11.1, v0.11.0, v0.10.1...
21 months ago

Big points:

  • Added a new algorithm for in-db diffing that uses OUTER JOIN, called "joindiff".
    • Much faster than the original "hashdiff" algorithm!
    • Automatically chosen if both dbs are the same
    • Validates that the key column is unique and contains no NULLs (joindiff only)
    • Explicitly switch between algorithms using the --algorithm parameter.
  • New feature to materialize joindiff results to DB
  • New feature that diffs the schemas when both dbs are the same
  • Added DuckDB support (thanks @jardayn!)
  • Better support for alphanumerics
  • Better support for boolean types
  • Added --version switch
  • New and improved database and query interface, named "sqeleton"
  • Tons of bugfixes and improvements!

What's Changed

New Contributors

Full Changelog: v0.2.8...v0.3.0

Let us know what you think in Discussions!

Don't miss a new data-diff release

NewReleases is sending notifications on new releases.