✨ Highlights ✨
Granular control of pandas validation #1490
There is now support for granular control of schema-level or data-level validations. This can be done via the PANDERA_VALIDATION_DEPTH
environment variable. Schema-level (or metadata) validation includes things like column name checks and column data types, while data-level validation involves checks that operate on actual data values.
export PANDERA_VALIDATION_DEPTH= SCHEMA_AND_DATA # check schema- and data-level checks (default)
export PANDERA_VALIDATION_DEPTH=SCHEMA_ONLY # only do schema-level checks
export PANDERA_VALIDATION_DEPTH=DATA_ONLY # only do data-level checks
Efficient Hypothesis strategies #1503
Pandas data synthesis strategies now uses comparison operator functions for more efficient data synthesis. It also updates the minimum hypothesis
version to 6.92.7
.
What's Changed
- Fix copy-pasted docstring in PySpark accessor test by @deepyaman in #1448
- Mypy precommit by @cosmicBboy in #1468
- @check_types now properly passes in *args **kwargs and checks their types by @ecthompson99 in #1336
- Bump starlette from 0.27.0 to 0.36.2 in /dev by @dependabot in #1484
- Bump fastapi from 0.103.0 to 0.109.1 by @dependabot in #1482
- Bump actions/cache from 3 to 4 by @dependabot in #1478
- Bump codecov/codecov-action from 3 to 4 by @dependabot in #1477
- Bump jinja2 from 3.1.2 to 3.1.3 by @dependabot in #1459
- fix: pin multimethod dep version (#1485) by @schatimo in #1486
- Fix issue where str dtype in a multiindex dataframe schema results in invalid example by @gsugar87 in #1050
- Bump python-multipart from 0.0.6 to 0.0.7 by @dependabot in #1496
- Bump python-multipart from 0.0.6 to 0.0.7 in /dev by @dependabot in #1495
- Bump python-multipart from 0.0.6 to 0.0.7 in /ci by @dependabot in #1494
- Bump jinja2 from 3.1.2 to 3.1.3 in /ci by @dependabot in #1457
- Bump starlette from 0.27.0 to 0.36.2 in /dev by @dependabot in #1489
- Bugfix/1463 Pandas 2.2.0 FutureWarning resolution by using assignment instead of … by @derinwalters in #1464
- Bump jinja2 from 3.1.2 to 3.1.3 in /dev by @dependabot in #1458
- add pandas 2.2.0 to tests, use uv for pip compile by @cosmicBboy in #1502
- Efficient Hypothesis strategies by @Zac-HD in #1503
- remove headers in requirements files by @cosmicBboy in #1512
- Granular validations on pandas dfs by @kykyi in #1490
New Contributors
- @deepyaman made their first contribution in #1448
- @ecthompson99 made their first contribution in #1336
- @schatimo made their first contribution in #1486
- @gsugar87 made their first contribution in #1050
- @Zac-HD made their first contribution in #1503
Full Changelog: v0.18.0...v0.18.1