Implemented new interfaces to test data and models in a batch: Test Suite.
Implemented the following Individual tests:
- TestNumberOfColumns()
- TestNumberOfRows()
- TestColumnNANShare()
- TestShareOfOutRangeValues()
- TestNumberOfOutListValues()
- TestMeanInNSigmas()
- TestMostCommonValueShare()
- TestNumberOfConstantColumns()
- TestNumberOfDuplicatedColumns()
- TestNumberOfDuplicatedRows()
- TestHighlyCorrelatedFeatures()
- TestTargetFeaturesCorrelations()
- TestShareOfDriftedFeatures()
- TestValueDrfit()
- TestColumnsType()
Implemented the following test presets:
- Data Quality. This preset is focused on the data quality issues like duplicate rows or null values.
- Data Stability. This preset identifies the changes in the data or differences between the batches.
- Data Drift. This one compares feature distributions using statistical tests and distance metrics.
- NoTargetPerformance. This preset combines several checks to run when there are model predictions, there are no actuals or ground truth labels. This includes checking for prediction drift and some of the data quality and stability checks.