Added
sniff
: sniff field names as well in addition to field data types in #317sniff
: intelligent sampling. In addition to specifying the number of first n rows to sample, when--sample
is between 0 and 1 exclusive, its treated as a percentage of the CSV to sample (e.g. 0.20 is 20 percent).
If its zero, the entire file is sampled. #318schema
: add --stdout option in #321stats
: smart date inferencing with field-name date whitelist. Also did some minor tweaks for a little more performance in #327rename
: added toqsvdp
- DataPusher+-optimized qsv binary
Changed
- Switch to qsv_sniffer fork of csv_sniffer. qsv_sniffer has several optimizations (field name sniffing, utf-8 encoding detection,
SIMD speedups, etc.) that enabled the addedsniff
features above. #320 - Bump uuid from 1.0.0 to 1.1.0 by @dependabot in #323
- Improved Performance Tuning section with more details about UTF-8 encoding, and Nightly builds
- Updated list of commands that use an index
- cargo update bump dependencies, notably jql 4.0.3 to 4.0.4, and cookie_store from 0.16.0 to 0.16.1
Fixed
- pinned Rust Nightly to 2022-05-23. Later Rust Nightly releases "broke" packed-simd dependency
which prevented us from building qsv's nightly build. (see apache/arrow-rs#1734)
Full Changelog: 0.50.1...0.51.0