github jqnatividad/qsv 0.136.0

14 hours ago

🎉 qsv pro is now available in the Microsoft Store! 🎉

It's Data Wrangling Democratized on the Desktop, featuring:

  • 📊 Familiar Spreadsheet Interface
    tap the power of qsv to query, analyze, enrich, scrub and transform huge Excel files and multi-gigabyte CSV files in seconds, without having to deal with the command-line.
  • CKAN CKAN desktop client
    designed to make data publishing easier for portal operators and data stewards using the CKAN CKAN platform.
  • 📥 Flow
    allows you to build custom node-based flows and data pipelines using a visual interface.
  • 🔧 Toolbox
    features an ever-expanding library of reusable scripts for common data-wrangling use cases.
  • ⭐ and more!
    Natural Language Interface (RAG), Polars SQL query support, an API, Python/Luau support, automatic Data Dictionaries, DCAT 3 metadata profile inferencing, along with a retinue of other cloud-based services (e.g. customizable street-level geocoding, data feeds, reference data lookups, geo-ip lookups, cloud storage support, .qsv file format, etc.) that will be unveiled in future versions.

Like qsv, we're iterating rapidly with qsv pro, so your feedback is essential. Give it a try!

Get it from https://qsvpro.dathere.com or

Other highlights:

  • excel: new --table option for XLSX files; new --header-row option; expanded --range option, adding support for Named Ranges and absolute ranges (e.g. Sheet2!$A$1:$J$10); and expanded metadata export now including Named Ranges and Tables (for XLSX files)
  • Improved performance for several commands (apply, datefmt, tojsonl and validate) through automatic batch size optimization
  • validate: dynamicEnum custom JSON Schema keyword in validate command (renamed from dynenum) and enhanced email validation
  • schema: automatic JSON Schema const inferencing for columns with just one value
  • Significant dependency updates, including latest upstream versions of Polars, jsonschema, and serde_json with unreleased performance upgrades, new features and fixes

NOTE: You can see qsv & qsv pro in action in our "The Problem with Data Portals" webinar Wed, Oct 23, 2024. 1-2pm EDT


Added

  • 🎉 qsv pro is now in the Microsoft Store!!! 🎉
  • apply, datefmt, tojsonl, validate: added logic to automatically determine optimal batch size for better parallelization #2178
  • enum: added --new-column support for all enum modes, not just --increment #2173
  • excel: new --table option for XLSX files #2194
  • excel: new --header-row option 458f79a
  • excel: expanded range and metadata options #2195
  • schema: added JSON Schema automatic const inferencing #2180
  • Add signing step to qsv MSI installer GitHub Action by @rzmk in #2182
  • contrib(completions): add --table option to qsv excel by @rzmk in #2197
  • completions: add --header-row option to qsv excel e8794d5
  • added new apply operations sentiment benchmark b745e64
  • docs: added indexing section to PERFORMANCE.md 804145a

Changed

Fixed

  • schema: fix enum so it only adds a list when the number of unique values > --enum-threshold #2180
  • Upload artifact fix for Debian package publishing by @tino097 in #2168
  • fixed typos configuration 627de89
  • fixed various GitHub Actions publishing workflow issues

Full Changelog: 0.135.0...0.136.0

Don't miss a new qsv release

NewReleases is sending notifications on new releases.