github tensorchord/VectorChord-bm25 0.2.0

latest releases: 0.3.0, 0.2.2, 0.2.1...
8 months ago

VectorChord-bm25 0.2 Release Notes

We are excited to announce the release of VectorChord-bm25 version 0.2. This release introduces significant improvements to the tokenizer functionality, enhancing flexibility, customization, and multi-language support. Below are the key updates in this release.

🚀 New Features and Improvements

Decoupled tokenizer

The tokenizer-related API has been extracted into a dedicated repository: pg_tokenizer.rs. This change introduces several key improvements:

  1. Enhanced Customization

    • The tokenizer now offers more flexible configuration options to suit a wide range of use cases.
    • Users can now customize stopwords, synonyms, and stemmers to better align with their specific requirements.
  2. Multi-Language Support

    • Support for multiple languages has been significantly enhanced, making the extension more versatile for global applications.

🔧 Upgrade Guide

  • Review the Updated API: Users are encouraged to review the updated API in the pg_tokenizer.rs repository for detailed information on the new features and customization options.

What's Changed

New Contributors

Full Changelog: 0.1.1...0.2.0

Don't miss a new VectorChord-bm25 release

NewReleases is sending notifications on new releases.