What's new?
- Add
audio-classificationwith MMS and Wav2Vec2 in #220. Example usage:// npm i @xenova/transformers import { pipeline } from '@xenova/transformers'; // Create audio classification pipeline let classifier = await pipeline('audio-classification', 'Xenova/mms-lid-4017'); // Run inference let url = 'https://huggingface.co/datasets/Xenova/transformers.js-docs/resolve/main/jeanNL.wav'; let output = await classifier(url); // [ // { label: 'fra', score: 0.9995712041854858 }, // { label: 'hat', score: 0.00003788191679632291 }, // { label: 'lin', score: 0.00002646935718075838 }, // { label: 'hun', score: 0.000015628289474989288 }, // { label: 'bre', score: 0.000007014674793026643 } // ]
- Adds
automatic-speech-recognitionfor Wav2Vec2 models in #220 (MMS coming soon). - Add support for multi-label classification problem type in #249. Thanks @KiterWork for reporting!
- Add M2M100 tokenizer in #250. Thanks @AAnirudh07 for the feature request!
- Documentation improvements
New Contributors
- @celsodias12 made their first contribution in #247
Full Changelog: 2.5.1...2.5.2