github huggingface/datasets 2.2.2

latest releases: 2.20.0, 2.19.2, 2.19.1...
2 years ago

Datasets fixes

Bug fixes

  • Support lists of multi-dimensional numpy arrays by @albertvillanova in #4194
  • Check if dataset features match before push in DatasetDict.push_to_hub by @mariosasko in #4372
  • Pin dill by @albertvillanova in #4380
    • dill 0.3.5 has some issues in transformers - pinning the version to <0.3.5 for now

Dataset Cards

  • Adding eval metadata for ade v2 by @sashavor in #4319
  • Adding eval metadata for AG News by @sashavor in #4329
  • Adding eval metadata to Allociné dataset by @sashavor in #4330
  • Adding eval metadata to Amazon Polarity by @sashavor in #4331
  • Adding eval metadata for arabic speech corpus by @sashavor in #4332
  • Adding eval metadata for Banking 77 by @sashavor in #4333
  • Eval metadata Batch 4: Tweet Eval, Tweets Hate Speech Detection, VCTK, Weibo NER, Wisesight Sentiment, XSum, Yahoo Answers Topics, Yelp Polarity, Yelp Review Full by @sashavor in #4338
  • Eval metadata batch 3: Reddit, Rotten Tomatoes, SemEval 2010, Sentiment 140, SMS Spam, Snips, SQuAD, SQuAD v2, Timit ASR by @sashavor in #4337
  • Eval metadata batch 1: BillSum, CoNLL2003, CoNLLPP, CUAD, Emotion, GigaWord, GLUE, Hate Speech 18, Hate Speech by @sashavor in #4335
  • Eval metadata batch 2 : Health Fact, Jigsaw Toxicity, LIAR, LJ Speech, MSRA NER, Multi News, NCBI Disease, Poem Sentiment by @sashavor in #4336

Docs

  • Add API code examples for Builder classes by @stevhliu in #4313
  • Add redirect to dataset script in the repo structure page by @lhoestq in #4369

Other improvements and bug fixes

New Contributors

Full Changelog: 2.2.1...2.2.2

Don't miss a new datasets release

NewReleases is sending notifications on new releases.