pypi catboost 0.20
v0.20

latest releases: 1.2.5, 1.2.3, 1.2.2...
4 years ago

New submodule for text processing!
It contains two classes to help you make text features ready for training:

  • Tokenizer -- use this class to split text into tokens (automatic lowercase and punctuation removal)
  • Dictionary -- with this class you create a dictionary which maps tokens to numeric identifiers. You then use these identifiers as new features.

New features:

  • Enabled boost_from_average for MAPE loss function

Bug fixes:

  • Fixed Pool creation from pandas.DataFrame with discontinuous columns, #1079
  • Fixed standalone_evaluator, PR #1083

Speedups:

  • Huge speedup of preprocessing in python-package for datasets with many samples (>10 mln)

We also release precompiled packages for Python 3.8

Don't miss a new catboost release

NewReleases is sending notifications on new releases.