huggingface/transformers v0.2.0
Small improvements and a few bug fixes.

on GitHub

latest releases: v4.41.0, v4.40.2, v4.40.1...

5 years ago

Improvement:

Added a cache_dir option to from_pretrained() function to select a specific path to download and cache the pre-trained model weights. Useful for distributed training (see readme) (fix issue #44).

Bug fixes in model training and tokenizer loading:

Fixed error in CrossEntropyLoss reshaping (issue #55).
Fixed unicode error in vocabulary loading (issue #52).

Bug fixes in examples:

Fix weight decay in examples (previously bias and layer norm weights were also decayed due to an erroneous check in training loop).
Fix fp16 grad norm is None error in examples (issue #43).

Updated readme and docstrings

Check out latest releases or
releases around huggingface/transformers v0.2.0

Don't miss a new transformers release

NewReleases is sending notifications on new releases.

Get notifications