v2.3.0

latest releases: v2.10.1, v2.10.0, v2.9.3...
3 years ago

What's new

Added 🎉

  • Ported the following Huggingface LambdaLR-based schedulers: ConstantLearningRateScheduler, ConstantWithWarmupLearningRateScheduler, CosineWithWarmupLearningRateScheduler, CosineHardRestartsWithWarmupLearningRateScheduler.
  • Added new sub_token_mode parameter to pretrained_transformer_mismatched_embedder class to support first sub-token embedding
  • Added a way to run a multi task model with a dataset reader as part of allennlp predict.
  • Added new eval_mode in PretrainedTransformerEmbedder. If it is set to True, the transformer is always run in evaluation mode, which, e.g., disables dropout and does not update batch normalization statistics.
  • Added additional parameters to the W&B callback: entity, group, name, notes, and wandb_kwargs.

Changed ⚠️

  • Sanity checks in the GradientDescentTrainer can now be turned off by setting the run_sanity_checks parameter to False.
  • Allow the order of examples in the task cards to be specified explicitly
  • histogram_interval parameter is now deprecated in TensorboardWriter, please use distribution_interval instead.
  • Memory usage is not logged in tensorboard during training now. ConsoleLoggerCallback should be used instead.
  • If you use the min_count parameter of the Vocabulary, but you specify a namespace that does not exist, the vocabulary creation will raise a ConfigurationError.
  • Documentation updates made to SoftmaxLoss regarding padding and the expected shapes of the input and output tensors of forward.
  • Moved the data preparation script for coref into allennlp-models.
  • If a transformer is not in cache but has override weights, the transformer's pretrained weights are no longer downloaded, that is, only its config.json file is downloaded.
  • SanityChecksCallback now raises SanityCheckError instead of AssertionError when a check fails.
  • jsonpickle removed from dependencies.
  • Improved the error message from Registrable.by_name() when the name passed does not match any registered subclassess.
    The error message will include a suggestion if there is a close match between the name passed and a registered name.

Fixed ✅

  • Fixed a bug where some Activation implementations could not be pickled due to involving a lambda function.
  • Fixed __str__() method on ModelCardInfo class.
  • Fixed a stall when using distributed training and gradient accumulation at the same time
  • Fixed an issue where using the from_pretrained_transformer Vocabulary constructor in distributed training via the allennlp train command
    would result in the data being iterated through unnecessarily.
  • Fixed a bug regarding token indexers with the InterleavingDatasetReader when used with multi-process data loading.
  • Fixed a warning from transformers when using max_length in the PretrainedTransformerTokenizer.

Removed 👋

  • Removed the stride parameter to PretrainedTransformerTokenizer. This parameter had no effect.


