github tensorflow/tensor2tensor v1.14.0

latest releases: v1.15.7, v1.15.6, v1.15.5...
4 years ago

Models / Layers:

  • NeuralStack and NeuralQueue added, in 838aca4 - thanks @narphorium !
  • Open Sourcing the Search Space used in EvolvedTransformer - 4ce3661
  • Masked local n-D attention added in - 2da59d2

Problems:

Bug Fixes:

  • Loss twice multiplied with loss_coef (#1627) by @davidmrau - thanks a lot David!
  • Fix log_prob accumulation during decoding, thanks @lmthang !
  • Fixed high usage of TPU HBM "Arguments" during serving
    in d38f343 thanks @ziy !
  • Should not generate summary during decoding in dot_product_relative_atention (#1618) thanks @phamthuonghai !

Misc changes:

  • Implement sequence packing as a tf.data.Dataset transformation - 560c008 thanks @robieta !
  • Lots of work on t2t_distill and model exporting by @ziy - thanks @ziy !

RL:

Introduce Rainbow. (#1607) by @konradczechowski in #1607
Changes to MBRL by @konradczechowski , @koz4k in multiple PRs.

PRs:

TRAX:

Base

  • Forked optimizers from JAX and make them objects in 1c7c10c
  • Trax layers are now stateful and support custom gradients.
  • Multi-device capability added.
  • Memory efficient trainer added in b2615aa ! Thanks Nikita Kitaev!
  • Adafactor optimizer added in TRAX - 63c015f
  • Demo Colab added in cec26db thanks @levskaya
  • Demo colab for trax layers - 7632ed0
  • Transformer, TransformerLM, Reversible Transformer, PositionLookupTransformer and Resnet50 are some of the models that TRAX now supports.

RL

  • Many PPO changes to be able to work on Atari.
  • Distributed PPO where the envs can run in multiple parallel machines using gRPC
  • SimulatedEnvProblem by @koz4k - a gym env that simulates a step taken by a trainer of a Neural Network in 2c76178
  • Implement SerializedSequenceSimulatedEnvProblem
    by @koz4k
  • Transformer can be used as a policy now, thanks to @koz4k in 33783fd !

Don't miss a new tensor2tensor release

NewReleases is sending notifications on new releases.