huggingface/optimum-intel v1.3.0 on GitHub

Knowledge distillation

Knowledge distillation was introduced in #8. To perform distillation, an IncDistiller must be instantiated with the appropriate configuration.

One-shot optimization

The possibility to combine compression techniques such as pruning, knowledge distillation and quantization aware training in one-shot during training was introduced (#7). One-shot optimization is set by default, but can be cancelled by setting the one_shot_optimization parameter to False when instantiating the IncOptimizer.

Seq2Seq models support

Both quantization and pruning can now be applied on Seq2Seq models (#14)

huggingface/optimum-intel v1.3.0 v1.3.0: Knowledge distillation and one-shot optimization support on GitHub

Knowledge distillation

One-shot optimization

Seq2Seq models support

huggingface/optimum-intel v1.3.0
v1.3.0: Knowledge distillation and one-shot optimization support

on GitHub