github hiyouga/LLaMA-Factory v0.9.1
v0.9.1: Many Vision Models, Qwen2.5 Coder, Gradient Fix

12 hours ago

New features

Note: now you can install transformers>=4.46.0,<=4.46.1 to make the gradient accumulation fix enabled.

New models

  • Base models
    • Qwen2.5 (0.5B/1.5B/3B/7B/14B/32B/72B) πŸ“„
    • Qwen2.5-Coder (0.5B/1.5B/3B/7B/14B/32B) πŸ“„πŸ–₯️
    • Llama-3.2 (1B/3B) πŸ“„
    • OpenCoder (1.5B/8B) πŸ“„πŸ–₯️
    • Index (1.9B) πŸ“„
  • Instruct/Chat models
    • Qwen2.5-Instruct (0.5B/1.5B/3B/7B/14B/32B/72B) πŸ“„πŸ€–
    • Qwen2.5-Coder-Instruct (0.5B/1.5B/3B/7B/14B/32B) πŸ“„πŸ€–πŸ–₯️
    • Llama-3.2-Instruct (1B/3B) πŸ“„πŸ€–
    • OpenCoder-Instruct (1.5B/8B) πŸ“„πŸ€–πŸ–₯️
    • Index-Chat (1.9B) πŸ“„πŸ€–
    • LLaVA-NeXT (7B/8B/13B/34B/72B/110B) πŸ“„πŸ€–πŸ–ΌοΈ
    • LLaVA-NeXT-Video (7B/34B) πŸ“„πŸ€–πŸ–ΌοΈ
    • Video-LLaVA (7B) πŸ“„πŸ€–πŸ–ΌοΈ
    • Pixtral (12B) πŸ“„πŸ€–πŸ–ΌοΈ
    • EXAONE-3.0-Instruct (8B) πŸ“„πŸ€–

Security fix

Bug fix

Full Changelog: v0.9.0...v0.9.1

Don't miss a new LLaMA-Factory release

NewReleases is sending notifications on new releases.