github hiyouga/LLaMA-Factory v0.5.2
v0.5.2: Block Expansion, Qwen1.5 Models

latest releases: v0.9.0, v0.8.3, v0.8.2...
8 months ago

New features

  • Support block expansion in LLaMA Pro, see tests/llama_pro.py for usage
  • Add use_rslora option for the LoRA method

New models

  • Base models
    • Qwen1.5 (0.5B/1.8B/4B/7B/14B/72B)
    • DeepSeekMath-7B-Base
    • DeepSeekCoder-7B-Base-v1.5
    • Orion-14B-Base
  • Instruct/Chat models
    • Qwen1.5-Chat (0.5B/1.8B/4B/7B/14B/72B)
    • MiniCPM-2B-SFT/DPO
    • DeepSeekMath-7B-Instruct
    • DeepSeekCoder-7B-Instruct-v1.5
    • Orion-14B-Chat
    • Orion-14B-Long-Chat
    • Orion-14B-RAG-Chat
    • Orion-14B-Plugin-Chat

New datasets

  • Supervised fine-tuning datasets
    • SlimOrca (en)
    • Dolly (de)
    • Dolphin (de)
    • Airoboros (de)
  • Preference datasets
    • Orca DPO (de)

Bug fix

Don't miss a new LLaMA-Factory release

NewReleases is sending notifications on new releases.