github hiyouga/LlamaFactory v0.9.0
v0.9.0: Qwen2-VL, Liger-Kernel, Adam-mini

latest releases: v0.9.4, v0.9.3, v0.9.2...
16 months ago

Congratulations on 30,000 stars πŸŽ‰ Follow us at X (twitter)

New features

New models

  • Base models
    • Qwen2-Math (1.5B/7B/72B) πŸ“„πŸ”’
    • Yi-Coder (1.5B/9B) πŸ“„
    • InternLM2.5 (1.8B/7B/20B) πŸ“„
    • Gemma-2-2B πŸ“„
    • Meta-Llama-3.1 (8B/70B) πŸ“„
  • Instruct/Chat models
    • MiniCPM/MiniCPM3 (1B/2B/4B) by @LDLINGLINGLING in #4996 #5372 πŸ“„πŸ€–
    • Qwen2-Math-Instruct (1.5B/7B/72B) πŸ“„πŸ€–πŸ”’
    • Yi-Coder-Chat (1.5B/9B) πŸ“„πŸ€–
    • InternLM2.5-Chat (1.8B/7B/20B) πŸ“„πŸ€–
    • Qwen2-VL-Instruct (2B/7B) πŸ“„πŸ€–πŸ–ΌοΈ
    • Gemma-2-2B-it by @codemayq in #5037 πŸ“„πŸ€–
    • Meta-Llama-3.1-Instruct (8B/70B) πŸ“„πŸ€–
    • Mistral-Nemo-Instruct (12B) πŸ“„πŸ€–

New datasets

  • Supervised fine-tuning datasets
    • Magpie-ultra-v0.1 (en) πŸ“„
    • Pokemon-gpt4o-captions (en&zh) πŸ“„πŸ–ΌοΈ
  • Preference datasets
    • RLHF-V (en) πŸ“„πŸ–ΌοΈ
    • VLFeedback (en) πŸ“„πŸ–ΌοΈ

Changes

  • Due to compatibility consideration, fine-tuning vision language models (VLMs) requires transformers>=4.35.0.dev0, try pip install git+https://github.com/huggingface/transformers.git to install it.
  • visual_inputs has been deprecated, now you do not need to specify this argument.
  • LlamaFactory now adopts lazy loading for multimodal inputs, see #5346 for details. Please use preprocessing_batch_size to restrict the batch size in dataset pre-processing (supported by @naem1023 in #5323 ).
  • LlamaFactory now supports lmf (equivalent to llamafactory-cli) as a shortcut command.

Bug fix

Don't miss a new LlamaFactory release

NewReleases is sending notifications on new releases.