github icereed/paperless-gpt v0.20.0

latest releases: v0.22.0, v0.21.0
3 months ago

Release Highlights 🚀

New Features

🧠 Google Gemini AI Integration

  • Added Google Gemini AI support - Paperless-GPT now supports Google's Gemini AI models as a new LLM provider option
  • Thinking budget support - Leverages Gemini's new thinking capabilities for enhanced document processing
  • Enhanced error handling - Improved API response validation and error management for Google AI services
  • Set LLM_PROVIDER: "googleai" and configure your Google AI API credentials to get started

Improvements & Refinements

🔧 LLM Prompt Optimization

  • Enhanced prompt structure - Added XML-like separators to LLM prompts for improved parsing accuracy and consistency
  • Better data organization - Input data is now enclosed in structured tags for clearer LLM interpretation
  • Improved reliability - More consistent results across different document types and LLM providers

📄 PDF Processing Fixes

  • Fixed split PDF logic - Updated file naming pattern to match pdfcpu output format (removes zero-padding)
  • Consistent naming - Split files now use simplified naming: original_1.pdf, original_2.pdf, etc.
  • Better workflow integration - Improved compatibility with existing PDF processing pipelines

Documentation Updates

📚 Model Recommendations

  • Updated model suggestions - Documentation now recommends qwen3:8b instead of deepseek-r1:8b for Ollama users
  • Better performance - qwen3:8b offers more recent and powerful reasoning capabilities
  • Improved examples - Updated configuration examples throughout the documentation

Dependencies & Maintenance

🔄 Dependency Updates

  • testcontainers updated to v10.28.0 - Latest testing framework improvements
  • globals updated to v16.2.0 - Enhanced JavaScript globals definitions
  • Automated maintenance - Renovate bot ensures dependencies stay current and secure

Technical Details

What's Changed

  • adds support for new gemini models with thinking budget #441
  • refactor: Add XML-like separators to LLM prompts for improved parsing #442
  • fix: the split pdf logic to be consistent with the output from pdfcpu #435
  • doc: mention qwen3:8b instead of deepseek-r1:8b #439
  • chore(deps): update dependency testcontainers to v10.28.0 #425
  • chore(deps): update dependency globals to v16.2.0 #428

Contributors

Special thanks to @thiswillbeyourgithub, @dawidkulpa, and @moarsmokes for their contributions to this release!

Configuration Notes

  • For Google Gemini AI: Set environment variables for GOOGLE_AI_API_KEY and configure LLM_PROVIDER: "googleai"
  • No breaking changes - existing configurations continue to work as expected
  • Consider updating to qwen3:8b model if using Ollama for better performance

New Contributors

Full Changelog: v0.19.0...v0.20.0

Don't miss a new paperless-gpt release

NewReleases is sending notifications on new releases.