icereed/paperless-gpt v0.20.0
on GitHub

latest releases: v0.22.0, v0.21.0

3 months ago

Release Highlights 🚀

New Features

🧠 Google Gemini AI Integration

Added Google Gemini AI support - Paperless-GPT now supports Google's Gemini AI models as a new LLM provider option
Thinking budget support - Leverages Gemini's new thinking capabilities for enhanced document processing
Enhanced error handling - Improved API response validation and error management for Google AI services
Set LLM_PROVIDER: "googleai" and configure your Google AI API credentials to get started

Improvements & Refinements

🔧 LLM Prompt Optimization

Enhanced prompt structure - Added XML-like separators to LLM prompts for improved parsing accuracy and consistency
Better data organization - Input data is now enclosed in structured tags for clearer LLM interpretation
Improved reliability - More consistent results across different document types and LLM providers

📄 PDF Processing Fixes

Fixed split PDF logic - Updated file naming pattern to match pdfcpu output format (removes zero-padding)
Consistent naming - Split files now use simplified naming: original_1.pdf, original_2.pdf, etc.
Better workflow integration - Improved compatibility with existing PDF processing pipelines

Documentation Updates

📚 Model Recommendations

Updated model suggestions - Documentation now recommends qwen3:8b instead of deepseek-r1:8b for Ollama users
Better performance - qwen3:8b offers more recent and powerful reasoning capabilities
Improved examples - Updated configuration examples throughout the documentation

Dependencies & Maintenance

🔄 Dependency Updates

testcontainers updated to v10.28.0 - Latest testing framework improvements
globals updated to v16.2.0 - Enhanced JavaScript globals definitions
Automated maintenance - Renovate bot ensures dependencies stay current and secure

Technical Details

What's Changed

adds support for new gemini models with thinking budget #441
refactor: Add XML-like separators to LLM prompts for improved parsing #442
fix: the split pdf logic to be consistent with the output from pdfcpu #435
doc: mention qwen3:8b instead of deepseek-r1:8b #439
chore(deps): update dependency testcontainers to v10.28.0 #425
chore(deps): update dependency globals to v16.2.0 #428

Contributors

Special thanks to @thiswillbeyourgithub, @dawidkulpa, and @moarsmokes for their contributions to this release!

Configuration Notes

For Google Gemini AI: Set environment variables for GOOGLE_AI_API_KEY and configure LLM_PROVIDER: "googleai"
No breaking changes - existing configurations continue to work as expected
Consider updating to qwen3:8b model if using Ollama for better performance

New Contributors

@thiswillbeyourgithub made their first contribution in #439
@moarsmokes made their first contribution in #435

Full Changelog: v0.19.0...v0.20.0

Check out latest releases or
releases around icereed/paperless-gpt v0.20.0

Don't miss a new paperless-gpt release

NewReleases is sending notifications on new releases.

Get notifications