Release Highlights 🚀
New Features
🧠 Google Gemini AI Integration
- Added Google Gemini AI support - Paperless-GPT now supports Google's Gemini AI models as a new LLM provider option
- Thinking budget support - Leverages Gemini's new thinking capabilities for enhanced document processing
- Enhanced error handling - Improved API response validation and error management for Google AI services
- Set
LLM_PROVIDER: "googleai"
and configure your Google AI API credentials to get started
Improvements & Refinements
🔧 LLM Prompt Optimization
- Enhanced prompt structure - Added XML-like separators to LLM prompts for improved parsing accuracy and consistency
- Better data organization - Input data is now enclosed in structured tags for clearer LLM interpretation
- Improved reliability - More consistent results across different document types and LLM providers
📄 PDF Processing Fixes
- Fixed split PDF logic - Updated file naming pattern to match
pdfcpu
output format (removes zero-padding) - Consistent naming - Split files now use simplified naming:
original_1.pdf
,original_2.pdf
, etc. - Better workflow integration - Improved compatibility with existing PDF processing pipelines
Documentation Updates
📚 Model Recommendations
- Updated model suggestions - Documentation now recommends
qwen3:8b
instead ofdeepseek-r1:8b
for Ollama users - Better performance -
qwen3:8b
offers more recent and powerful reasoning capabilities - Improved examples - Updated configuration examples throughout the documentation
Dependencies & Maintenance
🔄 Dependency Updates
- testcontainers updated to v10.28.0 - Latest testing framework improvements
- globals updated to v16.2.0 - Enhanced JavaScript globals definitions
- Automated maintenance - Renovate bot ensures dependencies stay current and secure
Technical Details
What's Changed
- adds support for new gemini models with thinking budget #441
- refactor: Add XML-like separators to LLM prompts for improved parsing #442
- fix: the split pdf logic to be consistent with the output from pdfcpu #435
- doc: mention qwen3:8b instead of deepseek-r1:8b #439
- chore(deps): update dependency testcontainers to v10.28.0 #425
- chore(deps): update dependency globals to v16.2.0 #428
Contributors
Special thanks to @thiswillbeyourgithub, @dawidkulpa, and @moarsmokes for their contributions to this release!
Configuration Notes
- For Google Gemini AI: Set environment variables for
GOOGLE_AI_API_KEY
and configureLLM_PROVIDER: "googleai"
- No breaking changes - existing configurations continue to work as expected
- Consider updating to
qwen3:8b
model if using Ollama for better performance
New Contributors
- @thiswillbeyourgithub made their first contribution in #439
- @moarsmokes made their first contribution in #435
Full Changelog: v0.19.0...v0.20.0