🚀 GitHub Repository Analysis Enhancements
This release significantly improves GitHub repository scraping with unlimited local analysis, configurable directory exclusions, and numerous bug fixes.
✨ New Features
- Configurable directory exclusions for local repository analysis (#203)
exclude_dirs_additional: Extend default exclusions with custom directoriesexclude_dirs: Replace default exclusions entirely (advanced users)- 19 comprehensive tests covering all scenarios
- Logging: INFO for extend mode, WARNING for replace mode
- Unlimited local repository analysis via
local_repo_pathconfiguration parameter - Auto-exclusion of virtual environments, build artifacts, and cache directories
- Support for analyzing repositories without GitHub API rate limits (50 → unlimited files)
- Skip llms.txt option - Force HTML scraping even when llms.txt is detected (#198)
🐛 Bug Fixes
- Fixed logger initialization error causing
AttributeError: 'NoneType' object has no attribute 'setLevel'(#190) - Fixed 3 NoneType subscriptable errors in release tag parsing
- Fixed relative import paths causing
ModuleNotFoundError - Fixed hardcoded 50-file analysis limit preventing comprehensive code analysis
- Fixed GitHub API file tree limitation (140 → 345 files discovered)
- Fixed AST parser "not iterable" errors eliminating 100% of parsing failures (95 → 0 errors)
- Fixed virtual environment file pollution reducing file tree noise by 95%
- Fixed
force_rescrapeflag not checked before interactive prompt causing EOFError in CI/CD environments
📈 Improvements
- Code analysis coverage: 14% → 93.6% (+79.6 percentage points)
- File discovery: 140 → 345 files (+146%)
- Class extraction: 55 → 585 classes (+964%)
- Function extraction: 512 → 2,784 functions (+444%)
- Test suite: Expanded to 427 tests (up from 391)
📦 Installation
# Install from PyPI (recommended)
pip install skill-seekers==2.1.1
# Or upgrade existing installation
pip install --upgrade skill-seekers📚 Documentation
- CHANGELOG.md - Full changelog
- README.md - Complete documentation
- CLAUDE.md - Technical architecture
Full Changelog: v2.1.0...v2.1.1