github yusufkaraaslan/Skill_Seekers v1.3.0
v1.3.0 - Refactoring & Performance (2-3x Faster)

latest releases: v2.1.1, v2.1.0, v2.0.0...
one month ago

๐Ÿš€ v1.3.0 - Refactoring & Performance Improvements

Major refactoring release with async support, improved code quality, and better package structure.

๐ŸŽฏ Performance Highlights

  • 2-3x faster scraping with async mode (18 pg/s โ†’ 55 pg/s)
  • 66% less memory (120 MB โ†’ 40 MB)
  • 299 tests (92 new tests added)

โœจ New Features

Async/Await Support for Parallel Scraping

# Enable async mode with 8 workers (recommended for large docs)
python3 cli/doc_scraper.py --config configs/react.json --async --workers 8

Performance Comparison:

  • Sync: ~18 pages/sec, 120 MB memory
  • Async: ~55 pages/sec, 40 MB memory
  • 3x faster with 66% less memory!

Python Package Structure

  • Proper __init__.py files for clean imports
  • cli/ package with organized modules
  • skill_seeker_mcp/ package (renamed from mcp/)
  • Better IDE support and maintainability

Centralized Configuration

  • New cli/constants.py with 18 configuration constants
  • All magic numbers centralized and configurable
  • Easy to customize defaults

๐Ÿ”ง Code Quality Improvements

  • 71 print statements โ†’ proper logging (logger.info, logger.warning, logger.error)
  • Type hints added to all DocToSkillConverter methods
  • mypy type checking - all issues fixed
  • Better error handling with comprehensive logging

๐Ÿ“š Documentation

  • New ASYNC_SUPPORT.md - Complete async guide
  • Updated README.md with async examples
  • Updated CLAUDE.md with technical details
  • Comprehensive CHANGELOG.md

๐Ÿงช Testing

  • 299 tests passing (was 207)
  • 92 new tests added:
    • 11 async scraping tests
    • 26 integration tests
    • 13 llms.txt tests
    • 21 constants tests
    • 21 package structure tests
  • 100% test pass rate
  • Fixed test isolation issues

๐Ÿ”„ Breaking Changes

None! This is a backwards-compatible refactoring release.

๐Ÿ“ฆ What's Changed

Added

  • Async/await support with --async flag
  • Connection pooling for better performance
  • asyncio.Semaphore for concurrency control
  • Python package structure with proper imports
  • Centralized configuration module
  • Type hints throughout codebase
  • Comprehensive test coverage

Changed

  • All print() โ†’ logging calls
  • Better IDE support with package structure
  • Code quality improved from 5.5/10 to 6.5/10
  • Test count: 207 โ†’ 299

Fixed

  • Test isolation issues
  • Import issues (no more sys.path.insert hacks)
  • All mypy type checking issues

๐Ÿ“– Full Changelog

See CHANGELOG.md for complete details.

๐Ÿ™ Acknowledgments

This refactoring was completed as Phase 0 of our development roadmap, setting a solid foundation for future features.


Installation:

git clone https://github.com/yusufkaraaslan/Skill_Seekers.git
cd Skill_Seekers
pip install -r requirements.txt

Quick Start:

# Try async mode
python3 cli/doc_scraper.py --config configs/react.json --async --workers 8

๐Ÿค– Generated with Claude Code

Don't miss a new Skill_Seekers release

NewReleases is sending notifications on new releases.