github HKUDS/DeepTutor v1.0.0-beta.1
DeepTutor-1.0.0-beta.1

6 hours ago

๐Ÿš€ DeepTutor v1.0.0-beta1 Release Notes

Release Date: 2026.04.04

We're thrilled to announce DeepTutor v1.0.0-beta1 โ€” the first beta of the brand new DeepTutor architecture. This is a ground-up rewrite that transforms DeepTutor from a monolithic RAG tutor into an agent-native learning platform with a two-layer plugin model (Tools + Capabilities), three unified entry points (CLI / WebSocket / Python SDK), and a completely rebuilt web application shell. Find more surprises by your own!

โš ๏ธ Beta Notice: This is beta 1 of v1.0.0. The core architecture is stable, but some UI interactions and edge-case workflows may still contain bugs. We appreciate your patience and welcome bug reports via Issues.

๐Ÿ“Œ Knowledge Base Note: In this release, the RAG pipeline has been simplified to LlamaIndex only. LightRAG and RAG-Anything pipelines along with their related knowledge base content have been temporarily removed to focus on stability. They will be re-introduced in upcoming releases.

Tip

Call for Feedback: If you encounter any bugs or have feature requests, please open an issue! PRs are welcome โ€” see our Contributing Guide.

Diff Scope: main...dev (903 files changed, 92,701 insertions, 73,749 deletions)


Quick Summary

  • Architecture โ€” Complete rewrite from src/ to deeptutor/ + deeptutor_cli/ with agent-native runtime (Tools + Capabilities).
  • Entry Points โ€” Three unified entry points: standalone CLI (deeptutor), WebSocket API (/api/v1/ws), and Python SDK facade.
  • Capabilities โ€” Five built-in capabilities: chat, deep_solve, deep_question, deep_research, math_animator.
  • Tools โ€” Seven LLM-callable tools: rag, web_search, code_execution, reason, brainstorm, paper_search, geogebra_analysis.
  • Web App โ€” Rebuilt Next.js app with workspace/utility route groups, new Playground, Co-Writer, Agents, and Guide pages.
  • TutorBot โ€” Multi-channel bot agent supporting 12 messaging platforms.
  • Infra โ€” SQLite-backed session persistence, turn runtime, provider-level LLM traffic control and telemetry.

โœจ Highlights

๐Ÿ—๏ธ Agent-Native Runtime (Tools + Capabilities)

Introduced a two-layer plugin model that decouples tool execution from high-level agent workflows:

  • Core Contracts: ToolProtocol, CapabilityProtocol, UnifiedContext, StreamEvent, and StreamBus โ€” the foundation of all runtime execution.
  • ChatOrchestrator: Central coordinator with two registries:
    • ToolRegistry โ€” tool discovery, OpenAI-style schema export, and execution.
    • CapabilityRegistry โ€” capability routing, manifest management, and stage-aware streaming.

๐Ÿ–ฅ๏ธ Unified Entry Points: CLI / WebSocket / Python SDK

Three entry points share a single ChatOrchestrator runtime:

Entry Point Description
CLI (deeptutor) Typer-based CLI with sub-commands: run, chat, bot, kb, memory, session, notebook, plugin, config, provider, serve
WebSocket (/api/v1/ws) Unified endpoint with turn lifecycle: start_turn, subscribe_turn, subscribe_session, resume_from, cancel_turn
Python SDK (deeptutor.app.facade) Programmatic facade for SDK-style integrations

๐Ÿง  Capability Layer

Five built-in capabilities, each a multi-step agent pipeline:

Capability Stages Description
chat responding Default tool-augmented conversation
deep_solve planning โ†’ reasoning โ†’ writing Multi-stage problem solving
deep_question ideation โ†’ evaluation โ†’ generation โ†’ validation Intelligent question generation with follow-up mode
deep_research search โ†’ analyze โ†’ synthesize โ†’ report Multi-agent research with report generation
math_animator analysis โ†’ design โ†’ codegen โ†’ review โ†’ render Manim-based math concept video generation

๐Ÿ”ง Tooling System

Seven unified LLM-callable tools with bilingual prompt hints (en/zh):

Tool Description
rag Knowledge base retrieval via LlamaIndex
web_search 10 search providers: Tavily, Exa, Jina, Serper, Perplexity, Brave, Baidu, SearXNG, DuckDuckGo, OpenRouter
code_execution Sandboxed Python execution with AST-based safety guards
reason Dedicated deep-reasoning LLM call
brainstorm Breadth-first idea exploration with structured rationale
paper_search arXiv academic paper search

๐Ÿค– TutorBot โ€” Multi-Channel Bot Agent

New autonomous bot system (deeptutor/tutorbot/) that brings DeepTutor to messaging platforms:

  • 12 Channels: Telegram, Discord, Slack, WeChat Work (WeCom), Feishu, DingTalk, WhatsApp, Matrix, QQ, Email, MoChat
  • Agent Loop: Tool-augmented LLM loop with memory, subagent spawning, and team collaboration
  • Built-in Tools: Shell, filesystem, web, MCP, cron, and message tools
  • Background Services: Heartbeat health checks and cron-based scheduled tasks

๐ŸŒ Web Application Restructure

Complete rebuild of the Next.js frontend with new route groups:

Workspace Routes ((workspace)/):

Page Description
Home (/) Main chat interface with tool-augmented conversation
Guide (/guide) Interactive learning guide with session history, progress tracking, and completion summaries
Playground (/playground) Unified deep capability UI (deep_solve, deep_question, deep_research, math_animator)
Co-Writer (/co-writer) AI-assisted collaborative writing with edit and narrator agents
Agents (/agents) TutorBot management โ€” create, configure, and chat with custom bots

Utility Routes ((utility)/):

Page Description
Knowledge (/knowledge) Knowledge base management with LlamaIndex pipeline
Memory (/memory) User memory and preference management
Settings (/settings) Unified configuration for LLM, Embedding, TTS, and Search services

๐Ÿญ Service Infrastructure Rebuild

Refactored services into clearer domains:

deeptutor/services/
โ”œโ”€โ”€ config/       # Environment store, model catalog, provider runtime
โ”œโ”€โ”€ llm/          # Multi-provider LLM: factory, registry, traffic control, telemetry
โ”œโ”€โ”€ embedding/    # Adapter-based: OpenAI-compatible, Cohere, Jina, Ollama
โ”œโ”€โ”€ rag/          # LlamaIndex pipeline with component-based architecture
โ”œโ”€โ”€ search/       # 10 web search providers with result consolidation
โ”œโ”€โ”€ session/      # SQLite store, turn runtime, context builder
โ”œโ”€โ”€ memory/       # User memory persistence
โ”œโ”€โ”€ notebook/     # Notebook management
โ”œโ”€โ”€ prompt/       # Bilingual prompt template manager (en/zh)
โ”œโ”€โ”€ settings/     # Interface settings
โ”œโ”€โ”€ setup/        # Application initialization
โ”œโ”€โ”€ tutorbot/     # TutorBot management
โ””โ”€โ”€ path_service  # Centralized data path resolution

๐Ÿ”’ Security & Stability

  • Code Execution Safety: AST-based import/call guards with configurable allowlists.
  • LLM Traffic Control: Provider-level circuit breaker, error rate tracking, and retry mechanisms.
  • Startup Validation: Capability-to-tool consistency checks at boot time.

๐Ÿงช Test Coverage

53+ new test files across all major layers: runtime (tool/capability registry, orchestrator), services (LLM provider/factory/routing/telemetry, RAG pipeline, embedding, search, session, memory, notebook, config), agents (chat, solve, question, math_animator), API (knowledge, memory, solve, WebSocket turn runtime), CLI, and tools (code executor safety).


โš ๏ธ Breaking Changes

  • Package layout: src/ โ†’ deeptutor/ + deeptutor_cli/. Old src/ directory fully removed (140 files).
  • Package renamed: ai-tutor โ†’ deeptutor, version 1.0.0.
  • Runtime model: Capability-native orchestration. chat is the default; deep modes selected explicitly via run command or WebSocket.
  • Web routes: All pages reorganized under (workspace)/ and (utility)/. Legacy pages (/solver, /question, /research, /ideagen, /notebook, /history) removed.
  • RAG pipeline: Only LlamaIndex available. LightRAG and RAG-Anything temporarily removed.
  • Data layout: Runtime data centered under data/user/workspace/....
  • Dependencies: Split into layered requirements: cli.txt, server.txt, dev.txt, math-animator.txt, tutorbot.txt.

๐Ÿ“ฆ What's Changed

  • Complete codebase rewrite with agent-native architecture (DeepTutor 2.0).
  • Two-layer plugin model (Tools + Capabilities) with ChatOrchestrator coordinator.
  • Standalone CLI package (deeptutor_cli/) with 11 sub-commands via Typer.
  • Unified WebSocket endpoint with turn lifecycle and session streaming.
  • 5 built-in capabilities and 7 LLM-callable tools with bilingual prompt hints.
  • TutorBot multi-channel bot agent with 12 platform integrations.
  • Rebuilt web app with workspace/utility route groups and new Playground, Co-Writer, Agents, and Guide pages.
  • Service infrastructure rebuild: LLM provider registry, embedding adapters, SQLite session store, memory, notebook, and search consolidation.
  • AST-based code execution safety, LLM traffic control, and provider telemetry.
  • 53+ test files across runtime, services, agents, API, CLI, and tools.
  • Updated Docker configuration and layered dependency management.

New Contributors

Full Changelog: v0.6.0...v1.0.0-beta.1

Don't miss a new DeepTutor release

NewReleases is sending notifications on new releases.