- Instead of hardcoded 3001 now runs on any available port (will try 3826 as default first as least common, then find other options if still busy)
- Added "Compress" button on top of each chat, that will use Expert model in order to summarize a long conversation history, to save up tokens as it blows up.
- Added support for MCP servers, now you can add many MCP servers, and use them in chats to add more tools
- Additional fixes to the token counter, now it updates realtime even as agentic loop keeps going