github Trans-N-ai/swama v1.1.0

latest releases: v1.4.3, v1.4.2, v1.4.1...
3 months ago

πŸ†• What's New

Text Embeddings Support

  • New /v1/embeddings API endpoint - Full OpenAI compatibility
  • Built-in embedding generation for semantic search and RAG applications
  • Batch processing with automatic padding and optimization

Enhanced Chat Capabilities

  • System prompts support - Define AI assistant behavior with system messages
  • Multi-turn conversations - Maintain conversation history across requests
  • Full OpenAI ChatGPT API compatibility - Drop-in replacement for chat applications

Intelligent Memory Management

  • Automatic model eviction - Prevents GPU memory exhaustion with smart cleanup
  • Usage-based prioritization - Keeps frequently used models in memory
  • Concurrent inference control - Safe parallel processing with per-model locking

πŸš€ Usage

Generate Embeddings

curl -X POST http://localhost:28100/v1/embeddings \
  -H "Content-Type: application/json" \
  -d '{
    "input": ["Hello world", "Text embeddings"],
    "model": "mlx-community/Qwen3-Embedding-0.6B-4bit-DWQ"
  }'

Chat with System Prompts

curl -X POST http://localhost:28100/v1/chat/completions \
  -H "Content-Type: application/json" \
  -d '{
    "messages": [
      {"role": "system", "content": "You are a helpful math tutor."},
      {"role": "user", "content": "Explain quadratic equations."}
    ],
    "model": "qwen3"
  }'

Multi-turn Conversations

curl -X POST http://localhost:28100/v1/chat/completions \
  -H "Content-Type: application/json" \
  -d '{
    "messages": [
      {"role": "user", "content": "My name is Alice."},
      {"role": "assistant", "content": "Nice to meet you, Alice!"},
      {"role": "user", "content": "What is my name?"}
    ],
    "model": "qwen3"
  }'

πŸ“¦ Download

Download Swama v1.1.0

πŸ”„ Upgrade Notes

  • If upgrading from a previous version: After installing the new version, open Swama from the menu bar and click "Install Command Line Tool…" to update the CLI tools

πŸ”§ Requirements

  • macOS 14.0+
  • Apple Silicon (M1/M2/M3/M4)

What's Changed

New Contributors

Full Changelog: v1.0.0...v1.1.0

Don't miss a new swama release

NewReleases is sending notifications on new releases.