github jundot/omlx v0.1.6
oMLX v0.1.6

latest releases: v0.3.6, v0.3.5, v0.3.5-rc1...
one month ago

Breaking Changes

Beautiful Benchmark Tool

One-click performance benchmarking right from the Admin Panel — easily measure the performance of any model you want to use!

Benchmark Tool

What's New

Benchmark

  • Add built-in benchmark tool to the admin panel with prefill (PP) and text generation (TG) TPS metrics
  • Support partial prefix cache hit measurement for realistic benchmarking
  • Add text export for benchmark results

Admin panel

  • Split monolithic dashboard into tab-based partials for better maintainability
  • Vendor all CDN dependencies for fully offline admin panel support

Performance

  • Optimize scheduler hot path for improved TPS throughput
  • Add async background write for SSD paged cache to reduce write latency

Memory management

  • Add process-level memory enforcement (Memory Limit Total) to prevent system-wide OOM

Multiple model directories

  • Support multiple model directories for organizing models across different paths

SSD cache

  • Fix CacheList sub_meta_states sanitization for correct KVCache reconstruction

Engine

  • Upgrade mlx-lm to commit 179da77

Don't miss a new omlx release

NewReleases is sending notifications on new releases.