Refactor features section in README for improved clarity and organization

This commit is contained in:
2025-09-18 20:14:03 +02:00
parent 8006dd3841
commit 84c3453281

View File

@@ -4,17 +4,24 @@
**Management server and proxy for multiple llama.cpp and MLX instances with OpenAI-compatible API routing.** **Management server and proxy for multiple llama.cpp and MLX instances with OpenAI-compatible API routing.**
## Why llamactl? ## Features
🚀 **Multiple Model Serving**: Run different models simultaneously (7B for speed, 70B for quality) ### 🚀 Easy Model Management
🔗 **OpenAI API Compatible**: Drop-in replacement - route requests by model name - **Multiple Model Serving**: Run different models simultaneously (7B for speed, 70B for quality)
🍎 **Multi-Backend Support**: Native support for both llama.cpp and MLX (Apple Silicon optimized) - **On-Demand Instance Start**: Automatically launch instances upon receiving API requests
🌐 **Web Dashboard**: Modern React UI for visual management (unlike CLI-only tools) - **State Persistence**: Ensure instances remain intact across server restarts
🔐 **API Key Authentication**: Separate keys for management vs inference access
📊 **Instance Monitoring**: Health checks, auto-restart, log management ### 🔗 Universal Compatibility
**Smart Resource Management**: Idle timeout, LRU eviction, and configurable instance limits - **OpenAI API Compatible**: Drop-in replacement - route requests by model name
💡 **On-Demand Instance Start**: Automatically launch instances upon receiving OpenAI-compatible API requests - **Multi-Backend Support**: Native support for both llama.cpp and MLX (Apple Silicon optimized)
💾 **State Persistence**: Ensure instances remain intact across server restarts
### 🌐 User-Friendly Interface
- **Web Dashboard**: Modern React UI for visual management (unlike CLI-only tools)
- **API Key Authentication**: Separate keys for management vs inference access
### ⚡ Smart Operations
- **Instance Monitoring**: Health checks, auto-restart, log management
- **Smart Resource Management**: Idle timeout, LRU eviction, and configurable instance limits
![Dashboard Screenshot](docs/images/dashboard.png) ![Dashboard Screenshot](docs/images/dashboard.png)