mirror of
https://github.com/lordmathis/llamactl.git
synced 2025-11-06 00:54:23 +00:00
Refactor features section in README for improved clarity and organization
This commit is contained in:
27
README.md
27
README.md
@@ -4,17 +4,24 @@
|
|||||||
|
|
||||||
**Management server and proxy for multiple llama.cpp and MLX instances with OpenAI-compatible API routing.**
|
**Management server and proxy for multiple llama.cpp and MLX instances with OpenAI-compatible API routing.**
|
||||||
|
|
||||||
## Why llamactl?
|
## Features
|
||||||
|
|
||||||
🚀 **Multiple Model Serving**: Run different models simultaneously (7B for speed, 70B for quality)
|
### 🚀 Easy Model Management
|
||||||
🔗 **OpenAI API Compatible**: Drop-in replacement - route requests by model name
|
- **Multiple Model Serving**: Run different models simultaneously (7B for speed, 70B for quality)
|
||||||
🍎 **Multi-Backend Support**: Native support for both llama.cpp and MLX (Apple Silicon optimized)
|
- **On-Demand Instance Start**: Automatically launch instances upon receiving API requests
|
||||||
🌐 **Web Dashboard**: Modern React UI for visual management (unlike CLI-only tools)
|
- **State Persistence**: Ensure instances remain intact across server restarts
|
||||||
🔐 **API Key Authentication**: Separate keys for management vs inference access
|
|
||||||
📊 **Instance Monitoring**: Health checks, auto-restart, log management
|
### 🔗 Universal Compatibility
|
||||||
⚡ **Smart Resource Management**: Idle timeout, LRU eviction, and configurable instance limits
|
- **OpenAI API Compatible**: Drop-in replacement - route requests by model name
|
||||||
💡 **On-Demand Instance Start**: Automatically launch instances upon receiving OpenAI-compatible API requests
|
- **Multi-Backend Support**: Native support for both llama.cpp and MLX (Apple Silicon optimized)
|
||||||
💾 **State Persistence**: Ensure instances remain intact across server restarts
|
|
||||||
|
### 🌐 User-Friendly Interface
|
||||||
|
- **Web Dashboard**: Modern React UI for visual management (unlike CLI-only tools)
|
||||||
|
- **API Key Authentication**: Separate keys for management vs inference access
|
||||||
|
|
||||||
|
### ⚡ Smart Operations
|
||||||
|
- **Instance Monitoring**: Health checks, auto-restart, log management
|
||||||
|
- **Smart Resource Management**: Idle timeout, LRU eviction, and configurable instance limits
|
||||||
|
|
||||||

|

|
||||||
|
|
||||||
|
|||||||
Reference in New Issue
Block a user