Create initial documentation structure

2025-11-06 00:54:23 +00:00 · 2025-08-31 14:27:00 +02:00
parent 7675271370
commit bd31c03f4a
16 changed files with 3514 additions and 0 deletions
--- a/docs/getting-started/configuration.md
+++ b/docs/getting-started/configuration.md
@@ -0,0 +1,154 @@
+# Configuration
+
+LlamaCtl can be configured through various methods to suit your needs.
+
+## Configuration File
+
+Create a configuration file at `~/.llamactl/config.yaml`:
+
+```yaml
+# Server configuration
+server:
+  host: "0.0.0.0"
+  port: 8080
+  cors_enabled: true
+
+# Authentication (optional)
+auth:
+  enabled: false
+  # When enabled, configure your authentication method
+  # jwt_secret: "your-secret-key"
+
+# Default instance settings
+defaults:
+  backend: "llamacpp"
+  timeout: 300
+  log_level: "info"
+
+# Paths
+paths:
+  models_dir: "/path/to/your/models"
+  logs_dir: "/var/log/llamactl"
+  data_dir: "/var/lib/llamactl"
+
+# Instance limits
+limits:
+  max_instances: 10
+  max_memory_per_instance: "8GB"
+```
+
+## Environment Variables
+
+You can also configure LlamaCtl using environment variables:
+
+```bash
+# Server settings
+export LLAMACTL_HOST=0.0.0.0
+export LLAMACTL_PORT=8080
+
+# Paths
+export LLAMACTL_MODELS_DIR=/path/to/models
+export LLAMACTL_LOGS_DIR=/var/log/llamactl
+
+# Limits
+export LLAMACTL_MAX_INSTANCES=5
+```
+
+## Command Line Options
+
+View all available command line options:
+
+```bash
+llamactl --help
+```
+
+Common options:
+
+```bash
+# Specify config file
+llamactl --config /path/to/config.yaml
+
+# Set log level
+llamactl --log-level debug
+
+# Run on different port
+llamactl --port 9090
+```
+
+## Instance Configuration
+
+When creating instances, you can specify various options:
+
+### Basic Options
+
+- `name`: Unique identifier for the instance
+- `model_path`: Path to the GGUF model file
+- `port`: Port for the instance to listen on
+
+### Advanced Options
+
+- `threads`: Number of CPU threads to use
+- `context_size`: Context window size
+- `batch_size`: Batch size for processing
+- `gpu_layers`: Number of layers to offload to GPU
+- `memory_lock`: Lock model in memory
+- `no_mmap`: Disable memory mapping
+
+### Example Instance Configuration
+
+```json
+{
+  "name": "production-model",
+  "model_path": "/models/llama-2-13b-chat.gguf",
+  "port": 8081,
+  "options": {
+    "threads": 8,
+    "context_size": 4096,
+    "batch_size": 512,
+    "gpu_layers": 35,
+    "memory_lock": true
+  }
+}
+```
+
+## Security Configuration
+
+### Enable Authentication
+
+To enable authentication, update your config file:
+
+```yaml
+auth:
+  enabled: true
+  jwt_secret: "your-very-secure-secret-key"
+  token_expiry: "24h"
+```
+
+### HTTPS Configuration
+
+For production deployments, configure HTTPS:
+
+```yaml
+server:
+  tls:
+    enabled: true
+    cert_file: "/path/to/cert.pem"
+    key_file: "/path/to/key.pem"
+```
+
+## Logging Configuration
+
+Configure logging levels and outputs:
+
+```yaml
+logging:
+  level: "info"  # debug, info, warn, error
+  format: "json"  # json or text
+  output: "/var/log/llamactl/app.log"
+```
+
+## Next Steps
+
+- Learn about [Managing Instances](../user-guide/managing-instances.md)
+- Explore [Advanced Configuration](../advanced/monitoring.md)
+- Set up [Monitoring](../advanced/monitoring.md)