|
|
9ee0a184b3
|
Re-validate instance name in DeleteInstance for improved security
|
2025-10-09 22:18:53 +02:00 |
|
|
|
5436c28a1f
|
Add instance name validation before deletion for security
|
2025-10-09 22:10:40 +02:00 |
|
|
|
e7a6a7003e
|
Skip remote instances in checkAllTimeouts and EvictLRUInstance methods
|
2025-10-09 21:13:38 +02:00 |
|
|
|
2b950ee649
|
Implement updateLocalInstanceFromRemote to preserve Nodes field when syncing remote instance data
|
2025-10-09 20:39:21 +02:00 |
|
|
|
8a16a195de
|
Fix getting remote instance logs
|
2025-10-09 20:22:32 +02:00 |
|
|
|
9684a8a09b
|
Enhance instance management to preserve local state for remote instances
|
2025-10-09 19:34:52 +02:00 |
|
|
|
e281708b20
|
Enhance auto-start logic to differentiate between remote and local instances
|
2025-10-09 18:56:23 +02:00 |
|
|
|
7f6725da96
|
Refactor NodeConfig handling to use a map
|
2025-10-08 19:24:24 +02:00 |
|
|
|
3418735204
|
Add stripNodesFromOptions function to prevent routing loops in remote requests
|
2025-10-07 20:27:31 +02:00 |
|
|
|
2f1cf5acdc
|
Refactor CreateRemoteInstance and UpdateRemoteInstance to directly use options parameter in API requests
|
2025-10-07 19:57:21 +02:00 |
|
|
|
01380e6641
|
Update instance manager tests to use empty NodeConfig slice
|
2025-10-07 19:18:13 +02:00 |
|
|
|
16b28bac05
|
Merge branch 'main' into feat/multi-host
|
2025-10-07 18:04:24 +02:00 |
|
|
|
d15976e7aa
|
Implement auto-stop for instances with auto-restart disabled and add corresponding tests
|
2025-10-04 21:17:55 +02:00 |
|
|
|
c30053e51c
|
Enhance instance loading to support remote instances and handle node configuration
|
2025-10-01 22:59:45 +02:00 |
|
|
|
347c58e15f
|
Enhance instance manager to persist remote instances and update tracking on modifications
|
2025-10-01 22:58:57 +02:00 |
|
|
|
0188f82306
|
Implement remote instance creation and deletion in instance manager
|
2025-10-01 22:05:18 +02:00 |
|
|
|
e0f176de10
|
Enhance instance manager to support remote instance management and update related tests
|
2025-10-01 20:25:06 +02:00 |
|
|
|
2759be65a5
|
Add remote instance management functionality and configuration support
|
2025-09-30 21:09:05 +02:00 |
|
|
|
ba0f877185
|
Fix tests
|
2025-09-24 21:35:44 +02:00 |
|
|
|
840a7bc650
|
Add Docker command handling for backend options and refactor command building
|
2025-09-24 21:34:54 +02:00 |
|
|
|
313666ea17
|
Fix missing vllm proxy setup
|
2025-09-22 20:51:00 +02:00 |
|
|
|
2a20817078
|
Remove redundant LlamaExecutable field from instance configuration in tests
|
2025-09-18 20:29:04 +02:00 |
|
|
|
cc5d8acd92
|
Refactor instance and manager tests to use BackendConfig for LlamaExecutable and MLXLMExecutable
|
2025-09-16 21:45:50 +02:00 |
|
|
|
154b754aff
|
Add MLX command parsing and routing support
|
2025-09-16 21:39:08 +02:00 |
|
|
|
468688cdbc
|
Pass backend options to instances
|
2025-09-16 21:37:48 +02:00 |
|
|
|
712d28ea42
|
Remove port marking logic from CreateInstance method
|
2025-09-02 21:56:25 +02:00 |
|
|
|
d9542ba117
|
Refactor instance management to support backend types and options
|
2025-09-01 21:59:18 +02:00 |
|
|
|
9579930a6a
|
Simplify LRU eviction tests
|
2025-08-31 11:46:16 +02:00 |
|
|
|
447f441fd0
|
Move LRU eviction to timeout.go
|
2025-08-31 11:42:32 +02:00 |
|
|
|
27012b6de6
|
Split manager tests into multiple test files
|
2025-08-31 11:39:44 +02:00 |
|
|
|
905e685107
|
Add LRU eviction tests for instance management
|
2025-08-31 11:30:57 +02:00 |
|
|
|
d6d4792a0c
|
Skip eviction for instances without a valid idle timeout
|
2025-08-31 00:59:26 +02:00 |
|
|
|
894f3c3213
|
Refactor StartInstance method to improve max running instances check
|
2025-08-31 00:14:29 +02:00 |
|
|
|
c1fa0faf4b
|
Add LastRequestTime method and LRU eviction logic for instance management
|
2025-08-30 23:59:37 +02:00 |
|
|
|
58cb36bd18
|
Refactor instance management: replace CanStartInstance with IsMaxRunningInstancesReached method
|
2025-08-30 23:12:58 +02:00 |
|
|
|
68253be3e8
|
Add CanStartInstance method to check instance start conditions
|
2025-08-30 22:47:15 +02:00 |
|
|
|
74495f8163
|
Refactor Shutdown method to improve instance stopping logic and avoid deadlocks
|
2025-08-30 22:04:43 +02:00 |
|
|
|
9d548e6dda
|
Remove wrong MaxRunningInstancesError type
|
2025-08-28 20:42:56 +02:00 |
|
|
|
41d8c41188
|
Introduce MaxRunningInstancesError type and handle it in StartInstance handler
|
2025-08-28 20:07:03 +02:00 |
|
|
|
0b058237fe
|
Enforce maximum running instances limit in StartInstance method
|
2025-08-27 21:18:38 +02:00 |
|
|
|
ae37055331
|
Add onStatusChange callback to instance management for status updates
|
2025-08-27 20:54:26 +02:00 |
|
|
|
1443746add
|
Refactor instance status management: replace Running boolean with InstanceStatus enum and update related methods
|
2025-08-27 19:44:38 +02:00 |
|
|
|
615c2ac54e
|
Add MaxRunningInstances to InstancesConfig and implement IsRunning method
|
2025-08-27 18:42:34 +02:00 |
|
|
|
00a3cba717
|
Enhance shutdown handling in InstanceManager with proper synchronization and max instances check
|
2025-08-19 22:34:48 +02:00 |
|
|
|
eb1d4ab55f
|
Enhance timeout functionality tests to validate configuration and logic without starting instances
|
2025-08-19 20:52:59 +02:00 |
|
|
|
a9e3801eae
|
Refactor logging in checkAllTimeouts
|
2025-08-19 19:25:15 +02:00 |
|
|
|
78eda77e44
|
Enhance timeout handling in InstanceManager with goroutine recovery and shutdown support
|
2025-08-17 22:49:28 +02:00 |
|
|
|
41eaebc927
|
Add TimeoutCheckInterval to instance configuration in tests
|
2025-08-17 21:42:52 +02:00 |
|
|
|
c45fa13206
|
Initialize last request time on instance start and update timeout handling logic
|
2025-08-17 21:15:28 +02:00 |
|
|
|
5e3a28398d
|
Implement periodic timeout checking for instances
|
2025-08-17 21:10:48 +02:00 |
|