|
|
227ca7927a
|
Refactor SetStatus method to capture onStatusChange callback reference before unlocking mutex
|
2025-08-28 18:59:26 +02:00 |
|
|
|
0b058237fe
|
Enforce maximum running instances limit in StartInstance method
|
2025-08-27 21:18:38 +02:00 |
|
|
|
ae37055331
|
Add onStatusChange callback to instance management for status updates
|
2025-08-27 20:54:26 +02:00 |
|
|
|
b41ebdc604
|
Set instance status to Failed when restart conditions are not met
|
2025-08-27 19:47:36 +02:00 |
|
|
|
1443746add
|
Refactor instance status management: replace Running boolean with InstanceStatus enum and update related methods
|
2025-08-27 19:44:38 +02:00 |
|
|
|
615c2ac54e
|
Add MaxRunningInstances to InstancesConfig and implement IsRunning method
|
2025-08-27 18:42:34 +02:00 |
|
|
|
1939b45312
|
Refactor WaitForHealthy method to use direct health check URL and simplify health check logic
|
2025-08-20 15:58:08 +02:00 |
|
|
|
ddb54763f6
|
Add OnDemandStartTimeout configuration and update OpenAIProxy to use it
|
2025-08-20 14:25:43 +02:00 |
|
|
|
287a5e0817
|
Implement WaitForHealthy method and enhance OpenAIProxy to support on-demand instance start
|
2025-08-20 14:19:12 +02:00 |
|
|
|
7b4adfa0cd
|
Add DefaultOnDemandStart configuration and update instance options
|
2025-08-20 13:50:43 +02:00 |
|
|
|
492c3ff270
|
Remove redundant timeout tests and improve test coverage for instance timeout validation
|
2025-08-20 13:25:56 +02:00 |
|
|
|
00a3cba717
|
Enhance shutdown handling in InstanceManager with proper synchronization and max instances check
|
2025-08-19 22:34:48 +02:00 |
|
|
|
eb1d4ab55f
|
Enhance timeout functionality tests to validate configuration and logic without starting instances
|
2025-08-19 20:52:59 +02:00 |
|
|
|
a9e3801eae
|
Refactor logging in checkAllTimeouts
|
2025-08-19 19:25:15 +02:00 |
|
|
|
78eda77e44
|
Enhance timeout handling in InstanceManager with goroutine recovery and shutdown support
|
2025-08-17 22:49:28 +02:00 |
|
|
|
d70bb634cd
|
Implement instance tests for timeout
|
2025-08-17 21:50:16 +02:00 |
|
|
|
41eaebc927
|
Add TimeoutCheckInterval to instance configuration in tests
|
2025-08-17 21:42:52 +02:00 |
|
|
|
c45fa13206
|
Initialize last request time on instance start and update timeout handling logic
|
2025-08-17 21:15:28 +02:00 |
|
|
|
5e3a28398d
|
Implement periodic timeout checking for instances
|
2025-08-17 21:10:48 +02:00 |
|
|
|
c734bcae4a
|
Move UpdateLastRequestTime method to timeout.go and add ShouldTimeout method for idle timeout handling
|
2025-08-17 20:37:20 +02:00 |
|
|
|
e4e7a82294
|
Implement last request time tracking for instance management
|
2025-08-17 19:44:57 +02:00 |
|
|
|
ccffbca6b2
|
Add timeout check interval and update instance configuration
|
2025-08-17 19:26:21 +02:00 |
|
|
|
902be409d5
|
Add IdleTimeout option to CreateInstanceOptions and update JSON handling
|
2025-08-17 19:06:09 +02:00 |
|
|
|
afef3d0180
|
Update import path for API documentation to use apidocs
|
2025-08-07 19:48:28 +02:00 |
|
|
|
a87652937f
|
Move swagger documentation to apidoc
|
2025-08-07 19:48:03 +02:00 |
|
|
|
e2b64620b5
|
Expose version endpoint
|
2025-08-07 19:10:06 +02:00 |
|
|
|
8154b8d0ab
|
Fix temp in tests
|
2025-08-06 18:49:36 +02:00 |
|
|
|
6203b64045
|
Fix missing or wrong llama server options
|
2025-08-06 18:31:17 +02:00 |
|
|
|
85b3638efb
|
Update ValidateInstanceName to return the validated name and modify tests accordingly
|
2025-08-04 20:46:15 +02:00 |
|
|
|
934d1c5aaa
|
Refactor instance management by moving operations to a new file
|
2025-08-04 20:38:57 +02:00 |
|
|
|
2abe9c282e
|
Rename config and instance struct to avoid awkward naming
|
2025-08-04 19:30:50 +02:00 |
|
|
|
6a7a9a2d09
|
Split large package into subpackages
|
2025-08-04 19:23:56 +02:00 |
|
|
|
7426008ef9
|
Use instance directly in DeleteInstance
|
2025-08-02 23:35:03 +02:00 |
|
|
|
3cbd23a6e2
|
Implement persistence tests
|
2025-08-02 22:52:50 +02:00 |
|
|
|
bed172bf73
|
Implement instance loading and auto-start functionality on manager initialization
|
2025-08-02 21:39:19 +02:00 |
|
|
|
d449255bc9
|
Persist instance state after starting and stopping instances
|
2025-08-02 21:23:31 +02:00 |
|
|
|
de89d0673a
|
Implement instance persistence with JSON file storage and deletion
|
2025-08-02 21:09:16 +02:00 |
|
|
|
dd6ffa548c
|
Refactor configuration structure to replace DataConfig with instance-specific directories and auto-creation options
|
2025-08-02 19:10:40 +02:00 |
|
|
|
7935f19cc1
|
Add data directory configuration with auto-creation option
|
2025-08-02 15:33:33 +02:00 |
|
|
|
b24d744cad
|
Implement graceful shutdown for the server and add Shutdown method to InstanceManager
|
2025-08-01 23:41:18 +02:00 |
|
|
|
8e8056f071
|
Update swagger docs
|
2025-07-30 21:34:46 +02:00 |
|
|
|
bedec089ef
|
Implement middleware tests
|
2025-07-30 21:20:50 +02:00 |
|
|
|
b3540d5b3e
|
Implement api key auth
|
2025-07-30 20:15:14 +02:00 |
|
|
|
e1889a65ac
|
Remove llama server cors headers
|
2025-07-28 22:06:44 +02:00 |
|
|
|
1057265c7f
|
Fix default host value in TestLoadConfig_Defaults
|
2025-07-27 19:11:24 +02:00 |
|
|
|
40d11b12ca
|
Add cors middleware
|
2025-07-27 19:05:15 +02:00 |
|
|
|
f2073f3eb4
|
Fix monitorProcess to properly close and reset monitorDone channel
|
2025-07-27 12:24:17 +02:00 |
|
|
|
e6652e52e1
|
Add OpenAI-compatible endpoints and instance creation timestamp
|
2025-07-27 12:07:59 +02:00 |
|
|
|
815e45e597
|
Fix OpenAIProxy handler
|
2025-07-26 22:46:07 +02:00 |
|
|
|
f8b6cabb6c
|
Implement platform-specific process attribute settings for llama server
|
2025-07-26 21:02:57 +02:00 |
|