|
|
712d28ea42
|
Remove port marking logic from CreateInstance method
|
2025-09-02 21:56:25 +02:00 |
|
|
|
0fd3613798
|
Refactor backend type from LLAMA_SERVER to LLAMA_CPP across components and tests
|
2025-09-02 21:19:22 +02:00 |
|
|
|
4f6bb6292e
|
Implement backend configuration options and refactor related components
|
2025-09-02 21:12:14 +02:00 |
|
|
|
d9542ba117
|
Refactor instance management to support backend types and options
|
2025-09-01 21:59:18 +02:00 |
|
|
|
9a4dafeee8
|
Merge pull request #26 from lordmathis/feat/lru-eviction
feat: Implement least recently used instance eviction
|
2025-08-31 12:44:06 +02:00 |
|
|
|
9579930a6a
|
Simplify LRU eviction tests
|
2025-08-31 11:46:16 +02:00 |
|
|
|
447f441fd0
|
Move LRU eviction to timeout.go
|
2025-08-31 11:42:32 +02:00 |
|
|
|
27012b6de6
|
Split manager tests into multiple test files
|
2025-08-31 11:39:44 +02:00 |
|
|
|
905e685107
|
Add LRU eviction tests for instance management
|
2025-08-31 11:30:57 +02:00 |
|
|
|
d6d4792a0c
|
Skip eviction for instances without a valid idle timeout
|
2025-08-31 00:59:26 +02:00 |
|
|
|
da26f607d4
|
Update README to enhance resource management details and add configuration options for max running instances and LRU eviction
|
2025-08-31 00:56:35 +02:00 |
|
|
|
894f3c3213
|
Refactor StartInstance method to improve max running instances check
|
2025-08-31 00:14:29 +02:00 |
|
|
|
c1fa0faf4b
|
Add LastRequestTime method and LRU eviction logic for instance management
|
2025-08-30 23:59:37 +02:00 |
|
|
|
4581d67165
|
Enhance instance management: improve on-demand start handling and add LRU eviction logic
|
2025-08-30 23:13:08 +02:00 |
|
|
|
58cb36bd18
|
Refactor instance management: replace CanStartInstance with IsMaxRunningInstancesReached method
|
2025-08-30 23:12:58 +02:00 |
|
|
|
68253be3e8
|
Add CanStartInstance method to check instance start conditions
|
2025-08-30 22:47:15 +02:00 |
|
|
|
a9f1c1a619
|
Add LRU eviction configuration for instances
|
2025-08-30 22:26:02 +02:00 |
|
|
|
8fdebb069c
|
Merge pull request #25 from lordmathis/fix/stopping-deadlock
fix: Server stopping deadlock
|
2025-08-30 22:12:07 +02:00 |
|
|
|
fdd46859b9
|
Add environment variables for development configuration in launch.json
|
2025-08-30 22:04:52 +02:00 |
|
|
|
74495f8163
|
Refactor Shutdown method to improve instance stopping logic and avoid deadlocks
|
2025-08-30 22:04:43 +02:00 |
|
|
|
8ec36dd1b7
|
Merge pull request #24 from lordmathis/feat/max-running-instances
feat: Implement max running instances constraint and refactor instances status
|
2025-08-28 20:45:27 +02:00 |
|
|
|
c4ed745ba9
|
Fix comparison operators in useInstanceHealth hook
|
2025-08-28 20:43:41 +02:00 |
|
|
|
9d548e6dda
|
Remove wrong MaxRunningInstancesError type
|
2025-08-28 20:42:56 +02:00 |
|
|
|
41d8c41188
|
Introduce MaxRunningInstancesError type and handle it in StartInstance handler
|
2025-08-28 20:07:03 +02:00 |
|
|
|
7d5c68e671
|
Add launch configuration for Go server in VSCode
|
2025-08-28 19:19:55 +02:00 |
|
|
|
e319731239
|
Remove unnecessary read locks from GetStatus and IsRunning methods
|
2025-08-28 19:19:28 +02:00 |
|
|
|
b698c1d0ea
|
Remove locks from SetStatus
|
2025-08-28 19:08:20 +02:00 |
|
|
|
227ca7927a
|
Refactor SetStatus method to capture onStatusChange callback reference before unlocking mutex
|
2025-08-28 18:59:26 +02:00 |
|
|
|
0b058237fe
|
Enforce maximum running instances limit in StartInstance method
|
2025-08-27 21:18:38 +02:00 |
|
|
|
ae37055331
|
Add onStatusChange callback to instance management for status updates
|
2025-08-27 20:54:26 +02:00 |
|
|
|
a8f3a8e0f5
|
Refactor instance status handling on the frontend
|
2025-08-27 20:11:21 +02:00 |
|
|
|
b41ebdc604
|
Set instance status to Failed when restart conditions are not met
|
2025-08-27 19:47:36 +02:00 |
|
|
|
1443746add
|
Refactor instance status management: replace Running boolean with InstanceStatus enum and update related methods
|
2025-08-27 19:44:38 +02:00 |
|
|
|
615c2ac54e
|
Add MaxRunningInstances to InstancesConfig and implement IsRunning method
|
2025-08-27 18:42:34 +02:00 |
|
|
|
a6e3cb4a9b
|
Merge pull request #23 from lordmathis/feat/start-on-request
feat: On-Demand Instance Start
v0.4.0
|
2025-08-20 16:04:59 +02:00 |
|
|
|
9181c3d7bc
|
Remove unused import from zodFormUtils.ts
|
2025-08-20 16:03:09 +02:00 |
|
|
|
1939b45312
|
Refactor WaitForHealthy method to use direct health check URL and simplify health check logic
|
2025-08-20 15:58:08 +02:00 |
|
|
|
8265a94bf7
|
Add on-demand start configuration to instance options and basic fields
|
2025-08-20 14:56:11 +02:00 |
|
|
|
4bc9362f7a
|
Add default on-demand start settings and timeout configuration to README
|
2025-08-20 14:41:42 +02:00 |
|
|
|
ddb54763f6
|
Add OnDemandStartTimeout configuration and update OpenAIProxy to use it
|
2025-08-20 14:25:43 +02:00 |
|
|
|
496ab3aa5d
|
Update README to clarify on-demand instance start feature
|
2025-08-20 14:22:55 +02:00 |
|
|
|
287a5e0817
|
Implement WaitForHealthy method and enhance OpenAIProxy to support on-demand instance start
|
2025-08-20 14:19:12 +02:00 |
|
|
|
7b4adfa0cd
|
Add DefaultOnDemandStart configuration and update instance options
|
2025-08-20 13:50:43 +02:00 |
|
|
|
651c8b9b2c
|
Merge pull request #22 from lordmathis/feat/timeout
feat: Implement idle instance timeout
|
2025-08-20 13:34:38 +02:00 |
|
|
|
7194e1fdd1
|
Update README to clarify idle timeout management and state persistence features
|
2025-08-20 13:32:03 +02:00 |
|
|
|
492c3ff270
|
Remove redundant timeout tests and improve test coverage for instance timeout validation
|
2025-08-20 13:25:56 +02:00 |
|
|
|
00a3cba717
|
Enhance shutdown handling in InstanceManager with proper synchronization and max instances check
|
2025-08-19 22:34:48 +02:00 |
|
|
|
eb1d4ab55f
|
Enhance timeout functionality tests to validate configuration and logic without starting instances
|
2025-08-19 20:52:59 +02:00 |
|
|
|
a9e3801eae
|
Refactor logging in checkAllTimeouts
|
2025-08-19 19:25:15 +02:00 |
|
|
|
1aaab96cec
|
Add idle timeout configuration to instance options and basic fields
|
2025-08-19 19:24:54 +02:00 |
|