llamactl

mirror of https://github.com/lordmathis/llamactl.git synced 2025-12-23 01:24:24 +00:00

Author	SHA1	Message	Date
LordMathis	c038aac91b	Remove redundant UpdateLast RequestTime calls	2025-10-25 16:09:57 +02:00
LordMathis	7d9b983f93	Don't strip remote llama-cpp proxy prefix	2025-10-25 16:02:09 +02:00
LordMathis	ff719f3ef9	Remove remote instance proxy handling from handlers	2025-10-25 14:07:11 +02:00
LordMathis	a9fb0d613d	Validate instance name in openai proxy	2025-10-22 18:55:57 +02:00
LordMathis	3b8bc658e3	Add name validation to backend handlers	2025-10-22 18:50:51 +02:00
LordMathis	c794e4f98b	Move instance name validation to handlers	2025-10-22 18:40:39 +02:00
LordMathis	9da2433a7c	Refactor instance and manager tests to use BackendOptions structure	2025-10-19 18:07:14 +02:00
LordMathis	2a7010d0e1	Flatten backends package structure	2025-10-19 15:50:42 +02:00
LordMathis	113b51eda2	Refactor instance node handling to use a map	2025-10-18 00:33:16 +02:00
LordMathis	4b30791be2	Refactor instance options structure and related code	2025-10-16 20:53:24 +02:00
LordMathis	80ca0cbd4f	Rename Process to Instance	2025-10-16 19:38:44 +02:00
LordMathis	8a16a195de	Fix getting remote instance logs	2025-10-09 20:22:32 +02:00
LordMathis	7f6725da96	Refactor NodeConfig handling to use a map	2025-10-08 19:24:24 +02:00
LordMathis	6298b03636	Refactor RemoteOpenAIProxy to use cached proxies and restore request body handling	2025-10-07 18:57:08 +02:00
LordMathis	aae3f84d49	Implement caching for remote instance proxies and enhance proxy request handling	2025-10-07 18:44:23 +02:00
LordMathis	16b28bac05	Merge branch 'main' into feat/multi-host	2025-10-07 18:04:24 +02:00
Anuruth Lertpiya	997bd1b063	Changed status code to StatusBadRequest (400) if requested invalid model name.	2025-10-05 14:53:20 +00:00
Anuruth Lertpiya	fa43f9e967	Added support for proxying llama.cpp native API endpoints via `/llama-cpp/{name}/`	2025-10-05 14:28:33 +00:00
LordMathis	8ebdb1a183	Fix double read of json response when content-length header is missing	2025-10-04 22:16:28 +02:00
Anuruth Lertpiya	0e1bc8a352	Added support for configuring CORS headers	2025-10-04 09:13:40 +00:00
LordMathis	670f8ff81b	Split up handlers	2025-10-02 23:11:20 +02:00
LordMathis	da56456504	Add node management endpoints to handle listing and retrieving node details	2025-10-02 22:51:41 +02:00
LordMathis	2ed67eb672	Add remote instance proxying functionality to handler	2025-10-01 22:17:19 +02:00
LordMathis	30e40ecd30	Refactor API endpoints to use /backends/llama-cpp path and update related documentation	2025-09-23 21:27:58 +02:00
LordMathis	46622d2107	Update documentation and add README synchronization	2025-09-22 22:37:53 +02:00
LordMathis	4df02a6519	Initial vLLM backend support	2025-09-19 18:05:12 +02:00
LordMathis	154b754aff	Add MLX command parsing and routing support	2025-09-16 21:39:08 +02:00
LordMathis	1b5934303b	Enhance command parsing in ParseLlamaCommand and improve error handling in ParseCommandRequest	2025-09-15 22:12:56 +02:00
LordMathis	323056096c	Implement llama-server command parsing and add UI components for command input	2025-09-15 21:04:14 +02:00
LordMathis	4581d67165	Enhance instance management: improve on-demand start handling and add LRU eviction logic	2025-08-30 23:13:08 +02:00
LordMathis	41d8c41188	Introduce MaxRunningInstancesError type and handle it in StartInstance handler	2025-08-28 20:07:03 +02:00
LordMathis	1443746add	Refactor instance status management: replace Running boolean with InstanceStatus enum and update related methods	2025-08-27 19:44:38 +02:00
LordMathis	ddb54763f6	Add OnDemandStartTimeout configuration and update OpenAIProxy to use it	2025-08-20 14:25:43 +02:00
LordMathis	287a5e0817	Implement WaitForHealthy method and enhance OpenAIProxy to support on-demand instance start	2025-08-20 14:19:12 +02:00
LordMathis	e4e7a82294	Implement last request time tracking for instance management	2025-08-17 19:44:57 +02:00
LordMathis	afef3d0180	Update import path for API documentation to use apidocs	2025-08-07 19:48:28 +02:00
LordMathis	a87652937f	Move swagger documentation to apidoc	2025-08-07 19:48:03 +02:00
LordMathis	e2b64620b5	Expose version endpoint	2025-08-07 19:10:06 +02:00
LordMathis	2abe9c282e	Rename config and instance struct to avoid awkward naming	2025-08-04 19:30:50 +02:00
LordMathis	6a7a9a2d09	Split large package into subpackages	2025-08-04 19:23:56 +02:00

40 Commits