Commit graph

342 commits

Author SHA1 Message Date
ishaan-jaff
e5810a177c (test) add test for testing queuing 2023-11-21 16:59:33 -08:00
ishaan-jaff
a5ecf8d7c1 (test) move test file dir 2023-11-21 16:59:33 -08:00
Krrish Dholakia
b16646e584 refactor(proxy_server.py): using celery workers instead of rq for concurrency 2023-11-21 16:31:56 -08:00
ishaan-jaff
8c98a2c899 (feat)proxy: readon config per request 2023-11-21 16:26:05 -08:00
ishaan-jaff
6117bcb19f (feat) proxy: add config col to prisma config 2023-11-21 16:22:26 -08:00
ishaan-jaff
5835e6ed04 (docs) proxy queue config yaml 2023-11-21 16:22:00 -08:00
Krrish Dholakia
70f3159a2a refactor(rq_worker.py): put rq worker behind function call (prevent default import) 2023-11-21 13:51:42 -08:00
Krrish Dholakia
a1f6b9b531 refactor(proxy_server.py): refactoring background rq worker 2023-11-21 13:47:09 -08:00
Krrish Dholakia
015c7de549 fix(proxy_server.py): defaulting status to queued 2023-11-21 12:40:30 -08:00
Krrish Dholakia
331fe86c8f refactor(proxy_server.py): experimental endpoint - defined job status 2023-11-21 12:37:00 -08:00
Krrish Dholakia
d0f228e53f refactor(proxy_server.py): refactoring experimental endpoint names 2023-11-21 12:12:15 -08:00
Krrish Dholakia
c6a4744947 feat(proxy_server.py): EXPERIMENTAL: adding queuing endpoints to openai proxy server 2023-11-21 12:06:23 -08:00
ishaan-jaff
2a35ff88a7 (fix) proxy server LiteLLM warning 2023-11-21 08:50:31 -08:00
ishaan-jaff
8969c5f769 (docs) proxy starting config.yaml 2023-11-21 08:36:34 -08:00
ishaan-jaff
6898840218 (fix) proxy server - raise 500 on exceptions 2023-11-21 08:36:34 -08:00
Krrish Dholakia
846a824c47 fix(proxy_server.py): run ollama serve when ollama in config.yaml 2023-11-21 08:35:04 -08:00
Krrish Dholakia
33e47dae8e feat(proxy_server.py): enable model aliases 2023-11-20 16:51:04 -08:00
Krrish Dholakia
1976d0f7d6 fix(routing.py): update token usage on streaming 2023-11-20 14:19:25 -08:00
Krrish Dholakia
d8079907b9 fix(proxy_server.py): fix linting errors 2023-11-18 17:35:16 -08:00
Krrish Dholakia
eefa66e8f0 docs(simple_proxy.md): adding token based auth to docs 2023-11-18 17:34:11 -08:00
ishaan-jaff
73ce9aceb2 (fix) linting 2023-11-18 17:05:46 -08:00
Krrish Dholakia
7a669a36d2 fix(proxy_server.py): handle initializing prisma / db connection just once 2023-11-18 16:45:19 -08:00
Krrish Dholakia
c02794d3ff feat(proxy_server.py): enable token based authentication for server endpoints 2023-11-18 16:13:53 -08:00
Krrish Dholakia
8ae855e008 fix(proxy_server.py): allow user to connect their proxy to a postgres db 2023-11-18 15:57:38 -08:00
ishaan-jaff
e006cbbc73 (docs) update readme proxy server 2023-11-17 17:40:44 -08:00
ishaan-jaff
d1af0af7bf (docs) load balancer 2023-11-17 17:25:46 -08:00
ishaan-jaff
42432bedaa (docs) add example load balancer 2023-11-17 17:25:12 -08:00
Krrish Dholakia
0ab6b2451d fix(acompletion): support client side timeouts + raise exceptions correctly for async calls 2023-11-17 15:39:47 -08:00
ishaan-jaff
c162f8b4b0 (docs) test proxy 2023-11-17 10:19:12 -08:00
ishaan-jaff
b8c64f16cd (chore) linting fix 2023-11-16 09:57:40 -08:00
ishaan-jaff
d9fb4d494c (feat) raise exception on proxy server 2023-11-16 09:51:17 -08:00
ishaan-jaff
60c1ef7929 (chore) fix config yaml 2023-11-15 17:12:28 -08:00
ishaan-jaff
9493aeff68 (docs) add example configs to proxy 2023-11-15 16:41:26 -08:00
ishaan-jaff
a18fe5a1fd (fix) proxy raise exceptions from litellm.completion() 2023-11-15 16:26:04 -08:00
ishaan-jaff
e698f8218f (feat) proxy - show loaded config for users 2023-11-15 16:26:04 -08:00
ishaan-jaff
5625fe3d81 (docs) using langfuse with litellm 2023-11-15 13:53:12 -08:00
ishaan-jaff
11872e0d45 (fix) proxy safe access router model names 2023-11-15 13:44:17 -08:00
Krrish Dholakia
8c104e9c6a fix(azure.py-+-proxy_server.py): fix function calling response object + support router on proxy 2023-11-15 13:15:16 -08:00
ishaan-jaff
8f8b35eb34 add ollama/zephyr to config 2023-11-15 13:04:34 -08:00
Oliver Smith
4c980cb161 need to re-attempt backoff and yaml imports if the first import attempt fails. not sure which import is missing from requirements 2023-11-15 00:10:22 -08:00
Krrish Dholakia
1738341dcb fix(main.py): misrouting ollama models to nlp cloud 2023-11-14 18:55:08 -08:00
ishaan-jaff
255c0fab1d (docs) passing azure_ad_token to config 2023-11-14 15:27:59 -08:00
Krrish Dholakia
7ef8611952 fix(simple_proxy.md): enable setting a master key to protect proxy endpoints 2023-11-14 12:44:33 -08:00
ishaan-jaff
1207910522 (fix) proxy cli maintain back comp with openai < 1.00 2023-11-13 11:06:59 -08:00
ishaan-jaff
e125414611 (fix) proxy cli compatible with openai v1.0.0 2023-11-13 10:58:20 -08:00
ishaan-jaff
16abdf44c9 (fix) proxy_server convert chunk to dict() 2023-11-13 10:58:20 -08:00
ishaan-jaff
18b694f01a (fix) proxy cli use openai v1.0.0 2023-11-13 10:08:48 -08:00
ishaan-jaff
cf0ab7155e (fix) proxy + docs: use openai.chat.completions.create instead of openai.ChatCompletions 2023-11-13 08:24:26 -08:00
Krrish Dholakia
45b6f8b853 refactor: fixing linting issues 2023-11-11 18:52:28 -08:00
ishaan-jaff
833c38edeb (fix) proxy raise exception when config path does not exist 2023-11-11 12:36:22 -08:00