Commit graph

3942 commits

Author SHA1 Message Date
ishaan-jaff
7ec9dbc94e (test) add test for testing queuing 2023-11-21 16:59:33 -08:00
ishaan-jaff
0f80ae46a0 (test) move test file dir 2023-11-21 16:59:33 -08:00
Krrish Dholakia
7fb3a71b47 refactor(proxy_server.py): using celery workers instead of rq for concurrency 2023-11-21 16:31:56 -08:00
ishaan-jaff
cfafdb3463 (feat)proxy: readon config per request 2023-11-21 16:26:05 -08:00
ishaan-jaff
c3ff9cd433 (feat) proxy: add config col to prisma config 2023-11-21 16:22:26 -08:00
ishaan-jaff
0c5cfe5d1e (docs) proxy queue config yaml 2023-11-21 16:22:00 -08:00
Krrish Dholakia
5c3ea2a97e refactor(rq_worker.py): put rq worker behind function call (prevent default import) 2023-11-21 13:51:42 -08:00
Krrish Dholakia
68c955409d refactor(proxy_server.py): refactoring background rq worker 2023-11-21 13:47:09 -08:00
Krrish Dholakia
7fb25ef2dd fix(proxy_server.py): defaulting status to queued 2023-11-21 12:40:30 -08:00
Krrish Dholakia
8a77885c1c refactor(proxy_server.py): experimental endpoint - defined job status 2023-11-21 12:37:00 -08:00
Krrish Dholakia
0b0e931e21 refactor(proxy_server.py): refactoring experimental endpoint names 2023-11-21 12:12:15 -08:00
Krrish Dholakia
b8e62f3d0c feat(proxy_server.py): EXPERIMENTAL: adding queuing endpoints to openai proxy server 2023-11-21 12:06:23 -08:00
ishaan-jaff
05f9ff6351 (fix) proxy server LiteLLM warning 2023-11-21 08:50:31 -08:00
ishaan-jaff
8ab13122d3 (docs) proxy starting config.yaml 2023-11-21 08:36:34 -08:00
ishaan-jaff
259f4b977a (fix) proxy server - raise 500 on exceptions 2023-11-21 08:36:34 -08:00
Krrish Dholakia
3e3bb0180d fix(proxy_server.py): run ollama serve when ollama in config.yaml 2023-11-21 08:35:04 -08:00
Krrish Dholakia
2ac804a42f feat(proxy_server.py): enable model aliases 2023-11-20 16:51:04 -08:00
Krrish Dholakia
7472be1529 fix(routing.py): update token usage on streaming 2023-11-20 14:19:25 -08:00
Krrish Dholakia
1812b7ae37 fix(proxy_server.py): fix linting errors 2023-11-18 17:35:16 -08:00
Krrish Dholakia
59b547a65e docs(simple_proxy.md): adding token based auth to docs 2023-11-18 17:34:11 -08:00
ishaan-jaff
2466d26301 (fix) linting 2023-11-18 17:05:46 -08:00
Krrish Dholakia
6a1270ce5a fix(proxy_server.py): handle initializing prisma / db connection just once 2023-11-18 16:45:19 -08:00
Krrish Dholakia
8e599ed4f8 feat(proxy_server.py): enable token based authentication for server endpoints 2023-11-18 16:13:53 -08:00
Krrish Dholakia
229e5ea083 fix(proxy_server.py): allow user to connect their proxy to a postgres db 2023-11-18 15:57:38 -08:00
ishaan-jaff
505a0c354c (docs) update readme proxy server 2023-11-17 17:40:44 -08:00
ishaan-jaff
bae94aced0 (docs) load balancer 2023-11-17 17:25:46 -08:00
ishaan-jaff
a9bb85489f (docs) add example load balancer 2023-11-17 17:25:12 -08:00
Krrish Dholakia
02ed97d0b2 fix(acompletion): support client side timeouts + raise exceptions correctly for async calls 2023-11-17 15:39:47 -08:00
ishaan-jaff
160d8f9c55 (docs) test proxy 2023-11-17 10:19:12 -08:00
ishaan-jaff
900593355b (chore) linting fix 2023-11-16 09:57:40 -08:00
ishaan-jaff
082acfcc53 (feat) raise exception on proxy server 2023-11-16 09:51:17 -08:00
ishaan-jaff
396f0023dd (chore) fix config yaml 2023-11-15 17:12:28 -08:00
ishaan-jaff
ff498be65c (docs) add example configs to proxy 2023-11-15 16:41:26 -08:00
ishaan-jaff
b1deaa1246 (fix) proxy raise exceptions from litellm.completion() 2023-11-15 16:26:04 -08:00
ishaan-jaff
f3827bc6da (feat) proxy - show loaded config for users 2023-11-15 16:26:04 -08:00
ishaan-jaff
7d3b653a99 (docs) using langfuse with litellm 2023-11-15 13:53:12 -08:00
ishaan-jaff
b10e7b7973 (fix) proxy safe access router model names 2023-11-15 13:44:17 -08:00
Krrish Dholakia
e5929f2f7e fix(azure.py-+-proxy_server.py): fix function calling response object + support router on proxy 2023-11-15 13:15:16 -08:00
ishaan-jaff
24cc1f620d add ollama/zephyr to config 2023-11-15 13:04:34 -08:00
Oliver Smith
4e4e0a2251 need to re-attempt backoff and yaml imports if the first import attempt fails. not sure which import is missing from requirements 2023-11-15 00:10:22 -08:00
Krrish Dholakia
5d58bb9cd0 fix(main.py): misrouting ollama models to nlp cloud 2023-11-14 18:55:08 -08:00
ishaan-jaff
9f1d050967 (docs) passing azure_ad_token to config 2023-11-14 15:27:59 -08:00
Krrish Dholakia
1283f345dc fix(simple_proxy.md): enable setting a master key to protect proxy endpoints 2023-11-14 12:44:33 -08:00
ishaan-jaff
8ae511b178 (fix) proxy cli maintain back comp with openai < 1.00 2023-11-13 11:06:59 -08:00
ishaan-jaff
aec5ad0f66 (fix) proxy cli compatible with openai v1.0.0 2023-11-13 10:58:20 -08:00
ishaan-jaff
91e6ac36c3 (fix) proxy_server convert chunk to dict() 2023-11-13 10:58:20 -08:00
ishaan-jaff
cdd40ae45f (fix) proxy cli use openai v1.0.0 2023-11-13 10:08:48 -08:00
ishaan-jaff
523fa7c81c (fix) proxy + docs: use openai.chat.completions.create instead of openai.ChatCompletions 2023-11-13 08:24:26 -08:00
Krrish Dholakia
4b74ddcb17 refactor: fixing linting issues 2023-11-11 18:52:28 -08:00
ishaan-jaff
a861dd69c9 (fix) proxy raise exception when config path does not exist 2023-11-11 12:36:22 -08:00