Krrish Dholakia
|
9d97082eed
|
docs(routing.md): add queueing to docs
|
2023-11-21 18:01:02 -08:00 |
|
ishaan-jaff
|
a2681e353f
|
(fix) explicitly run prisma generate
|
2023-11-21 17:42:42 -08:00 |
|
ishaan-jaff
|
99b2a1bc8a
|
(fix) prisma
|
2023-11-21 17:38:34 -08:00 |
|
ishaan-jaff
|
410c059d52
|
(fix) prisma always installed on deploys
|
2023-11-21 17:27:07 -08:00 |
|
ishaan-jaff
|
213b345a43
|
(docs) example hosted litellm yaml
|
2023-11-21 16:59:33 -08:00 |
|
ishaan-jaff
|
e5810a177c
|
(test) add test for testing queuing
|
2023-11-21 16:59:33 -08:00 |
|
ishaan-jaff
|
a5ecf8d7c1
|
(test) move test file dir
|
2023-11-21 16:59:33 -08:00 |
|
Krrish Dholakia
|
b16646e584
|
refactor(proxy_server.py): using celery workers instead of rq for concurrency
|
2023-11-21 16:31:56 -08:00 |
|
ishaan-jaff
|
8c98a2c899
|
(feat)proxy: readon config per request
|
2023-11-21 16:26:05 -08:00 |
|
ishaan-jaff
|
6117bcb19f
|
(feat) proxy: add config col to prisma config
|
2023-11-21 16:22:26 -08:00 |
|
ishaan-jaff
|
5835e6ed04
|
(docs) proxy queue config yaml
|
2023-11-21 16:22:00 -08:00 |
|
Krrish Dholakia
|
70f3159a2a
|
refactor(rq_worker.py): put rq worker behind function call (prevent default import)
|
2023-11-21 13:51:42 -08:00 |
|
Krrish Dholakia
|
a1f6b9b531
|
refactor(proxy_server.py): refactoring background rq worker
|
2023-11-21 13:47:09 -08:00 |
|
Krrish Dholakia
|
015c7de549
|
fix(proxy_server.py): defaulting status to queued
|
2023-11-21 12:40:30 -08:00 |
|
Krrish Dholakia
|
331fe86c8f
|
refactor(proxy_server.py): experimental endpoint - defined job status
|
2023-11-21 12:37:00 -08:00 |
|
Krrish Dholakia
|
d0f228e53f
|
refactor(proxy_server.py): refactoring experimental endpoint names
|
2023-11-21 12:12:15 -08:00 |
|
Krrish Dholakia
|
c6a4744947
|
feat(proxy_server.py): EXPERIMENTAL: adding queuing endpoints to openai proxy server
|
2023-11-21 12:06:23 -08:00 |
|
ishaan-jaff
|
2a35ff88a7
|
(fix) proxy server LiteLLM warning
|
2023-11-21 08:50:31 -08:00 |
|
ishaan-jaff
|
8969c5f769
|
(docs) proxy starting config.yaml
|
2023-11-21 08:36:34 -08:00 |
|
ishaan-jaff
|
6898840218
|
(fix) proxy server - raise 500 on exceptions
|
2023-11-21 08:36:34 -08:00 |
|
Krrish Dholakia
|
846a824c47
|
fix(proxy_server.py): run ollama serve when ollama in config.yaml
|
2023-11-21 08:35:04 -08:00 |
|
Krrish Dholakia
|
33e47dae8e
|
feat(proxy_server.py): enable model aliases
|
2023-11-20 16:51:04 -08:00 |
|
Krrish Dholakia
|
1976d0f7d6
|
fix(routing.py): update token usage on streaming
|
2023-11-20 14:19:25 -08:00 |
|
Krrish Dholakia
|
d8079907b9
|
fix(proxy_server.py): fix linting errors
|
2023-11-18 17:35:16 -08:00 |
|
Krrish Dholakia
|
eefa66e8f0
|
docs(simple_proxy.md): adding token based auth to docs
|
2023-11-18 17:34:11 -08:00 |
|
ishaan-jaff
|
73ce9aceb2
|
(fix) linting
|
2023-11-18 17:05:46 -08:00 |
|
Krrish Dholakia
|
7a669a36d2
|
fix(proxy_server.py): handle initializing prisma / db connection just once
|
2023-11-18 16:45:19 -08:00 |
|
Krrish Dholakia
|
c02794d3ff
|
feat(proxy_server.py): enable token based authentication for server endpoints
|
2023-11-18 16:13:53 -08:00 |
|
Krrish Dholakia
|
8ae855e008
|
fix(proxy_server.py): allow user to connect their proxy to a postgres db
|
2023-11-18 15:57:38 -08:00 |
|
ishaan-jaff
|
e006cbbc73
|
(docs) update readme proxy server
|
2023-11-17 17:40:44 -08:00 |
|
ishaan-jaff
|
d1af0af7bf
|
(docs) load balancer
|
2023-11-17 17:25:46 -08:00 |
|
ishaan-jaff
|
42432bedaa
|
(docs) add example load balancer
|
2023-11-17 17:25:12 -08:00 |
|
Krrish Dholakia
|
0ab6b2451d
|
fix(acompletion): support client side timeouts + raise exceptions correctly for async calls
|
2023-11-17 15:39:47 -08:00 |
|
ishaan-jaff
|
c162f8b4b0
|
(docs) test proxy
|
2023-11-17 10:19:12 -08:00 |
|
ishaan-jaff
|
b8c64f16cd
|
(chore) linting fix
|
2023-11-16 09:57:40 -08:00 |
|
ishaan-jaff
|
d9fb4d494c
|
(feat) raise exception on proxy server
|
2023-11-16 09:51:17 -08:00 |
|
ishaan-jaff
|
60c1ef7929
|
(chore) fix config yaml
|
2023-11-15 17:12:28 -08:00 |
|
ishaan-jaff
|
9493aeff68
|
(docs) add example configs to proxy
|
2023-11-15 16:41:26 -08:00 |
|
ishaan-jaff
|
a18fe5a1fd
|
(fix) proxy raise exceptions from litellm.completion()
|
2023-11-15 16:26:04 -08:00 |
|
ishaan-jaff
|
e698f8218f
|
(feat) proxy - show loaded config for users
|
2023-11-15 16:26:04 -08:00 |
|
ishaan-jaff
|
5625fe3d81
|
(docs) using langfuse with litellm
|
2023-11-15 13:53:12 -08:00 |
|
ishaan-jaff
|
11872e0d45
|
(fix) proxy safe access router model names
|
2023-11-15 13:44:17 -08:00 |
|
Krrish Dholakia
|
8c104e9c6a
|
fix(azure.py-+-proxy_server.py): fix function calling response object + support router on proxy
|
2023-11-15 13:15:16 -08:00 |
|
ishaan-jaff
|
8f8b35eb34
|
add ollama/zephyr to config
|
2023-11-15 13:04:34 -08:00 |
|
Oliver Smith
|
4c980cb161
|
need to re-attempt backoff and yaml imports if the first import attempt fails. not sure which import is missing from requirements
|
2023-11-15 00:10:22 -08:00 |
|
Krrish Dholakia
|
1738341dcb
|
fix(main.py): misrouting ollama models to nlp cloud
|
2023-11-14 18:55:08 -08:00 |
|
ishaan-jaff
|
255c0fab1d
|
(docs) passing azure_ad_token to config
|
2023-11-14 15:27:59 -08:00 |
|
Krrish Dholakia
|
7ef8611952
|
fix(simple_proxy.md): enable setting a master key to protect proxy endpoints
|
2023-11-14 12:44:33 -08:00 |
|
ishaan-jaff
|
1207910522
|
(fix) proxy cli maintain back comp with openai < 1.00
|
2023-11-13 11:06:59 -08:00 |
|
ishaan-jaff
|
e125414611
|
(fix) proxy cli compatible with openai v1.0.0
|
2023-11-13 10:58:20 -08:00 |
|