ishaan-jaff
|
7ec9dbc94e
|
(test) add test for testing queuing
|
2023-11-21 16:59:33 -08:00 |
|
ishaan-jaff
|
0f80ae46a0
|
(test) move test file dir
|
2023-11-21 16:59:33 -08:00 |
|
Krrish Dholakia
|
7fb3a71b47
|
refactor(proxy_server.py): using celery workers instead of rq for concurrency
|
2023-11-21 16:31:56 -08:00 |
|
ishaan-jaff
|
cfafdb3463
|
(feat)proxy: readon config per request
|
2023-11-21 16:26:05 -08:00 |
|
ishaan-jaff
|
c3ff9cd433
|
(feat) proxy: add config col to prisma config
|
2023-11-21 16:22:26 -08:00 |
|
ishaan-jaff
|
0c5cfe5d1e
|
(docs) proxy queue config yaml
|
2023-11-21 16:22:00 -08:00 |
|
Krrish Dholakia
|
5c3ea2a97e
|
refactor(rq_worker.py): put rq worker behind function call (prevent default import)
|
2023-11-21 13:51:42 -08:00 |
|
Krrish Dholakia
|
68c955409d
|
refactor(proxy_server.py): refactoring background rq worker
|
2023-11-21 13:47:09 -08:00 |
|
Krrish Dholakia
|
7fb25ef2dd
|
fix(proxy_server.py): defaulting status to queued
|
2023-11-21 12:40:30 -08:00 |
|
Krrish Dholakia
|
8a77885c1c
|
refactor(proxy_server.py): experimental endpoint - defined job status
|
2023-11-21 12:37:00 -08:00 |
|
Krrish Dholakia
|
0b0e931e21
|
refactor(proxy_server.py): refactoring experimental endpoint names
|
2023-11-21 12:12:15 -08:00 |
|
Krrish Dholakia
|
b8e62f3d0c
|
feat(proxy_server.py): EXPERIMENTAL: adding queuing endpoints to openai proxy server
|
2023-11-21 12:06:23 -08:00 |
|
ishaan-jaff
|
05f9ff6351
|
(fix) proxy server LiteLLM warning
|
2023-11-21 08:50:31 -08:00 |
|
ishaan-jaff
|
8ab13122d3
|
(docs) proxy starting config.yaml
|
2023-11-21 08:36:34 -08:00 |
|
ishaan-jaff
|
259f4b977a
|
(fix) proxy server - raise 500 on exceptions
|
2023-11-21 08:36:34 -08:00 |
|
Krrish Dholakia
|
3e3bb0180d
|
fix(proxy_server.py): run ollama serve when ollama in config.yaml
|
2023-11-21 08:35:04 -08:00 |
|
Krrish Dholakia
|
2ac804a42f
|
feat(proxy_server.py): enable model aliases
|
2023-11-20 16:51:04 -08:00 |
|
Krrish Dholakia
|
7472be1529
|
fix(routing.py): update token usage on streaming
|
2023-11-20 14:19:25 -08:00 |
|
Krrish Dholakia
|
1812b7ae37
|
fix(proxy_server.py): fix linting errors
|
2023-11-18 17:35:16 -08:00 |
|
Krrish Dholakia
|
59b547a65e
|
docs(simple_proxy.md): adding token based auth to docs
|
2023-11-18 17:34:11 -08:00 |
|
ishaan-jaff
|
2466d26301
|
(fix) linting
|
2023-11-18 17:05:46 -08:00 |
|
Krrish Dholakia
|
6a1270ce5a
|
fix(proxy_server.py): handle initializing prisma / db connection just once
|
2023-11-18 16:45:19 -08:00 |
|
Krrish Dholakia
|
8e599ed4f8
|
feat(proxy_server.py): enable token based authentication for server endpoints
|
2023-11-18 16:13:53 -08:00 |
|
Krrish Dholakia
|
229e5ea083
|
fix(proxy_server.py): allow user to connect their proxy to a postgres db
|
2023-11-18 15:57:38 -08:00 |
|
ishaan-jaff
|
505a0c354c
|
(docs) update readme proxy server
|
2023-11-17 17:40:44 -08:00 |
|
ishaan-jaff
|
bae94aced0
|
(docs) load balancer
|
2023-11-17 17:25:46 -08:00 |
|
ishaan-jaff
|
a9bb85489f
|
(docs) add example load balancer
|
2023-11-17 17:25:12 -08:00 |
|
Krrish Dholakia
|
02ed97d0b2
|
fix(acompletion): support client side timeouts + raise exceptions correctly for async calls
|
2023-11-17 15:39:47 -08:00 |
|
ishaan-jaff
|
160d8f9c55
|
(docs) test proxy
|
2023-11-17 10:19:12 -08:00 |
|
ishaan-jaff
|
900593355b
|
(chore) linting fix
|
2023-11-16 09:57:40 -08:00 |
|
ishaan-jaff
|
082acfcc53
|
(feat) raise exception on proxy server
|
2023-11-16 09:51:17 -08:00 |
|
ishaan-jaff
|
396f0023dd
|
(chore) fix config yaml
|
2023-11-15 17:12:28 -08:00 |
|
ishaan-jaff
|
ff498be65c
|
(docs) add example configs to proxy
|
2023-11-15 16:41:26 -08:00 |
|
ishaan-jaff
|
b1deaa1246
|
(fix) proxy raise exceptions from litellm.completion()
|
2023-11-15 16:26:04 -08:00 |
|
ishaan-jaff
|
f3827bc6da
|
(feat) proxy - show loaded config for users
|
2023-11-15 16:26:04 -08:00 |
|
ishaan-jaff
|
7d3b653a99
|
(docs) using langfuse with litellm
|
2023-11-15 13:53:12 -08:00 |
|
ishaan-jaff
|
b10e7b7973
|
(fix) proxy safe access router model names
|
2023-11-15 13:44:17 -08:00 |
|
Krrish Dholakia
|
e5929f2f7e
|
fix(azure.py-+-proxy_server.py): fix function calling response object + support router on proxy
|
2023-11-15 13:15:16 -08:00 |
|
ishaan-jaff
|
24cc1f620d
|
add ollama/zephyr to config
|
2023-11-15 13:04:34 -08:00 |
|
Oliver Smith
|
4e4e0a2251
|
need to re-attempt backoff and yaml imports if the first import attempt fails. not sure which import is missing from requirements
|
2023-11-15 00:10:22 -08:00 |
|
Krrish Dholakia
|
5d58bb9cd0
|
fix(main.py): misrouting ollama models to nlp cloud
|
2023-11-14 18:55:08 -08:00 |
|
ishaan-jaff
|
9f1d050967
|
(docs) passing azure_ad_token to config
|
2023-11-14 15:27:59 -08:00 |
|
Krrish Dholakia
|
1283f345dc
|
fix(simple_proxy.md): enable setting a master key to protect proxy endpoints
|
2023-11-14 12:44:33 -08:00 |
|
ishaan-jaff
|
8ae511b178
|
(fix) proxy cli maintain back comp with openai < 1.00
|
2023-11-13 11:06:59 -08:00 |
|
ishaan-jaff
|
aec5ad0f66
|
(fix) proxy cli compatible with openai v1.0.0
|
2023-11-13 10:58:20 -08:00 |
|
ishaan-jaff
|
91e6ac36c3
|
(fix) proxy_server convert chunk to dict()
|
2023-11-13 10:58:20 -08:00 |
|
ishaan-jaff
|
cdd40ae45f
|
(fix) proxy cli use openai v1.0.0
|
2023-11-13 10:08:48 -08:00 |
|
ishaan-jaff
|
523fa7c81c
|
(fix) proxy + docs: use openai.chat.completions.create instead of openai.ChatCompletions
|
2023-11-13 08:24:26 -08:00 |
|
Krrish Dholakia
|
4b74ddcb17
|
refactor: fixing linting issues
|
2023-11-11 18:52:28 -08:00 |
|
ishaan-jaff
|
a861dd69c9
|
(fix) proxy raise exception when config path does not exist
|
2023-11-11 12:36:22 -08:00 |
|