ishaan-jaff
|
9570636474
|
(Feat) update config.yaml example on proxy
|
2023-11-23 10:54:30 -08:00 |
|
ishaan-jaff
|
b9f0316032
|
(feat) proxy: caching - show redis settings when initializing
|
2023-11-23 10:52:50 -08:00 |
|
Krish Dholakia
|
31bb24e9c1
|
Merge pull request #881 from Manouchehri/lambda-1
Add AWS Lambda Support
|
2023-11-23 10:38:34 -08:00 |
|
ishaan-jaff
|
9648a8594b
|
(feat) proxy: add curl command test + read cache config
|
2023-11-23 10:31:04 -08:00 |
|
David Manouchehri
|
ed5b075080
|
Add mangum.
|
2023-11-23 00:04:47 -05:00 |
|
Krrish Dholakia
|
2df4791ae9
|
fix: fix linting errors
|
2023-11-22 19:59:25 -08:00 |
|
ishaan-jaff
|
52e2ac0106
|
(test) proxy test exception mapping
|
2023-11-22 16:22:05 -08:00 |
|
Krrish Dholakia
|
310b24a436
|
feat(proxy_server): add /v1/embeddings endpoint
n
|
2023-11-22 14:03:27 -08:00 |
|
Krrish Dholakia
|
bd87e30058
|
test(test_caching.py): cleaning up tests
|
2023-11-22 13:43:48 -08:00 |
|
Krrish Dholakia
|
e495a8a9c2
|
fix(main.py): fix acompletion for anyscale, openrouter, deepinfra, perplexity endpoints
|
2023-11-22 13:22:58 -08:00 |
|
ishaan-jaff
|
cfd30bb152
|
(feat) proxy server add /routes to see available routes
|
2023-11-22 13:20:21 -08:00 |
|
Krrish Dholakia
|
57e894ad5e
|
fix(proxy_server.py): If master key is set, only master key can be used to generate new keys
|
2023-11-22 10:18:28 -08:00 |
|
Krrish Dholakia
|
2a681e578c
|
fix(proxy_server): fix linting issues
|
2023-11-22 08:47:59 -08:00 |
|
ishaan-jaff
|
12c2d1411a
|
(test) load test with api.litellm.ai
|
2023-11-21 21:07:27 -08:00 |
|
ishaan-jaff
|
bd5c89aab9
|
(test) load test proxy
|
2023-11-21 21:04:46 -08:00 |
|
ishaan-jaff
|
4ccee2e1a6
|
(test) load test q
|
2023-11-21 20:48:56 -08:00 |
|
ishaan-jaff
|
67b7aba40f
|
(test) load test q
|
2023-11-21 20:48:56 -08:00 |
|
ishaan-jaff
|
97a8177dfc
|
(fix) caching config
|
2023-11-21 20:48:56 -08:00 |
|
ishaan-jaff
|
4db51c6eae
|
(fix) cache configs
|
2023-11-21 20:48:56 -08:00 |
|
Krrish Dholakia
|
e96bb8868d
|
fix(proxy_server.py): fix /models endpoint
|
2023-11-21 20:15:43 -08:00 |
|
Krrish Dholakia
|
9a0a24259c
|
fix(celery_app.py): add retries to worker
|
2023-11-21 20:07:16 -08:00 |
|
ishaan-jaff
|
5bf5bdf73a
|
(test) test q
|
2023-11-21 19:45:46 -08:00 |
|
ishaan-jaff
|
fd3462fb4f
|
(fix) proxy server set model list through headers
|
2023-11-21 19:33:48 -08:00 |
|
Krrish Dholakia
|
550ddb4a6b
|
docs(routing.md): update routing docs
|
2023-11-21 19:32:50 -08:00 |
|
ishaan-jaff
|
466e5c2a86
|
(test) test q
|
2023-11-21 18:15:00 -08:00 |
|
ishaan-jaff
|
e8b2c64255
|
(test) refactor test
|
2023-11-21 18:02:49 -08:00 |
|
Krrish Dholakia
|
9205f70b0f
|
docs(routing.md): add queueing to docs
|
2023-11-21 18:01:02 -08:00 |
|
ishaan-jaff
|
70af870005
|
(fix) explicitly run prisma generate
|
2023-11-21 17:42:42 -08:00 |
|
ishaan-jaff
|
3f442d85a4
|
(fix) prisma
|
2023-11-21 17:38:34 -08:00 |
|
ishaan-jaff
|
e4b7040500
|
(fix) prisma always installed on deploys
|
2023-11-21 17:27:07 -08:00 |
|
ishaan-jaff
|
5e62b8fdce
|
(docs) example hosted litellm yaml
|
2023-11-21 16:59:33 -08:00 |
|
ishaan-jaff
|
7ec9dbc94e
|
(test) add test for testing queuing
|
2023-11-21 16:59:33 -08:00 |
|
ishaan-jaff
|
0f80ae46a0
|
(test) move test file dir
|
2023-11-21 16:59:33 -08:00 |
|
Krrish Dholakia
|
7fb3a71b47
|
refactor(proxy_server.py): using celery workers instead of rq for concurrency
|
2023-11-21 16:31:56 -08:00 |
|
ishaan-jaff
|
cfafdb3463
|
(feat)proxy: readon config per request
|
2023-11-21 16:26:05 -08:00 |
|
ishaan-jaff
|
c3ff9cd433
|
(feat) proxy: add config col to prisma config
|
2023-11-21 16:22:26 -08:00 |
|
ishaan-jaff
|
0c5cfe5d1e
|
(docs) proxy queue config yaml
|
2023-11-21 16:22:00 -08:00 |
|
Krrish Dholakia
|
5c3ea2a97e
|
refactor(rq_worker.py): put rq worker behind function call (prevent default import)
|
2023-11-21 13:51:42 -08:00 |
|
Krrish Dholakia
|
68c955409d
|
refactor(proxy_server.py): refactoring background rq worker
|
2023-11-21 13:47:09 -08:00 |
|
Krrish Dholakia
|
7fb25ef2dd
|
fix(proxy_server.py): defaulting status to queued
|
2023-11-21 12:40:30 -08:00 |
|
Krrish Dholakia
|
8a77885c1c
|
refactor(proxy_server.py): experimental endpoint - defined job status
|
2023-11-21 12:37:00 -08:00 |
|
Krrish Dholakia
|
0b0e931e21
|
refactor(proxy_server.py): refactoring experimental endpoint names
|
2023-11-21 12:12:15 -08:00 |
|
Krrish Dholakia
|
b8e62f3d0c
|
feat(proxy_server.py): EXPERIMENTAL: adding queuing endpoints to openai proxy server
|
2023-11-21 12:06:23 -08:00 |
|
ishaan-jaff
|
05f9ff6351
|
(fix) proxy server LiteLLM warning
|
2023-11-21 08:50:31 -08:00 |
|
ishaan-jaff
|
8ab13122d3
|
(docs) proxy starting config.yaml
|
2023-11-21 08:36:34 -08:00 |
|
ishaan-jaff
|
259f4b977a
|
(fix) proxy server - raise 500 on exceptions
|
2023-11-21 08:36:34 -08:00 |
|
Krrish Dholakia
|
3e3bb0180d
|
fix(proxy_server.py): run ollama serve when ollama in config.yaml
|
2023-11-21 08:35:04 -08:00 |
|
Krrish Dholakia
|
2ac804a42f
|
feat(proxy_server.py): enable model aliases
|
2023-11-20 16:51:04 -08:00 |
|
Krrish Dholakia
|
7472be1529
|
fix(routing.py): update token usage on streaming
|
2023-11-20 14:19:25 -08:00 |
|
Krrish Dholakia
|
1812b7ae37
|
fix(proxy_server.py): fix linting errors
|
2023-11-18 17:35:16 -08:00 |
|