Commit graph

4686 commits

Author SHA1 Message Date
David Manouchehri
5b6f227170
Fix master key check. 2023-11-24 12:03:30 -05:00
David Manouchehri
3fa3a767b3
Fix OpenAPI auth spec. 2023-11-24 11:59:33 -05:00
ishaan-jaff
0a23af7990 (feat) proxy: cost tracking add track_cost_callback 2023-11-24 05:53:23 -08:00
Kumaran Rajendhiran
01fad94485 Fail gracefully if ollama is already being served 2023-11-24 16:52:55 +05:30
Krrish Dholakia
c9e37b8fb8 fix(proxy_server.py): fix linting errors 2023-11-23 21:42:39 -08:00
Krrish Dholakia
9a44433844 feat(proxy_server.py): /key/delete endpoint 2023-11-23 21:37:53 -08:00
ishaan-jaff
486037ff8b (fix) proxy: /embeddings 2023-11-23 21:16:51 -08:00
ishaan-jaff
31c5b4572b (fix) proxy: prisma.schema 2023-11-23 20:11:42 -08:00
Krrish Dholakia
c273d6f0d6 fix(router.py): add support for context window fallbacks on router 2023-11-23 16:43:02 -08:00
ishaan-jaff
9b53ea4b0f (feat) proxy: cost tracking per completion request 2023-11-23 16:08:59 -08:00
Krrish Dholakia
8c4e8d6c62 feat(proxy_server.py): add in-memory caching for user api keys 2023-11-23 13:21:45 -08:00
ishaan-jaff
8291f239a4 (feat) proxy server: add spend column 2023-11-23 11:46:59 -08:00
Krrish Dholakia
61fc76a8c4 fix(router.py): fix caching for tracking cooldowns + usage 2023-11-23 11:13:32 -08:00
ishaan-jaff
9d3d10da4c (Feat) update config.yaml example on proxy 2023-11-23 10:54:30 -08:00
ishaan-jaff
1dcab433b4 (feat) proxy: caching - show redis settings when initializing 2023-11-23 10:52:50 -08:00
Krish Dholakia
da12c13c5a
Merge pull request #881 from Manouchehri/lambda-1
Add AWS Lambda Support
2023-11-23 10:38:34 -08:00
ishaan-jaff
ae3eeb16ab (feat) proxy: add curl command test + read cache config 2023-11-23 10:31:04 -08:00
David Manouchehri
60ca43b6d9
Add mangum. 2023-11-23 00:04:47 -05:00
Krrish Dholakia
2f93c0155a fix: fix linting errors 2023-11-22 19:59:25 -08:00
ishaan-jaff
cb41b14cc2 (test) proxy test exception mapping 2023-11-22 16:22:05 -08:00
Krrish Dholakia
448ec0a571 feat(proxy_server): add /v1/embeddings endpoint
n
2023-11-22 14:03:27 -08:00
Krrish Dholakia
b0801f61e6 test(test_caching.py): cleaning up tests 2023-11-22 13:43:48 -08:00
Krrish Dholakia
78582e158a fix(main.py): fix acompletion for anyscale, openrouter, deepinfra, perplexity endpoints 2023-11-22 13:22:58 -08:00
ishaan-jaff
ba73224a3a (feat) proxy server add /routes to see available routes 2023-11-22 13:20:21 -08:00
Krrish Dholakia
604ad41eac fix(proxy_server.py): If master key is set, only master key can be used to generate new keys 2023-11-22 10:18:28 -08:00
Krrish Dholakia
e7bb4a0cbd fix(proxy_server): fix linting issues 2023-11-22 08:47:59 -08:00
ishaan-jaff
2f3e13e43b (test) load test with api.litellm.ai 2023-11-21 21:07:27 -08:00
ishaan-jaff
b770ff2404 (test) load test proxy 2023-11-21 21:04:46 -08:00
ishaan-jaff
d1ad84c26d (test) load test q 2023-11-21 20:48:56 -08:00
ishaan-jaff
359f542c10 (test) load test q 2023-11-21 20:48:56 -08:00
ishaan-jaff
6aa8b41fb3 (fix) caching config 2023-11-21 20:48:56 -08:00
ishaan-jaff
3c30705b76 (fix) cache configs 2023-11-21 20:48:56 -08:00
Krrish Dholakia
904def6119 fix(proxy_server.py): fix /models endpoint 2023-11-21 20:15:43 -08:00
Krrish Dholakia
e5fa4eb314 fix(celery_app.py): add retries to worker 2023-11-21 20:07:16 -08:00
ishaan-jaff
580e6dc166 (test) test q 2023-11-21 19:45:46 -08:00
ishaan-jaff
363d74cf7d (fix) proxy server set model list through headers 2023-11-21 19:33:48 -08:00
Krrish Dholakia
826f56a6a0 docs(routing.md): update routing docs 2023-11-21 19:32:50 -08:00
ishaan-jaff
fd3895878d (test) test q 2023-11-21 18:15:00 -08:00
ishaan-jaff
b6be6ef574 (test) refactor test 2023-11-21 18:02:49 -08:00
Krrish Dholakia
9d97082eed docs(routing.md): add queueing to docs 2023-11-21 18:01:02 -08:00
ishaan-jaff
a2681e353f (fix) explicitly run prisma generate 2023-11-21 17:42:42 -08:00
ishaan-jaff
99b2a1bc8a (fix) prisma 2023-11-21 17:38:34 -08:00
ishaan-jaff
410c059d52 (fix) prisma always installed on deploys 2023-11-21 17:27:07 -08:00
ishaan-jaff
213b345a43 (docs) example hosted litellm yaml 2023-11-21 16:59:33 -08:00
ishaan-jaff
e5810a177c (test) add test for testing queuing 2023-11-21 16:59:33 -08:00
ishaan-jaff
a5ecf8d7c1 (test) move test file dir 2023-11-21 16:59:33 -08:00
Krrish Dholakia
b16646e584 refactor(proxy_server.py): using celery workers instead of rq for concurrency 2023-11-21 16:31:56 -08:00
ishaan-jaff
8c98a2c899 (feat)proxy: readon config per request 2023-11-21 16:26:05 -08:00
ishaan-jaff
6117bcb19f (feat) proxy: add config col to prisma config 2023-11-21 16:22:26 -08:00
ishaan-jaff
5835e6ed04 (docs) proxy queue config yaml 2023-11-21 16:22:00 -08:00