Commit graph

228 commits

Author SHA1 Message Date
Krrish Dholakia
c273d6f0d6 fix(router.py): add support for context window fallbacks on router 2023-11-23 16:43:02 -08:00
ishaan-jaff
9b53ea4b0f (feat) proxy: cost tracking per completion request 2023-11-23 16:08:59 -08:00
Krrish Dholakia
8c4e8d6c62 feat(proxy_server.py): add in-memory caching for user api keys 2023-11-23 13:21:45 -08:00
ishaan-jaff
8291f239a4 (feat) proxy server: add spend column 2023-11-23 11:46:59 -08:00
Krrish Dholakia
61fc76a8c4 fix(router.py): fix caching for tracking cooldowns + usage 2023-11-23 11:13:32 -08:00
ishaan-jaff
9d3d10da4c (Feat) update config.yaml example on proxy 2023-11-23 10:54:30 -08:00
ishaan-jaff
1dcab433b4 (feat) proxy: caching - show redis settings when initializing 2023-11-23 10:52:50 -08:00
Krish Dholakia
da12c13c5a
Merge pull request #881 from Manouchehri/lambda-1
Add AWS Lambda Support
2023-11-23 10:38:34 -08:00
ishaan-jaff
ae3eeb16ab (feat) proxy: add curl command test + read cache config 2023-11-23 10:31:04 -08:00
David Manouchehri
60ca43b6d9
Add mangum. 2023-11-23 00:04:47 -05:00
Krrish Dholakia
2f93c0155a fix: fix linting errors 2023-11-22 19:59:25 -08:00
ishaan-jaff
cb41b14cc2 (test) proxy test exception mapping 2023-11-22 16:22:05 -08:00
Krrish Dholakia
448ec0a571 feat(proxy_server): add /v1/embeddings endpoint
n
2023-11-22 14:03:27 -08:00
Krrish Dholakia
b0801f61e6 test(test_caching.py): cleaning up tests 2023-11-22 13:43:48 -08:00
Krrish Dholakia
78582e158a fix(main.py): fix acompletion for anyscale, openrouter, deepinfra, perplexity endpoints 2023-11-22 13:22:58 -08:00
ishaan-jaff
ba73224a3a (feat) proxy server add /routes to see available routes 2023-11-22 13:20:21 -08:00
Krrish Dholakia
604ad41eac fix(proxy_server.py): If master key is set, only master key can be used to generate new keys 2023-11-22 10:18:28 -08:00
Krrish Dholakia
e7bb4a0cbd fix(proxy_server): fix linting issues 2023-11-22 08:47:59 -08:00
ishaan-jaff
2f3e13e43b (test) load test with api.litellm.ai 2023-11-21 21:07:27 -08:00
ishaan-jaff
b770ff2404 (test) load test proxy 2023-11-21 21:04:46 -08:00
ishaan-jaff
d1ad84c26d (test) load test q 2023-11-21 20:48:56 -08:00
ishaan-jaff
359f542c10 (test) load test q 2023-11-21 20:48:56 -08:00
ishaan-jaff
6aa8b41fb3 (fix) caching config 2023-11-21 20:48:56 -08:00
ishaan-jaff
3c30705b76 (fix) cache configs 2023-11-21 20:48:56 -08:00
Krrish Dholakia
904def6119 fix(proxy_server.py): fix /models endpoint 2023-11-21 20:15:43 -08:00
Krrish Dholakia
e5fa4eb314 fix(celery_app.py): add retries to worker 2023-11-21 20:07:16 -08:00
ishaan-jaff
580e6dc166 (test) test q 2023-11-21 19:45:46 -08:00
ishaan-jaff
363d74cf7d (fix) proxy server set model list through headers 2023-11-21 19:33:48 -08:00
Krrish Dholakia
826f56a6a0 docs(routing.md): update routing docs 2023-11-21 19:32:50 -08:00
ishaan-jaff
fd3895878d (test) test q 2023-11-21 18:15:00 -08:00
ishaan-jaff
b6be6ef574 (test) refactor test 2023-11-21 18:02:49 -08:00
Krrish Dholakia
9d97082eed docs(routing.md): add queueing to docs 2023-11-21 18:01:02 -08:00
ishaan-jaff
a2681e353f (fix) explicitly run prisma generate 2023-11-21 17:42:42 -08:00
ishaan-jaff
99b2a1bc8a (fix) prisma 2023-11-21 17:38:34 -08:00
ishaan-jaff
410c059d52 (fix) prisma always installed on deploys 2023-11-21 17:27:07 -08:00
ishaan-jaff
213b345a43 (docs) example hosted litellm yaml 2023-11-21 16:59:33 -08:00
ishaan-jaff
e5810a177c (test) add test for testing queuing 2023-11-21 16:59:33 -08:00
ishaan-jaff
a5ecf8d7c1 (test) move test file dir 2023-11-21 16:59:33 -08:00
Krrish Dholakia
b16646e584 refactor(proxy_server.py): using celery workers instead of rq for concurrency 2023-11-21 16:31:56 -08:00
ishaan-jaff
8c98a2c899 (feat)proxy: readon config per request 2023-11-21 16:26:05 -08:00
ishaan-jaff
6117bcb19f (feat) proxy: add config col to prisma config 2023-11-21 16:22:26 -08:00
ishaan-jaff
5835e6ed04 (docs) proxy queue config yaml 2023-11-21 16:22:00 -08:00
Krrish Dholakia
70f3159a2a refactor(rq_worker.py): put rq worker behind function call (prevent default import) 2023-11-21 13:51:42 -08:00
Krrish Dholakia
a1f6b9b531 refactor(proxy_server.py): refactoring background rq worker 2023-11-21 13:47:09 -08:00
Krrish Dholakia
015c7de549 fix(proxy_server.py): defaulting status to queued 2023-11-21 12:40:30 -08:00
Krrish Dholakia
331fe86c8f refactor(proxy_server.py): experimental endpoint - defined job status 2023-11-21 12:37:00 -08:00
Krrish Dholakia
d0f228e53f refactor(proxy_server.py): refactoring experimental endpoint names 2023-11-21 12:12:15 -08:00
Krrish Dholakia
c6a4744947 feat(proxy_server.py): EXPERIMENTAL: adding queuing endpoints to openai proxy server 2023-11-21 12:06:23 -08:00
ishaan-jaff
2a35ff88a7 (fix) proxy server LiteLLM warning 2023-11-21 08:50:31 -08:00
ishaan-jaff
8969c5f769 (docs) proxy starting config.yaml 2023-11-21 08:36:34 -08:00