Commit graph

4483 commits

Author SHA1 Message Date
Kumaran Rajendhiran
9fb31448a9 Fail gracefully if ollama is already being served 2023-11-24 16:52:55 +05:30
Krrish Dholakia
daa45b4031 fix(proxy_server.py): fix linting errors 2023-11-23 21:42:39 -08:00
Krrish Dholakia
8030a9b8d1 feat(proxy_server.py): /key/delete endpoint 2023-11-23 21:37:53 -08:00
ishaan-jaff
7ccfbde6fd (fix) proxy: /embeddings 2023-11-23 21:16:51 -08:00
ishaan-jaff
d8c7417647 (fix) proxy: prisma.schema 2023-11-23 20:11:42 -08:00
Krrish Dholakia
e4deb09eb6 fix(router.py): add support for context window fallbacks on router 2023-11-23 16:43:02 -08:00
ishaan-jaff
c64aad7335 (feat) proxy: cost tracking per completion request 2023-11-23 16:08:59 -08:00
Krrish Dholakia
7610b1f0af feat(proxy_server.py): add in-memory caching for user api keys 2023-11-23 13:21:45 -08:00
ishaan-jaff
4ade4d4e8a (feat) proxy server: add spend column 2023-11-23 11:46:59 -08:00
Krrish Dholakia
0e3064ac8c fix(router.py): fix caching for tracking cooldowns + usage 2023-11-23 11:13:32 -08:00
ishaan-jaff
9570636474 (Feat) update config.yaml example on proxy 2023-11-23 10:54:30 -08:00
ishaan-jaff
b9f0316032 (feat) proxy: caching - show redis settings when initializing 2023-11-23 10:52:50 -08:00
Krish Dholakia
31bb24e9c1 Merge pull request #881 from Manouchehri/lambda-1
Add AWS Lambda Support
2023-11-23 10:38:34 -08:00
ishaan-jaff
9648a8594b (feat) proxy: add curl command test + read cache config 2023-11-23 10:31:04 -08:00
David Manouchehri
ed5b075080 Add mangum. 2023-11-23 00:04:47 -05:00
Krrish Dholakia
2df4791ae9 fix: fix linting errors 2023-11-22 19:59:25 -08:00
ishaan-jaff
52e2ac0106 (test) proxy test exception mapping 2023-11-22 16:22:05 -08:00
Krrish Dholakia
310b24a436 feat(proxy_server): add /v1/embeddings endpoint
n
2023-11-22 14:03:27 -08:00
Krrish Dholakia
bd87e30058 test(test_caching.py): cleaning up tests 2023-11-22 13:43:48 -08:00
Krrish Dholakia
e495a8a9c2 fix(main.py): fix acompletion for anyscale, openrouter, deepinfra, perplexity endpoints 2023-11-22 13:22:58 -08:00
ishaan-jaff
cfd30bb152 (feat) proxy server add /routes to see available routes 2023-11-22 13:20:21 -08:00
Krrish Dholakia
57e894ad5e fix(proxy_server.py): If master key is set, only master key can be used to generate new keys 2023-11-22 10:18:28 -08:00
Krrish Dholakia
2a681e578c fix(proxy_server): fix linting issues 2023-11-22 08:47:59 -08:00
ishaan-jaff
12c2d1411a (test) load test with api.litellm.ai 2023-11-21 21:07:27 -08:00
ishaan-jaff
bd5c89aab9 (test) load test proxy 2023-11-21 21:04:46 -08:00
ishaan-jaff
4ccee2e1a6 (test) load test q 2023-11-21 20:48:56 -08:00
ishaan-jaff
67b7aba40f (test) load test q 2023-11-21 20:48:56 -08:00
ishaan-jaff
97a8177dfc (fix) caching config 2023-11-21 20:48:56 -08:00
ishaan-jaff
4db51c6eae (fix) cache configs 2023-11-21 20:48:56 -08:00
Krrish Dholakia
e96bb8868d fix(proxy_server.py): fix /models endpoint 2023-11-21 20:15:43 -08:00
Krrish Dholakia
9a0a24259c fix(celery_app.py): add retries to worker 2023-11-21 20:07:16 -08:00
ishaan-jaff
5bf5bdf73a (test) test q 2023-11-21 19:45:46 -08:00
ishaan-jaff
fd3462fb4f (fix) proxy server set model list through headers 2023-11-21 19:33:48 -08:00
Krrish Dholakia
550ddb4a6b docs(routing.md): update routing docs 2023-11-21 19:32:50 -08:00
ishaan-jaff
466e5c2a86 (test) test q 2023-11-21 18:15:00 -08:00
ishaan-jaff
e8b2c64255 (test) refactor test 2023-11-21 18:02:49 -08:00
Krrish Dholakia
9205f70b0f docs(routing.md): add queueing to docs 2023-11-21 18:01:02 -08:00
ishaan-jaff
70af870005 (fix) explicitly run prisma generate 2023-11-21 17:42:42 -08:00
ishaan-jaff
3f442d85a4 (fix) prisma 2023-11-21 17:38:34 -08:00
ishaan-jaff
e4b7040500 (fix) prisma always installed on deploys 2023-11-21 17:27:07 -08:00
ishaan-jaff
5e62b8fdce (docs) example hosted litellm yaml 2023-11-21 16:59:33 -08:00
ishaan-jaff
7ec9dbc94e (test) add test for testing queuing 2023-11-21 16:59:33 -08:00
ishaan-jaff
0f80ae46a0 (test) move test file dir 2023-11-21 16:59:33 -08:00
Krrish Dholakia
7fb3a71b47 refactor(proxy_server.py): using celery workers instead of rq for concurrency 2023-11-21 16:31:56 -08:00
ishaan-jaff
cfafdb3463 (feat)proxy: readon config per request 2023-11-21 16:26:05 -08:00
ishaan-jaff
c3ff9cd433 (feat) proxy: add config col to prisma config 2023-11-21 16:22:26 -08:00
ishaan-jaff
0c5cfe5d1e (docs) proxy queue config yaml 2023-11-21 16:22:00 -08:00
Krrish Dholakia
5c3ea2a97e refactor(rq_worker.py): put rq worker behind function call (prevent default import) 2023-11-21 13:51:42 -08:00
Krrish Dholakia
68c955409d refactor(proxy_server.py): refactoring background rq worker 2023-11-21 13:47:09 -08:00
Krrish Dholakia
7fb25ef2dd fix(proxy_server.py): defaulting status to queued 2023-11-21 12:40:30 -08:00