Commit graph

3838 commits

Author SHA1 Message Date
Krrish Dholakia
604ad41eac fix(proxy_server.py): If master key is set, only master key can be used to generate new keys 2023-11-22 10:18:28 -08:00
Krrish Dholakia
10fe16c965 fix(utils.py): add param mapping for perplexity, anyscale, deepinfra
n

n
2023-11-22 10:04:27 -08:00
Krrish Dholakia
e7bb4a0cbd fix(proxy_server): fix linting issues 2023-11-22 08:47:59 -08:00
ishaan-jaff
7a4be44805 (docs) request q 2023-11-22 08:10:30 -08:00
Krrish Dholakia
a4406e1784 bump: version 1.3.4 → 1.4.0 2023-11-21 21:19:27 -08:00
Krrish Dholakia
76f46902ed feat(router.py): adding latency-based routing strategy 2023-11-21 21:19:27 -08:00
ishaan-jaff
2f3e13e43b (test) load test with api.litellm.ai 2023-11-21 21:07:27 -08:00
ishaan-jaff
b770ff2404 (test) load test proxy 2023-11-21 21:04:46 -08:00
ishaan-jaff
d1ad84c26d (test) load test q 2023-11-21 20:48:56 -08:00
ishaan-jaff
359f542c10 (test) load test q 2023-11-21 20:48:56 -08:00
ishaan-jaff
6aa8b41fb3 (fix) caching config 2023-11-21 20:48:56 -08:00
ishaan-jaff
3c30705b76 (fix) cache configs 2023-11-21 20:48:56 -08:00
Krrish Dholakia
d7f292c108 bump: version 1.3.3 → 1.3.4 2023-11-21 20:16:00 -08:00
Krrish Dholakia
904def6119 fix(proxy_server.py): fix /models endpoint 2023-11-21 20:15:43 -08:00
Krrish Dholakia
e5fa4eb314 fix(celery_app.py): add retries to worker 2023-11-21 20:07:16 -08:00
Krrish Dholakia
381fdcd37b fix(utils.py): add response ms for async calls 2023-11-21 19:59:00 -08:00
ishaan-jaff
898f15af2b (docs) update routing with api.litellm.ai 2023-11-21 19:54:29 -08:00
ishaan-jaff
580e6dc166 (test) test q 2023-11-21 19:45:46 -08:00
ishaan-jaff
363d74cf7d (fix) proxy server set model list through headers 2023-11-21 19:33:48 -08:00
Krrish Dholakia
826f56a6a0 docs(routing.md): update routing docs 2023-11-21 19:32:50 -08:00
ishaan-jaff
fd3895878d (test) test q 2023-11-21 18:15:00 -08:00
ishaan-jaff
f91bc03f4c (fix) add psutil req.txt 2023-11-21 18:15:00 -08:00
Krish Dholakia
3cb25d5681
Update model_prices_and_context_window.json 2023-11-21 18:04:02 -08:00
ishaan-jaff
b6be6ef574 (test) refactor test 2023-11-21 18:02:49 -08:00
Krrish Dholakia
9d97082eed docs(routing.md): add queueing to docs 2023-11-21 18:01:02 -08:00
ishaan-jaff
a2681e353f (fix) explicitly run prisma generate 2023-11-21 17:42:42 -08:00
ishaan-jaff
99b2a1bc8a (fix) prisma 2023-11-21 17:38:34 -08:00
ishaan-jaff
90b8fd8ac5 (fix) add celery to req 2023-11-21 17:30:47 -08:00
ishaan-jaff
410c059d52 (fix) prisma always installed on deploys 2023-11-21 17:27:07 -08:00
ishaan-jaff
dcc60ae31a (docs) routing 2023-11-21 17:22:06 -08:00
ishaan-jaff
8b8e5890e3 (docs) routing 2023-11-21 17:20:24 -08:00
ishaan-jaff
4cfaf7ff95 (fix) dockerfile 2023-11-21 17:20:14 -08:00
ishaan-jaff
fbf9cab5b9 (fix) dockerfile 2023-11-21 17:14:34 -08:00
ishaan-jaff
53558cc8b3 (feat) update req.txt 2023-11-21 17:08:03 -08:00
ishaan-jaff
38c1ee2349 (feat) update dockerfile 2023-11-21 17:07:53 -08:00
ishaan-jaff
213b345a43 (docs) example hosted litellm yaml 2023-11-21 16:59:33 -08:00
ishaan-jaff
e5810a177c (test) add test for testing queuing 2023-11-21 16:59:33 -08:00
ishaan-jaff
a5ecf8d7c1 (test) move test file dir 2023-11-21 16:59:33 -08:00
ishaan-jaff
da90d01d04 (docs) proxy 2023-11-21 16:59:33 -08:00
Krrish Dholakia
b16646e584 refactor(proxy_server.py): using celery workers instead of rq for concurrency 2023-11-21 16:31:56 -08:00
ishaan-jaff
8c98a2c899 (feat)proxy: readon config per request 2023-11-21 16:26:05 -08:00
ishaan-jaff
6117bcb19f (feat) proxy: add config col to prisma config 2023-11-21 16:22:26 -08:00
ishaan-jaff
5835e6ed04 (docs) proxy queue config yaml 2023-11-21 16:22:00 -08:00
ishaan-jaff
ab75654e0d (docs) queue server 2023-11-21 14:54:03 -08:00
ishaan-jaff
93ec24710b (docs) exception mapping 2023-11-21 14:09:51 -08:00
ishaan-jaff
8093a4fc6f (test) load test queing 2023-11-21 14:05:04 -08:00
ishaan-jaff
34886a99cc (docs) fix exception mapping 2023-11-21 14:05:04 -08:00
Krish Dholakia
e4f1e2b138
Merge pull request #845 from canada4663/upstream-main
Added support for multiple embeddings via Bedrock
2023-11-21 14:00:06 -08:00
Krrish Dholakia
70f3159a2a refactor(rq_worker.py): put rq worker behind function call (prevent default import) 2023-11-21 13:51:42 -08:00
Krrish Dholakia
a1f6b9b531 refactor(proxy_server.py): refactoring background rq worker 2023-11-21 13:47:09 -08:00