Commit graph

3524 commits

Author SHA1 Message Date
Krrish Dholakia
e5fa4eb314 fix(celery_app.py): add retries to worker 2023-11-21 20:07:16 -08:00
Krrish Dholakia
381fdcd37b fix(utils.py): add response ms for async calls 2023-11-21 19:59:00 -08:00
ishaan-jaff
898f15af2b (docs) update routing with api.litellm.ai 2023-11-21 19:54:29 -08:00
ishaan-jaff
580e6dc166 (test) test q 2023-11-21 19:45:46 -08:00
ishaan-jaff
363d74cf7d (fix) proxy server set model list through headers 2023-11-21 19:33:48 -08:00
Krrish Dholakia
826f56a6a0 docs(routing.md): update routing docs 2023-11-21 19:32:50 -08:00
ishaan-jaff
fd3895878d (test) test q 2023-11-21 18:15:00 -08:00
ishaan-jaff
f91bc03f4c (fix) add psutil req.txt 2023-11-21 18:15:00 -08:00
Krish Dholakia
3cb25d5681
Update model_prices_and_context_window.json 2023-11-21 18:04:02 -08:00
ishaan-jaff
b6be6ef574 (test) refactor test 2023-11-21 18:02:49 -08:00
Krrish Dholakia
9d97082eed docs(routing.md): add queueing to docs 2023-11-21 18:01:02 -08:00
ishaan-jaff
a2681e353f (fix) explicitly run prisma generate 2023-11-21 17:42:42 -08:00
ishaan-jaff
99b2a1bc8a (fix) prisma 2023-11-21 17:38:34 -08:00
ishaan-jaff
90b8fd8ac5 (fix) add celery to req 2023-11-21 17:30:47 -08:00
ishaan-jaff
410c059d52 (fix) prisma always installed on deploys 2023-11-21 17:27:07 -08:00
ishaan-jaff
dcc60ae31a (docs) routing 2023-11-21 17:22:06 -08:00
ishaan-jaff
8b8e5890e3 (docs) routing 2023-11-21 17:20:24 -08:00
ishaan-jaff
4cfaf7ff95 (fix) dockerfile 2023-11-21 17:20:14 -08:00
ishaan-jaff
fbf9cab5b9 (fix) dockerfile 2023-11-21 17:14:34 -08:00
ishaan-jaff
53558cc8b3 (feat) update req.txt 2023-11-21 17:08:03 -08:00
ishaan-jaff
38c1ee2349 (feat) update dockerfile 2023-11-21 17:07:53 -08:00
ishaan-jaff
213b345a43 (docs) example hosted litellm yaml 2023-11-21 16:59:33 -08:00
ishaan-jaff
e5810a177c (test) add test for testing queuing 2023-11-21 16:59:33 -08:00
ishaan-jaff
a5ecf8d7c1 (test) move test file dir 2023-11-21 16:59:33 -08:00
ishaan-jaff
da90d01d04 (docs) proxy 2023-11-21 16:59:33 -08:00
Krrish Dholakia
b16646e584 refactor(proxy_server.py): using celery workers instead of rq for concurrency 2023-11-21 16:31:56 -08:00
ishaan-jaff
8c98a2c899 (feat)proxy: readon config per request 2023-11-21 16:26:05 -08:00
ishaan-jaff
6117bcb19f (feat) proxy: add config col to prisma config 2023-11-21 16:22:26 -08:00
ishaan-jaff
5835e6ed04 (docs) proxy queue config yaml 2023-11-21 16:22:00 -08:00
ishaan-jaff
ab75654e0d (docs) queue server 2023-11-21 14:54:03 -08:00
ishaan-jaff
93ec24710b (docs) exception mapping 2023-11-21 14:09:51 -08:00
ishaan-jaff
8093a4fc6f (test) load test queing 2023-11-21 14:05:04 -08:00
ishaan-jaff
34886a99cc (docs) fix exception mapping 2023-11-21 14:05:04 -08:00
Krish Dholakia
e4f1e2b138
Merge pull request #845 from canada4663/upstream-main
Added support for multiple embeddings via Bedrock
2023-11-21 14:00:06 -08:00
Krrish Dholakia
70f3159a2a refactor(rq_worker.py): put rq worker behind function call (prevent default import) 2023-11-21 13:51:42 -08:00
Krrish Dholakia
a1f6b9b531 refactor(proxy_server.py): refactoring background rq worker 2023-11-21 13:47:09 -08:00
ishaan-jaff
7c79d10e9f (test) catch streaming + timeout exceptions 2023-11-21 13:46:21 -08:00
ishaan-jaff
a2e2889d88 (docs) catch streaming exceptions 2023-11-21 13:45:54 -08:00
ishaan-jaff
f18a64f6a4 (docs) catching timeout exceptions 2023-11-21 13:39:59 -08:00
ishaan-jaff
9d6569791f (test) timeout error 2023-11-21 13:37:12 -08:00
ishaan-jaff
589cdce8d3 (fix) request_timeout mapped to timeout for openai 2023-11-21 13:22:21 -08:00
ishaan-jaff
20b129b7ad (docs) update timeout variable name 2023-11-21 13:14:55 -08:00
ishaan-jaff
b98d9199f6 (test) load test queing 2023-11-21 12:51:58 -08:00
Krrish Dholakia
015c7de549 fix(proxy_server.py): defaulting status to queued 2023-11-21 12:40:30 -08:00
ishaan-jaff
b2c35bbd50 (test) load test proxy 2023-11-21 12:38:46 -08:00
Krrish Dholakia
331fe86c8f refactor(proxy_server.py): experimental endpoint - defined job status 2023-11-21 12:37:00 -08:00
ishaan-jaff
4e06b4a26f (test) add load test queuing 2023-11-21 12:14:20 -08:00
ishaan-jaff
78f90cd0d0 (test) load test litelm proxy 2023-11-21 12:14:20 -08:00
Krrish Dholakia
d0f228e53f refactor(proxy_server.py): refactoring experimental endpoint names 2023-11-21 12:12:15 -08:00
Krrish Dholakia
c6a4744947 feat(proxy_server.py): EXPERIMENTAL: adding queuing endpoints to openai proxy server 2023-11-21 12:06:23 -08:00