Krrish Dholakia
|
e5fa4eb314
|
fix(celery_app.py): add retries to worker
|
2023-11-21 20:07:16 -08:00 |
|
Krrish Dholakia
|
381fdcd37b
|
fix(utils.py): add response ms for async calls
|
2023-11-21 19:59:00 -08:00 |
|
ishaan-jaff
|
898f15af2b
|
(docs) update routing with api.litellm.ai
|
2023-11-21 19:54:29 -08:00 |
|
ishaan-jaff
|
580e6dc166
|
(test) test q
|
2023-11-21 19:45:46 -08:00 |
|
ishaan-jaff
|
363d74cf7d
|
(fix) proxy server set model list through headers
|
2023-11-21 19:33:48 -08:00 |
|
Krrish Dholakia
|
826f56a6a0
|
docs(routing.md): update routing docs
|
2023-11-21 19:32:50 -08:00 |
|
ishaan-jaff
|
fd3895878d
|
(test) test q
|
2023-11-21 18:15:00 -08:00 |
|
ishaan-jaff
|
f91bc03f4c
|
(fix) add psutil req.txt
|
2023-11-21 18:15:00 -08:00 |
|
Krish Dholakia
|
3cb25d5681
|
Update model_prices_and_context_window.json
|
2023-11-21 18:04:02 -08:00 |
|
ishaan-jaff
|
b6be6ef574
|
(test) refactor test
|
2023-11-21 18:02:49 -08:00 |
|
Krrish Dholakia
|
9d97082eed
|
docs(routing.md): add queueing to docs
|
2023-11-21 18:01:02 -08:00 |
|
ishaan-jaff
|
a2681e353f
|
(fix) explicitly run prisma generate
|
2023-11-21 17:42:42 -08:00 |
|
ishaan-jaff
|
99b2a1bc8a
|
(fix) prisma
|
2023-11-21 17:38:34 -08:00 |
|
ishaan-jaff
|
90b8fd8ac5
|
(fix) add celery to req
|
2023-11-21 17:30:47 -08:00 |
|
ishaan-jaff
|
410c059d52
|
(fix) prisma always installed on deploys
|
2023-11-21 17:27:07 -08:00 |
|
ishaan-jaff
|
dcc60ae31a
|
(docs) routing
|
2023-11-21 17:22:06 -08:00 |
|
ishaan-jaff
|
8b8e5890e3
|
(docs) routing
|
2023-11-21 17:20:24 -08:00 |
|
ishaan-jaff
|
4cfaf7ff95
|
(fix) dockerfile
|
2023-11-21 17:20:14 -08:00 |
|
ishaan-jaff
|
fbf9cab5b9
|
(fix) dockerfile
|
2023-11-21 17:14:34 -08:00 |
|
ishaan-jaff
|
53558cc8b3
|
(feat) update req.txt
|
2023-11-21 17:08:03 -08:00 |
|
ishaan-jaff
|
38c1ee2349
|
(feat) update dockerfile
|
2023-11-21 17:07:53 -08:00 |
|
ishaan-jaff
|
213b345a43
|
(docs) example hosted litellm yaml
|
2023-11-21 16:59:33 -08:00 |
|
ishaan-jaff
|
e5810a177c
|
(test) add test for testing queuing
|
2023-11-21 16:59:33 -08:00 |
|
ishaan-jaff
|
a5ecf8d7c1
|
(test) move test file dir
|
2023-11-21 16:59:33 -08:00 |
|
ishaan-jaff
|
da90d01d04
|
(docs) proxy
|
2023-11-21 16:59:33 -08:00 |
|
Krrish Dholakia
|
b16646e584
|
refactor(proxy_server.py): using celery workers instead of rq for concurrency
|
2023-11-21 16:31:56 -08:00 |
|
ishaan-jaff
|
8c98a2c899
|
(feat)proxy: readon config per request
|
2023-11-21 16:26:05 -08:00 |
|
ishaan-jaff
|
6117bcb19f
|
(feat) proxy: add config col to prisma config
|
2023-11-21 16:22:26 -08:00 |
|
ishaan-jaff
|
5835e6ed04
|
(docs) proxy queue config yaml
|
2023-11-21 16:22:00 -08:00 |
|
ishaan-jaff
|
ab75654e0d
|
(docs) queue server
|
2023-11-21 14:54:03 -08:00 |
|
ishaan-jaff
|
93ec24710b
|
(docs) exception mapping
|
2023-11-21 14:09:51 -08:00 |
|
ishaan-jaff
|
8093a4fc6f
|
(test) load test queing
|
2023-11-21 14:05:04 -08:00 |
|
ishaan-jaff
|
34886a99cc
|
(docs) fix exception mapping
|
2023-11-21 14:05:04 -08:00 |
|
Krish Dholakia
|
e4f1e2b138
|
Merge pull request #845 from canada4663/upstream-main
Added support for multiple embeddings via Bedrock
|
2023-11-21 14:00:06 -08:00 |
|
Krrish Dholakia
|
70f3159a2a
|
refactor(rq_worker.py): put rq worker behind function call (prevent default import)
|
2023-11-21 13:51:42 -08:00 |
|
Krrish Dholakia
|
a1f6b9b531
|
refactor(proxy_server.py): refactoring background rq worker
|
2023-11-21 13:47:09 -08:00 |
|
ishaan-jaff
|
7c79d10e9f
|
(test) catch streaming + timeout exceptions
|
2023-11-21 13:46:21 -08:00 |
|
ishaan-jaff
|
a2e2889d88
|
(docs) catch streaming exceptions
|
2023-11-21 13:45:54 -08:00 |
|
ishaan-jaff
|
f18a64f6a4
|
(docs) catching timeout exceptions
|
2023-11-21 13:39:59 -08:00 |
|
ishaan-jaff
|
9d6569791f
|
(test) timeout error
|
2023-11-21 13:37:12 -08:00 |
|
ishaan-jaff
|
589cdce8d3
|
(fix) request_timeout mapped to timeout for openai
|
2023-11-21 13:22:21 -08:00 |
|
ishaan-jaff
|
20b129b7ad
|
(docs) update timeout variable name
|
2023-11-21 13:14:55 -08:00 |
|
ishaan-jaff
|
b98d9199f6
|
(test) load test queing
|
2023-11-21 12:51:58 -08:00 |
|
Krrish Dholakia
|
015c7de549
|
fix(proxy_server.py): defaulting status to queued
|
2023-11-21 12:40:30 -08:00 |
|
ishaan-jaff
|
b2c35bbd50
|
(test) load test proxy
|
2023-11-21 12:38:46 -08:00 |
|
Krrish Dholakia
|
331fe86c8f
|
refactor(proxy_server.py): experimental endpoint - defined job status
|
2023-11-21 12:37:00 -08:00 |
|
ishaan-jaff
|
4e06b4a26f
|
(test) add load test queuing
|
2023-11-21 12:14:20 -08:00 |
|
ishaan-jaff
|
78f90cd0d0
|
(test) load test litelm proxy
|
2023-11-21 12:14:20 -08:00 |
|
Krrish Dholakia
|
d0f228e53f
|
refactor(proxy_server.py): refactoring experimental endpoint names
|
2023-11-21 12:12:15 -08:00 |
|
Krrish Dholakia
|
c6a4744947
|
feat(proxy_server.py): EXPERIMENTAL: adding queuing endpoints to openai proxy server
|
2023-11-21 12:06:23 -08:00 |
|