Kumaran Rajendhiran
|
9fb31448a9
|
Fail gracefully if ollama is already being served
|
2023-11-24 16:52:55 +05:30 |
|
Krrish Dholakia
|
daa45b4031
|
fix(proxy_server.py): fix linting errors
|
2023-11-23 21:42:39 -08:00 |
|
Krrish Dholakia
|
8030a9b8d1
|
feat(proxy_server.py): /key/delete endpoint
|
2023-11-23 21:37:53 -08:00 |
|
ishaan-jaff
|
7ccfbde6fd
|
(fix) proxy: /embeddings
|
2023-11-23 21:16:51 -08:00 |
|
ishaan-jaff
|
d8c7417647
|
(fix) proxy: prisma.schema
|
2023-11-23 20:11:42 -08:00 |
|
Krrish Dholakia
|
e4deb09eb6
|
fix(router.py): add support for context window fallbacks on router
|
2023-11-23 16:43:02 -08:00 |
|
ishaan-jaff
|
c64aad7335
|
(feat) proxy: cost tracking per completion request
|
2023-11-23 16:08:59 -08:00 |
|
Krrish Dholakia
|
7610b1f0af
|
feat(proxy_server.py): add in-memory caching for user api keys
|
2023-11-23 13:21:45 -08:00 |
|
ishaan-jaff
|
4ade4d4e8a
|
(feat) proxy server: add spend column
|
2023-11-23 11:46:59 -08:00 |
|
Krrish Dholakia
|
0e3064ac8c
|
fix(router.py): fix caching for tracking cooldowns + usage
|
2023-11-23 11:13:32 -08:00 |
|
ishaan-jaff
|
9570636474
|
(Feat) update config.yaml example on proxy
|
2023-11-23 10:54:30 -08:00 |
|
ishaan-jaff
|
b9f0316032
|
(feat) proxy: caching - show redis settings when initializing
|
2023-11-23 10:52:50 -08:00 |
|
Krish Dholakia
|
31bb24e9c1
|
Merge pull request #881 from Manouchehri/lambda-1
Add AWS Lambda Support
|
2023-11-23 10:38:34 -08:00 |
|
ishaan-jaff
|
9648a8594b
|
(feat) proxy: add curl command test + read cache config
|
2023-11-23 10:31:04 -08:00 |
|
David Manouchehri
|
ed5b075080
|
Add mangum.
|
2023-11-23 00:04:47 -05:00 |
|
Krrish Dholakia
|
2df4791ae9
|
fix: fix linting errors
|
2023-11-22 19:59:25 -08:00 |
|
ishaan-jaff
|
52e2ac0106
|
(test) proxy test exception mapping
|
2023-11-22 16:22:05 -08:00 |
|
Krrish Dholakia
|
310b24a436
|
feat(proxy_server): add /v1/embeddings endpoint
n
|
2023-11-22 14:03:27 -08:00 |
|
Krrish Dholakia
|
bd87e30058
|
test(test_caching.py): cleaning up tests
|
2023-11-22 13:43:48 -08:00 |
|
Krrish Dholakia
|
e495a8a9c2
|
fix(main.py): fix acompletion for anyscale, openrouter, deepinfra, perplexity endpoints
|
2023-11-22 13:22:58 -08:00 |
|
ishaan-jaff
|
cfd30bb152
|
(feat) proxy server add /routes to see available routes
|
2023-11-22 13:20:21 -08:00 |
|
Krrish Dholakia
|
57e894ad5e
|
fix(proxy_server.py): If master key is set, only master key can be used to generate new keys
|
2023-11-22 10:18:28 -08:00 |
|
Krrish Dholakia
|
2a681e578c
|
fix(proxy_server): fix linting issues
|
2023-11-22 08:47:59 -08:00 |
|
ishaan-jaff
|
12c2d1411a
|
(test) load test with api.litellm.ai
|
2023-11-21 21:07:27 -08:00 |
|
ishaan-jaff
|
bd5c89aab9
|
(test) load test proxy
|
2023-11-21 21:04:46 -08:00 |
|
ishaan-jaff
|
4ccee2e1a6
|
(test) load test q
|
2023-11-21 20:48:56 -08:00 |
|
ishaan-jaff
|
67b7aba40f
|
(test) load test q
|
2023-11-21 20:48:56 -08:00 |
|
ishaan-jaff
|
97a8177dfc
|
(fix) caching config
|
2023-11-21 20:48:56 -08:00 |
|
ishaan-jaff
|
4db51c6eae
|
(fix) cache configs
|
2023-11-21 20:48:56 -08:00 |
|
Krrish Dholakia
|
e96bb8868d
|
fix(proxy_server.py): fix /models endpoint
|
2023-11-21 20:15:43 -08:00 |
|
Krrish Dholakia
|
9a0a24259c
|
fix(celery_app.py): add retries to worker
|
2023-11-21 20:07:16 -08:00 |
|
ishaan-jaff
|
5bf5bdf73a
|
(test) test q
|
2023-11-21 19:45:46 -08:00 |
|
ishaan-jaff
|
fd3462fb4f
|
(fix) proxy server set model list through headers
|
2023-11-21 19:33:48 -08:00 |
|
Krrish Dholakia
|
550ddb4a6b
|
docs(routing.md): update routing docs
|
2023-11-21 19:32:50 -08:00 |
|
ishaan-jaff
|
466e5c2a86
|
(test) test q
|
2023-11-21 18:15:00 -08:00 |
|
ishaan-jaff
|
e8b2c64255
|
(test) refactor test
|
2023-11-21 18:02:49 -08:00 |
|
Krrish Dholakia
|
9205f70b0f
|
docs(routing.md): add queueing to docs
|
2023-11-21 18:01:02 -08:00 |
|
ishaan-jaff
|
70af870005
|
(fix) explicitly run prisma generate
|
2023-11-21 17:42:42 -08:00 |
|
ishaan-jaff
|
3f442d85a4
|
(fix) prisma
|
2023-11-21 17:38:34 -08:00 |
|
ishaan-jaff
|
e4b7040500
|
(fix) prisma always installed on deploys
|
2023-11-21 17:27:07 -08:00 |
|
ishaan-jaff
|
5e62b8fdce
|
(docs) example hosted litellm yaml
|
2023-11-21 16:59:33 -08:00 |
|
ishaan-jaff
|
7ec9dbc94e
|
(test) add test for testing queuing
|
2023-11-21 16:59:33 -08:00 |
|
ishaan-jaff
|
0f80ae46a0
|
(test) move test file dir
|
2023-11-21 16:59:33 -08:00 |
|
Krrish Dholakia
|
7fb3a71b47
|
refactor(proxy_server.py): using celery workers instead of rq for concurrency
|
2023-11-21 16:31:56 -08:00 |
|
ishaan-jaff
|
cfafdb3463
|
(feat)proxy: readon config per request
|
2023-11-21 16:26:05 -08:00 |
|
ishaan-jaff
|
c3ff9cd433
|
(feat) proxy: add config col to prisma config
|
2023-11-21 16:22:26 -08:00 |
|
ishaan-jaff
|
0c5cfe5d1e
|
(docs) proxy queue config yaml
|
2023-11-21 16:22:00 -08:00 |
|
Krrish Dholakia
|
5c3ea2a97e
|
refactor(rq_worker.py): put rq worker behind function call (prevent default import)
|
2023-11-21 13:51:42 -08:00 |
|
Krrish Dholakia
|
68c955409d
|
refactor(proxy_server.py): refactoring background rq worker
|
2023-11-21 13:47:09 -08:00 |
|
Krrish Dholakia
|
7fb25ef2dd
|
fix(proxy_server.py): defaulting status to queued
|
2023-11-21 12:40:30 -08:00 |
|