Krrish Dholakia
|
c273d6f0d6
|
fix(router.py): add support for context window fallbacks on router
|
2023-11-23 16:43:02 -08:00 |
|
ishaan-jaff
|
9b53ea4b0f
|
(feat) proxy: cost tracking per completion request
|
2023-11-23 16:08:59 -08:00 |
|
Krrish Dholakia
|
8c4e8d6c62
|
feat(proxy_server.py): add in-memory caching for user api keys
|
2023-11-23 13:21:45 -08:00 |
|
ishaan-jaff
|
8291f239a4
|
(feat) proxy server: add spend column
|
2023-11-23 11:46:59 -08:00 |
|
Krrish Dholakia
|
61fc76a8c4
|
fix(router.py): fix caching for tracking cooldowns + usage
|
2023-11-23 11:13:32 -08:00 |
|
ishaan-jaff
|
9d3d10da4c
|
(Feat) update config.yaml example on proxy
|
2023-11-23 10:54:30 -08:00 |
|
ishaan-jaff
|
1dcab433b4
|
(feat) proxy: caching - show redis settings when initializing
|
2023-11-23 10:52:50 -08:00 |
|
Krish Dholakia
|
da12c13c5a
|
Merge pull request #881 from Manouchehri/lambda-1
Add AWS Lambda Support
|
2023-11-23 10:38:34 -08:00 |
|
ishaan-jaff
|
ae3eeb16ab
|
(feat) proxy: add curl command test + read cache config
|
2023-11-23 10:31:04 -08:00 |
|
David Manouchehri
|
60ca43b6d9
|
Add mangum.
|
2023-11-23 00:04:47 -05:00 |
|
Krrish Dholakia
|
2f93c0155a
|
fix: fix linting errors
|
2023-11-22 19:59:25 -08:00 |
|
ishaan-jaff
|
cb41b14cc2
|
(test) proxy test exception mapping
|
2023-11-22 16:22:05 -08:00 |
|
Krrish Dholakia
|
448ec0a571
|
feat(proxy_server): add /v1/embeddings endpoint
n
|
2023-11-22 14:03:27 -08:00 |
|
Krrish Dholakia
|
b0801f61e6
|
test(test_caching.py): cleaning up tests
|
2023-11-22 13:43:48 -08:00 |
|
Krrish Dholakia
|
78582e158a
|
fix(main.py): fix acompletion for anyscale, openrouter, deepinfra, perplexity endpoints
|
2023-11-22 13:22:58 -08:00 |
|
ishaan-jaff
|
ba73224a3a
|
(feat) proxy server add /routes to see available routes
|
2023-11-22 13:20:21 -08:00 |
|
Krrish Dholakia
|
604ad41eac
|
fix(proxy_server.py): If master key is set, only master key can be used to generate new keys
|
2023-11-22 10:18:28 -08:00 |
|
Krrish Dholakia
|
e7bb4a0cbd
|
fix(proxy_server): fix linting issues
|
2023-11-22 08:47:59 -08:00 |
|
ishaan-jaff
|
2f3e13e43b
|
(test) load test with api.litellm.ai
|
2023-11-21 21:07:27 -08:00 |
|
ishaan-jaff
|
b770ff2404
|
(test) load test proxy
|
2023-11-21 21:04:46 -08:00 |
|
ishaan-jaff
|
d1ad84c26d
|
(test) load test q
|
2023-11-21 20:48:56 -08:00 |
|
ishaan-jaff
|
359f542c10
|
(test) load test q
|
2023-11-21 20:48:56 -08:00 |
|
ishaan-jaff
|
6aa8b41fb3
|
(fix) caching config
|
2023-11-21 20:48:56 -08:00 |
|
ishaan-jaff
|
3c30705b76
|
(fix) cache configs
|
2023-11-21 20:48:56 -08:00 |
|
Krrish Dholakia
|
904def6119
|
fix(proxy_server.py): fix /models endpoint
|
2023-11-21 20:15:43 -08:00 |
|
Krrish Dholakia
|
e5fa4eb314
|
fix(celery_app.py): add retries to worker
|
2023-11-21 20:07:16 -08:00 |
|
ishaan-jaff
|
580e6dc166
|
(test) test q
|
2023-11-21 19:45:46 -08:00 |
|
ishaan-jaff
|
363d74cf7d
|
(fix) proxy server set model list through headers
|
2023-11-21 19:33:48 -08:00 |
|
Krrish Dholakia
|
826f56a6a0
|
docs(routing.md): update routing docs
|
2023-11-21 19:32:50 -08:00 |
|
ishaan-jaff
|
fd3895878d
|
(test) test q
|
2023-11-21 18:15:00 -08:00 |
|
ishaan-jaff
|
b6be6ef574
|
(test) refactor test
|
2023-11-21 18:02:49 -08:00 |
|
Krrish Dholakia
|
9d97082eed
|
docs(routing.md): add queueing to docs
|
2023-11-21 18:01:02 -08:00 |
|
ishaan-jaff
|
a2681e353f
|
(fix) explicitly run prisma generate
|
2023-11-21 17:42:42 -08:00 |
|
ishaan-jaff
|
99b2a1bc8a
|
(fix) prisma
|
2023-11-21 17:38:34 -08:00 |
|
ishaan-jaff
|
410c059d52
|
(fix) prisma always installed on deploys
|
2023-11-21 17:27:07 -08:00 |
|
ishaan-jaff
|
213b345a43
|
(docs) example hosted litellm yaml
|
2023-11-21 16:59:33 -08:00 |
|
ishaan-jaff
|
e5810a177c
|
(test) add test for testing queuing
|
2023-11-21 16:59:33 -08:00 |
|
ishaan-jaff
|
a5ecf8d7c1
|
(test) move test file dir
|
2023-11-21 16:59:33 -08:00 |
|
Krrish Dholakia
|
b16646e584
|
refactor(proxy_server.py): using celery workers instead of rq for concurrency
|
2023-11-21 16:31:56 -08:00 |
|
ishaan-jaff
|
8c98a2c899
|
(feat)proxy: readon config per request
|
2023-11-21 16:26:05 -08:00 |
|
ishaan-jaff
|
6117bcb19f
|
(feat) proxy: add config col to prisma config
|
2023-11-21 16:22:26 -08:00 |
|
ishaan-jaff
|
5835e6ed04
|
(docs) proxy queue config yaml
|
2023-11-21 16:22:00 -08:00 |
|
Krrish Dholakia
|
70f3159a2a
|
refactor(rq_worker.py): put rq worker behind function call (prevent default import)
|
2023-11-21 13:51:42 -08:00 |
|
Krrish Dholakia
|
a1f6b9b531
|
refactor(proxy_server.py): refactoring background rq worker
|
2023-11-21 13:47:09 -08:00 |
|
Krrish Dholakia
|
015c7de549
|
fix(proxy_server.py): defaulting status to queued
|
2023-11-21 12:40:30 -08:00 |
|
Krrish Dholakia
|
331fe86c8f
|
refactor(proxy_server.py): experimental endpoint - defined job status
|
2023-11-21 12:37:00 -08:00 |
|
Krrish Dholakia
|
d0f228e53f
|
refactor(proxy_server.py): refactoring experimental endpoint names
|
2023-11-21 12:12:15 -08:00 |
|
Krrish Dholakia
|
c6a4744947
|
feat(proxy_server.py): EXPERIMENTAL: adding queuing endpoints to openai proxy server
|
2023-11-21 12:06:23 -08:00 |
|
ishaan-jaff
|
2a35ff88a7
|
(fix) proxy server LiteLLM warning
|
2023-11-21 08:50:31 -08:00 |
|
ishaan-jaff
|
8969c5f769
|
(docs) proxy starting config.yaml
|
2023-11-21 08:36:34 -08:00 |
|