Commit graph

1812 commits

Author SHA1 Message Date
ishaan-jaff
da7c07574f (test) cost calc on azure 2023-11-23 13:50:09 -08:00
Krrish Dholakia
c074023e14 fix: fix linting issues 2023-11-23 13:47:43 -08:00
Krrish Dholakia
4f183dc6a0 fix(utils.py): support reading api keys dynamically from the os environment 2023-11-23 13:41:56 -08:00
Krrish Dholakia
87aa36a2ec fix(caching.py): fix linting issues 2023-11-23 13:21:45 -08:00
Krrish Dholakia
8c4e8d6c62 feat(proxy_server.py): add in-memory caching for user api keys 2023-11-23 13:21:45 -08:00
ishaan-jaff
8291f239a4 (feat) proxy server: add spend column 2023-11-23 11:46:59 -08:00
ishaan-jaff
2c50ea94c8 (test) embedding: azurea - dd response_ms 2023-11-23 11:15:52 -08:00
Krrish Dholakia
61fc76a8c4 fix(router.py): fix caching for tracking cooldowns + usage 2023-11-23 11:13:32 -08:00
ishaan-jaff
9d3d10da4c (Feat) update config.yaml example on proxy 2023-11-23 10:54:30 -08:00
ishaan-jaff
1dcab433b4 (feat) proxy: caching - show redis settings when initializing 2023-11-23 10:52:50 -08:00
Krish Dholakia
da12c13c5a
Merge pull request #881 from Manouchehri/lambda-1
Add AWS Lambda Support
2023-11-23 10:38:34 -08:00
ishaan-jaff
ae3eeb16ab (feat) proxy: add curl command test + read cache config 2023-11-23 10:31:04 -08:00
ishaan-jaff
fbd2ab4c71 (test) caching ensure we always test add/get caching redis 2023-11-23 08:27:44 -08:00
Krish Dholakia
6ba4eeb961
Merge pull request #885 from Codium-ai/bugfix/hf_timeout
Do not timeout when calling HF through acomplete
2023-11-23 07:48:59 -08:00
Ori Kotek
e74ac03169
Do not timeout when calling HF through acomplete 2023-11-23 15:56:59 +02:00
maqsoodshaik
0f89c3375a this commit fixes #883 2023-11-23 12:45:38 +01:00
David Manouchehri
60ca43b6d9
Add mangum. 2023-11-23 00:04:47 -05:00
ishaan-jaff
db146bc40a (test) router with fallback deployments 2023-11-22 20:52:56 -08:00
ishaan-jaff
8ebc1b974c (chore) run ci/cd again 2023-11-22 20:34:14 -08:00
David Manouchehri
4d9aa7a3f7
Add symlink and only copy in source dir to stay under 50MB compressed limit for Lambdas. 2023-11-22 23:07:33 -05:00
Krrish Dholakia
2f93c0155a fix: fix linting errors 2023-11-22 19:59:25 -08:00
Krrish Dholakia
5d5ca9f7ef fix(router.py): add support for cooldowns with redis 2023-11-22 19:54:22 -08:00
ishaan-jaff
cb41b14cc2 (test) proxy test exception mapping 2023-11-22 16:22:05 -08:00
ishaan-jaff
4260e0c1f0 (fix) linting error 2023-11-22 16:22:05 -08:00
Krrish Dholakia
3e76d4b422 feat(router.py): add server cooldown logic 2023-11-22 15:59:48 -08:00
ishaan-jaff
5abd566b7c (feat) embedding() support for timeouts 2023-11-22 14:25:55 -08:00
ishaan-jaff
c38782521c (test)timeout error on openai embedding 2023-11-22 14:25:55 -08:00
ishaan-jaff
40e88eec4b (test)timeout errors 2023-11-22 14:25:55 -08:00
ishaan-jaff
3059f30672 (test) verify azure response have expected keys 2023-11-22 14:25:55 -08:00
ishaan-jaff
4247df02c7 (fix) Azure - only use ad_token when api_key is None 2023-11-22 14:25:55 -08:00
ishaan-jaff
b3bca98561 (feat) embedding() remove junk params 2023-11-22 14:25:55 -08:00
Krrish Dholakia
0b4e10e068 test(test_embedding.py): fix the embedding test 2023-11-22 14:09:45 -08:00
Krrish Dholakia
448ec0a571 feat(proxy_server): add /v1/embeddings endpoint
n
2023-11-22 14:03:27 -08:00
ishaan-jaff
40dd38508f (test) embedding stricter testing 2023-11-22 13:50:45 -08:00
ishaan-jaff
e8ff4d5eca (feat) clean out junk params from litellm embedding 2023-11-22 13:50:45 -08:00
Krrish Dholakia
b0801f61e6 test(test_caching.py): cleaning up tests 2023-11-22 13:43:48 -08:00
Krrish Dholakia
78582e158a fix(main.py): fix acompletion for anyscale, openrouter, deepinfra, perplexity endpoints 2023-11-22 13:22:58 -08:00
ishaan-jaff
ba73224a3a (feat) proxy server add /routes to see available routes 2023-11-22 13:20:21 -08:00
Krrish Dholakia
604ad41eac fix(proxy_server.py): If master key is set, only master key can be used to generate new keys 2023-11-22 10:18:28 -08:00
Krrish Dholakia
10fe16c965 fix(utils.py): add param mapping for perplexity, anyscale, deepinfra
n

n
2023-11-22 10:04:27 -08:00
Krrish Dholakia
e7bb4a0cbd fix(proxy_server): fix linting issues 2023-11-22 08:47:59 -08:00
Krrish Dholakia
76f46902ed feat(router.py): adding latency-based routing strategy 2023-11-21 21:19:27 -08:00
ishaan-jaff
2f3e13e43b (test) load test with api.litellm.ai 2023-11-21 21:07:27 -08:00
ishaan-jaff
b770ff2404 (test) load test proxy 2023-11-21 21:04:46 -08:00
ishaan-jaff
d1ad84c26d (test) load test q 2023-11-21 20:48:56 -08:00
ishaan-jaff
359f542c10 (test) load test q 2023-11-21 20:48:56 -08:00
ishaan-jaff
6aa8b41fb3 (fix) caching config 2023-11-21 20:48:56 -08:00
ishaan-jaff
3c30705b76 (fix) cache configs 2023-11-21 20:48:56 -08:00
Krrish Dholakia
904def6119 fix(proxy_server.py): fix /models endpoint 2023-11-21 20:15:43 -08:00
Krrish Dholakia
e5fa4eb314 fix(celery_app.py): add retries to worker 2023-11-21 20:07:16 -08:00