Commit graph

3572 commits

Author SHA1 Message Date
Krish Dholakia
dbd5f2b3cc
Update README.md 2023-11-23 10:03:14 -08:00
ishaan-jaff
fbd2ab4c71 (test) caching ensure we always test add/get caching redis 2023-11-23 08:27:44 -08:00
ishaan-jaff
b15b723567 (docs) proxy server: add caching 2023-11-23 08:08:12 -08:00
ishaan-jaff
44e867499f (docs) proxy 2023-11-23 07:55:12 -08:00
Krish Dholakia
6ba4eeb961
Merge pull request #885 from Codium-ai/bugfix/hf_timeout
Do not timeout when calling HF through acomplete
2023-11-23 07:48:59 -08:00
Krish Dholakia
c51bfc686b
Merge pull request #884 from maqsoodshaik/main
this commit fixes #883
2023-11-23 07:47:28 -08:00
Ori Kotek
e74ac03169
Do not timeout when calling HF through acomplete 2023-11-23 15:56:59 +02:00
maqsoodshaik
0f89c3375a this commit fixes #883 2023-11-23 12:45:38 +01:00
ishaan-jaff
1af7575c86 (docs) rename reliability -> Fallbacks, num retries 2023-11-22 20:55:53 -08:00
ishaan-jaff
db146bc40a (test) router with fallback deployments 2023-11-22 20:52:56 -08:00
Ishaan Jaff
629415c91a
Merge pull request #880 from Manouchehri/patch-1
(docs) Fix missing `-r` in pip command
2023-11-22 20:45:25 -08:00
David Manouchehri
94b1d09973
(docs) Fix missing -r in pip command 2023-11-22 23:41:16 -05:00
ishaan-jaff
8ebc1b974c (chore) run ci/cd again 2023-11-22 20:34:14 -08:00
Krrish Dholakia
2f93c0155a fix: fix linting errors 2023-11-22 19:59:25 -08:00
Krrish Dholakia
5d5ca9f7ef fix(router.py): add support for cooldowns with redis 2023-11-22 19:54:22 -08:00
ishaan-jaff
cb41b14cc2 (test) proxy test exception mapping 2023-11-22 16:22:05 -08:00
ishaan-jaff
4260e0c1f0 (fix) linting error 2023-11-22 16:22:05 -08:00
Krrish Dholakia
a45be1d16a bump: version 1.4.0 → 1.5.0 2023-11-22 15:59:57 -08:00
Krrish Dholakia
3e76d4b422 feat(router.py): add server cooldown logic 2023-11-22 15:59:48 -08:00
ishaan-jaff
4ece219ec5 (docs) simple proxy 2023-11-22 15:01:26 -08:00
ishaan-jaff
d0f11e7a13 (docs) input params for litellm.embedding() 2023-11-22 14:40:52 -08:00
ishaan-jaff
5abd566b7c (feat) embedding() support for timeouts 2023-11-22 14:25:55 -08:00
ishaan-jaff
c38782521c (test)timeout error on openai embedding 2023-11-22 14:25:55 -08:00
ishaan-jaff
40e88eec4b (test)timeout errors 2023-11-22 14:25:55 -08:00
ishaan-jaff
3059f30672 (test) verify azure response have expected keys 2023-11-22 14:25:55 -08:00
ishaan-jaff
4247df02c7 (fix) Azure - only use ad_token when api_key is None 2023-11-22 14:25:55 -08:00
ishaan-jaff
b3bca98561 (feat) embedding() remove junk params 2023-11-22 14:25:55 -08:00
Krrish Dholakia
0b4e10e068 test(test_embedding.py): fix the embedding test 2023-11-22 14:09:45 -08:00
Krrish Dholakia
448ec0a571 feat(proxy_server): add /v1/embeddings endpoint
n
2023-11-22 14:03:27 -08:00
ishaan-jaff
40dd38508f (test) embedding stricter testing 2023-11-22 13:50:45 -08:00
ishaan-jaff
e8ff4d5eca (feat) clean out junk params from litellm embedding 2023-11-22 13:50:45 -08:00
Krrish Dholakia
b0801f61e6 test(test_caching.py): cleaning up tests 2023-11-22 13:43:48 -08:00
Krrish Dholakia
78582e158a fix(main.py): fix acompletion for anyscale, openrouter, deepinfra, perplexity endpoints 2023-11-22 13:22:58 -08:00
ishaan-jaff
ba73224a3a (feat) proxy server add /routes to see available routes 2023-11-22 13:20:21 -08:00
Krrish Dholakia
604ad41eac fix(proxy_server.py): If master key is set, only master key can be used to generate new keys 2023-11-22 10:18:28 -08:00
Krrish Dholakia
10fe16c965 fix(utils.py): add param mapping for perplexity, anyscale, deepinfra
n

n
2023-11-22 10:04:27 -08:00
Krrish Dholakia
e7bb4a0cbd fix(proxy_server): fix linting issues 2023-11-22 08:47:59 -08:00
ishaan-jaff
7a4be44805 (docs) request q 2023-11-22 08:10:30 -08:00
Krrish Dholakia
a4406e1784 bump: version 1.3.4 → 1.4.0 2023-11-21 21:19:27 -08:00
Krrish Dholakia
76f46902ed feat(router.py): adding latency-based routing strategy 2023-11-21 21:19:27 -08:00
ishaan-jaff
2f3e13e43b (test) load test with api.litellm.ai 2023-11-21 21:07:27 -08:00
ishaan-jaff
b770ff2404 (test) load test proxy 2023-11-21 21:04:46 -08:00
ishaan-jaff
d1ad84c26d (test) load test q 2023-11-21 20:48:56 -08:00
ishaan-jaff
359f542c10 (test) load test q 2023-11-21 20:48:56 -08:00
ishaan-jaff
6aa8b41fb3 (fix) caching config 2023-11-21 20:48:56 -08:00
ishaan-jaff
3c30705b76 (fix) cache configs 2023-11-21 20:48:56 -08:00
Krrish Dholakia
d7f292c108 bump: version 1.3.3 → 1.3.4 2023-11-21 20:16:00 -08:00
Krrish Dholakia
904def6119 fix(proxy_server.py): fix /models endpoint 2023-11-21 20:15:43 -08:00
Krrish Dholakia
e5fa4eb314 fix(celery_app.py): add retries to worker 2023-11-21 20:07:16 -08:00
Krrish Dholakia
381fdcd37b fix(utils.py): add response ms for async calls 2023-11-21 19:59:00 -08:00