Krish Dholakia
|
dbd5f2b3cc
|
Update README.md
|
2023-11-23 10:03:14 -08:00 |
|
ishaan-jaff
|
fbd2ab4c71
|
(test) caching ensure we always test add/get caching redis
|
2023-11-23 08:27:44 -08:00 |
|
ishaan-jaff
|
b15b723567
|
(docs) proxy server: add caching
|
2023-11-23 08:08:12 -08:00 |
|
ishaan-jaff
|
44e867499f
|
(docs) proxy
|
2023-11-23 07:55:12 -08:00 |
|
Krish Dholakia
|
6ba4eeb961
|
Merge pull request #885 from Codium-ai/bugfix/hf_timeout
Do not timeout when calling HF through acomplete
|
2023-11-23 07:48:59 -08:00 |
|
Krish Dholakia
|
c51bfc686b
|
Merge pull request #884 from maqsoodshaik/main
this commit fixes #883
|
2023-11-23 07:47:28 -08:00 |
|
Ori Kotek
|
e74ac03169
|
Do not timeout when calling HF through acomplete
|
2023-11-23 15:56:59 +02:00 |
|
maqsoodshaik
|
0f89c3375a
|
this commit fixes #883
|
2023-11-23 12:45:38 +01:00 |
|
ishaan-jaff
|
1af7575c86
|
(docs) rename reliability -> Fallbacks, num retries
|
2023-11-22 20:55:53 -08:00 |
|
ishaan-jaff
|
db146bc40a
|
(test) router with fallback deployments
|
2023-11-22 20:52:56 -08:00 |
|
Ishaan Jaff
|
629415c91a
|
Merge pull request #880 from Manouchehri/patch-1
(docs) Fix missing `-r` in pip command
|
2023-11-22 20:45:25 -08:00 |
|
David Manouchehri
|
94b1d09973
|
(docs) Fix missing -r in pip command
|
2023-11-22 23:41:16 -05:00 |
|
ishaan-jaff
|
8ebc1b974c
|
(chore) run ci/cd again
|
2023-11-22 20:34:14 -08:00 |
|
Krrish Dholakia
|
2f93c0155a
|
fix: fix linting errors
|
2023-11-22 19:59:25 -08:00 |
|
Krrish Dholakia
|
5d5ca9f7ef
|
fix(router.py): add support for cooldowns with redis
|
2023-11-22 19:54:22 -08:00 |
|
ishaan-jaff
|
cb41b14cc2
|
(test) proxy test exception mapping
|
2023-11-22 16:22:05 -08:00 |
|
ishaan-jaff
|
4260e0c1f0
|
(fix) linting error
|
2023-11-22 16:22:05 -08:00 |
|
Krrish Dholakia
|
a45be1d16a
|
bump: version 1.4.0 → 1.5.0
|
2023-11-22 15:59:57 -08:00 |
|
Krrish Dholakia
|
3e76d4b422
|
feat(router.py): add server cooldown logic
|
2023-11-22 15:59:48 -08:00 |
|
ishaan-jaff
|
4ece219ec5
|
(docs) simple proxy
|
2023-11-22 15:01:26 -08:00 |
|
ishaan-jaff
|
d0f11e7a13
|
(docs) input params for litellm.embedding()
|
2023-11-22 14:40:52 -08:00 |
|
ishaan-jaff
|
5abd566b7c
|
(feat) embedding() support for timeouts
|
2023-11-22 14:25:55 -08:00 |
|
ishaan-jaff
|
c38782521c
|
(test)timeout error on openai embedding
|
2023-11-22 14:25:55 -08:00 |
|
ishaan-jaff
|
40e88eec4b
|
(test)timeout errors
|
2023-11-22 14:25:55 -08:00 |
|
ishaan-jaff
|
3059f30672
|
(test) verify azure response have expected keys
|
2023-11-22 14:25:55 -08:00 |
|
ishaan-jaff
|
4247df02c7
|
(fix) Azure - only use ad_token when api_key is None
|
2023-11-22 14:25:55 -08:00 |
|
ishaan-jaff
|
b3bca98561
|
(feat) embedding() remove junk params
|
2023-11-22 14:25:55 -08:00 |
|
Krrish Dholakia
|
0b4e10e068
|
test(test_embedding.py): fix the embedding test
|
2023-11-22 14:09:45 -08:00 |
|
Krrish Dholakia
|
448ec0a571
|
feat(proxy_server): add /v1/embeddings endpoint
n
|
2023-11-22 14:03:27 -08:00 |
|
ishaan-jaff
|
40dd38508f
|
(test) embedding stricter testing
|
2023-11-22 13:50:45 -08:00 |
|
ishaan-jaff
|
e8ff4d5eca
|
(feat) clean out junk params from litellm embedding
|
2023-11-22 13:50:45 -08:00 |
|
Krrish Dholakia
|
b0801f61e6
|
test(test_caching.py): cleaning up tests
|
2023-11-22 13:43:48 -08:00 |
|
Krrish Dholakia
|
78582e158a
|
fix(main.py): fix acompletion for anyscale, openrouter, deepinfra, perplexity endpoints
|
2023-11-22 13:22:58 -08:00 |
|
ishaan-jaff
|
ba73224a3a
|
(feat) proxy server add /routes to see available routes
|
2023-11-22 13:20:21 -08:00 |
|
Krrish Dholakia
|
604ad41eac
|
fix(proxy_server.py): If master key is set, only master key can be used to generate new keys
|
2023-11-22 10:18:28 -08:00 |
|
Krrish Dholakia
|
10fe16c965
|
fix(utils.py): add param mapping for perplexity, anyscale, deepinfra
n
n
|
2023-11-22 10:04:27 -08:00 |
|
Krrish Dholakia
|
e7bb4a0cbd
|
fix(proxy_server): fix linting issues
|
2023-11-22 08:47:59 -08:00 |
|
ishaan-jaff
|
7a4be44805
|
(docs) request q
|
2023-11-22 08:10:30 -08:00 |
|
Krrish Dholakia
|
a4406e1784
|
bump: version 1.3.4 → 1.4.0
|
2023-11-21 21:19:27 -08:00 |
|
Krrish Dholakia
|
76f46902ed
|
feat(router.py): adding latency-based routing strategy
|
2023-11-21 21:19:27 -08:00 |
|
ishaan-jaff
|
2f3e13e43b
|
(test) load test with api.litellm.ai
|
2023-11-21 21:07:27 -08:00 |
|
ishaan-jaff
|
b770ff2404
|
(test) load test proxy
|
2023-11-21 21:04:46 -08:00 |
|
ishaan-jaff
|
d1ad84c26d
|
(test) load test q
|
2023-11-21 20:48:56 -08:00 |
|
ishaan-jaff
|
359f542c10
|
(test) load test q
|
2023-11-21 20:48:56 -08:00 |
|
ishaan-jaff
|
6aa8b41fb3
|
(fix) caching config
|
2023-11-21 20:48:56 -08:00 |
|
ishaan-jaff
|
3c30705b76
|
(fix) cache configs
|
2023-11-21 20:48:56 -08:00 |
|
Krrish Dholakia
|
d7f292c108
|
bump: version 1.3.3 → 1.3.4
|
2023-11-21 20:16:00 -08:00 |
|
Krrish Dholakia
|
904def6119
|
fix(proxy_server.py): fix /models endpoint
|
2023-11-21 20:15:43 -08:00 |
|
Krrish Dholakia
|
e5fa4eb314
|
fix(celery_app.py): add retries to worker
|
2023-11-21 20:07:16 -08:00 |
|
Krrish Dholakia
|
381fdcd37b
|
fix(utils.py): add response ms for async calls
|
2023-11-21 19:59:00 -08:00 |
|