ishaan-jaff
|
da7c07574f
|
(test) cost calc on azure
|
2023-11-23 13:50:09 -08:00 |
|
Krrish Dholakia
|
c074023e14
|
fix: fix linting issues
|
2023-11-23 13:47:43 -08:00 |
|
Krrish Dholakia
|
4f183dc6a0
|
fix(utils.py): support reading api keys dynamically from the os environment
|
2023-11-23 13:41:56 -08:00 |
|
Krrish Dholakia
|
87aa36a2ec
|
fix(caching.py): fix linting issues
|
2023-11-23 13:21:45 -08:00 |
|
Krrish Dholakia
|
8c4e8d6c62
|
feat(proxy_server.py): add in-memory caching for user api keys
|
2023-11-23 13:21:45 -08:00 |
|
ishaan-jaff
|
8291f239a4
|
(feat) proxy server: add spend column
|
2023-11-23 11:46:59 -08:00 |
|
ishaan-jaff
|
2c50ea94c8
|
(test) embedding: azurea - dd response_ms
|
2023-11-23 11:15:52 -08:00 |
|
Krrish Dholakia
|
61fc76a8c4
|
fix(router.py): fix caching for tracking cooldowns + usage
|
2023-11-23 11:13:32 -08:00 |
|
ishaan-jaff
|
9d3d10da4c
|
(Feat) update config.yaml example on proxy
|
2023-11-23 10:54:30 -08:00 |
|
ishaan-jaff
|
1dcab433b4
|
(feat) proxy: caching - show redis settings when initializing
|
2023-11-23 10:52:50 -08:00 |
|
Krish Dholakia
|
da12c13c5a
|
Merge pull request #881 from Manouchehri/lambda-1
Add AWS Lambda Support
|
2023-11-23 10:38:34 -08:00 |
|
ishaan-jaff
|
ae3eeb16ab
|
(feat) proxy: add curl command test + read cache config
|
2023-11-23 10:31:04 -08:00 |
|
ishaan-jaff
|
fbd2ab4c71
|
(test) caching ensure we always test add/get caching redis
|
2023-11-23 08:27:44 -08:00 |
|
Krish Dholakia
|
6ba4eeb961
|
Merge pull request #885 from Codium-ai/bugfix/hf_timeout
Do not timeout when calling HF through acomplete
|
2023-11-23 07:48:59 -08:00 |
|
Ori Kotek
|
e74ac03169
|
Do not timeout when calling HF through acomplete
|
2023-11-23 15:56:59 +02:00 |
|
maqsoodshaik
|
0f89c3375a
|
this commit fixes #883
|
2023-11-23 12:45:38 +01:00 |
|
David Manouchehri
|
60ca43b6d9
|
Add mangum.
|
2023-11-23 00:04:47 -05:00 |
|
ishaan-jaff
|
db146bc40a
|
(test) router with fallback deployments
|
2023-11-22 20:52:56 -08:00 |
|
ishaan-jaff
|
8ebc1b974c
|
(chore) run ci/cd again
|
2023-11-22 20:34:14 -08:00 |
|
David Manouchehri
|
4d9aa7a3f7
|
Add symlink and only copy in source dir to stay under 50MB compressed limit for Lambdas.
|
2023-11-22 23:07:33 -05:00 |
|
Krrish Dholakia
|
2f93c0155a
|
fix: fix linting errors
|
2023-11-22 19:59:25 -08:00 |
|
Krrish Dholakia
|
5d5ca9f7ef
|
fix(router.py): add support for cooldowns with redis
|
2023-11-22 19:54:22 -08:00 |
|
ishaan-jaff
|
cb41b14cc2
|
(test) proxy test exception mapping
|
2023-11-22 16:22:05 -08:00 |
|
ishaan-jaff
|
4260e0c1f0
|
(fix) linting error
|
2023-11-22 16:22:05 -08:00 |
|
Krrish Dholakia
|
3e76d4b422
|
feat(router.py): add server cooldown logic
|
2023-11-22 15:59:48 -08:00 |
|
ishaan-jaff
|
5abd566b7c
|
(feat) embedding() support for timeouts
|
2023-11-22 14:25:55 -08:00 |
|
ishaan-jaff
|
c38782521c
|
(test)timeout error on openai embedding
|
2023-11-22 14:25:55 -08:00 |
|
ishaan-jaff
|
40e88eec4b
|
(test)timeout errors
|
2023-11-22 14:25:55 -08:00 |
|
ishaan-jaff
|
3059f30672
|
(test) verify azure response have expected keys
|
2023-11-22 14:25:55 -08:00 |
|
ishaan-jaff
|
4247df02c7
|
(fix) Azure - only use ad_token when api_key is None
|
2023-11-22 14:25:55 -08:00 |
|
ishaan-jaff
|
b3bca98561
|
(feat) embedding() remove junk params
|
2023-11-22 14:25:55 -08:00 |
|
Krrish Dholakia
|
0b4e10e068
|
test(test_embedding.py): fix the embedding test
|
2023-11-22 14:09:45 -08:00 |
|
Krrish Dholakia
|
448ec0a571
|
feat(proxy_server): add /v1/embeddings endpoint
n
|
2023-11-22 14:03:27 -08:00 |
|
ishaan-jaff
|
40dd38508f
|
(test) embedding stricter testing
|
2023-11-22 13:50:45 -08:00 |
|
ishaan-jaff
|
e8ff4d5eca
|
(feat) clean out junk params from litellm embedding
|
2023-11-22 13:50:45 -08:00 |
|
Krrish Dholakia
|
b0801f61e6
|
test(test_caching.py): cleaning up tests
|
2023-11-22 13:43:48 -08:00 |
|
Krrish Dholakia
|
78582e158a
|
fix(main.py): fix acompletion for anyscale, openrouter, deepinfra, perplexity endpoints
|
2023-11-22 13:22:58 -08:00 |
|
ishaan-jaff
|
ba73224a3a
|
(feat) proxy server add /routes to see available routes
|
2023-11-22 13:20:21 -08:00 |
|
Krrish Dholakia
|
604ad41eac
|
fix(proxy_server.py): If master key is set, only master key can be used to generate new keys
|
2023-11-22 10:18:28 -08:00 |
|
Krrish Dholakia
|
10fe16c965
|
fix(utils.py): add param mapping for perplexity, anyscale, deepinfra
n
n
|
2023-11-22 10:04:27 -08:00 |
|
Krrish Dholakia
|
e7bb4a0cbd
|
fix(proxy_server): fix linting issues
|
2023-11-22 08:47:59 -08:00 |
|
Krrish Dholakia
|
76f46902ed
|
feat(router.py): adding latency-based routing strategy
|
2023-11-21 21:19:27 -08:00 |
|
ishaan-jaff
|
2f3e13e43b
|
(test) load test with api.litellm.ai
|
2023-11-21 21:07:27 -08:00 |
|
ishaan-jaff
|
b770ff2404
|
(test) load test proxy
|
2023-11-21 21:04:46 -08:00 |
|
ishaan-jaff
|
d1ad84c26d
|
(test) load test q
|
2023-11-21 20:48:56 -08:00 |
|
ishaan-jaff
|
359f542c10
|
(test) load test q
|
2023-11-21 20:48:56 -08:00 |
|
ishaan-jaff
|
6aa8b41fb3
|
(fix) caching config
|
2023-11-21 20:48:56 -08:00 |
|
ishaan-jaff
|
3c30705b76
|
(fix) cache configs
|
2023-11-21 20:48:56 -08:00 |
|
Krrish Dholakia
|
904def6119
|
fix(proxy_server.py): fix /models endpoint
|
2023-11-21 20:15:43 -08:00 |
|
Krrish Dholakia
|
e5fa4eb314
|
fix(celery_app.py): add retries to worker
|
2023-11-21 20:07:16 -08:00 |
|