Commit graph

1823 commits

Author SHA1 Message Date
ishaan-jaff
43ae3b94fa (fix) debugging: viewing raw POST request 2023-11-23 16:28:19 -08:00
Krrish Dholakia
afac42e93a fix(router.py): enable async completions with model fallbacks 2023-11-23 16:15:57 -08:00
ishaan-jaff
9b53ea4b0f (feat) proxy: cost tracking per completion request 2023-11-23 16:08:59 -08:00
ishaan-jaff
1365092c89 (test) router: ensure errors are raised 2023-11-23 16:08:59 -08:00
ishaan-jaff
37dedeee04 (fix) debugging: POST request 2023-11-23 16:08:59 -08:00
ishaan-jaff
e8b844abae (fix) azure: better debugging 2023-11-23 16:08:59 -08:00
ishaan-jaff
525e01824a (test) ensure exception raised when incorrect key passed 2023-11-23 16:08:59 -08:00
Krrish Dholakia
8ac03e492f fix(router.py): enable fallbacks for sync completions 2023-11-23 16:06:46 -08:00
ishaan-jaff
bb00595429 (feat) cost: azure gpt + testing 2023-11-23 14:20:48 -08:00
ishaan-jaff
90a9116a17 (test) strict test ft:gpt-3.5 cost calc 2023-11-23 14:11:01 -08:00
ishaan-jaff
46fc8be4c8 (feat) cost tracking ft:gpt-3.5-turbo 2023-11-23 13:58:59 -08:00
ishaan-jaff
da7c07574f (test) cost calc on azure 2023-11-23 13:50:09 -08:00
Krrish Dholakia
c074023e14 fix: fix linting issues 2023-11-23 13:47:43 -08:00
Krrish Dholakia
4f183dc6a0 fix(utils.py): support reading api keys dynamically from the os environment 2023-11-23 13:41:56 -08:00
Krrish Dholakia
87aa36a2ec fix(caching.py): fix linting issues 2023-11-23 13:21:45 -08:00
Krrish Dholakia
8c4e8d6c62 feat(proxy_server.py): add in-memory caching for user api keys 2023-11-23 13:21:45 -08:00
ishaan-jaff
8291f239a4 (feat) proxy server: add spend column 2023-11-23 11:46:59 -08:00
ishaan-jaff
2c50ea94c8 (test) embedding: azurea - dd response_ms 2023-11-23 11:15:52 -08:00
Krrish Dholakia
61fc76a8c4 fix(router.py): fix caching for tracking cooldowns + usage 2023-11-23 11:13:32 -08:00
ishaan-jaff
9d3d10da4c (Feat) update config.yaml example on proxy 2023-11-23 10:54:30 -08:00
ishaan-jaff
1dcab433b4 (feat) proxy: caching - show redis settings when initializing 2023-11-23 10:52:50 -08:00
Krish Dholakia
da12c13c5a
Merge pull request #881 from Manouchehri/lambda-1
Add AWS Lambda Support
2023-11-23 10:38:34 -08:00
ishaan-jaff
ae3eeb16ab (feat) proxy: add curl command test + read cache config 2023-11-23 10:31:04 -08:00
ishaan-jaff
fbd2ab4c71 (test) caching ensure we always test add/get caching redis 2023-11-23 08:27:44 -08:00
Krish Dholakia
6ba4eeb961
Merge pull request #885 from Codium-ai/bugfix/hf_timeout
Do not timeout when calling HF through acomplete
2023-11-23 07:48:59 -08:00
Ori Kotek
e74ac03169
Do not timeout when calling HF through acomplete 2023-11-23 15:56:59 +02:00
maqsoodshaik
0f89c3375a this commit fixes #883 2023-11-23 12:45:38 +01:00
David Manouchehri
60ca43b6d9
Add mangum. 2023-11-23 00:04:47 -05:00
ishaan-jaff
db146bc40a (test) router with fallback deployments 2023-11-22 20:52:56 -08:00
ishaan-jaff
8ebc1b974c (chore) run ci/cd again 2023-11-22 20:34:14 -08:00
David Manouchehri
4d9aa7a3f7
Add symlink and only copy in source dir to stay under 50MB compressed limit for Lambdas. 2023-11-22 23:07:33 -05:00
Krrish Dholakia
2f93c0155a fix: fix linting errors 2023-11-22 19:59:25 -08:00
Krrish Dholakia
5d5ca9f7ef fix(router.py): add support for cooldowns with redis 2023-11-22 19:54:22 -08:00
ishaan-jaff
cb41b14cc2 (test) proxy test exception mapping 2023-11-22 16:22:05 -08:00
ishaan-jaff
4260e0c1f0 (fix) linting error 2023-11-22 16:22:05 -08:00
Krrish Dholakia
3e76d4b422 feat(router.py): add server cooldown logic 2023-11-22 15:59:48 -08:00
ishaan-jaff
5abd566b7c (feat) embedding() support for timeouts 2023-11-22 14:25:55 -08:00
ishaan-jaff
c38782521c (test)timeout error on openai embedding 2023-11-22 14:25:55 -08:00
ishaan-jaff
40e88eec4b (test)timeout errors 2023-11-22 14:25:55 -08:00
ishaan-jaff
3059f30672 (test) verify azure response have expected keys 2023-11-22 14:25:55 -08:00
ishaan-jaff
4247df02c7 (fix) Azure - only use ad_token when api_key is None 2023-11-22 14:25:55 -08:00
ishaan-jaff
b3bca98561 (feat) embedding() remove junk params 2023-11-22 14:25:55 -08:00
Krrish Dholakia
0b4e10e068 test(test_embedding.py): fix the embedding test 2023-11-22 14:09:45 -08:00
Krrish Dholakia
448ec0a571 feat(proxy_server): add /v1/embeddings endpoint
n
2023-11-22 14:03:27 -08:00
ishaan-jaff
40dd38508f (test) embedding stricter testing 2023-11-22 13:50:45 -08:00
ishaan-jaff
e8ff4d5eca (feat) clean out junk params from litellm embedding 2023-11-22 13:50:45 -08:00
Krrish Dholakia
b0801f61e6 test(test_caching.py): cleaning up tests 2023-11-22 13:43:48 -08:00
Krrish Dholakia
78582e158a fix(main.py): fix acompletion for anyscale, openrouter, deepinfra, perplexity endpoints 2023-11-22 13:22:58 -08:00
ishaan-jaff
ba73224a3a (feat) proxy server add /routes to see available routes 2023-11-22 13:20:21 -08:00
Krrish Dholakia
604ad41eac fix(proxy_server.py): If master key is set, only master key can be used to generate new keys 2023-11-22 10:18:28 -08:00