Commit graph

133 commits

Author SHA1 Message Date
Krish Dholakia
be3c7b401e
LiteLLM Minor fixes + improvements (08/03/2024) (#5488)
* fix(internal_user_endpoints.py): set budget_reset_at for /user/update

* fix(vertex_and_google_ai_studio_gemini.py): handle accumulated json

Fixes https://github.com/BerriAI/litellm/issues/5479

* fix(vertex_ai_and_gemini.py): fix assistant message function call when content is not None

Fixes https://github.com/BerriAI/litellm/issues/5490

* fix(proxy_server.py): generic state uuid for okta sso

* fix(lago.py): improve debug logs

Debugging for https://github.com/BerriAI/litellm/issues/5477

* docs(bedrock.md): add bedrock cross-region inferencing to docs

* fix(azure.py): return azure response headers on aembedding call

* feat(azure.py): return azure response headers for `/audio/transcription`

* fix(types/utils.py): standardize deepseek / anthropic prompt caching usage information

Closes https://github.com/BerriAI/litellm/issues/5285

* docs(usage.md): add docs on litellm usage object

* test(test_completion.py): mark flaky test
2024-09-03 21:21:34 -07:00
Simon S. Viloria
7778fa0146
fix response_format={'type': 'json_object'} not working for Azure models (#5468) 2024-09-01 13:31:13 -07:00
Krrish Dholakia
63adb3f940 fix(azure.py): fix raw response dump 2024-08-27 13:44:38 -07:00
Krrish Dholakia
18731cf42b fix: fix linting errors 2024-08-27 12:14:23 -07:00
Krrish Dholakia
756a828c15 fix(azure.py): add response header coverage for azure models 2024-08-24 15:12:51 -07:00
Krish Dholakia
f458f565af
Merge pull request #5292 from OgnjenFrancuski/main
Update SSL verification
2024-08-23 20:42:35 -07:00
Krrish Dholakia
d7d3eee349 feat(azure.py): support health checking azure deployments
Fixes https://github.com/BerriAI/litellm/issues/5279
2024-08-22 16:11:14 -07:00
Krrish Dholakia
80675b22bd feat(batches): add azure openai batches endpoint support
Closes https://github.com/BerriAI/litellm/issues/5073
2024-08-22 14:46:51 -07:00
Krrish Dholakia
a4f9ba4a88 fix(azure.py): fix optional param elif statement 2024-08-20 12:24:51 -07:00
Krish Dholakia
ad1e327208
Merge pull request #5284 from BerriAI/litellm_fix_azure_api_version
feat(azure.py): support dynamic azure api versions
2024-08-20 11:46:55 -07:00
Ognjen Francuski
9492277fea Fix using sync 'litellm.client_session' for async calls in azure.py 2024-08-20 10:29:37 +02:00
Krrish Dholakia
49416e121c feat(azure.py): support dynamic api versions
Closes https://github.com/BerriAI/litellm/issues/5228
2024-08-19 12:17:43 -07:00
Krrish Dholakia
7ec7c9970b feat(azure.py): support 'json_schema' for older models
Converts the json schema input to a tool call, allows the call to still work on older azure models
2024-08-17 16:31:13 -07:00
Krrish Dholakia
96d4466a8f fix(azure.py): return response headers acompletion + acompletion w/ streaming
Fixes https://github.com/BerriAI/litellm/issues/2440#issuecomment-2283259187
2024-08-12 11:29:49 -07:00
Krrish Dholakia
fe2aa706e8 refactor(openai/azure.py): move to returning openai/azure response headers by default
Allows token tracking to work more reliably across multiple azure/openai deployments
2024-08-02 09:42:08 -07:00
Ishaan Jaff
f2443996d8 feat support audio health checks for azure 2024-07-25 17:30:15 -07:00
Krrish Dholakia
381347082d fix(azure.py): improve error handling for azure image gen responses 2024-07-08 16:39:05 -07:00
Krrish Dholakia
298505c47c fix(whisper---handle-openai/azure-vtt-response-format): Fixes https://github.com/BerriAI/litellm/issues/4595 2024-07-08 09:10:40 -07:00
Krish Dholakia
8661da1980
Merge branch 'main' into litellm_fix_httpx_transport 2024-07-06 19:12:06 -07:00
Krrish Dholakia
ac00639301 fix(azure.py): fix httpx call for dall e 3 2024-07-02 18:30:21 -07:00
Krrish Dholakia
0647278a69 refactor: remove custom transport logic
Not needed after azure dall-e-2 refactor
2024-07-02 17:35:27 -07:00
Krrish Dholakia
cf5334fe8a refactor(azure.py): refactor sync azure calls to httpx 2024-07-02 17:06:48 -07:00
Krrish Dholakia
589c1c6280 refactor(azure.py): replaces the custom transport logic for just using our httpx client
Done to fix all the http/https proxy issues people are facing with proxy.
2024-07-02 15:32:53 -07:00
Ishaan Jaff
568245b5c0 feat - set response headers in azure requests 2024-07-01 20:12:39 -07:00
Ishaan Jaff
140f7fe254 return azure response headers 2024-07-01 17:09:06 -07:00
Ishaan Jaff
1980a07f32 fix test custom callback router 2024-06-28 16:54:28 -07:00
Krish Dholakia
1223b2b111
Merge pull request #4449 from BerriAI/litellm_azure_tts
feat(azure.py): azure tts support
2024-06-27 21:33:38 -07:00
Ishaan Jaff
57ba0a46b7 azure - log post api call 2024-06-27 17:38:03 -07:00
Krrish Dholakia
c14cc35e52 feat(azure.py): azure tts support
Closes https://github.com/BerriAI/litellm/issues/4002
2024-06-27 16:59:25 -07:00
Ishaan Jaff
b7bca0af6c fix - reuse client initialized on proxy config 2024-06-26 16:16:58 -07:00
Krrish Dholakia
09f4eb7617 docs(reliable_completions.md): improve headers for easier searching 2024-06-26 08:09:31 -07:00
Krrish Dholakia
aa6b56f057 fix(azure.py): handle asyncio.CancelledError 2024-06-18 20:14:27 -07:00
David Manouchehri
d950373768
feat(azure.py): Add Azure AD cred caching for OIDC flow. 2024-06-11 15:54:44 +00:00
David Manouchehri
a31fa5fbc8
feat(util.py/azure.py): Add OIDC support when running in Azure Kubernetes Service (AKS). 2024-06-11 15:54:34 +00:00
Krrish Dholakia
b360ab4c89 fix(azure.py): support dynamic drop params 2024-06-05 09:03:10 -07:00
Krrish Dholakia
60ec3228ed fix(azure.py): fix logprobs naming error
fixing string value
2024-06-05 08:54:53 -07:00
Krrish Dholakia
650ea6d0c3 feat(assistants/main.py): support arun_thread_stream 2024-06-04 16:47:51 -07:00
Krrish Dholakia
f3d78532f9 feat(assistants/main.py): add assistants api streaming support 2024-06-04 16:30:35 -07:00
Krish Dholakia
5ee3b0f30f
Merge pull request #3996 from BerriAI/litellm_azure_assistants_api_support
feat(assistants/main.py): Azure Assistants API support
2024-06-03 21:05:03 -07:00
Krrish Dholakia
2e8f081a86 fix(main.py): cast to string only if var is not None 2024-06-03 19:25:59 -07:00
Krrish Dholakia
7163bce37b feat(assistants/main.py): Closes https://github.com/BerriAI/litellm/issues/3993 2024-06-03 18:47:05 -07:00
Krrish Dholakia
9ef83126d7 fix(utils.py): correctly instrument passing through api version in optional param check 2024-06-01 19:31:52 -07:00
Krrish Dholakia
7efac4d36c fix(azure.py): support dropping 'tool_choice=required' for older azure API versions
Closes https://github.com/BerriAI/litellm/issues/3876
2024-06-01 18:44:50 -07:00
Krish Dholakia
86d0c0ae4e
Merge pull request #3582 from BerriAI/litellm_explicit_region_name_setting
feat(router.py): allow setting model_region in litellm_params
2024-05-11 11:36:22 -07:00
Krrish Dholakia
6714854bb7 feat(router.py): support region routing for bedrock, vertex ai, watsonx 2024-05-11 11:04:00 -07:00
Krish Dholakia
40063798bd
Merge pull request #3507 from Manouchehri/oidc-3505-part-1
Initial OIDC support (Google/GitHub/CircleCI -> Amazon Bedrock & Azure OpenAI)
2024-05-11 09:25:17 -07:00
Krish Dholakia
8ad979cdfe
Merge branch 'main' into litellm_region_based_routing 2024-05-08 22:19:51 -07:00
Krrish Dholakia
3d18897d69 feat(router.py): enable filtering model group by 'allowed_model_region' 2024-05-08 22:10:17 -07:00
Krrish Dholakia
6575143460 feat(proxy_server.py): return litellm version in response headers 2024-05-08 16:00:08 -07:00
David Manouchehri
cb49fb004d
fix(azure.py): Correct invalid .get to a .post for OIDC 2024-05-07 20:01:46 +00:00