Commit graph

1072 commits

Author SHA1 Message Date
Ishaan Jaff
80e95b4ccf add mock testing for vertex tts 2024-08-23 18:18:37 -07:00
Ishaan Jaff
8fada93fff docs on using vertex tts 2024-08-23 17:57:49 -07:00
Ishaan Jaff
755a0514f6 fix linting 2024-08-23 16:05:31 -07:00
Ishaan Jaff
c3987745fe fix linting errors 2024-08-23 15:44:31 -07:00
Krish Dholakia
76b3db334b
Merge branch 'main' into litellm_azure_batch_apis 2024-08-22 19:07:54 -07:00
Ishaan Jaff
228252b92d
Merge branch 'main' into litellm_allow_using_azure_ad_token_auth 2024-08-22 18:21:24 -07:00
Krrish Dholakia
d7d3eee349 feat(azure.py): support health checking azure deployments
Fixes https://github.com/BerriAI/litellm/issues/5279
2024-08-22 16:11:14 -07:00
Ishaan Jaff
08fa3f346a add new litellm params for client_id, tenant_id etc 2024-08-22 11:37:30 -07:00
Ishaan Jaff
8f657b40f5 use azure_ad_token_provider to init clients 2024-08-22 11:03:49 -07:00
Krrish Dholakia
70bf8bd4f4 feat(factory.py): enable 'user_continue_message' for interweaving user/assistant messages when provider requires it
allows bedrock to be used with autogen
2024-08-22 11:03:33 -07:00
Krrish Dholakia
11bfc1dca7 fix(cohere_chat.py): support passing 'extra_headers'
Fixes https://github.com/BerriAI/litellm/issues/4709
2024-08-22 10:17:36 -07:00
Krrish Dholakia
f36e7e0754 fix(ollama_chat.py): fix passing assistant message with tool call param
Fixes https://github.com/BerriAI/litellm/issues/5319
2024-08-22 10:00:03 -07:00
Ishaan Jaff
35781ab8d5 add multi modal vtx embedding 2024-08-21 15:05:59 -07:00
Ishaan Jaff
7e3dc83c0d add initial support for multimodal_embedding vertex 2024-08-21 14:29:05 -07:00
Krish Dholakia
409306b266
Merge branch 'main' into litellm_fix_azure_api_version 2024-08-20 11:40:53 -07:00
Krrish Dholakia
89791d9285 fix(main.py): response_format typing for acompletion
Fixes https://github.com/BerriAI/litellm/issues/5239
2024-08-20 08:14:14 -07:00
Krrish Dholakia
49416e121c feat(azure.py): support dynamic api versions
Closes https://github.com/BerriAI/litellm/issues/5228
2024-08-19 12:17:43 -07:00
Krish Dholakia
a8dd2b6910
Merge pull request #5244 from BerriAI/litellm_better_error_logging_sentry
refactor: replace .error() with .exception() logging for better debugging on sentry
2024-08-16 19:16:20 -07:00
Krrish Dholakia
7fce6b0163 fix(health_check.py): return 'missing mode' error message, if error with health check, and mode is missing 2024-08-16 17:24:29 -07:00
Krrish Dholakia
61f4b71ef7 refactor: replace .error() with .exception() logging for better debugging on sentry 2024-08-16 09:22:47 -07:00
Ishaan Jaff
df4ea8fba6 refactor sagemaker to be async 2024-08-15 18:18:02 -07:00
Krrish Dholakia
583a3b330d fix(utils.py): support calling openai models via azure_ai/ 2024-08-14 13:41:04 -07:00
Krrish Dholakia
068ee12c30 fix(main.py): safely fail stream_chunk_builder calls 2024-08-10 10:22:26 -07:00
Krrish Dholakia
a858cc4d0c docs(main.py): clarify 'num_retries' usage 2024-08-09 16:57:06 -07:00
Krrish Dholakia
ba7b070883 fix(utils.py): set max_retries = num_retries, if given 2024-08-09 16:54:54 -07:00
Ishaan Jaff
e734568b5a fix cohere / cohere_chat when timeout is None 2024-08-09 12:10:02 -07:00
Krish Dholakia
2e434d56e3
Merge pull request #5079 from BerriAI/litellm_add_pydantic_model_support
feat(utils.py): support passing response_format as pydantic model
2024-08-07 14:43:05 -07:00
Krish Dholakia
93d048b1dc
Merge branch 'main' into litellm_anthropic_streaming_tool_call_fix 2024-08-07 14:33:30 -07:00
Krrish Dholakia
c0ef2e9dd0 fix(main.py): fix linting error for python3.8 2024-08-07 13:21:35 -07:00
Krish Dholakia
3605e873a1
Merge branch 'main' into litellm_add_pydantic_model_support 2024-08-07 13:07:46 -07:00
Krrish Dholakia
4919cc4d25 fix(anthropic.py): handle scenario where anthropic returns invalid json string for tool call while streaming
Fixes https://github.com/BerriAI/litellm/issues/5063
2024-08-07 09:24:11 -07:00
Ishaan Jaff
dc3cdf3ed8 fix use extra headers for open router 2024-08-07 08:15:05 -07:00
Krrish Dholakia
9cf3d5f568 feat(utils.py): support passing response_format as pydantic model
Related issue - https://github.com/BerriAI/litellm/issues/5074
2024-08-06 18:16:07 -07:00
Krrish Dholakia
7bf1b4d661 fix(main.py): log hidden params for text completion calls 2024-08-05 21:26:48 -07:00
Krrish Dholakia
3c4c78a71f feat(caching.py): enable caching on provider-specific optional params
Closes https://github.com/BerriAI/litellm/issues/5049
2024-08-05 11:18:59 -07:00
Krish Dholakia
bca71019ad
Merge branch 'main' into litellm_anthropic_api_streaming 2024-08-03 21:16:50 -07:00
Krrish Dholakia
ac6c39c283 feat(anthropic_adapter.py): support streaming requests for /v1/messages endpoint
Fixes https://github.com/BerriAI/litellm/issues/5011
2024-08-03 20:16:19 -07:00
Joe Cheng
b7be609d6e Use correct key name 2024-08-03 11:58:46 -07:00
Joe Cheng
33f4411f17 Fix tool call coalescing
The previous code seemed to assume that the tool call index property
started at 0, but Anthropic sometimes returns them starting at 1.
This was causing an extra null-ish tool call to be materialized.
2024-08-02 13:05:23 -07:00
Joe Cheng
90dd60fa71 fix(main.py): Handle bedrock tool calling in stream_chunk_builder
Fixes #5022.

The streaming chunks from Anthropic seem to violate an assumption
that is implicit in the stream_chunk_builder implementation: that
only tool_calls OR function_calls OR content will appear in a
streamed response. The repro in #5022 shows that you can get
content followed by tool calls.

These changes properly handle these combinations by building
separate lists of each type of chunk (note that in theory a chunk
could appear in multiple lists, e.g. both delta.tool_calls and
delta.content being present on one chunk).
2024-08-02 12:41:13 -07:00
Krish Dholakia
25ac9c2d75
Merge branch 'main' into litellm_fix_streaming_usage_calc 2024-08-01 21:29:04 -07:00
Krrish Dholakia
6e163d3c8a fix(vertex_ai_partner.py): add /chat/completion codestral support
Closes https://github.com/BerriAI/litellm/issues/4984
2024-08-01 18:06:40 -07:00
Krrish Dholakia
c6eabe0253 fix(main.py): fix linting error 2024-08-01 17:33:29 -07:00
Krrish Dholakia
010d5ed81d feat(vertex_ai_partner.py): add vertex ai codestral FIM support
Closes https://github.com/BerriAI/litellm/issues/4984
2024-08-01 17:10:27 -07:00
Krrish Dholakia
246b3227a9 fix(vertex_ai_partner.py): add /chat/completion codestral support
Closes https://github.com/BerriAI/litellm/issues/4984
2024-08-01 16:12:05 -07:00
Krrish Dholakia
ca0a0bed46 fix(utils.py): fix anthropic streaming usage calculation
Fixes https://github.com/BerriAI/litellm/issues/4965
2024-08-01 14:45:54 -07:00
Krish Dholakia
653aefde40
Merge branch 'main' into litellm_async_cohere_calls 2024-07-30 15:35:20 -07:00
Krrish Dholakia
9b2eb1702b fix(cohere.py): support async cohere embedding calls 2024-07-30 14:49:07 -07:00
Krrish Dholakia
99dc7d2e97 fix(main.py): fix linting error 2024-07-30 13:55:04 -07:00
Krrish Dholakia
69afbc6091 feat(huggingface_restapi.py): Support multiple hf embedding types + async hf embeddings
Closes https://github.com/BerriAI/litellm/issues/3261
2024-07-30 13:32:03 -07:00