Commit graph

1336 commits

Author SHA1 Message Date
Krrish Dholakia
7df94100e8 build(model_prices_and_context_window.json): add model pricing for vertex ai llama 3.1 api 2024-07-23 17:36:07 -07:00
Krrish Dholakia
83ef52e180 feat(vertex_ai_llama.py): vertex ai llama3.1 api support
Initial working commit for vertex ai llama 3.1 api support
2024-07-23 17:07:30 -07:00
Ishaan Jaff
aba600a892 fix triton linting 2024-07-23 11:03:34 -07:00
Ishaan Jaff
1355932bf4
Merge pull request #3905 from giritatavarty-8451/litellm_triton_chatcompletion_support
Litellm triton chatcompletion support - Resubmit of #3895
2024-07-23 10:30:26 -07:00
Krrish Dholakia
f4a388f217 fix(openai.py): check if error body is a dictionary before indexing in 2024-07-22 18:12:04 -07:00
Krrish Dholakia
2f65c950fe fix(vertex_httpx.py): Change non-blocking vertex error to warning
Fixes https://github.com/BerriAI/litellm/issues/4825
2024-07-22 14:00:33 -07:00
Krish Dholakia
227f55f370
Revert "Fix: use Bedrock region from environment variables before other region definitions" 2024-07-22 07:40:24 -07:00
Ishaan Jaff
b901a65572 fix make_sync_openai_audio_transcriptions_request 2024-07-20 20:03:12 -07:00
Krish Dholakia
bb6b2c6872
Merge pull request #4613 from petermuller/main
Fix: use Bedrock region from environment variables before other region definitions
2024-07-20 19:19:05 -07:00
Ishaan Jaff
66c73e4425 fix merge conflicts 2024-07-20 19:14:20 -07:00
Ishaan Jaff
f6225623e9
Merge branch 'main' into litellm_return-response_headers 2024-07-20 19:05:56 -07:00
Ishaan Jaff
82764d2cec fix make_sync_openai_audio_transcriptions_request 2024-07-20 18:17:21 -07:00
Ishaan Jaff
5e4d291244 rename to _response_headers 2024-07-20 17:31:16 -07:00
Ishaan Jaff
5e52f50a82 return response headers 2024-07-20 15:26:44 -07:00
Krrish Dholakia
a27454b8e3 fix(openai.py): support completion, streaming, async_streaming 2024-07-20 15:23:42 -07:00
Krrish Dholakia
86c9e05c10 fix(openai.py): drop invalid params if drop_params: true for azure ai
Fixes https://github.com/BerriAI/litellm/issues/4800
2024-07-20 15:08:15 -07:00
Ishaan Jaff
3427838ce5 openai - return response headers 2024-07-20 15:04:27 -07:00
Ishaan Jaff
64dbe07593 openai return response headers 2024-07-20 14:07:41 -07:00
Krish Dholakia
156d445597
Merge pull request #4796 from BerriAI/litellm_refactor_requests_factory
fix(factory.py): refactor factory to use httpx client
2024-07-19 21:07:41 -07:00
Krrish Dholakia
335cf65452 fix(anthropic.py): fix streaming client 2024-07-19 18:55:00 -07:00
Krrish Dholakia
757dedd4c8 fix(factory.py): refactor factory to use httpx client 2024-07-19 15:35:05 -07:00
Krrish Dholakia
4c4f032a75 fix(factory.py): refactor factory to use httpx client 2024-07-19 15:32:01 -07:00
Krish Dholakia
f973bd8109
Merge branch 'main' into litellm_ai21_jamba 2024-07-19 15:11:55 -07:00
Sophia Loris
d779253949 resolve merge conflicts 2024-07-19 09:45:53 -05:00
Sophia Loris
d5c65c6be2 Add support for Triton streaming & triton async completions 2024-07-19 09:35:27 -05:00
Krish Dholakia
2ad1c0455a
Merge pull request #4773 from titusz/patch-1
Add missing `num_gpu` ollama configuration parameter
2024-07-18 22:47:39 -07:00
Krrish Dholakia
c56456be64 fix(anthropic.py): revert client to requests library 2024-07-18 22:45:41 -07:00
Krrish Dholakia
f2401d6d5e feat(vertex_ai_anthropic.py): support response_schema for vertex ai anthropic calls
allows passing response_schema for anthropic calls. supports schema validation.
2024-07-18 22:40:35 -07:00
Krish Dholakia
967964a51c
Merge branch 'main' into litellm_anthropic_response_schema_support 2024-07-18 20:40:16 -07:00
Krrish Dholakia
96471c145e fix(bedrock_httpx.py): support jamba streaming 2024-07-18 19:36:50 -07:00
Krrish Dholakia
cece76c4ee feat(bedrock_httpx.py): add ai21 jamba instruct as converse model
initial commit for adding ai21 jamba instruct support through bedrock converse
2024-07-18 18:24:06 -07:00
Krrish Dholakia
e3ca2789df fix(bedrock_httpx.py): cache response stream shape
Fixes https://github.com/BerriAI/litellm/issues/4774
2024-07-18 17:41:31 -07:00
Krrish Dholakia
30d6c1d171 fix(factory.py): handle escape characters in image url
prevents mimetype errors in vertex ai calls
2024-07-18 17:36:58 -07:00
Krrish Dholakia
4d963ab789 feat(vertex_ai_anthropic.py): support response_schema for vertex ai anthropic calls
allows passing response_schema for anthropic calls. supports schema validation.
2024-07-18 16:57:38 -07:00
Titusz
fcef2c4580
Add missing num_gpu ollama configuration parameter 2024-07-18 17:51:56 +02:00
Krrish Dholakia
73bfc299f7 fix(factory.py): check if empty text block passed in (anthropic_messages_pt) 2024-07-17 11:27:22 -07:00
Krish Dholakia
606466d5fc
Merge pull request #4746 from BerriAI/litellm_anthropic_translation
fix(factory.py): use stronger typing for anthropic translation
2024-07-16 22:15:51 -07:00
Krrish Dholakia
bc9f5eb628 fix(factory.py): handle content not being set 2024-07-16 21:51:30 -07:00
Ishaan Jaff
ee33a80486 fix remove index from tool calls cohere error 2024-07-16 21:49:45 -07:00
Ishaan Jaff
b04d20d367 fix linting error 2024-07-16 21:21:50 -07:00
Ishaan Jaff
c2f73764cb
Merge pull request #4747 from BerriAI/litellm_fix_anthrpic_exception_format
[Fix + Test] anthropic raise litellm.AuthenticationError when no Anthropic API Key provided
2024-07-16 21:19:08 -07:00
Ishaan Jaff
012c8ab132
Merge pull request #4736 from davidschuler-8451/fix/triton-embedding-shape
fix: enables batch embedding support for triton
2024-07-16 21:17:35 -07:00
Ishaan Jaff
dc968a7213 anthropic - raise Authentication error when no api key provided 2024-07-16 20:42:43 -07:00
Krrish Dholakia
9771edb302 fix(factory.py): use stronger typing for anthropic translation
Fixes https://github.com/BerriAI/litellm/issues/4738
2024-07-16 19:58:27 -07:00
davidschuler-8451
35b733f14d feat: enables batch embedding support for triton 2024-07-16 13:31:59 -04:00
Krrish Dholakia
155ee7e99c fix(factory.py): allow converting pdf url to base64 2024-07-15 21:11:53 -07:00
Krrish Dholakia
023f10cf1c fix(vertex_httpx.py): return grounding metadata 2024-07-15 19:43:37 -07:00
Krrish Dholakia
82ca7af6df fix(vertex_httpx.py): google search grounding fix 2024-07-14 08:06:17 -07:00
Ishaan Jaff
03933de775 fix exception raised in factory.py 2024-07-13 09:54:32 -07:00
Krrish Dholakia
0decc36bed fix(factory.py): handle message content being a list instead of string
Fixes https://github.com/BerriAI/litellm/issues/4679
2024-07-12 19:00:39 -07:00