Commit graph

179 commits

Author SHA1 Message Date
Krish Dholakia
559a6ad826
fix(google_ai_studio): working context caching (#5421)
* fix(google_ai_studio): working context caching

* feat(vertex_ai_context_caching.py): support async cache check calls

* fix(vertex_and_google_ai_studio_gemini.py): fix setting headers

* fix(vertex_ai_parter_models): fix import

* fix(vertex_and_google_ai_studio_gemini.py): fix input

* test(test_amazing_vertex_completion.py): fix test
2024-08-29 07:00:30 -07:00
Krrish Dholakia
fd1cca207f test(test_amazing_vertex_completion.py): fix test 2024-08-28 21:12:39 -07:00
Krish Dholakia
a857f4a8ee
Merge branch 'main' into litellm_main_staging 2024-08-28 18:05:27 -07:00
Ishaan Jaff
2f3d400148 fix vertex ai test 2024-08-28 15:30:39 -07:00
Ishaan Jaff
068114859a fix failing vertex test 2024-08-28 14:46:57 -07:00
Krrish Dholakia
4ce59f1a96 test(test_amazing_vertex_completion.py): update test to not pick experimental gemini models 2024-08-28 14:28:06 -07:00
Krish Dholakia
d928220ed2
Merge pull request #5393 from BerriAI/litellm_gemini_embedding_support
feat(vertex_ai_and_google_ai_studio): Support Google AI Studio Embedding Endpoint
2024-08-28 13:46:28 -07:00
Ishaan Jaff
1a106a4f06 mark vertex tests as flaky 2024-08-28 08:32:33 -07:00
Ishaan Jaff
b373320569 mark test as flaky 2024-08-28 08:24:04 -07:00
Krrish Dholakia
d29a7087f1 feat(vertex_ai_and_google_ai_studio): Support Google AI Studio Embeddings endpoint
Closes https://github.com/BerriAI/litellm/issues/5385
2024-08-27 16:53:11 -07:00
Ishaan Jaff
33a3a01949 add mock test for ai21 2024-08-27 14:42:13 -07:00
Ishaan Jaff
093f7f15fd add test for test_partner_models_httpx_ai21 2024-08-27 13:36:47 -07:00
Krrish Dholakia
5aad9d2db7 fix: fix imports 2024-08-26 22:19:01 -07:00
Krrish Dholakia
0eea01dae9 feat(vertex_ai_context_caching.py): check gemini cache, if key already exists 2024-08-26 22:19:01 -07:00
Krrish Dholakia
b0cc1df2d6 feat(vertex_ai_context_caching.py): support making context caching calls to vertex ai in a normal chat completion call (anthropic caching format)
Closes https://github.com/BerriAI/litellm/issues/5213
2024-08-26 22:19:01 -07:00
Ishaan Jaff
07a45fc844 add test for test_completion_fine_tuned_model 2024-08-26 13:26:56 -07:00
Krish Dholakia
f27abe0462
Merge branch 'main' into litellm_vertex_migration 2024-08-24 18:24:19 -07:00
Ishaan Jaff
4fe22ec493 fix test_vertexai_multimodal_embedding use magicMock requests 2024-08-22 09:56:24 -07:00
Krrish Dholakia
25820680eb test: fix test 2024-08-21 22:30:41 -07:00
Krrish Dholakia
5e707091ee test(test_amazing_vertex_completion.py): handle vertex api instability 2024-08-21 17:09:17 -07:00
Ishaan Jaff
dd524a4f50
Merge pull request #5326 from BerriAI/litellm_Add_vertex_multimodal_embedding
[Feat] add vertex multimodal embedding support
2024-08-21 17:06:43 -07:00
Ishaan Jaff
2c7f5a0d27 fix test test_vertexai_multimodal_embedding 2024-08-21 15:25:13 -07:00
Krrish Dholakia
d995fff7a2 fix(vertex_httpx.py): fix json schema call to pass in response_mime_type=="application/json" 2024-08-21 15:24:04 -07:00
Ishaan Jaff
0e9efb3669 feat add multimodal embeddings on vertex 2024-08-21 15:19:48 -07:00
Ishaan Jaff
35781ab8d5 add multi modal vtx embedding 2024-08-21 15:05:59 -07:00
Krish Dholakia
a8dd2b6910
Merge pull request #5244 from BerriAI/litellm_better_error_logging_sentry
refactor: replace .error() with .exception() logging for better debugging on sentry
2024-08-16 19:16:20 -07:00
Ishaan Jaff
31f9453618 skip InternalServerError on vertex test 2024-08-16 14:51:51 -07:00
Krrish Dholakia
61f4b71ef7 refactor: replace .error() with .exception() logging for better debugging on sentry 2024-08-16 09:22:47 -07:00
Ishaan Jaff
7b6110bbb3 add tests to make sure correct vertex ai route is used 2024-08-08 11:28:19 -07:00
Krrish Dholakia
b4392d2b67 test(test_amazing_vertex_completion.py): fix test for json schema validation in openai schema 2024-08-07 18:39:20 -07:00
Krrish Dholakia
5dfde2ee0b feat: Translate openai 'response_format' json_schema to 'response_schema' for vertex ai + google ai studio
Closes https://github.com/BerriAI/litellm/issues/5074
2024-08-06 19:06:14 -07:00
Krrish Dholakia
aab09f5b86 feat(vertex_httpx.py): Support gemini 'response_schema' param
Fixes https://github.com/BerriAI/litellm/pull/3366#issuecomment-2269183338
2024-08-05 07:56:44 -07:00
Krrish Dholakia
246b3227a9 fix(vertex_ai_partner.py): add /chat/completion codestral support
Closes https://github.com/BerriAI/litellm/issues/4984
2024-08-01 16:12:05 -07:00
Krrish Dholakia
1ffd96b348 fix(google.py): fix cost tracking for vertex ai mistral models 2024-08-01 15:22:32 -07:00
Ishaan Jaff
4e7d9d2bb1 fix test_completion_function_plus_pdf 2024-07-31 13:53:06 -07:00
Krrish Dholakia
185a6857f9 fix(utils.py): fix cost tracking for vertex ai partner models 2024-07-30 14:20:52 -07:00
Krish Dholakia
b854d2100c
Merge branch 'main' into litellm_vertex_migration 2024-07-27 20:25:12 -07:00
Krrish Dholakia
6d5aedc48d feat(databricks.py): support vertex mistral cost tracking 2024-07-27 20:22:35 -07:00
Krrish Dholakia
5b71421a7b feat(vertex_ai_partner.py): initial working commit for calling vertex ai mistral
Closes https://github.com/BerriAI/litellm/issues/4874
2024-07-27 12:54:14 -07:00
Krrish Dholakia
ce210ddaf6 fix(vertex_ai_llama3.py): Fix llama3 streaming issue
Closes https://github.com/BerriAI/litellm/issues/4885
2024-07-25 22:30:55 -07:00
Krrish Dholakia
2f773d9cb6 fix(litellm_cost_calc/google.py): support meta llama vertex ai cost tracking 2024-07-25 22:12:07 -07:00
Krrish Dholakia
83ef52e180 feat(vertex_ai_llama.py): vertex ai llama3.1 api support
Initial working commit for vertex ai llama 3.1 api support
2024-07-23 17:07:30 -07:00
Krish Dholakia
967964a51c
Merge branch 'main' into litellm_anthropic_response_schema_support 2024-07-18 20:40:16 -07:00
Ishaan Jaff
81c77f33b8 fix test amazing vertex medlm 2024-07-18 18:16:00 -07:00
Krrish Dholakia
4d963ab789 feat(vertex_ai_anthropic.py): support response_schema for vertex ai anthropic calls
allows passing response_schema for anthropic calls. supports schema validation.
2024-07-18 16:57:38 -07:00
Krrish Dholakia
155ee7e99c fix(factory.py): allow converting pdf url to base64 2024-07-15 21:11:53 -07:00
Krrish Dholakia
023f10cf1c fix(vertex_httpx.py): return grounding metadata 2024-07-15 19:43:37 -07:00
Krrish Dholakia
82ca7af6df fix(vertex_httpx.py): google search grounding fix 2024-07-14 08:06:17 -07:00
Krrish Dholakia
a784f7d8df fix(vertex_httpx.py): fix gtool handling 2024-07-09 08:01:49 -07:00
Krrish Dholakia
7541478459 feat(vertex_httpx.py): add support for gemini 'grounding'
Adds support for https://cloud.google.com/vertex-ai/generative-ai/docs/model-reference/grounding#rest
2024-07-08 21:37:07 -07:00