Commit graph

18313 commits

Author SHA1 Message Date
yujonglee
d87e2f92e9 update canary 2024-08-30 21:10:57 -04:00
Ishaan Jaff
3a72197e77
Merge pull request #5455 from BerriAI/litellm_vtx_add_input_type_mapping
[Feat] Vertex embeddings - map `input_type` to `text_type`
2024-08-30 17:03:04 -07:00
Ishaan Jaff
745bf3245a
Merge pull request #5458 from BerriAI/litellm_fix_allow_passthrough_as_llm_api
[Fix-Proxy-Auth] allow pass through routes as LLM API routes
2024-08-30 17:02:42 -07:00
Ishaan Jaff
06857d108d fix /spend logs call 2024-08-30 17:02:24 -07:00
Ishaan Jaff
2c86a62474 fix vertex ai test 2024-08-30 16:50:23 -07:00
Ishaan Jaff
f43060e8df mark as async 2024-08-30 16:40:41 -07:00
Ishaan Jaff
11776dcda5 bump: version 1.44.12 → 1.44.13 2024-08-30 16:37:14 -07:00
Ishaan Jaff
414d2dcb52 call spend logs endpoint 2024-08-30 16:35:07 -07:00
Ishaan Jaff
c86d1cb391 fix tests 2024-08-30 16:29:32 -07:00
Ishaan Jaff
f3f85f6141 add test for vertex basic pass throgh 2024-08-30 16:26:00 -07:00
Ishaan Jaff
8ed0ffea54 fix use existing custom_auth.py 2024-08-30 16:22:28 -07:00
Ishaan Jaff
36fda54970 allow pass through routes as LLM API routes 2024-08-30 16:08:44 -07:00
Ishaan Jaff
9f18aa4588 fix test_vertexai_embedding_embedding_latest_input_type 2024-08-30 15:57:44 -07:00
Ishaan Jaff
f50374e81d use helper class for pass through success handler 2024-08-30 15:52:47 -07:00
Ishaan Jaff
e1e1e2e566 add example custom 2024-08-30 15:46:45 -07:00
Ishaan Jaff
69dea613ca ci/cd run again 2024-08-30 14:03:00 -07:00
Ishaan Jaff
9f87481ef0 docs add task type for vertex ai 2024-08-30 13:31:24 -07:00
Ishaan Jaff
541033ec7e add VertexAITextEmbeddingConfig 2024-08-30 12:53:43 -07:00
Ishaan Jaff
bc45114c53
Merge pull request #5454 from BerriAI/litellm_refactor
[Refactor] Refactor vertex text to speech to be in vertex directory
2024-08-30 12:28:14 -07:00
Ishaan Jaff
518aa639fa fix map input_type to task_type for vertex ai 2024-08-30 12:09:07 -07:00
Ishaan Jaff
7f303db955
Merge pull request #5450 from BerriAI/litellm_load_config_from_gcs
[Feat-Proxy] Load config.yaml from GCS Bucket
2024-08-30 12:08:54 -07:00
Ishaan Jaff
281b183c7c
Merge pull request #5452 from BerriAI/litellm_allow_qdrant_api_key_to_be_optional
[Fix-Proxy] - Allow Qdrant API Key to be optional
2024-08-30 12:07:57 -07:00
Ishaan Jaff
5056e1e400
Merge pull request #5451 from BerriAI/litellm_fix_vertex
[Fix-Proxy] Vertex SDK pass through - pass all relevant vertex creds
2024-08-30 11:45:16 -07:00
Ishaan Jaff
570a5a2825 fix dir structure for tts 2024-08-30 11:44:23 -07:00
Ishaan Jaff
15296b4fb7 fix allow qdrant api key to be optional 2024-08-30 11:13:23 -07:00
Ishaan Jaff
50014aa78e vertex forward all headers from vertex 2024-08-30 11:05:23 -07:00
Ishaan Jaff
fcdfaac84b
Merge pull request #5419 from Manouchehri/gemini-1.5-pro-exp-0827-pricing
(models): Add gemini-1.5-pro-exp-0827 pricing.
2024-08-30 10:50:10 -07:00
Ishaan Jaff
a842723bb0
Merge pull request #5439 from BerriAI/litellm_use_correct_ai21_pricing
[Fix] Use correct Vertex AI AI21 Cost tracking
2024-08-30 10:49:42 -07:00
Ishaan Jaff
6109501274 update doc 2024-08-30 10:47:40 -07:00
Ishaan Jaff
c781fbcc55 doc using gcs bucket config.yaml 2024-08-30 10:44:41 -07:00
Ishaan Jaff
7d746064ab add gcs bucket base 2024-08-30 10:41:39 -07:00
Ishaan Jaff
051ac50fca use helper to get_config_file_contents_from_gcs 2024-08-30 10:26:42 -07:00
Ishaan Jaff
1bd2b2fc92
Merge pull request #5449 from BerriAI/litellm_Fix_vertex_multimodal
[Fix-Proxy] Allow running /health checks on vertex multimodal embedding requests
2024-08-30 10:21:42 -07:00
Ishaan Jaff
a6273a29fe add test for test_vertexai_multimodal_embedding_text_input 2024-08-30 09:19:48 -07:00
Krish Dholakia
8d6a0bdc81
- merge - fix TypeError: 'CompletionUsage' object is not subscriptable #5441 (#5448)
* fix TypeError: 'CompletionUsage' object is not subscriptable (#5441)

* test(test_team_logging.py): mark flaky test

---------

Co-authored-by: yafei lee <yafei@dao42.com>
2024-08-30 08:54:42 -07:00
JooHo Kim
5b1d9712c5
chore: Clarify support-related Exceptions in utils.py (#5447)
Improved the clarity of Exceptions in supports_system_messages, supports_response_schema, supports_function_calling, and supports_parallel_function_calling. Previously, it was difficult to determine the cause of Exception logs due to vague messaging. Each case now includes a more specific and appropriate Exception message.
2024-08-30 08:29:05 -07:00
Krrish Dholakia
7f1531006c docs(routing.md): add weight-based shuffling to docs 2024-08-30 08:24:12 -07:00
Krrish Dholakia
94db4ec830 test: mark flaky tests 2024-08-30 07:53:04 -07:00
Ishaan Jaff
fdaaea9b0d add tests to check ai21 models cost is calculated correct 2024-08-30 07:30:16 -07:00
Ishaan Jaff
a9ca183021
Merge pull request #5438 from BerriAI/litellm_show_error_types_swagger
[Feat-Proxy] Show all exceptioons types on swagger for LiteLLM Proxy
2024-08-30 07:21:23 -07:00
Ishaan Jaff
ef7835f7f3
Merge pull request #5442 from kiriloman/main
[Pricing] Add pricing for Openai ft:gpt-4o
2024-08-30 07:19:30 -07:00
Kyrylo Yefimenko
a100b01b90 Add pricing for Openai ft:gpt-4o 2024-08-30 08:16:24 +01:00
Krrish Dholakia
856ed40a07 bump: version 1.44.11 → 1.44.12 2024-08-29 22:41:10 -07:00
Krish Dholakia
dd7b008161
fix: Minor LiteLLM Fixes + Improvements (29/08/2024) (#5436)
* fix(model_checks.py): support returning wildcard models on `/v1/models`

Fixes https://github.com/BerriAI/litellm/issues/4903

* fix(bedrock_httpx.py): support calling bedrock via api_base

Closes https://github.com/BerriAI/litellm/pull/4587

* fix(litellm_logging.py): only leave last 4 char of gemini key unmasked

Fixes https://github.com/BerriAI/litellm/issues/5433

* feat(router.py): support setting 'weight' param for models on router

Closes https://github.com/BerriAI/litellm/issues/5410

* test(test_bedrock_completion.py): add unit test for custom api base

* fix(model_checks.py): handle no "/" in model
2024-08-29 22:40:25 -07:00
Ishaan Jaff
f70b7575d2 update docs 2024-08-29 21:00:10 -07:00
Ishaan Jaff
6bd6c956a5 use correct vtx ai21 pricing 2024-08-29 19:04:05 -07:00
Ishaan Jaff
26c03c9c8b add pricing for vertex ai 21 2024-08-29 19:03:38 -07:00
Ishaan Jaff
ad88c7d0a8 show all error types on swagger 2024-08-29 18:50:41 -07:00
Ishaan Jaff
ef47b2bc87 mark test_cost_tracking_with_caching as flaky 2024-08-29 17:44:21 -07:00
Ishaan Jaff
d57f5a955e bump: version 1.44.10 → 1.44.11 2024-08-29 17:37:01 -07:00