litellm-mirror

mirror of https://github.com/BerriAI/litellm.git synced 2025-04-25 02:34:29 +00:00

Author	SHA1	Message	Date
Krish Dholakia	a7db0df043	Gemini-2.5-flash improvements (#10198 ) * fix(vertex_and_google_ai_studio_gemini.py): allow thinking budget = 0 Fixes https://github.com/BerriAI/litellm/issues/10121 * fix(vertex_and_google_ai_studio_gemini.py): handle nuance in counting exclusive vs. inclusive tokens Addresses https://github.com/BerriAI/litellm/pull/10141#discussion_r2052272035	2025-04-21 22:48:00 -07:00
Ishaan Jaff	104e4cb1bc	[Feat] Add infinity embedding support (contributor pr) (#10196 ) * Feature - infinity support for #8764 (#10009) * Added support for infinity embeddings * Added test cases * Fixed tests and api base * Updated docs and tests * Removed unused import * Updated signature * Added support for infinity embeddings * Added test cases * Fixed tests and api base * Updated docs and tests * Removed unused import * Updated signature * Updated validate params --------- Co-authored-by: Ishaan Jaff <ishaanjaffer0324@gmail.com> * fix InfinityEmbeddingConfig --------- Co-authored-by: Prathamesh Saraf <pratamesh1867@gmail.com>	2025-04-21 20:01:29 -07:00
Krish Dholakia	55a17730fb	fix(transformation.py): pass back in gemini thinking content to api (#10173 ) Ensures thinking content always returned	2025-04-19 18:03:05 -07:00
Krish Dholakia	5c929317cd	fix(triton/completion/transformation.py): remove bad_words / stop wor… (#10163 ) * fix(triton/completion/transformation.py): remove bad_words / stop words from triton call parameter 'bad_words' has invalid type. It should be either 'int', 'bool', or 'string'. * fix(proxy_track_cost_callback.py): add debug logging for track cost callback error	2025-04-19 11:23:37 -07:00
Krish Dholakia	f08a4e3c06	Support 'file' message type for VLLM video url's + Anthropic redacted message thinking support (#10129 ) * feat(hosted_vllm/chat/transformation.py): support calling vllm video url with openai 'file' message type allows switching between gemini/vllm easily * [WIP] redacted thinking tests (#9044) * WIP: redacted thinking tests * test: add test for redacted thinking in assistant message --------- Co-authored-by: Krish Dholakia <krrishdholakia@gmail.com> * fix(anthropic/chat/transformation.py): support redacted thinking block on anthropic completion Fixes https://github.com/BerriAI/litellm/issues/9058 * fix(anthropic/chat/handler.py): transform anthropic redacted messages on streaming Fixes https://github.com/BerriAI/litellm/issues/9058 * fix(bedrock/): support redacted text on streaming + non-streaming Fixes https://github.com/BerriAI/litellm/issues/9058 * feat(litellm_proxy/chat/transformation.py): support 'reasoning_effort' param for proxy allows using reasoning effort with thinking models on proxy * test: update tests * fix(utils.py): fix linting error * fix: fix linting errors * fix: fix linting errors * fix: fix linting error * fix: fix linting errors * fix(anthropic/chat/transformation.py): fix returning citations in chat completion --------- Co-authored-by: Johann Miller <22018973+johannkm@users.noreply.github.com>	2025-04-19 11:16:37 -07:00
Krish Dholakia	2508ca71cb	Handle fireworks ai tool calling response (#10130 ) * feat(fireworks_ai/chat): handle tool calling with fireworks ai correctly Fixes https://github.com/BerriAI/litellm/issues/7209 * fix(utils.py): handle none type in message * fix: fix model name in test * fix(utils.py): fix validate check for openai messages * fix: fix model returned * fix(main.py): fix text completion routing * test: update testing * test: skip test - cohere having RBAC issues	2025-04-19 09:37:45 -07:00
Krish Dholakia	36308a31be	Gemini-2.5-flash - support reasoning cost calc + return reasoning content (#10141 ) * build(model_prices_and_context_window.json): add vertex ai gemini-2.5-flash pricing * build(model_prices_and_context_window.json): add gemini reasoning token pricing * fix(vertex_and_google_ai_studio_gemini.py): support counting thinking tokens for gemini allows accurate cost calc * fix(utils.py): add reasoning token cost calc to generic cost calc ensures gemini-2.5-flash cost calculation is accurate * build(model_prices_and_context_window.json): mark gemini-2.5-flash as 'supports_reasoning' * feat(gemini/): support 'thinking' + 'reasoning_effort' params + new unit tests allow controlling thinking effort for gemini-2.5-flash models * test: update unit testing * feat(vertex_and_google_ai_studio_gemini.py): return reasoning content if given in gemini response * test: update model name * fix: fix ruff check * test(test_spend_management_endpoints.py): update tests to be less sensitive to new keys / updates to usage object * fix(vertex_and_google_ai_studio_gemini.py): fix translation	2025-04-19 09:20:52 -07:00
Krish Dholakia	1ea046cc61	test: update tests to new deployment model (#10142 ) * test: update tests to new deployment model * test: update model name * test: skip cohere rbac issue test * test: update test - replace gpt-4o model	2025-04-18 14:22:12 -07:00
Krish Dholakia	fdfa1108a6	Add property ordering for vertex ai schema (#9828 ) + Fix combining multiple tool calls (#10040 ) * fix #9783: Retain schema field ordering for google gemini and vertex (#9828) * test: update test * refactor(groq.py): initial commit migrating groq to base_llm_http_handler * fix(streaming_chunk_builder_utils.py): fix how tool content is combined Fixes https://github.com/BerriAI/litellm/issues/10034 * fix(vertex_ai/common_utils.py): prevent infinite loop in helper function * fix(groq/chat/transformation.py): handle groq streaming errors correctly * fix(groq/chat/transformation.py): handle max_retries --------- Co-authored-by: Adrian Lyjak <adrian@chatmeter.com>	2025-04-15 22:29:25 -07:00
Krish Dholakia	8faf56922c	Fix azure tenant id check from env var + response_format check on api_version 2025+ (#9993 ) * fix(azure/common_utils.py): check for azure tenant id, client id, client secret in env var Fixes https://github.com/BerriAI/litellm/issues/9598#issuecomment-2801966027 * fix(azure/gpt_transformation.py): fix passing response_format to azure when api year = 2025 Fixes https://github.com/BerriAI/litellm/issues/9703 * test: monkeypatch azure api version in test * test: update testing * test: fix test * test: update test * docs(config_settings.md): document env vars	2025-04-14 22:02:35 -07:00
Krish Dholakia	069aee9f70	fix(transformation.py): correctly translate 'thinking' param for lite… (#9904 ) All checks were successful Helm unit test / unit-test (push) Successful in 21s Details Read Version from pyproject.toml / read-version (push) Successful in 40s Details * fix(transformation.py): correctly translate 'thinking' param for litellm_proxy/ route Fixes https://github.com/BerriAI/litellm/issues/9892 * test: update test	2025-04-11 23:25:13 -07:00
Ishaan Jaff	57bc03b30b	[Feat] Add reasoning_effort support for `xai/grok-3-mini-beta` model family (#9932 ) * add BaseReasoningEffortTests * BaseReasoningLLMTests * fix test rename * docs update thinking / reasoning content docs	2025-04-11 19:17:09 -07:00
Ishaan Jaff	f9ce754817	[Feat] Add litellm.supports_reasoning() util to track if an llm supports reasoning (#9923 ) * add supports_reasoning for xai models * add "supports_reasoning": true for o1 series models * add supports_reasoning util * add litellm.supports_reasoning * add supports reasoning for claude 3-7 models * add deepseek as supports reasoning * test_supports_reasoning * add supports reasoning to model group info * add supports_reasoning * docs supports reasoning * fix supports_reasoning test * "supports_reasoning": false, * fix test * supports_reasoning	2025-04-11 17:56:04 -07:00
Krish Dholakia	78879c68a9	Revert avglogprobs change + Add azure/gpt-4o-realtime-audio cost tracking (#9893 ) * test: initial commit fixing gemini logprobs Fixes https://github.com/BerriAI/litellm/issues/9888 * fix(vertex_and_google_ai_studio.py): Revert avglogprobs change Fixes https://github.com/BerriAI/litellm/issues/8890 * build(model_prices_and_context_window.json): add gpt-4o-realtime-preview cost to model cost map Fixes https://github.com/BerriAI/litellm/issues/9814 * test: add cost calculation unit testing * test: fix test * test: update test	2025-04-10 21:23:55 -07:00
Krish Dholakia	6ba3c4a4f8	VertexAI non-jsonl file storage support (#9781 ) * test: add initial e2e test * fix(vertex_ai/files): initial commit adding sync file create support * refactor: initial commit of vertex ai non-jsonl files reaching gcp endpoint * fix(vertex_ai/files/transformation.py): initial working commit of non-jsonl file call reaching backend endpoint * fix(vertex_ai/files/transformation.py): working e2e non-jsonl file upload * test: working e2e jsonl call * test: unit testing for jsonl file creation * fix(vertex_ai/transformation.py): reset file pointer after read allow multiple reads on same file object * fix: fix linting errors * fix: fix ruff linting errors * fix: fix import * fix: fix linting error * fix: fix linting error * fix(vertex_ai/files/transformation.py): fix linting error * test: update test * test: update tests * fix: fix linting errors * fix: fix test * fix: fix linting error	2025-04-09 14:01:48 -07:00
qvalentin	93532e00db	feat: add enterpriseWebSearch tool for vertex-ai (#9856 )	2025-04-09 13:17:48 -07:00
Krish Dholakia	ac9f03beae	Allow passing `thinking` param to litellm proxy via client sdk + Code QA Refactor on get_optional_params (get correct values) (#9386 ) * fix(litellm_proxy/chat/transformation.py): support 'thinking' param Fixes https://github.com/BerriAI/litellm/issues/9380 * feat(azure/gpt_transformation.py): add azure audio model support Closes https://github.com/BerriAI/litellm/issues/6305 * fix(utils.py): use provider_config in common functions * fix(utils.py): add missing provider configs to get_chat_provider_config * test: fix test * fix: fix path * feat(utils.py): make bedrock invoke nova config baseconfig compatible * fix: fix linting errors * fix(azure_ai/transformation.py): remove buggy optional param filtering for azure ai Removes incorrect check for support tool choice when calling azure ai - prevented calling models with response_format unless on litell model cost map * fix(amazon_cohere_transformation.py): fix bedrock invoke cohere transformation to inherit from coherechatconfig * test: fix azure ai tool choice mapping * fix: fix model cost map to add 'supports_tool_choice' to cohere models * fix(get_supported_openai_params.py): check if custom llm provider in llm providers * fix(get_supported_openai_params.py): fix llm provider in list check * fix: fix ruff check errors * fix: support defs when calling bedrock nova * fix(factory.py): fix test	2025-04-07 21:04:11 -07:00
Krish Dholakia	fcf17d114f	Litellm dev 04 05 2025 p2 (#9774 ) * test: move test to just checking async * fix(transformation.py): handle function call with no schema * fix(utils.py): handle pydantic base model in message tool calls Fix https://github.com/BerriAI/litellm/issues/9321 * fix(vertex_and_google_ai_studio.py): handle tools=[] Fixes https://github.com/BerriAI/litellm/issues/9080 * test: remove max token restriction * test: fix basic test * fix(get_supported_openai_params.py): fix check * fix(converse_transformation.py): support fake streaming for meta.llama3-3-70b-instruct-v1:0 * fix: fix test * fix: parse out empty dictionary on dbrx streaming + tool calls * fix(handle-'strict'-param-when-calling-fireworks-ai): fireworks ai does not support 'strict' param * fix: fix ruff check ' * fix: handle no strict in function * fix: revert bedrock change - handle in separate PR	2025-04-07 21:02:52 -07:00
Krish Dholakia	8d338aee78	fix(databricks/chat/transformation.py): remove reasoning_effort from request (#9811 ) All checks were successful Read Version from pyproject.toml / read-version (push) Successful in 16s Details Helm unit test / unit-test (push) Successful in 27s Details Fixes https://github.com/BerriAI/litellm/issues/9700#issuecomment-2784431995	2025-04-07 19:43:19 -07:00
Krish Dholakia	8e3c7b2de0	fix(vertex_ai.py): move to only passing in accepted keys by vertex ai response schema (#8992 ) * fix(vertex_ai.py): common_utils.py move to only passing in accepted keys by vertex ai prevent json schema compatible keys like $id, and $comment from causing vertex ai openapi calls to fail * fix(test_vertex.py): add testing to ensure only accepted schema params passed in * fix(common_utils.py): fix linting error * test: update test * test: accept function	2025-04-07 18:07:01 -07:00
Krish Dholakia	9a60cd9deb	fix(gemini/transformation.py): handle file_data being passed in (#9786 )	2025-04-07 16:32:08 -07:00
Krish Dholakia	34bdf36eab	Add inference providers support for Hugging Face (#8258 ) (#9738 ) (#9773 ) * Add inference providers support for Hugging Face (#8258) * add first version of inference providers for huggingface * temporarily skipping tests * Add documentation * Fix titles * remove max_retries from params and clean up * add suggestions * use llm http handler * update doc * add suggestions * run formatters * add tests * revert * revert * rename file * set maxsize for lru cache * fix embeddings * fix inference url * fix tests following breaking change in main * use ChatCompletionRequest * fix tests and lint * [Hugging Face] Remove outdated chat completion tests and fix embedding tests (#9749) * remove or fix tests * fix link in doc * fix(config_settings.md): document hf api key --------- Co-authored-by: célina <hanouticelina@gmail.com>	2025-04-05 10:50:15 -07:00
Krrish Dholakia	af9db827fc	fix(databricks/chat/transformation.py): handle empty headers case	2025-04-05 08:33:56 -07:00
Krish Dholakia	5099aac1a5	Add DBRX Anthropic w/ thinking + response_format support (#9744 ) * feat(databricks/chat/): add anthropic w/ reasoning content support via databricks Allows user to call claude-3-7-sonnet with thinking via databricks * refactor: refactor choices transformation + add unit testing * fix(databricks/chat/transformation.py): support thinking blocks on databricks response streaming * feat(databricks/chat/transformation.py): support response_format for claude models * fix(databricks/chat/transformation.py): correctly handle response_format={"type": "text"} * feat(databricks/chat/transformation.py): support 'reasoning_effort' param mapping for anthropic * fix: fix ruff errors * fix: fix linting error * test: update test * fix(databricks/chat/transformation.py): handle json mode output parsing * fix(databricks/chat/transformation.py): handle json mode on streaming * test: update test * test: update dbrx testing * test: update testing * fix(base_model_iterator.py): handle non-json chunk * test: update tests * fix: fix ruff check * fix: fix databricks config import * fix: handle _tool = none * test: skip invalid test	2025-04-04 22:13:32 -07:00
Krish Dholakia	af42e5855f	Gemini image generation output support (#9646 ) * fix(gemini/transformation.py): make GET request to get uri details, if cannot be inferred * fix: fix linting errors * Revert "fix: fix linting errors" This reverts commit `926a5a527f`. * fix(gemini/transformation.py): modalities param support Partially resolves https://github.com/BerriAI/litellm/issues/9237 * feat(google_ai_studio/): add image generation support Closes https://github.com/BerriAI/litellm/issues/9237 * fix: fix types * fix: fix ruff check	2025-04-04 20:37:48 -07:00
Krish Dholakia	90a4dfab3c	fix(xai/chat/transformation.py): filter out 'name' param for xai non-… (#9761 ) * fix(xai/chat/transformation.py): filter out 'name' param for xai non-user roles Fixes https://github.com/BerriAI/litellm/issues/9720 * test fix test_hf_chat_template --------- Co-authored-by: Ishaan Jaff <ishaanjaffer0324@gmail.com>	2025-04-04 20:37:08 -07:00
Krrish Dholakia	b5851769fc	fix: fix import	2025-04-04 20:26:11 -07:00
Ishaan Jaff	5c2bc796b1	test fix test_hf_chat_template	2025-04-04 16:45:20 -07:00
Krrish Dholakia	ad90871ad6	fix(factory.py): don't pass cache control if not set bedrock invoke does not support this	2025-04-04 12:37:34 -07:00
Albert Örwall	bd5a8d582b	Fix prompt caching for Anthropic tool calls (#9706 ) * Add prompt cache support to Anhtropic tool calls * Fix linting issue and add test	2025-04-03 20:19:21 -07:00
Ishaan Jaff	ef6bf02ac4	test_nova_optional_params_tool_choice All checks were successful Read Version from pyproject.toml / read-version (push) Successful in 17s Details Helm unit test / unit-test (push) Successful in 23s Details	2025-04-03 12:27:21 -07:00
Krish Dholakia	6dda1ba6dd	LiteLLM Minor Fixes & Improvements (04/02/2025) (#9725 ) * Add date picker to usage tab + Add reasoning_content token tracking across all providers on streaming (#9722) * feat(new_usage.tsx): add date picker for new usage tab allow user to look back on their usage data * feat(anthropic/chat/transformation.py): report reasoning tokens in completion token details allows usage tracking on how many reasoning tokens are actually being used * feat(streaming_chunk_builder.py): return reasoning_tokens in anthropic/openai streaming response allows tracking reasoning_token usage across providers * Fix update team metadata + fix bulk adding models on Ui (#9721) * fix(handle_add_model_submit.tsx): fix bulk adding models * fix(team_info.tsx): fix team metadata update Fixes https://github.com/BerriAI/litellm/issues/9689 * (v0) Unified file id - allow calling multiple providers with same file id (#9718) * feat(files_endpoints.py): initial commit adding 'target_model_names' support allow developer to specify all the models they want to call with the file * feat(files_endpoints.py): return unified files endpoint * test(test_files_endpoints.py): add validation test - if invalid purpose submitted * feat: more updates * feat: initial working commit of unified file id translation * fix: additional fixes * fix(router.py): remove model replace logic in jsonl on acreate_file enables file upload to work for chat completion requests as well * fix(files_endpoints.py): remove whitespace around model name * fix(azure/handler.py): return acreate_file with correct response type * fix: fix linting errors * test: fix mock test to run on github actions * fix: fix ruff errors * fix: fix file too large error * fix(utils.py): remove redundant var * test: modify test to work on github actions * test: update tests * test: more debug logs to understand ci/cd issue * test: fix test for respx * test: skip mock respx test fails on ci/cd - not clear why * fix: fix ruff check * fix: fix test * fix(model_connection_test.tsx): fix linting error * test: update unit tests	2025-04-03 11:48:52 -07:00
Ishaan Jaff	e0dbd328be	test_bedrock_nova_json.py All checks were successful Read Version from pyproject.toml / read-version (push) Successful in 17s Details Helm unit test / unit-test (push) Successful in 23s Details	2025-04-03 08:37:59 -07:00
Ishaan Jaff	acf920a41a	Merge branch 'main' into litellm_fix_azure_o_series	2025-04-02 20:58:52 -07:00
Ishaan Jaff	d4a20d4fb8	test azure o series	2025-04-02 09:46:45 -07:00
Ishaan Jaff	83e4c34e0a	test fix get_base_completion_call_args	2025-04-02 09:18:56 -07:00
Krish Dholakia	053b0e741f	Add Google AI Studio `/v1/files` upload API support (#9645 ) All checks were successful Read Version from pyproject.toml / read-version (push) Successful in 16s Details Helm unit test / unit-test (push) Successful in 23s Details * test: fix import for test * fix: fix bad error string * docs: cleanup files docs * fix(files/main.py): cleanup error string * style: initial commit with a provider/config pattern for files api google ai studio files api onboarding * fix: test * feat(gemini/files/transformation.py): support gemini files api response transformation * fix(gemini/files/transformation.py): return file id as gemini uri allows id to be passed in to chat completion request, just like openai * feat(llm_http_handler.py): support async route for files api on llm_http_handler * fix: fix linting errors * fix: fix model info check * fix: fix ruff errors * fix: fix linting errors * Revert "fix: fix linting errors" This reverts commit `926a5a527f`. * fix: fix linting errors * test: fix test * test: fix tests	2025-04-02 08:56:58 -07:00
Ishaan Jaff	443b8ab93a	test_azure_o1_series_response_format_extra_params	2025-04-02 07:01:08 -07:00
Krish Dholakia	23051d89dd	fix(streaming_handler.py): fix completion start time tracking (#9688 ) * fix(streaming_handler.py): fix completion start time tracking Fixes https://github.com/BerriAI/litellm/issues/9210 * feat(anthropic/chat/transformation.py): map openai 'reasoning_effort' to anthropic 'thinking' param Fixes https://github.com/BerriAI/litellm/issues/9022 * feat: map 'reasoning_effort' to 'thinking' param across bedrock + vertex Closes https://github.com/BerriAI/litellm/issues/9022#issuecomment-2705260808	2025-04-01 22:00:56 -07:00
Ishaan Jaff	4b99f833bb	test_cohere_request_body_with_allowed_params	2025-04-01 21:30:24 -07:00
Ishaan Jaff	f7129e5e59	fix _apply_openai_param_overrides	2025-04-01 21:17:59 -07:00
Ishaan Jaff	c454dbec30	get_supported_openai_params for o-1 series models	2025-04-01 19:03:50 -07:00
Krish Dholakia	5ad2fbcba6	Openrouter streaming fixes + Anthropic 'file' message support (#9667 ) * fix(openrouter/transformation.py): Handle error in openrouter stream Fixes https://github.com/Aider-AI/aider/issues/3550 * test(test_openrouter_chat_transformation.py): add unit tests * feat(anthropic/chat/transformation.py): add openai 'file' message content type support Closes https://github.com/BerriAI/litellm/issues/9463 * fix(factory.py): add bedrock converse support for openai 'file' message content type Closes https://github.com/BerriAI/litellm/issues/9463	2025-03-31 21:22:59 -07:00
Krrish Dholakia	b37cb642c1	test: fix b64 str	2025-03-31 17:10:06 -07:00
Krish Dholakia	5c107c64dd	Add gemini audio input support + handle special tokens in sagemaker response (#9640 ) * fix(internal_user_endpoints.py): cleanup unused variables on beta endpoint no team/org split on daily user endpoint * build(model_prices_and_context_window.json): gemini-2.0-flash supports audio input * feat(gemini/transformation.py): support passing audio input to gemini * test: fix test * fix(gemini/transformation.py): support audio input as a url enables passing google cloud bucket urls * fix(gemini/transformation.py): support explicitly passing format of file * fix(gemini/transformation.py): expand support for inferred file types from url * fix(sagemaker/completion/transformation.py): fix special token error when counting sagemaker tokens * test: fix import	2025-03-29 19:23:09 -07:00
Krish Dholakia	5ac61a7572	Add bedrock latency optimized inference support (#9623 ) * fix(converse_transformation.py): add performanceConfig param support on bedrock Closes https://github.com/BerriAI/litellm/issues/7606 * fix(converse_transformation.py): refactor to use more flexible single getter for params which are separate config blocks * test(test_main.py): add e2e mock test for bedrock performance config * build(model_prices_and_context_window.json): add versioned multimodal embedding * refactor(multimodal_embeddings/): migrate to config pattern * feat(vertex_ai/multimodalembeddings): calculate usage for multimodal embedding calls Enables cost calculation for multimodal embeddings * feat(vertex_ai/multimodalembeddings): get usage object for embedding calls ensures accurate cost tracking for vertexai multimodal embedding calls * fix(embedding_handler.py): remove unused imports * fix: fix linting errors * fix: handle response api usage calculation * test(test_vertex_ai_multimodal_embedding_transformation.py): update tests * test: mark flaky test * feat(vertex_ai/multimodal_embeddings/transformation.py): support text+image+video input * docs(vertex.md): document sending text + image to vertex multimodal embeddings * test: remove incorrect file * fix(multimodal_embeddings/transformation.py): fix linting error * style: remove unused import	2025-03-29 00:23:09 -07:00
Krish Dholakia	222898d727	Fix anthropic thinking + response_format (#9594 ) * fix(anthropic/chat/transformation.py): Don't set tool choice on response_format conversion when thinking is enabled Not allowed by Anthropic Fixes https://github.com/BerriAI/litellm/issues/8901 * refactor: move test to base anthropic chat tests ensures consistent behaviour across vertex/anthropic/bedrock * fix(anthropic/chat/transformation.py): if thinking token is specified and max tokens is not - ensure max token to anthropic is higher than thinking tokens * feat(converse_transformation.py): correctly handle thinking + response format on Bedrock Converse Fixes https://github.com/BerriAI/litellm/issues/8901 * fix(converse_transformation.py): correctly handle adding max tokens * test: handle service unavailable error	2025-03-28 15:57:40 -07:00
Krish Dholakia	0865e52db3	fix(proxy_server.py): get master key from environment, if not set in … (#9617 ) * fix(proxy_server.py): get master key from environment, if not set in general settings or general settings not set at all * test: mark flaky test * test(test_proxy_server.py): mock prisma client * ci: add new github workflow for testing just the mock tests * fix: fix linting error * ci(conftest.py): add conftest.py to isolate proxy tests * build(pyproject.toml): add respx to dev dependencies * build(pyproject.toml): add prisma to dev dependencies * test: fix mock prompt management tests to use a mock anthropic key * ci(test-litellm.yml): parallelize mock testing make it run faster * build(pyproject.toml): add hypercorn as dev dep * build(pyproject.toml): separate proxy vs. core dev dependencies make it easier for non-proxy contributors to run tests locally - e.g. no need to install hypercorn * ci(test-litellm.yml): pin python version * test(test_rerank.py): move test - cannot be mocked, requires aws credentials for e2e testing * ci: add thank you message to ci * test: add mock env var to test * test: add autouse to tests * test: test mock env vars for e2e tests	2025-03-28 12:32:04 -07:00
Krish Dholakia	c0845fec1f	Add OpenAI gpt-4o-transcribe support (#9517 ) * refactor: introduce new transformation config for gpt-4o-transcribe models * refactor: expose new transformation configs for audio transcription * ci: fix config yml * feat(openai/transcriptions): support provider config transformation on openai audio transcriptions allows gpt-4o and whisper audio transformation to work as expected * refactor: migrate fireworks ai + deepgram to new transform request pattern * feat(openai/): working support for gpt-4o-audio-transcribe * build(model_prices_and_context_window.json): add gpt-4o-transcribe to model cost map * build(model_prices_and_context_window.json): specify what endpoints are supported for `/audio/transcriptions` * fix(get_supported_openai_params.py): fix return * refactor(deepgram/): migrate unit test to deepgram handler * refactor: cleanup unused imports * fix(get_supported_openai_params.py): fix linting error * test: update test	2025-03-26 23:10:25 -07:00
Krish Dholakia	6fd18651d1	Support `litellm.api_base` for vertex_ai + gemini/ across completion, embedding, image_generation (#9516 ) All checks were successful Read Version from pyproject.toml / read-version (push) Successful in 19s Details Helm unit test / unit-test (push) Successful in 20s Details * test(tests): add unit testing for litellm_proxy integration * fix(cost_calculator.py): fix tracking cost in sdk when calling proxy * fix(main.py): respect litellm.api_base on `vertex_ai/` and `gemini/` routes * fix(main.py): consistently support custom api base across gemini + vertexai on embedding + completion * feat(vertex_ai/): test * fix: fix linting error * test: set api base as None before starting loadtest	2025-03-25 23:46:20 -07:00

1 2 3 4 5 ...

300 commits