litellm-mirror

mirror of https://github.com/BerriAI/litellm.git synced 2025-04-26 11:14:04 +00:00

Author	SHA1	Message	Date
Krish Dholakia	6eb2346fd6	QA: ensure all bedrock regional models have same `supported_` as base + Anthropic nested pydantic object support (#7844 ) * build: ensure all regional bedrock models have same supported values as base bedrock model prevents drift * test(base_llm_unit_tests.py): add testing for nested pydantic objects * fix(test_utils.py): add test_get_potential_model_names * fix(anthropic/chat/transformation.py): support nested pydantic objects Fixes https://github.com/BerriAI/litellm/issues/7755	2025-01-17 19:49:12 -08:00
Ishaan Jaff	2e91ab52ec	_handle_tool_call_message linting	2025-01-16 22:34:16 -08:00
Ishaan Jaff	b6d1ab6152	test_completion_mistral_api_mistral_large_function_call	2025-01-16 22:27:48 -08:00
Ishaan Jaff	e89c3bad5f	sec fix minor (#7810 )	2025-01-16 22:03:28 -08:00
Ishaan Jaff	559006643d	(fix) IBM Watsonx using ZenApiKey (#7821 ) * ibm watsonx fix * test ZenAPIKey * fix zenapikey	2025-01-16 22:02:36 -08:00
Krish Dholakia	543655adc7	Litellm dev 01 14 2025 p2 (#7772 ) * feat(pass_through_endpoints.py): fix anthropic end user cost tracking * fix(anthropic/chat/transformation.py): use returned provider model for anthropic handles anthropic `-latest` tag in request body throwing cost calculation errors ensures we can be accurate in our model cost tracking * feat(model_prices_and_context_window.json): add gemini-2.0-flash-thinking-exp pricing * test: update test to use assumption that user_api_key_dict can get anthropic user id * test: fix test * fix: fix test * fix(anthropic_pass_through.py): uncomment previous anthropic end-user cost tracking code block can't guarantee user api key dict always has end user id - too many code paths * fix(user_api_key_auth.py): this allows end user id from request body to always be read and set in auth object * fix(auth_check.py): fix linting error * test: fix auth check * fix(auth_utils.py): fix get end user id to handle metadata = None	2025-01-15 21:34:50 -08:00
Krish Dholakia	26c1b86f4e	Litellm dev 01 2025 p4 (#7776 ) * fix(gemini/): support gemini 'frequency_penalty' and 'presence_penalty' Closes https://github.com/BerriAI/litellm/issues/7748 * feat(proxy_server.py): new env var to disable prisma health check on startup * test: fix test	2025-01-14 21:49:25 -08:00
Krish Dholakia	142662a504	build(pyproject.toml): bump uvicorn depedency requirement (#7773 ) * build(pyproject.toml): bump uvicorn depedency requirement Fixes https://github.com/BerriAI/litellm/issues/7768 * fix(anthropic/chat/transformation.py): fix is_vertex_request check to actually use optional param passed in Fixes https://github.com/BerriAI/litellm/issues/6898#issuecomment-2590860695 * fix(o1_transformation.py): fix azure o1 'is_o1_model' check to just check for o1 in model string https://github.com/BerriAI/litellm/issues/7743 * test: load vertex creds	2025-01-14 21:47:11 -08:00
Ishaan Jaff	6a4e8c33b3	(fix) `BaseAWSLLM` - cache IAM role credentials when used (#7775 ) * fix base aws llm * fix auth with aws role * test aws base llm * fix base aws llm init * run ci/cd again * fix get_credentials * ci/cd run again * _auth_with_aws_role	2025-01-14 20:16:22 -08:00
Krish Dholakia	d7a13ad561	Support temporary budget increases on keys (#7754 ) * fix(gpt_transformation.py): fix response_format translation check for 4o models Fixes https://github.com/BerriAI/litellm/issues/7616 * feat(key_management_endpoints.py): support 'temp_budget_increase' and 'temp_budget_expiry' fields Allow proxy admin to grant temporary budget increases to keys * fix(proxy/_types.py): enforce temp_budget_increase and temp_budget_expiry are always passed together * feat(user_api_key_auth.py): initial working temp budget increase logic ensures key budget exceeded error checks for temp budget in key metadata * feat(proxy_server.py): return the key max budget and key spend in the response headers Allows clientside user to know their remaining limits * test: add unit testing for new proxy utils Ensures new key budget is correctly handled * docs(temporary_budget_increase.md): add doc on temporary budget increase * fix(utils.py): remove 3.5 from response_format check for now not all azure 3.5 models support response_format * fix(user_api_key_auth.py): return valid user api key auth object on all paths	2025-01-14 17:03:11 -08:00
Krish Dholakia	01e2e26bd1	add azure o1 pricing (#7715 ) * build(model_prices_and_context_window.json): add azure o1 pricing Closes https://github.com/BerriAI/litellm/issues/7712 * refactor: replace regex with string method for whitespace check in stop-sequences handling (#7713) * Allows overriding keep_alive time in ollama (#7079) * Allows overriding keep_alive time in ollama * Also adds to ollama_chat * Adds some info on the docs about this parameter * fix: together ai warning (#7688) Co-authored-by: Carl Senze <carl.senze@aleph-alpha.com> * fix(proxy_server.py): handle config containing thread locked objects when using get_config_state * fix(proxy_server.py): add exception to debug * build(model_prices_and_context_window.json): update 'supports_vision' for azure o1 --------- Co-authored-by: Wolfram Ravenwolf <52386626+WolframRavenwolf@users.noreply.github.com> Co-authored-by: Regis David Souza Mesquita <github@rdsm.dev> Co-authored-by: Carl <45709281+capsenz@users.noreply.github.com> Co-authored-by: Carl Senze <carl.senze@aleph-alpha.com>	2025-01-12 18:15:35 -08:00
Krish Dholakia	8ee79dd5d9	[BETA] Add OpenAI `/images/variations` + Topaz API support (#7700 ) * feat(main.py): initial commit for `/image/variations` endpoint support * refactor(base_llm/): introduce new base llm base config for image variation endpoints * refactor(openai/image_variations/transformation.py): implement openai image variation transformation handler * fix: test * feat(openai/): working openai `/image/variation` endpoint calls via sdk * feat(topaz/): topaz sync image variation call support Addresses https://github.com/BerriAI/litellm/issues/7593 ' * fix(topaz/transformation.py): fix linting errors * fix(openai/image_variations/handler.py): fix passing json data * fix(main.py): image_variation/ support async image variation route - `aimage_variation` * fix(test_get_model_info.py): fix test * fix: cleanup unused imports * feat(openai/): add async `/image/variations` endpoint support * feat(topaz/): support async `/image/variations` calls * fix: test * fix(utils.py): fix get_model_info_helper for no model info w/ provider config handles situation where model info is not known but provider config exists * test(test_router_fallbacks.py): mark flaky test * fix: fix unused imports * test: bump otel load test perf threshold - accounts for current load tests hitting same server	2025-01-11 23:27:46 -08:00
Krish Dholakia	953c021aa7	Litellm dev 01 10 2025 p3 (#7682 ) * feat(langfuse.py): log the used prompt when prompt management used * test: fix test * docs(self_serve.md): add doc on restricting personal key creation on ui * feat(s3.py): support s3 logging with team alias prefixes (if available) New preview feature * fix(main.py): remove old if block - simplify to just await if coroutine returned fixes lm_studio async embedding error * fix(langfuse.py): handle get prompt check	2025-01-10 21:56:42 -08:00
Krish Dholakia	75c3ddfc9e	fix(vertex_ai/gemini/transformation.py): handle 'http://' in gemini p… (#7660 ) * fix(vertex_ai/gemini/transformation.py): handle 'http://' in gemini process url * refactor(router.py): refactor '_prompt_management_factory' to use logging obj get_chat_completion logic deduplicates code * fix(litellm_logging.py): update 'get_chat_completion_prompt' to update logging object messages * docs(prompt_management.md): update prompt management to be in beta given feedback - this still needs to be revised (e.g. passing in user message, not ignoring) * refactor(prompt_management_base.py): introduce base class for prompt management allows consistent behaviour across prompt management integrations * feat(prompt_management_base.py): support adding client message to template message + refactor langfuse prompt management to use prompt management base * fix(litellm_logging.py): log prompt id + prompt variables to langfuse if set allows tracking what prompt was used for what purpose * feat(litellm_logging.py): log prompt management metadata in standard logging payload + use in langfuse allows logging prompt id / prompt variables to langfuse * test: fix test * fix(router.py): cleanup unused imports * fix: fix linting error * fix: fix trace param typing * fix: fix linting errors * fix: fix code qa check	2025-01-10 07:31:59 -08:00
Ishaan Jaff	19cac744f8	(Feat - Batches API) add support for retrieving vertex api batch jobs (#7661 ) * add _async_retrieve_batch * fix aretrieve_batch * fix _get_batch_id_from_vertex_ai_batch_response * fix batches docs	2025-01-09 18:35:03 -08:00
Ishaan Jaff	7d680d411d	(proxy - RPS) - Get 2K RPS at 4 instances, minor fix `aiohttp_openai/` (#7659 ) * speed up transform_response * use 2 workers * undo changes to uvicorn * ci/cd run again	2025-01-09 17:24:18 -08:00
Krish Dholakia	6d8cfeaf14	LiteLLM Minor Fixes & Improvements (01/08/2025) - p2 (#7643 ) * fix(streaming_chunk_builder_utils.py): add test for groq tool calling + streaming + combine chunks Addresses https://github.com/BerriAI/litellm/issues/7621 * fix(streaming_utils.py): fix modelresponseiterator for openai like chunk parser ensures chunk parser uses the correct tool call id when translating the chunk Fixes https://github.com/BerriAI/litellm/issues/7621 * build(model_hub.tsx): display cost pricing on model hub * build(model_hub.tsx): show cost per token pricing + complete model information * fix(types/utils.py): fix usage object handling	2025-01-08 19:45:19 -08:00
Krish Dholakia	b769b826d0	Litellm dev 01 07 2025 p2 (#7622 ) * build(ui/): update ui * fix: drop unsupported non-whitespace characters for real when calling… (#7484) * fix: drop unsupported non-whitespace characters for real when calling anthropic with stop sequences * test: add parameterized test for _map_stop_sequences method in AnthropicConfig --------- Co-authored-by: Wolfram Ravenwolf <52386626+WolframRavenwolf@users.noreply.github.com>	2025-01-08 16:56:39 -08:00
Krish Dholakia	3d0a044f7c	Litellm dev 01 01 2025 p2 (#7615 ) * fix(utils.py): prevent double logging when passing 'fallbacks=' to .completion() Fixes https://github.com/BerriAI/litellm/issues/7477 * fix(utils.py): fix vertex anthropic check * fix(utils.py): ensure supported params is always set Fixes https://github.com/BerriAI/litellm/issues/7470 * test(test_optional_params.py): add unit testing to prevent mistranslation Fixes https://github.com/BerriAI/litellm/issues/7470 * fix: fix linting error * test: cleanup	2025-01-07 21:40:33 -08:00
Ishaan Jaff	5d3ae6da62	`aiohttp_openai/` fixes - allow using `aiohttp_openai/gpt-4o` (#7598 ) * fixes for get_complete_url * update aiohttp tests * fix event loop for aiohtto * ci/cd run again * test_aiohttp_openai	2025-01-06 21:39:11 -08:00
Krish Dholakia	0af875c7df	Litellm dev 01 06 2025 p2 (#7597 ) * test(test_amazing_vertex_completion.py): fix test * test: initial working code gecko test * fix(vertex_ai_non_gemini.py): support vertex ai code gecko fake streaming Fixes https://github.com/BerriAI/litellm/issues/7360 * test(test_get_model_info.py): add test for getting custom provider model info Covers https://github.com/BerriAI/litellm/issues/7575 * fix(utils.py): fix get_provider_model_info check Handle custom llm provider scenario Fixes https://github.com/ BerriAI/litellm/issues/7575	2025-01-06 21:04:49 -08:00
Ishaan Jaff	7ac1c0c1b2	(perf) - fixes for aiohttp handler to hit 1K RPS (#7590 ) * fix getting aiohttp sesson * fix _get_async_client_session	2025-01-06 15:41:39 -08:00
Krish Dholakia	8a5e74d519	fix(groq/chat/transformation.py): fix groq response_format transformation (#7565 ) Fixes https://github.com/BerriAI/litellm/issues/4804	2025-01-04 19:39:04 -08:00
Krish Dholakia	db82b3bb2a	feat(router.py): support request prioritization for text completion c… (#7540 ) * feat(router.py): support request prioritization for text completion calls * fix(internal_user_endpoints.py): fix sql query to return all keys, including null team id keys on `/user/info` Fixes https://github.com/BerriAI/litellm/issues/7485 * fix: fix linting errors * fix: fix linting error * test(test_router_helper_utils.py): add direct test for '_schedule_factory' Fixes code qa test	2025-01-03 19:35:44 -08:00
Krish Dholakia	fb1272b46b	Support checking provider-specific `/models` endpoints for available models based on key (#7538 ) * test(test_utils.py): initial test for valid models Addresses https://github.com/BerriAI/litellm/issues/7525 * fix: test * feat(fireworks_ai/transformation.py): support retrieving valid models from fireworks ai endpoint * refactor(fireworks_ai/): support checking model info on `/v1/models` route * docs(set_keys.md): update docs to clarify check llm provider api usage * fix(watsonx/common_utils.py): support 'WATSONX_ZENAPIKEY' for iam auth * fix(watsonx): read in watsonx token from env var * fix: fix linting errors * fix(utils.py): fix provider config check * style: cleanup unused imports	2025-01-03 19:29:59 -08:00
Krish Dholakia	5d4ab0a123	Revert "fix: add missing parameters order, limit, before, and after in get_as…" (#7542 ) This reverts commit `4b0505dffd`.	2025-01-03 16:32:12 -08:00
Ishaan Jaff	dc10cebde7	fix _make_common_async_call	2025-01-03 15:15:38 -08:00
Ishaan Jaff	6e2f712b06	(fix) `aiohttp_openai/` route - get to 1K RPS on single instance (#7539 ) * ClientSession * re use client_session * _init_client_session * fix aiohttp	2025-01-03 15:12:17 -08:00
Jean Carlo de Souza	08056b9aef	fix: add missing parameters order, limit, before, and after in get_assistants method for openai (#7537 ) - Ensured that `before` and `after` parameters are only passed when provided to avoid AttributeError. - Implemented safe access using default values for `before` and `after` to prevent missing attribute issues. - Added consistent handling of `order` and `limit` to improve flexibility and robustness in API calls.	2025-01-03 14:41:54 -08:00
Ishaan Jaff	6ffdd5c250	fix unused imports	2025-01-02 22:28:22 -08:00
Ishaan Jaff	3a454ee2ce	(perf) use `aiohttp` for `custom_openai` (#7514 ) * use aiohttp handler * BaseLLMAIOHTTPHandler * use CustomOpenAIChatConfig * CustomOpenAIChatConfig * CustomOpenAIChatConfig * fix linting * AiohttpOpenAIChatConfig * fix order * aiohttp_openai	2025-01-02 22:15:17 -08:00
Ishaan Jaff	2d57581307	Revert "(fix) GCS bucket logger - apply `truncate_standard_logging_payload_co…" (#7515 ) This reverts commit `26a37c50c9`.	2025-01-02 22:01:02 -08:00
Krish Dholakia	52f1e692fe	Litellm dev 01 02 2025 p2 (#7512 ) * feat(deepgram/transformation.py): support reading in deepgram api base from env var * fix(litellm_logging.py): make skipping log message a .info easier to see * docs(logging.md): add doc on turn off all tracking/logging for a request	2025-01-02 21:57:51 -08:00
Ishaan Jaff	3317619357	(fix) GCS bucket logger - apply `truncate_standard_logging_payload_content` to `standard_logging_payload` and ensure GCS flushes queue on fails (#7500 ) * use truncate_standard_logging_payload_content * update truncate_standard_logging_payload_content * update dd logger * update gcs async_send_batch * fix code check * test_datadog_payload_content_truncation * fix code quality	2025-01-01 20:21:01 -08:00
Krish Dholakia	b0f570ee16	Litellm dev 12 30 2024 p2 (#7495 ) * test(azure_openai_o1.py): initial commit with testing for azure openai o1 preview model * fix(base_llm_unit_tests.py): handle azure o1 preview response format tests skip as o1 on azure doesn't support tool calling yet * fix: initial commit of azure o1 handler using openai caller simplifies calling + allows fake streaming logic alr. implemented for openai to just work * feat(azure/o1_handler.py): fake o1 streaming for azure o1 models azure does not currently support streaming for o1 * feat(o1_transformation.py): support overriding 'should_fake_stream' on azure/o1 via 'supports_native_streaming' param on model info enables user to toggle on when azure allows o1 streaming without needing to bump versions * style(router.py): remove 'give feedback/get help' messaging when router is used Prevents noisy messaging Closes https://github.com/BerriAI/litellm/issues/5942 * fix(types/utils.py): handle none logprobs Fixes https://github.com/BerriAI/litellm/issues/328 * fix(exception_mapping_utils.py): fix error str unbound error * refactor(azure_ai/): move to openai_like chat completion handler allows for easy swapping of api base url's (e.g. ai.services.com) Fixes https://github.com/BerriAI/litellm/issues/7275 * refactor(azure_ai/): move to base llm http handler * fix(azure_ai/): handle differing api endpoints * fix(azure_ai/): make sure all unit tests are passing * fix: fix linting errors * fix: fix linting errors * fix: fix linting error * fix: fix linting errors * fix(azure_ai/transformation.py): handle extra body param * fix(azure_ai/transformation.py): fix max retries param handling * fix: fix test * test(test_azure_o1.py): fix test * fix(llm_http_handler.py): support handling azure ai unprocessable entity error * fix(llm_http_handler.py): handle sync invalid param error for azure ai * fix(azure_ai/): streaming support with base_llm_http_handler * fix(llm_http_handler.py): working sync stream calls with unprocessable entity handling for azure ai * fix: fix linting errors * fix(llm_http_handler.py): fix linting error * fix(azure_ai/): handle cohere tool call invalid index param error	2025-01-01 18:57:29 -08:00
Ishaan Jaff	0cbecbe185	(Feat) - LiteLLM Use `UsernamePasswordCredential` for Azure OpenAI (#7496 ) * add get_azure_ad_token_from_username_password * docs azure use username / password for auth * update doc * get_azure_ad_token_from_username_password * test test_get_azure_ad_token_from_username_password	2025-01-01 14:11:27 -08:00
Ishaan Jaff	0b4d529af8	(feat) POST `/fine_tuning/jobs` support passing vertex specific hyper params (#7490 ) * update convert_openai_request_to_vertex * test_create_vertex_fine_tune_jobs_mocked * fix order of methods * update LiteLLMFineTuningJobCreate * update OpenAIFineTuningHyperparameters * update vertex hyper params in response * _transform_openai_hyperparameters_to_vertex_hyperparameters * supervised_tuning_spec["hyperParameters"] fix * fix mapping for ft params testing * docs fine tuning apis * fix test_convert_basic_openai_request_to_vertex_request * update hyperparams for create fine tuning * fix linting * test_create_vertex_fine_tune_jobs_mocked_with_hyperparameters * run ci/cd again * test_convert_basic_openai_request_to_vertex_request	2025-01-01 07:44:48 -08:00
Sven Seeberg	9af6ba0a02	fix ollama embedding model response #7451 (#7473 )	2024-12-31 23:24:08 -08:00
Ishaan Jaff	29f16d4443	(fix) `v1/fine_tuning/jobs` with VertexAI (#7487 ) * update convert_openai_request_to_vertex * test_create_vertex_fine_tune_jobs_mocked	2024-12-31 15:09:56 -08:00
Krish Dholakia	0178e75cd9	Litellm dev 12 30 2024 p1 (#7480 ) * test(azure_openai_o1.py): initial commit with testing for azure openai o1 preview model * fix(base_llm_unit_tests.py): handle azure o1 preview response format tests skip as o1 on azure doesn't support tool calling yet * fix: initial commit of azure o1 handler using openai caller simplifies calling + allows fake streaming logic alr. implemented for openai to just work * feat(azure/o1_handler.py): fake o1 streaming for azure o1 models azure does not currently support streaming for o1 * feat(o1_transformation.py): support overriding 'should_fake_stream' on azure/o1 via 'supports_native_streaming' param on model info enables user to toggle on when azure allows o1 streaming without needing to bump versions * style(router.py): remove 'give feedback/get help' messaging when router is used Prevents noisy messaging Closes https://github.com/BerriAI/litellm/issues/5942 * test: fix azure o1 test * test: fix tests * fix: fix test	2024-12-30 21:52:52 -08:00
Krish Dholakia	75b5754376	Litellm dev 12 28 2024 p1 (#7463 ) * refactor(utils.py): migrate amazon titan config to base config * refactor(utils.py): refactor bedrock meta invoke model translation to use base config * refactor(utils.py): move bedrock ai21 to base config * refactor(utils.py): move bedrock cohere to base config * refactor(utils.py): move bedrock mistral to use base config * refactor(utils.py): move all provider optional param translations to using a config * docs(clientside_auth.md): clarify how to pass vertex region to litellm proxy * fix(utils.py): handle scenario where custom llm provider is none / empty * fix: fix get config * test(test_otel_load_tests.py): widen perf margin * fix(utils.py): fix get provider config check to handle custom llm's * fix(utils.py): fix check	2024-12-28 20:26:00 -08:00
Krish Dholakia	9150722a00	Litellm dev 12 28 2024 p2 (#7458 ) * docs(sidebar.js): docs for support model access groups for wildcard routes * feat(key_management_endpoints.py): add check if user is premium_user when adding model access group for wildcard route * refactor(docs/): make control model access a root-level doc in proxy sidebar easier to discover how to control model access on litellm * docs: more cleanup * feat(fireworks_ai/): add document inlining support Enables user to call non-vision models with images/pdfs/etc. * test(test_fireworks_ai_translation.py): add unit testing for fireworks ai transform inline helper util * docs(docs/): add document inlining details to fireworks ai docs * feat(fireworks_ai/): allow user to dynamically disable auto add transform inline allows client-side disabling of this feature for proxy users * feat(fireworks_ai/): return 'supports_vision' and 'supports_pdf_input' true on all fireworks ai models now true as fireworks ai supports document inlining * test: fix tests * fix(router.py): add unit testing for _is_model_access_group_for_wildcard_route	2024-12-28 19:38:06 -08:00
Krish Dholakia	ebc28b1921	Litellm dev 12 28 2024 p3 (#7464 ) * feat(deepgram/): initial e2e support for deepgram stt Uses deepgram's `/listen` endpoint to transcribe speech to text Closes https://github.com/BerriAI/litellm/issues/4875 * fix: fix linting errors * test: fix test	2024-12-28 19:18:58 -08:00
Ishaan Jaff	71f5e83ed3	(Refactor) - Re use litellm.completion/litellm.embedding etc for health checks (#7455 ) * add mode: realtime * add _realtime_health_check * test_realtime_health_check * azure _realtime_health_check * _realtime_health_check * Realtime Models * fix code quality * delete OAI / Azure custom health check code * simplest version of ahealth check * update tests * working health check post refactor * working aspeech health check * fix realtime health checks * test_audio_transcription_health_check * use get_audio_file_for_health_check * test_text_completion_health_check * ahealth_check * simplify health check code * update ahealth_check * fix import * fix unused imports * fix ahealth_check * fix local testing * test_async_realtime_health_check	2024-12-28 18:38:54 -08:00
Ishaan Jaff	4825e3eee5	(Bug Fix) Add health check support for realtime models (#7453 ) * add mode: realtime * add _realtime_health_check * test_realtime_health_check * azure _realtime_health_check * _realtime_health_check * Realtime Models * fix code quality	2024-12-28 18:15:00 -08:00
Krish Dholakia	f832eb18ec	Litellm dev 12 27 2024 p2 1 (#7449 ) * fix(azure_ai/transformation.py): route ai.services.azure calls to the azure provider route requires token to be passed in as 'api-key' Closes https://github.com/BerriAI/litellm/issues/7275 * fix(key_management_endpoints.py): enforce user is member of team, if team_id set and team_id exists in team table * fix(key_management_endpoints.py): handle assigned_user_id = none * feat(create_key_button.tsx): allow assigning keys to other users allows proxy admin to easily assign other people keys * build(create_key_button.tsx): fix error message display don't swallow the error message for key creation failure * build(create_key_button.tsx): allow proxy admin to edit team id * build(create_key_button.tsx): allow proxy admin to assign keys to other users * build(edit_user.tsx): clarify how 'user budgets' are applied * test: remove dup test * fix(key_management_endpoints.py): don't raise error if team not in db ' * test: fix test	2024-12-27 20:02:32 -08:00
Ishaan Jaff	72c65672ac	(Feat) - new endpoint `GET /v1/fine_tuning/jobs/{fine_tuning_job_id:path}` (#7427 ) * init commit ft jobs logging * add ft logging * add logging for FineTuningJob * simple FT Job create test * simplify Azure fine tuning to use all methods in OAI ft * update doc string * add aretrieve_fine_tuning_job * re use from litellm.proxy.utils import handle_exception_on_proxy * fix naming * add /fine_tuning/jobs/{fine_tuning_job_id:path} * remove unused imports * update func signature * run ci/cd again * ci/cd run again * fix code qulity * ci/cd run again	2024-12-27 17:01:14 -08:00
Krish Dholakia	7af1f8a0c7	Litellm dev 12 25 2025 p2 (#7420 ) * test: add new test image embedding to base llm unit tests Addresses https://github.com/BerriAI/litellm/issues/6515 * fix(bedrock/embed/multimodal-embeddings): strip data prefix from image urls for bedrock multimodal embeddings Fix https://github.com/BerriAI/litellm/issues/6515 * feat: initial commit for fireworks ai audio transcription support Relevant issue: https://github.com/BerriAI/litellm/issues/7134 * test: initial fireworks ai test * feat(fireworks_ai/): implemented fireworks ai audio transcription config * fix(utils.py): register fireworks ai audio transcription config, in config manager * fix(utils.py): add fireworks ai param translation to 'get_optional_params_transcription' * refactor(fireworks_ai/): define text completion route with model name handling moves model name handling to specific fireworks routes, as required by their api * refactor(fireworks_ai/chat): define transform_Request - allows fixing model if accounts/ is missing * fix: fix linting errors * fix: fix linting errors * fix: fix linting errors * fix: fix linting errors * fix(handler.py): fix linting errors * fix(main.py): fix tgai text completion route * refactor(together_ai/completion): refactors together ai text completion route to just use provider transform request * refactor: move test_fine_tuning_api out of local_testing reduces local testing ci/cd time	2024-12-25 18:35:34 -08:00
Krish Dholakia	b961f96b35	Litellm dev 12 25 2024 p1 (#7411 ) * test(test_watsonx.py): e2e unit test for watsonx custom header covers https://github.com/BerriAI/litellm/issues/7408 * fix(common_utils.py): handle auth token already present in headers (watsonx + openai-like base handler) Fixes https://github.com/BerriAI/litellm/issues/7408 * fix(watsonx/chat): fix chat route Fixes https://github.com/BerriAI/litellm/issues/7408 * fix(huggingface/chat/handler.py): fix huggingface async completion calls * Correct handling of max_retries=0 to disable AzureOpenAI retries (#7379) * test: fix test --------- Co-authored-by: Minh Duc <phamminhduc0711@gmail.com>	2024-12-25 17:36:30 -08:00
Krish Dholakia	f929a1f309	Litellm dev 12 24 2024 p4 (#7407 ) * fix(invoke_handler.py): fix mock response iterator to handle tool calling returns tool call if returned by model response * fix(prometheus.py): add new 'tokens_by_tag' metric on prometheus allows tracking 'token usage' by task * feat(prometheus.py): add input + output token tracking by tag * feat(prometheus.py): add tag based deployment failure tracking allows admin to track failure by use-case	2024-12-24 20:24:06 -08:00

1 2 3 4 5 ...

1904 commits