litellm-mirror

mirror of https://github.com/BerriAI/litellm.git synced 2025-04-27 03:34:10 +00:00

Author	SHA1	Message	Date
Ishaan Jaff	7a69e93b33	fix merge conflicts	2024-12-11 01:08:43 -08:00
Ishaan Jaff	a3f8b88228	fix - handle merge conflicts	2024-12-11 01:06:40 -08:00
Krrish Dholakia	fd97b9d966	build: Squashed commit of https://github.com/BerriAI/litellm/pull/7170 Closes https://github.com/BerriAI/litellm/pull/7170	2024-12-11 01:03:57 -08:00
Ishaan Jaff	e02c4b8e9d	add enforce_llms_folder_style (#7175 )	2024-12-11 01:01:49 -08:00
Krrish Dholakia	6493eaf2ee	build: Squashed commit of https://github.com/BerriAI/litellm/pull/7165 Closes https://github.com/BerriAI/litellm/pull/7165	2024-12-11 01:00:33 -08:00
Ishaan Jaff	b79db3616c	(Refactor) Code Quality improvement - rename `text_completion_codestral.py` -> `codestral/completion/` (#7172 ) * rename files * fix codestral fim organization * fix CodestralTextCompletionConfig * fix import CodestralTextCompletion * fix BaseLLM * fix imports * fix CodestralTextCompletionConfig * fix imports CodestralTextCompletion	2024-12-11 00:55:47 -08:00
Ishaan Jaff	3afd7be40d	Code Quality Improvement - move `aleph_alpha` to deprecated_providers (#7168 ) * move aleph alpha to deprecated providers * fix import location * fix aleph_alpha * pytest skip * undo change to test file	2024-12-11 00:50:40 -08:00
Ishaan Jaff	e09d3761d8	Code Quality Improvement - use `vertex_ai/` as folder name for vertexAI (#7166 ) * fix rename vertex ai * run ci/cd again	2024-12-11 00:32:41 -08:00
Ishaan Jaff	26918487d6	(Refactor) Code Quality improvement - remove `/prompt_templates/` , `base_aws_llm.py` from `/llms` folder (#7164 ) * fix move base_aws_llm * fix import * update enforce llms folder style * move prompt_templates * update prompt_templates location * fix imports * fix imports * fix imports * fix imports * fix checks	2024-12-11 00:02:46 -08:00
dependabot[bot]	05731f698b	build(deps): bump nanoid from 3.3.7 to 3.3.8 in /docs/my-website (#7159 ) Bumps [nanoid](https://github.com/ai/nanoid) from 3.3.7 to 3.3.8. - [Release notes](https://github.com/ai/nanoid/releases) - [Changelog](https://github.com/ai/nanoid/blob/main/CHANGELOG.md) - [Commits](https://github.com/ai/nanoid/compare/3.3.7...3.3.8) --- updated-dependencies: - dependency-name: nanoid dependency-type: indirect ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2024-12-10 23:51:05 -08:00
Ishaan Jaff	76a00247ea	Code Quality Improvement - remove `tokenizers/` from /llms (#7163 ) * move tokenizers out of /llms * use updated tokenizers location * fix test_google_secret_manager_read_in_memory	2024-12-10 23:50:15 -08:00
Krish Dholakia	93000bd8d3	Litellm merge pr (#7161 ) * build: merge branch * test: fix openai naming * fix(main.py): fix openai renaming * style: ignore function length for config factory * fix(sagemaker/): fix routing logic * fix: fix imports * fix: fix override	2024-12-10 22:49:26 -08:00
Krish Dholakia	cd9b92b402	Litellm vllm refactor (#7158 ) * refactor(vllm/): move vllm to use base llm config * test: mark flaky test	2024-12-10 21:48:35 -08:00
Krish Dholakia	e9fbefca5d	Litellm ollama refactor (#7162 ) * refactor(ollama/): refactor ollama `/api/generate` to use base llm config Addresses https://github.com/andrewyng/aisuite/issues/113#issuecomment-2512369132 * test: skip unresponsive test * test(test_secret_manager.py): mark flaky test * test: fix google sm test * fix: fix init.py	2024-12-10 21:45:35 -08:00
Krish Dholakia	6c6834dde7	Revert "LiteLLM Common Base LLM Config (pt.4): Move Ollama to Base LLM Config…" (#7160 ) This reverts commit `40a22eb4c6`.	2024-12-10 21:44:54 -08:00
Ishaan Jaff	91581bc2db	Code Quality Improvement - remove `file_apis`, `fine_tuning_apis` from `/llms` (#7156 ) * remove files_apis from /llms * fix imports * move fine tuning api from /llms * fix importing fine tuning handlers * fix imports	2024-12-10 21:44:25 -08:00
Krish Dholakia	71eaedac6f	LiteLLM Common Base LLM Config (pt.4): Move Ollama to Base LLM Config (#7157 ) * refactor(ollama/): refactor ollama `/api/generate` to use base llm config Addresses https://github.com/andrewyng/aisuite/issues/113#issuecomment-2512369132 * test: skip unresponsive test * test(test_secret_manager.py): mark flaky test * test: fix google sm test	2024-12-10 21:39:28 -08:00
Ishaan Jaff	d912e562ac	remove symlink (#7155 )	2024-12-10 21:04:21 -08:00
Ishaan Jaff	0cecff016e	fix import	2024-12-10 20:26:16 -08:00
Ishaan Jaff	5ad57dd54b	rename `llms/OpenAI/` -> `llms/openai/` (#7154 ) * rename OpenAI -> openai * fix file rename * fix rename changes * fix organization of openai/transcription * fix import OA fine tuning API * fix openai ft handler * fix handler import	2024-12-10 20:14:07 -08:00
Krish Dholakia	61afdab228	refactor(sagemaker/): separate chat + completion routes + make them b… (#7151 ) * refactor(sagemaker/): separate chat + completion routes + make them both use base llm config Addresses https://github.com/andrewyng/aisuite/issues/113#issuecomment-2512369132 * fix(main.py): pass hf model name + custom prompt dict to litellm params	2024-12-10 19:40:05 -08:00
Krish Dholakia	df12f87a64	LiteLLM Common Base LLM Config (pt.3): Move all OAI compatible providers to base llm config (#7148 ) * refactor(fireworks_ai/): inherit from openai like base config refactors fireworks ai to use a common config * test: fix import in test * refactor(watsonx/): refactor watsonx to use llm base config refactors chat + completion routes to base config path * fix: fix linting error * refactor: inherit base llm config for oai compatible routes * test: fix test * test: fix test	2024-12-10 17:12:42 -08:00
Krish Dholakia	4eeaaeeacd	refactor(fireworks_ai/): inherit from openai like base config (#7146 ) * refactor(fireworks_ai/): inherit from openai like base config refactors fireworks ai to use a common config * test: fix import in test * refactor(watsonx/): refactor watsonx to use llm base config refactors chat + completion routes to base config path * fix: fix linting error * test: fix test * fix: fix test	2024-12-10 16:15:19 -08:00
Ishaan Jaff	6a9225fac2	(Refactor) Code Quality improvement - stop redefining LiteLLMBase (#7147 ) * fix stop redefining LiteLLMBase * use better name for base pydantic obj	2024-12-10 15:49:01 -08:00
Krish Dholakia	97d70d2441	docs: document code quality (#7149 ) * docs: document code quality * build(readme.md): cleanup	2024-12-10 15:44:59 -08:00
Ishaan Jaff	0df4dc51de	(Refactor) Code Quality improvement - Use Common base handler for `anthropic_text/` (#7143 ) * add anthropic text provider * add ANTHROPIC_TEXT to LlmProviders * fix anthropic text implementation * working anthropic text claude-2 * test_acompletion_claude2_stream * add param mapping for anthropic text * fix unused imports * fix anthropic completion handler.py	2024-12-10 12:23:58 -08:00
Ishaan Jaff	1b377d5229	(Refactor) Code Quality improvement - Use Common base handler for Cohere /generate API (#7122 ) * use validate_environment in common utils * use transform request / response for cohere * remove unused file * use cohere base_llm_http_handler * working cohere generate api on llm http handler * streaming cohere generate api * fix get_model_response_iterator * fix streaming handler * fix get_model_response_iterator * test_cohere_generate_api_completion * fix linting error * fix testing cohere raising error * fix get_model_response_iterator type * add testing cohere generate api	2024-12-10 10:44:42 -08:00
Ishaan Jaff	9c2316b7ec	(Refactor) Code Quality improvement - Use Common base handler for `cloudflare/` provider (#7127 ) * add get_complete_url to base config * cloudflare - refactor to following existing pattern * migrate cloudflare chat completions to base llm http handler * fix unused import * fix fake stream in cloudflare * fix cloudflare transformation * fix naming for BaseModelResponseIterator * add async cloudflare streaming test * test cloudflare * add handler.py * add handler.py in cohere handler.py	2024-12-10 10:12:22 -08:00
Ishaan Jaff	28ff38e35d	(Refactor) Code Quality improvement - Use Common base handler for `clarifai/` (#7125 ) * use base_llm_http_handler for clarifai * fix clarifai completion * handle faking streaming base llm http handler * add fake streaming for clarifai * add FakeStreamResponseIterator for base model iterator * fix get_model_response_iterator * fix base model iterator * fix base model iterator * add support for faking sync streams clarfiai * add fake streaming for clarifai * remove unused code * fix import * fix llm http handler * test_async_completion_clarifai * fix clarifai tests * fix linting	2024-12-09 21:04:48 -08:00
Ishaan Jaff	c5e0407703	(Refactor) Code Quality improvement - use Common base handler for Cohere (#7117 ) * fix use new format for Cohere config * fix base llm http handler * Litellm code qa common config (#7116) * feat(base_llm): initial commit for common base config class Addresses code qa critique https://github.com/andrewyng/aisuite/issues/113#issuecomment-2512369132 * feat(base_llm/): add transform request/response abstract methods to base config class --------- Co-authored-by: Krrish Dholakia <krrishdholakia@gmail.com> * use base transform helpers * use base_llm_http_handler for cohere * working cohere using base llm handler * add async cohere chat completion support on base handler * fix completion code * working sync cohere stream * add async support cohere_chat * fix types get_model_response_iterator * async / sync tests cohere * feat cohere using base llm class * fix linting errors * fix _abc error * add cohere params to transformation * remove old cohere file * fix type error * fix merge conflicts * fix cohere merge conflicts * fix linting error * fix litellm.llms.custom_httpx.http_handler.HTTPHandler.post * fix passing cohere specific params --------- Co-authored-by: Krrish Dholakia <krrishdholakia@gmail.com>	2024-12-09 17:45:29 -08:00
Krish Dholakia	501885d653	Litellm code qa common config (#7113 ) * feat(base_llm): initial commit for common base config class Addresses code qa critique https://github.com/andrewyng/aisuite/issues/113#issuecomment-2512369132 * feat(base_llm/): add transform request/response abstract methods to base config class * feat(cohere-+-clarifai): refactor integrations to use common base config class * fix: fix linting errors * refactor(anthropic/): move anthropic + vertex anthropic to use base config * test: fix xai test * test: fix tests * fix: fix linting errors * test: comment out WIP test * fix(transformation.py): fix is pdf used check * fix: fix linting error	2024-12-09 15:58:25 -08:00
Krrish Dholakia	d8e6e5b89a	bump: version 1.54.0 → 1.54.1	2024-12-09 08:54:40 -08:00
Krish Dholakia	70c4e1b4d2	Litellm dev 12 07 2024 (#7086 ) * fix(main.py): support passing max retries to azure/openai embedding integrations Fixes https://github.com/BerriAI/litellm/issues/7003 * feat(team_endpoints.py): allow updating team model aliases Closes https://github.com/BerriAI/litellm/issues/6956 * feat(router.py): allow specifying model id as fallback - skips any cooldown check Allows a default model to be checked if all models in cooldown s/o @micahjsmith * docs(reliability.md): add fallback to specific model to docs * fix(utils.py): new 'is_prompt_caching_valid_prompt' helper util Allows user to identify if messages/tools have prompt caching Related issue: https://github.com/BerriAI/litellm/issues/6784 * feat(router.py): store model id for prompt caching valid prompt Allows routing to that model id on subsequent requests * fix(router.py): only cache if prompt is valid prompt caching prompt prevents storing unnecessary items in cache * feat(router.py): support routing prompt caching enabled models to previous deployments Closes https://github.com/BerriAI/litellm/issues/6784 * test: fix linting errors * feat(databricks/): convert basemodel to dict and exclude none values allow passing pydantic message to databricks * fix(utils.py): ensure all chat completion messages are dict * (feat) Track `custom_llm_provider` in LiteLLMSpendLogs (#7081) * add custom_llm_provider to SpendLogsPayload * add custom_llm_provider to SpendLogs * add custom llm provider to SpendLogs payload * test_spend_logs_payload * Add MLflow to the side bar (#7031) Signed-off-by: B-Step62 <yuki.watanabe@databricks.com> * (bug fix) SpendLogs update DB catch all possible DB errors for retrying (#7082) * catch DB_CONNECTION_ERROR_TYPES * fix DB retry mechanism for SpendLog updates * use DB_CONNECTION_ERROR_TYPES in auth checks * fix exp back off for writing SpendLogs * use _raise_failed_update_spend_exception to ensure errors print as NON blocking * test_update_spend_logs_multiple_batches_with_failure * (Feat) Add StructuredOutputs support for Fireworks.AI (#7085) * fix model cost map fireworks ai "supports_response_schema": true, * fix supports_response_schema * fix map openai params fireworks ai * test_map_response_format * test_map_response_format * added deepinfra/Meta-Llama-3.1-405B-Instruct (#7084) * bump: version 1.53.9 → 1.54.0 * fix deepinfra * litellm db fixes LiteLLM_UserTable (#7089) * ci/cd queue new release * fix llama-3.3-70b-versatile * refactor - use consistent file naming convention `AI21/` -> `ai21` (#7090) * fix refactor - use consistent file naming convention * ci/cd run again * fix naming structure * fix use consistent naming (#7092) --------- Signed-off-by: B-Step62 <yuki.watanabe@databricks.com> Co-authored-by: Ishaan Jaff <ishaanjaffer0324@gmail.com> Co-authored-by: Yuki Watanabe <31463517+B-Step62@users.noreply.github.com> Co-authored-by: ali sayyah <ali.sayyah2@gmail.com>	2024-12-08 00:30:33 -08:00
Ishaan Jaff	664d82ca9e	fix use consistent naming (#7092 )	2024-12-07 22:01:00 -08:00
Ishaan Jaff	249506065e	refactor - use consistent file naming convention `AI21/` -> `ai21` (#7090 ) * fix refactor - use consistent file naming convention * ci/cd run again * fix naming structure	2024-12-07 21:46:34 -08:00
Ishaan Jaff	2a35de0868	fix llama-3.3-70b-versatile	2024-12-07 20:19:02 -08:00
Ishaan Jaff	f34bde2eca	ci/cd queue new release	2024-12-07 19:09:57 -08:00
Ishaan Jaff	92a8f09655	litellm db fixes LiteLLM_UserTable (#7089 )	2024-12-07 19:08:37 -08:00
Ishaan Jaff	cdec1259b0	fix deepinfra	2024-12-07 19:07:38 -08:00
Ishaan Jaff	c35c3a6334	bump: version 1.53.9 → 1.54.0	2024-12-07 19:00:53 -08:00
ali sayyah	bf2b66e74a	added deepinfra/Meta-Llama-3.1-405B-Instruct (#7084 )	2024-12-07 18:58:28 -08:00
Ishaan Jaff	19597c77ba	(Feat) Add StructuredOutputs support for Fireworks.AI (#7085 ) * fix model cost map fireworks ai "supports_response_schema": true, * fix supports_response_schema * fix map openai params fireworks ai * test_map_response_format * test_map_response_format	2024-12-07 18:44:41 -08:00
Ishaan Jaff	b78eb6654d	(bug fix) SpendLogs update DB catch all possible DB errors for retrying (#7082 ) * catch DB_CONNECTION_ERROR_TYPES * fix DB retry mechanism for SpendLog updates * use DB_CONNECTION_ERROR_TYPES in auth checks * fix exp back off for writing SpendLogs * use _raise_failed_update_spend_exception to ensure errors print as NON blocking * test_update_spend_logs_multiple_batches_with_failure	2024-12-07 15:59:53 -08:00
Yuki Watanabe	6ec920d0b4	Add MLflow to the side bar (#7031 ) Signed-off-by: B-Step62 <yuki.watanabe@databricks.com>	2024-12-07 14:30:32 -08:00
Ishaan Jaff	ed9ebf3489	(feat) Track `custom_llm_provider` in LiteLLMSpendLogs (#7081 ) * add custom_llm_provider to SpendLogsPayload * add custom_llm_provider to SpendLogs * add custom llm provider to SpendLogs payload * test_spend_logs_payload	2024-12-07 13:40:22 -08:00
Krrish Dholakia	37a0b0bb7b	bump: version 1.53.8 → 1.53.9	2024-12-06 23:10:41 -08:00
Krish Dholakia	20e8dc35e1	feat(langfuse/): support langfuse prompt management (#7073 ) * feat(langfuse/): support langfuse prompt management Initial working commit for langfuse prompt management support Closes https://github.com/BerriAI/litellm/issues/6269 * test: update test * fix(litellm_logging.py): suppress linting error	2024-12-06 23:10:22 -08:00
Krish Dholakia	df3da2e5d2	Litellm dev 12 06 2024 (#7067 ) * fix(edit_budget_modal.tsx): call `/budget/update` endpoint instead of `/budget/new` allows updating existing budget on ui * fix(user_api_key_auth.py): support cost tracking for end user via jwt field * fix(presidio.py): support pii masking on sync logging callbacks enables masking before logging to langfuse * feat(utils.py): support retry policy logic inside '.completion()' Fixes https://github.com/BerriAI/litellm/issues/6623 * fix(utils.py): support retry by retry policy on async logic as well * fix(handle_jwt.py): set leeway default leeway value * test: fix test to handle jwt audience claim	2024-12-06 22:44:18 -08:00
Ishaan Jaff	f564981556	bump: version 1.53.7 → 1.53.8	2024-12-06 21:32:52 -08:00
Ishaan Jaff	ce1e4b1d5e	(feat) Allow enabling logging message / response for specific virtual keys (#7071 ) * redact_message_input_output_from_logging * initialize_standard_callback_dynamic_params * allow dynamically opting out of redaction * test_redact_msgs_from_logs_with_dynamic_params * fix AddTeamCallback * _get_turn_off_message_logging_from_dynamic_params * test_global_redaction_with_dynamic_params * test_dynamic_turn_off_message_logging * docs Disable/Enable Message redaction * fix doe qual check * _get_turn_off_message_logging_from_dynamic_params	2024-12-06 21:25:36 -08:00

... 2 3 4 5 6 ...

18754 commits