litellm-mirror

mirror of https://github.com/BerriAI/litellm.git synced 2025-04-26 19:24:27 +00:00

History

Krish Dholakia 1a4910f6c0 fix(health.md): add rerank model health check information (#7295 ) * fix(health.md): add rerank model health check information * build(model_prices_and_context_window.json): add gemini 2.0 for google ai studio - pricing + commercial rate limits * build(model_prices_and_context_window.json): add gemini-2.0 supports audio output = true * docs(team_model_add.md): clarify allowing teams to add models is an enterprise feature * fix(o1_transformation.py): add support for 'n', 'response_format' and 'stop' params for o1 and 'stream_options' param for o1-mini * build(model_prices_and_context_window.json): add 'supports_system_message' to supporting openai models needed as o1-preview, and o1-mini models don't support 'system message * fix(o1_transformation.py): translate system message based on if o1 model supports it * fix(o1_transformation.py): return 'stream' param support if o1-mini/o1-preview o1 currently doesn't support streaming, but the other model versions do Fixes https://github.com/BerriAI/litellm/issues/7292 * fix(o1_transformation.py): return tool calling/response_format in supported params if model map says so Fixes https://github.com/BerriAI/litellm/issues/7292 * fix: fix linting errors * fix: update '_transform_messages' * fix(o1_transformation.py): fix provider passed for supported param checks * test(base_llm_unit_tests.py): skip test if api takes >5s to respond * fix(utils.py): return false in 'supports_factory' if can't find value * fix(o1_transformation.py): always return stream + stream_options as supported params + handle stream options being passed in for azure o1 * feat(openai.py): support stream faking natively in openai handler Allows o1 calls to be faked for just the "o1" model, allows native streaming for o1-mini, o1-preview Fixes https://github.com/BerriAI/litellm/issues/7292 * fix(openai.py): use inference param instead of original optional param		2024-12-18 19:18:10 -08:00
..
audio_utils	fix import error	2024-09-05 10:09:44 -07:00
llm_cost_calc	LiteLLM Minor Fixes & Improvements (12/16/2024) - p1 (#7263 )	2024-12-17 15:33:36 -08:00
llm_response_utils	LiteLLM Minor Fixes & Improvements (12/16/2024) - p1 (#7263 )	2024-12-17 15:33:36 -08:00
prompt_templates	fix(health.md): add rerank model health check information (#7295 )	2024-12-18 19:18:10 -08:00
tokenizers	Code Quality Improvement - remove `tokenizers/` from /llms (#7163 )	2024-12-10 23:50:15 -08:00
asyncify.py	build(config.yml): bump anyio version	2024-08-27 07:37:06 -07:00
core_helpers.py	Litellm dev 11 07 2024 (#6649 )	2024-11-08 19:34:22 +05:30
default_encoding.py	Code Quality Improvement - remove `tokenizers/` from /llms (#7163 )	2024-12-10 23:50:15 -08:00
duration_parser.py	(QOL improvement) Provider budget routing - allow using 1s, 1d, 1mo, 2mo etc (#6885 )	2024-11-23 16:59:46 -08:00
exception_mapping_utils.py	Litellm dev 12 13 2024 p1 (#7219 )	2024-12-13 19:01:28 -08:00
get_llm_provider_logic.py	(fix) unable to pass input_type parameter to Voyage AI embedding mode (#7276 )	2024-12-17 19:23:49 -08:00
get_supported_openai_params.py	(fix) unable to pass input_type parameter to Voyage AI embedding mode (#7276 )	2024-12-17 19:23:49 -08:00
json_validation_rule.py	feat(vertex_ai_anthropic.py): support response_schema for vertex ai anthropic calls	2024-07-18 16:57:38 -07:00
litellm_logging.py	LiteLLM Minor Fixes & Improvements (12/16/2024) - p1 (#7263 )	2024-12-17 15:33:36 -08:00
llm_request_utils.py	Litellm ruff linting enforcement (#5992 )	2024-10-01 19:44:20 -04:00
logging_utils.py	(refactor) use helper function `_assemble_complete_response_from_streaming_chunks` to assemble complete responses in caching and logging callbacks (#6220 )	2024-10-15 12:45:12 +05:30
mock_functions.py	test(router_code_coverage.py): check if all router functions are dire… (#6186 )	2024-10-14 22:44:00 -07:00
README.md	(QOL improvement) Provider budget routing - allow using 1s, 1d, 1mo, 2mo etc (#6885 )	2024-11-23 16:59:46 -08:00
realtime_streaming.py	Litellm dev 10 22 2024 (#6384 )	2024-10-22 21:18:54 -07:00
redact_messages.py	(feat) Allow enabling logging message / response for specific virtual keys (#7071 )	2024-12-06 21:25:36 -08:00
response_header_helpers.py	fix(utils.py): guarantee openai-compatible headers always exist in response	2024-09-28 21:08:15 -07:00
rules.py	Litellm dev 11 07 2024 (#6649 )	2024-11-08 19:34:22 +05:30
streaming_chunk_builder_utils.py	LiteLLM Minor Fixes & Improvements (12/05/2024) (#7051 )	2024-12-06 14:29:53 -08:00
streaming_handler.py	LiteLLM Minor Fixes & Improvements (12/16/2024) - p1 (#7263 )	2024-12-17 15:33:36 -08:00
token_counter.py	fix: Support WebP image format and avoid token calculation error (#7182 )	2024-12-12 14:32:39 -08:00

README.md

Folder Contents

This folder contains general-purpose utilities that are used in multiple places in the codebase.

Core files:

streaming_handler.py: The core streaming logic + streaming related helper utils
core_helpers.py: code used in types/ - e.g. map_finish_reason.
exception_mapping_utils.py: utils for mapping exceptions to openai-compatible error types.
default_encoding.py: code for loading the default encoding (tiktoken)
get_llm_provider_logic.py: code for inferring the LLM provider from a given model name.
duration_parser.py: code for parsing durations - e.g. "1d", "1mo", "10s"