litellm-mirror

mirror of https://github.com/BerriAI/litellm.git synced 2025-04-26 19:24:27 +00:00

History

Krish Dholakia b0f570ee16 Litellm dev 12 30 2024 p2 (#7495 ) * test(azure_openai_o1.py): initial commit with testing for azure openai o1 preview model * fix(base_llm_unit_tests.py): handle azure o1 preview response format tests skip as o1 on azure doesn't support tool calling yet * fix: initial commit of azure o1 handler using openai caller simplifies calling + allows fake streaming logic alr. implemented for openai to just work * feat(azure/o1_handler.py): fake o1 streaming for azure o1 models azure does not currently support streaming for o1 * feat(o1_transformation.py): support overriding 'should_fake_stream' on azure/o1 via 'supports_native_streaming' param on model info enables user to toggle on when azure allows o1 streaming without needing to bump versions * style(router.py): remove 'give feedback/get help' messaging when router is used Prevents noisy messaging Closes https://github.com/BerriAI/litellm/issues/5942 * fix(types/utils.py): handle none logprobs Fixes https://github.com/BerriAI/litellm/issues/328 * fix(exception_mapping_utils.py): fix error str unbound error * refactor(azure_ai/): move to openai_like chat completion handler allows for easy swapping of api base url's (e.g. ai.services.com) Fixes https://github.com/BerriAI/litellm/issues/7275 * refactor(azure_ai/): move to base llm http handler * fix(azure_ai/): handle differing api endpoints * fix(azure_ai/): make sure all unit tests are passing * fix: fix linting errors * fix: fix linting errors * fix: fix linting error * fix: fix linting errors * fix(azure_ai/transformation.py): handle extra body param * fix(azure_ai/transformation.py): fix max retries param handling * fix: fix test * test(test_azure_o1.py): fix test * fix(llm_http_handler.py): support handling azure ai unprocessable entity error * fix(llm_http_handler.py): handle sync invalid param error for azure ai * fix(azure_ai/): streaming support with base_llm_http_handler * fix(llm_http_handler.py): working sync stream calls with unprocessable entity handling for azure ai * fix: fix linting errors * fix(llm_http_handler.py): fix linting error * fix(azure_ai/): handle cohere tool call invalid index param error		2025-01-01 18:57:29 -08:00
..
audio_utils	(Refactor) - Re use litellm.completion/litellm.embedding etc for health checks (#7455 )	2024-12-28 18:38:54 -08:00
llm_cost_calc	LiteLLM Minor Fixes & Improvements (12/23/2024) - p3 (#7394 )	2024-12-23 22:02:52 -08:00
llm_response_utils	LiteLLM Minor Fixes & Improvements (12/16/2024) - p1 (#7263 )	2024-12-17 15:33:36 -08:00
prompt_templates	(code quality) run ruff rule to ban unused imports (#7313 )	2024-12-19 12:33:42 -08:00
specialty_caches	Fix team-based logging to langfuse + allow custom tokenizer on `/token_counter` endpoint (#7493 )	2024-12-31 23:18:41 -08:00
tokenizers	Code Quality Improvement - remove `tokenizers/` from /llms (#7163 )	2024-12-10 23:50:15 -08:00
asyncify.py	(code quality) run ruff rule to ban unused imports (#7313 )	2024-12-19 12:33:42 -08:00
core_helpers.py	Litellm dev 12 30 2024 p2 (#7495 )	2025-01-01 18:57:29 -08:00
default_encoding.py	Code Quality Improvement - remove `tokenizers/` from /llms (#7163 )	2024-12-10 23:50:15 -08:00
duration_parser.py	(QOL improvement) Provider budget routing - allow using 1s, 1d, 1mo, 2mo etc (#6885 )	2024-11-23 16:59:46 -08:00
exception_mapping_utils.py	Litellm dev 12 30 2024 p2 (#7495 )	2025-01-01 18:57:29 -08:00
get_llm_provider_logic.py	(code quality) run ruff rule to ban unused imports (#7313 )	2024-12-19 12:33:42 -08:00
get_supported_openai_params.py	Litellm dev 12 28 2024 p3 (#7464 )	2024-12-28 19:18:58 -08:00
health_check_utils.py	(Refactor) - Re use litellm.completion/litellm.embedding etc for health checks (#7455 )	2024-12-28 18:38:54 -08:00
initialize_dynamic_callback_params.py	Fix team-based logging to langfuse + allow custom tokenizer on `/token_counter` endpoint (#7493 )	2024-12-31 23:18:41 -08:00
json_validation_rule.py	feat(vertex_ai_anthropic.py): support response_schema for vertex ai anthropic calls	2024-07-18 16:57:38 -07:00
litellm_logging.py	(Feat) - Add PagerDuty Alerting Integration (#7478 )	2025-01-01 07:12:51 -08:00
llm_request_utils.py	Litellm ruff linting enforcement (#5992 )	2024-10-01 19:44:20 -04:00
logging_utils.py	Complete 'requests' library removal (#7350 )	2024-12-22 07:21:25 -08:00
mock_functions.py	(code quality) run ruff rule to ban unused imports (#7313 )	2024-12-19 12:33:42 -08:00
README.md	(QOL improvement) Provider budget routing - allow using 1s, 1d, 1mo, 2mo etc (#6885 )	2024-11-23 16:59:46 -08:00
realtime_streaming.py	(code quality) run ruff rule to ban unused imports (#7313 )	2024-12-19 12:33:42 -08:00
redact_messages.py	(feat) Allow enabling logging message / response for specific virtual keys (#7071 )	2024-12-06 21:25:36 -08:00
response_header_helpers.py	fix(utils.py): guarantee openai-compatible headers always exist in response	2024-09-28 21:08:15 -07:00
rules.py	Litellm dev 11 07 2024 (#6649 )	2024-11-08 19:34:22 +05:30
streaming_chunk_builder_utils.py	(code quality) run ruff rule to ban unused imports (#7313 )	2024-12-19 12:33:42 -08:00
streaming_handler.py	Complete 'requests' library removal (#7350 )	2024-12-22 07:21:25 -08:00
token_counter.py	fix: Support WebP image format and avoid token calculation error (#7182 )	2024-12-12 14:32:39 -08:00

README.md

Folder Contents

This folder contains general-purpose utilities that are used in multiple places in the codebase.

Core files:

streaming_handler.py: The core streaming logic + streaming related helper utils
core_helpers.py: code used in types/ - e.g. map_finish_reason.
exception_mapping_utils.py: utils for mapping exceptions to openai-compatible error types.
default_encoding.py: code for loading the default encoding (tiktoken)
get_llm_provider_logic.py: code for inferring the LLM provider from a given model name.
duration_parser.py: code for parsing durations - e.g. "1d", "1mo", "10s"