litellm-mirror

mirror of https://github.com/BerriAI/litellm.git synced 2025-04-27 11:43:54 +00:00

History

Krish Dholakia a9038087cb Litellm dev 11 07 2024 (#6649 ) * fix(streaming_handler.py): save finish_reasons which might show up mid-stream (store last received one) Fixes https://github.com/BerriAI/litellm/issues/6104 * refactor: add readme to litellm_core_utils/ make it easier to navigate * fix(team_endpoints.py): return team id + object for invalid team in `/team/list` * fix(streaming_handler.py): remove import * fix(pattern_match_deployments.py): default to user input if unable to map based on wildcards (#6646) * fix(pattern_match_deployments.py): default to user input if unable to… (#6632) * fix(pattern_match_deployments.py): default to user input if unable to map based on wildcards * test: fix test * test: reset test name * test: update conftest to reload proxy server module between tests * ci(config.yml): move langfuse out of local_testing reduce ci/cd time * ci(config.yml): cleanup langfuse ci/cd tests * fix: update test to not use global proxy_server app module * ci: move caching to a separate test pipeline speed up ci pipeline * test: update conftest to check if proxy_server attr exists before reloading * build(conftest.py): don't block on inability to reload proxy_server * ci(config.yml): update caching unit test filter to work on 'cache' keyword as well * fix(encrypt_decrypt_utils.py): use function to get salt key * test: mark flaky test * test: handle anthropic overloaded errors * refactor: create separate ci/cd pipeline for proxy unit tests make ci/cd faster * ci(config.yml): add litellm_proxy_unit_testing to build_and_test jobs * ci(config.yml): generate prisma binaries for proxy unit tests * test: readd vertex_key.json * ci(config.yml): remove `-s` from proxy_unit_test cmd speed up test * ci: remove any 'debug' logging flag speed up ci pipeline * test: fix test * test(test_braintrust.py): rerun * test: add delay for braintrust test * chore: comment for maritalk (#6607) * Update gpt-4o-2024-08-06, and o1-preview, o1-mini models in model cost map (#6654) * Adding supports_response_schema to gpt-4o-2024-08-06 models * o1 models do not support vision --------- Co-authored-by: Emerson Gomes <emerson.gomes@thalesgroup.com> * (QOL improvement) add unit testing for all static_methods in litellm_logging.py (#6640) * add unit testing for standard logging payload * unit testing for static methods in litellm_logging * add code coverage check for litellm_logging * litellm_logging_code_coverage * test_get_final_response_obj * fix validate_redacted_message_span_attributes * test validate_redacted_message_span_attributes * (feat) log error class, function_name on prometheus service failure hook + only log DB related failures on DB service hook (#6650) * log error on prometheus service failure hook * use a more accurate function name for wrapper that handles logging db metrics * fix log_db_metrics * test_log_db_metrics_failure_error_types * fix linting * fix auth checks * Update several Azure AI models in model cost map (#6655) * Adding Azure Phi 3/3.5 models to model cost map * Update gpt-4o-mini models * Adding missing Azure Mistral models to model cost map * Adding Azure Llama3.2 models to model cost map * Fix Gemini-1.5-flash pricing * Fix Gemini-1.5-flash output pricing * Fix Gemini-1.5-pro prices * Fix Gemini-1.5-flash output prices * Correct gemini-1.5-pro prices * Correction on Vertex Llama3.2 entry --------- Co-authored-by: Emerson Gomes <emerson.gomes@thalesgroup.com> * fix(streaming_handler.py): fix linting error * test: remove duplicate test causes gemini ratelimit error --------- Co-authored-by: nobuo kawasaki <nobu007@users.noreply.github.com> Co-authored-by: Emerson Gomes <emerson.gomes@gmail.com> Co-authored-by: Emerson Gomes <emerson.gomes@thalesgroup.com> Co-authored-by: Ishaan Jaff <ishaanjaffer0324@gmail.com>		2024-11-08 19:34:22 +05:30
..
audio_utils	fix import error	2024-09-05 10:09:44 -07:00
llm_cost_calc	LiteLLM Minor Fixes & Improvements (10/09/2024) (#6139 )	2024-10-10 00:42:11 -07:00
llm_response_utils	(fix) litellm.text_completion raises a non-blocking error on simple usage (#6546 )	2024-11-04 15:47:48 -08:00
asyncify.py	build(config.yml): bump anyio version	2024-08-27 07:37:06 -07:00
core_helpers.py	Litellm dev 11 07 2024 (#6649 )	2024-11-08 19:34:22 +05:30
default_encoding.py	Litellm dev 11 07 2024 (#6649 )	2024-11-08 19:34:22 +05:30
exception_mapping_utils.py	LiteLLM Minor Fixes & Improvements (11/04/2024) (#6572 )	2024-11-06 17:53:46 +05:30
get_llm_provider_logic.py	chore: comment for maritalk (#6607 )	2024-11-07 12:20:12 -08:00
get_supported_openai_params.py	LiteLLM Minor Fixes & Improvements (11/04/2024) (#6572 )	2024-11-06 17:53:46 +05:30
json_validation_rule.py	feat(vertex_ai_anthropic.py): support response_schema for vertex ai anthropic calls	2024-07-18 16:57:38 -07:00
litellm_logging.py	(QOL improvement) add unit testing for all static_methods in litellm_logging.py (#6640 )	2024-11-07 16:26:53 -08:00
llm_request_utils.py	Litellm ruff linting enforcement (#5992 )	2024-10-01 19:44:20 -04:00
logging_utils.py	(refactor) use helper function `_assemble_complete_response_from_streaming_chunks` to assemble complete responses in caching and logging callbacks (#6220 )	2024-10-15 12:45:12 +05:30
mock_functions.py	test(router_code_coverage.py): check if all router functions are dire… (#6186 )	2024-10-14 22:44:00 -07:00
README.md	Litellm dev 11 07 2024 (#6649 )	2024-11-08 19:34:22 +05:30
realtime_streaming.py	Litellm dev 10 22 2024 (#6384 )	2024-10-22 21:18:54 -07:00
redact_messages.py	LiteLLM Minor Fixes & Improvements (10/04/2024) (#6064 )	2024-10-04 21:28:53 -04:00
response_header_helpers.py	fix(utils.py): guarantee openai-compatible headers always exist in response	2024-09-28 21:08:15 -07:00
rules.py	Litellm dev 11 07 2024 (#6649 )	2024-11-08 19:34:22 +05:30
streaming_chunk_builder_utils.py	LiteLLM Minor Fixes & Improvements (11/05/2024) (#6590 )	2024-11-07 04:17:05 +05:30
streaming_handler.py	Litellm dev 11 07 2024 (#6649 )	2024-11-08 19:34:22 +05:30
token_counter.py	fix(token_counter.py): New `get_modified_max_tokens' helper func	2024-06-27 15:38:09 -07:00

README.md

Folder Contents

This folder contains general-purpose utilities that are used in multiple places in the codebase.

Core files:

streaming_handler.py: The core streaming logic + streaming related helper utils
core_helpers.py: code used in types/ - e.g. map_finish_reason.
exception_mapping_utils.py: utils for mapping exceptions to openai-compatible error types.
default_encoding.py: code for loading the default encoding (tiktoken)
get_llm_provider_logic.py: code for inferring the LLM provider from a given model name.