litellm-mirror/litellm
Krish Dholakia e33543ae4f Litellm dev 02 13 2025 p2 (#8525)
* fix(azure/chat/gpt_transformation.py): add 'prediction' as a support azure param

Closes https://github.com/BerriAI/litellm/issues/8500

* build(model_prices_and_context_window.json): add new 'gemini-2.0-pro-exp-02-05' model

* style: cleanup invalid json trailing commma

* feat(utils.py): support passing 'tokenizer_config' to register_prompt_template

enables passing complete tokenizer config of model to litellm

 Allows calling deepseek on bedrock with the correct prompt template

* fix(utils.py): fix register_prompt_template for custom model names

* test(test_prompt_factory.py): fix test

* test(test_completion.py): add e2e test for bedrock invoke deepseek ft model

* feat(base_invoke_transformation.py): support hf_model_name param for bedrock invoke calls

enables proxy admin to set base model for ft bedrock deepseek model

* feat(bedrock/invoke): support deepseek_r1 route for bedrock

makes it easy to apply the right chat template to that call

* feat(constants.py): store deepseek r1 chat template - allow user to get correct response from deepseek r1 without extra work

* test(test_completion.py): add e2e mock test for bedrock deepseek

* docs(bedrock.md): document new deepseek_r1 route for bedrock

allows us to use the right config

* fix(exception_mapping_utils.py): catch read operation timeout
2025-02-13 20:28:42 -08:00
..
adapters (code quality) run ruff rule to ban unused imports (#7313) 2024-12-19 12:33:42 -08:00
assistants Revert "fix: add missing parameters order, limit, before, and after in get_as…" (#7542) 2025-01-03 16:32:12 -08:00
batch_completion (code quality) run ruff rule to ban unused imports (#7313) 2024-12-19 12:33:42 -08:00
batches (Feat - Batches API) add support for retrieving vertex api batch jobs (#7661) 2025-01-09 18:35:03 -08:00
caching (Redis Cluster) - Fixes for using redis cluster + pipeline (#8442) 2025-02-12 18:01:32 -08:00
files (code quality) run ruff rule to ban unused imports (#7313) 2024-12-19 12:33:42 -08:00
fine_tuning (feat) POST /fine_tuning/jobs support passing vertex specific hyper params (#7490) 2025-01-01 07:44:48 -08:00
integrations Added custom_attributes to additional_keys which can be sent to athina (#8518) 2025-02-13 13:19:24 -08:00
litellm_core_utils Litellm dev 02 13 2025 p2 (#8525) 2025-02-13 20:28:42 -08:00
llms Litellm dev 02 13 2025 p2 (#8525) 2025-02-13 20:28:42 -08:00
proxy fix(utils.py): fix vertex ai optional param handling (#8477) 2025-02-13 19:58:50 -08:00
realtime_api (Refactor) - Re use litellm.completion/litellm.embedding etc for health checks (#7455) 2024-12-28 18:38:54 -08:00
rerank_api (feat) /batches - track user_api_key_alias, user_api_key_team_alias etc for /batch requests (#7401) 2024-12-24 17:44:28 -08:00
router_strategy Litellm dev 01 30 2025 p2 (#8134) 2025-01-30 22:18:53 -08:00
router_utils Add attempted-retries and timeout values to response headers + more testing (#7926) 2025-01-22 22:19:44 -08:00
secret_managers fix: add default credential for azure (#7095) (#7891) 2025-01-21 09:01:49 -08:00
types Improved wildcard route handling on /models and /model_group/info (#8473) 2025-02-11 19:37:43 -08:00
__init__.py Litellm dev 02 13 2025 p2 (#8525) 2025-02-13 20:28:42 -08:00
_logging.py (sdk perf fix) - only print args passed to litellm when debugging mode is on (#7708) 2025-01-11 22:56:20 -08:00
_redis.py (Redis Cluster) - Fixes for using redis cluster + pipeline (#8442) 2025-02-12 18:01:32 -08:00
_service_logger.py fix svc logger (#7727) 2025-01-12 22:00:25 -08:00
_version.py Litellm ruff linting enforcement (#5992) 2024-10-01 19:44:20 -04:00
budget_manager.py (code quality) run ruff rule to ban unused imports (#7313) 2024-12-19 12:33:42 -08:00
constants.py Litellm dev 02 13 2025 p2 (#8525) 2025-02-13 20:28:42 -08:00
cost.json
cost_calculator.py Fix custom pricing - separate provider info from model info (#7990) 2025-01-25 21:49:28 -08:00
exceptions.py LiteLLM Minor Fixes & Improvements (12/27/2024) - p1 (#7448) 2024-12-27 19:04:39 -08:00
main.py Litellm dev 02 13 2025 p2 (#8525) 2025-02-13 20:28:42 -08:00
model_prices_and_context_window_backup.json Litellm dev 02 13 2025 p2 (#8525) 2025-02-13 20:28:42 -08:00
py.typed feature - Types for mypy - #360 2024-05-30 14:14:41 -04:00
router.py fix(router.py): add more deployment timeout debug information for tim… (#8523) 2025-02-13 17:10:22 -08:00
scheduler.py (refactor) caching use LLMCachingHandler for async_get_cache and set_cache (#6208) 2024-10-14 16:34:01 +05:30
timeout.py Litellm ruff linting enforcement (#5992) 2024-10-01 19:44:20 -04:00
utils.py Litellm dev 02 13 2025 p2 (#8525) 2025-02-13 20:28:42 -08:00