litellm-mirror/tests/litellm/proxy
Krish Dholakia 36308a31be
Gemini-2.5-flash - support reasoning cost calc + return reasoning content (#10141)
* build(model_prices_and_context_window.json): add vertex ai gemini-2.5-flash pricing

* build(model_prices_and_context_window.json): add gemini reasoning token pricing

* fix(vertex_and_google_ai_studio_gemini.py): support counting thinking tokens for gemini

allows accurate cost calc

* fix(utils.py): add reasoning token cost calc to generic cost calc

ensures gemini-2.5-flash cost calculation is accurate

* build(model_prices_and_context_window.json): mark gemini-2.5-flash as 'supports_reasoning'

* feat(gemini/): support 'thinking' + 'reasoning_effort' params + new unit tests

allow controlling thinking effort for gemini-2.5-flash models

* test: update unit testing

* feat(vertex_and_google_ai_studio_gemini.py): return reasoning content if given in gemini response

* test: update model name

* fix: fix ruff check

* test(test_spend_management_endpoints.py): update tests to be less sensitive to new keys / updates to usage object

* fix(vertex_and_google_ai_studio_gemini.py): fix translation
2025-04-19 09:20:52 -07:00
..
auth Litellm UI qa 04 12 2025 p1 (#9955) 2025-04-12 19:30:48 -07:00
common_utils fix(proxy_server.py): Fix "Circular reference detected" error when max_parallel_requests = 0 (#9671) 2025-03-31 22:06:02 -07:00
db [Feat - Cost Tracking improvement] Track prompt caching metrics in DailyUserSpendTransactions (#10029) 2025-04-15 21:40:57 -07:00
experimental/mcp_server test tool registry 2025-03-20 22:03:56 -07:00
health_endpoints test_db_health_readiness_check_with_prisma_error 2025-03-26 20:56:39 -07:00
hooks Litellm UI qa 04 12 2025 p1 (#9955) 2025-04-12 19:30:48 -07:00
management_endpoints test: update testing imports 2025-04-19 09:13:16 -07:00
middleware test_no_auth_metrics_when_disabled 2025-04-04 21:02:29 -07:00
openai_files_endpoint Litellm add managed files db (#9930) 2025-04-12 08:24:46 -07:00
pass_through_endpoints Add new /vertex_ai/discovery route - enables calling AgentBuilder API routes (#10084) 2025-04-16 21:45:51 -07:00
spend_tracking Gemini-2.5-flash - support reasoning cost calc + return reasoning content (#10141) 2025-04-19 09:20:52 -07:00
types_utils test fix post call rules (#9826) 2025-04-08 13:55:37 -07:00
ui_crud_endpoints [Feat - UI] - Allow setting Default Team setting when LiteLLM SSO auto creates teams (#9918) 2025-04-11 14:07:10 -07:00
test_caching_routes.py (bug fix) - Fix Cache Health Check for Redis when redis_version is float (#8979) 2025-03-04 21:26:18 -08:00
test_common_request_processing.py Add litellm call id passing to Aim guardrails on pre and post-hooks calls (#10021) 2025-04-16 07:41:28 -07:00
test_litellm_pre_call_utils.py test_get_enforced_params 2025-04-01 08:41:53 -07:00
test_proxy_cli.py (Bug fix) - running litellm proxy on wndows (#8735) 2025-02-25 15:19:19 -08:00
test_proxy_server.py [Security fix - CVE-2025-0330] - Leakage of Langfuse API keys in team exception handling (#9830) 2025-04-08 13:55:20 -07:00
test_route_llm_request.py fix(route_llm_request.py): move to using common router, even for clie… (#8966) 2025-03-03 22:57:08 -08:00