Commit graph

16000 commits

Author SHA1 Message Date
Ishaan Jaff
4c375d13d6 fix test_drop_params_parallel_tool_calls 2024-08-07 18:16:41 -07:00
Krrish Dholakia
400653992c feat(router.py): allow using .acompletion() for request prioritization
allows /chat/completion endpoint to work for request prioritization calls
2024-08-07 16:43:12 -07:00
Ishaan Jaff
e585dfba92 docs prom 2024-08-07 16:03:11 -07:00
Krish Dholakia
b6df998ac3
Merge pull request #5100 from sammcj/patch-1
Add deepseek-coder-v2(-lite), mistral-large, codegeex4 to ollama
2024-08-07 15:56:02 -07:00
Sam
8459046585
Add deepseek-coder-v2(-lite), mistral-large, codegeex4 to ollama 2024-08-08 10:51:59 +12:00
Ishaan Jaff
89050dc4f4 ci/cd run again 2024-08-07 15:44:54 -07:00
Ishaan Jaff
4543dbb2e2 fixinstalling openai on ci/cd 2024-08-07 15:44:36 -07:00
Ishaan Jaff
2c3e068435 fix test_team_update_redis 2024-08-07 15:37:02 -07:00
Ishaan Jaff
be0984009b bump: version 1.43.1 → 1.43.2 2024-08-07 15:26:55 -07:00
Ishaan Jaff
aab91a7314 fix - someone resolved a merge conflict badly 2024-08-07 15:23:15 -07:00
Ishaan Jaff
9cd437135b fix getting provider_specific_deployment 2024-08-07 15:20:59 -07:00
Ishaan Jaff
04b201efed
Merge pull request #5098 from BerriAI/litellm_provider_wildcard_routing
[Feat-Router + Proxy] Add provider wildcard routing
2024-08-07 14:51:42 -07:00
Ishaan Jaff
a367f97eb2 docs provider specific wildcard routing 2024-08-07 14:49:45 -07:00
Krish Dholakia
2e434d56e3
Merge pull request #5079 from BerriAI/litellm_add_pydantic_model_support
feat(utils.py): support passing response_format as pydantic model
2024-08-07 14:43:05 -07:00
Krish Dholakia
1bf1185506
Merge pull request #5095 from Mogith-P-N/main
Clarifai : Removed model name casing issue
2024-08-07 14:42:26 -07:00
Ishaan Jaff
f1ffa82062 fix use provider specific routing 2024-08-07 14:37:20 -07:00
Krish Dholakia
4640e925a1
Merge pull request #5091 from BerriAI/litellm_anthropic_streaming_tool_call_fix
fix(anthropic.py): handle anthropic returning empty argument string (invalid json str) for tool call while streaming
2024-08-07 14:33:42 -07:00
Krish Dholakia
93d048b1dc
Merge branch 'main' into litellm_anthropic_streaming_tool_call_fix 2024-08-07 14:33:30 -07:00
Ishaan Jaff
218ba0f470 support provider wildcard routing 2024-08-07 14:20:22 -07:00
Ishaan Jaff
d42949cb4a test_router_provider_wildcard_routing 2024-08-07 14:12:40 -07:00
Ishaan Jaff
5d7a1b2ec6 router use provider specific wildcard routing 2024-08-07 14:12:10 -07:00
Ishaan Jaff
3249e295cb test provider wildcard routing 2024-08-07 13:52:00 -07:00
Ishaan Jaff
18305b23f4 add + test provider specific routing 2024-08-07 13:49:46 -07:00
Krrish Dholakia
55ad5bb7c6 fix(config.yml): fix build and test 2024-08-07 13:23:04 -07:00
Krrish Dholakia
c0ef2e9dd0 fix(main.py): fix linting error for python3.8 2024-08-07 13:21:35 -07:00
Krrish Dholakia
37dc359efb fix(utils.py): fix linting error for python3.8 2024-08-07 13:14:29 -07:00
Ishaan Jaff
0b98959e6d gemini test skip internal server error 2024-08-07 13:12:45 -07:00
Krrish Dholakia
a2e792d32e test: update build requirements 2024-08-07 13:09:49 -07:00
Krish Dholakia
3605e873a1
Merge branch 'main' into litellm_add_pydantic_model_support 2024-08-07 13:07:46 -07:00
Krrish Dholakia
f0f900d69e fix(router.py): add reason for fallback failure to client-side exception string
make it easier to debug why a fallback failed to occur
2024-08-07 13:02:47 -07:00
Ishaan Jaff
1bf36cd7a4 docs prom metrics 2024-08-07 12:50:03 -07:00
Ishaan Jaff
61ccd5354b docs prometheus 2024-08-07 12:47:06 -07:00
Ishaan Jaff
958e0fdfab show warning about prometheus moving to enterprise 2024-08-07 12:46:26 -07:00
Ishaan Jaff
80acc7ecfe
Merge pull request #5088 from BerriAI/litellm_init_prom_svc_logger
[Fix] Init Prometheus Service Logger when it's None
2024-08-07 12:40:08 -07:00
Ishaan Jaff
3f9d37c596
Merge pull request #5092 from BerriAI/litellm_prom_metrics_llms
[Feat] - Prometheus Metrics to monitor a model health
2024-08-07 12:39:38 -07:00
Ishaan Jaff
72aebe5e59 docs link to enteprise pricing 2024-08-07 12:10:47 -07:00
Ishaan Jaff
8d1f051d8c docs prometheus 2024-08-07 11:37:05 -07:00
Ishaan Jaff
579f09b334 fix logging cool down deployment 2024-08-07 11:27:05 -07:00
Krrish Dholakia
ff386f6b60 fix(utils.py): support deepseek tool calling
Fixes https://github.com/BerriAI/litellm/issues/5081
2024-08-07 11:14:05 -07:00
Krrish Dholakia
2f402b1cdd fix(vertex_ai_partner.py): default vertex ai llama3.1 api to use all openai params
Poor vertex docs - not clear what can/can't work

Fixes https://github.com/BerriAI/litellm/issues/5090
2024-08-07 11:08:54 -07:00
Ishaan Jaff
d1e519afd1 use router_cooldown_handler 2024-08-07 10:40:55 -07:00
Ishaan Jaff
99ecde7744 allow setting outage metrics 2024-08-07 10:36:18 -07:00
Krrish Dholakia
3646e3e3a4 test(test_completion.py): handle internal server error in test 2024-08-07 10:21:37 -07:00
Krrish Dholakia
3cf9148a4a test: add vertex claude to streaming valid json str test 2024-08-07 10:18:17 -07:00
Mogith P N
e66aa1487d Clarifai : Fixed model name 2024-08-07 16:59:33 +00:00
Ishaan Jaff
9a8fa9f047 emit deployment_partial_outage on prometheus 2024-08-07 09:56:01 -07:00
Krrish Dholakia
2ccb5a48b7 fix(bedrock_httpx.py): handle empty arguments returned during tool calling streaming 2024-08-07 09:54:50 -07:00
Ishaan Jaff
0304513b67 rename to set_llm_deployment_success_metrics 2024-08-07 09:46:08 -07:00
Ishaan Jaff
13d1caf94a add set_remaining_tokens_requests_metric 2024-08-07 09:43:35 -07:00
Krrish Dholakia
4919cc4d25 fix(anthropic.py): handle scenario where anthropic returns invalid json string for tool call while streaming
Fixes https://github.com/BerriAI/litellm/issues/5063
2024-08-07 09:24:11 -07:00