Commit graph

1830 commits

Author SHA1 Message Date
Krrish Dholakia
4536ed6f6e feat(slack_alerting.py): refactor region outage alerting to do model based alerting instead
Unable to extract azure region from api base, makes sense to start with model alerting and then move to region
2024-05-24 19:10:33 -07:00
Krrish Dholakia
7368406c24 fix(slack_alerting.py): support region based outage alerting 2024-05-24 16:59:16 -07:00
Krish Dholakia
0f195d6b94 Merge pull request #3780 from nirga/traceloop-failures
Log errors in Traceloop Integration
2024-05-24 14:23:26 -07:00
Nir Gazit
43c30a4489 fix(traceloop): log errors 2024-05-24 22:05:31 +03:00
Ishaan Jaff
2b85d0faf9 feat - include litellm exception type when raising exception 2024-05-24 10:45:37 -07:00
Krish Dholakia
baa53d94f0 Merge pull request #3812 from afbarbaro/main
Fix issue with delta being None when Deferred / Async Content Filter is enabled on Azure OpenAI
2024-05-24 10:05:08 -07:00
Andres Barbaro
8dd4838d96 Fix issue with delta being None when Deferred / Async Content Filter is enabled on Azure OpenAI 2024-05-23 22:42:42 -05:00
Krrish Dholakia
c50074a0b7 feat(ui/model_dashboard.tsx): add databricks models via admin ui 2024-05-23 20:28:54 -07:00
Krish Dholakia
edb349a9ab Merge pull request #3808 from BerriAI/litellm_databricks_api
feat(databricks.py): adds databricks support - completion, async, streaming
2024-05-23 19:23:19 -07:00
Krrish Dholakia
e3c5e004c5 feat(databricks.py): add embedding model support 2024-05-23 18:22:03 -07:00
Krrish Dholakia
143a44823a feat(databricks.py): adds databricks support - completion, async, streaming
Closes https://github.com/BerriAI/litellm/issues/2160
2024-05-23 16:29:46 -07:00
Ishaan Jaff
bff4227f6a feat - add prixing for vertex_ai image gen 2024-05-23 16:27:08 -07:00
Krish Dholakia
10e1b43751 Merge branch 'main' into litellm_filter_invalid_params 2024-05-21 20:42:21 -07:00
Krrish Dholakia
c989b92801 feat(router.py): Fixes https://github.com/BerriAI/litellm/issues/3769 2024-05-21 17:24:51 -07:00
Krrish Dholakia
413be6d805 fix(utils.py): filter out hf eos token
Closes https://github.com/BerriAI/litellm/issues/3757
2024-05-21 14:31:54 -07:00
alisalim17
fe0e600062 Revert "Revert "Logfire Integration""
This reverts commit b04a8d878a.
2024-05-21 11:07:40 +04:00
Krish Dholakia
db77e41833 Merge pull request #3740 from BerriAI/litellm_return_rejected_response
feat(proxy_server.py): allow admin to return rejected response as string to user
2024-05-20 17:48:21 -07:00
Ishaan Jaff
7e6c9274fc Merge branch 'main' into litellm_standardize_slack_exception_msg_format 2024-05-20 16:39:41 -07:00
Ishaan Jaff
dc55a57d8a Merge pull request #3716 from BerriAI/litellm_set_cooldown_time_based_on_exception_header
[Feat] Router/ Proxy - set cooldown_time based on Azure exception headers
2024-05-20 16:34:12 -07:00
Ishaan Jaff
233828e16f fix - standardize slack alerting format 2024-05-20 16:26:11 -07:00
Ishaan Jaff
28d1bde250 Merge pull request #3739 from BerriAI/litellm_add_imagen_support
[FEAT] Async VertexAI Image Generation
2024-05-20 14:14:43 -07:00
Ishaan Jaff
883a9eb69a add parameter mapping with vertex ai 2024-05-20 13:28:20 -07:00
Krrish Dholakia
d4d4550bb6 fix(proxy_server.py): fixes for making rejected responses work with streaming 2024-05-20 12:32:19 -07:00
Ishaan Jaff
8ea3482676 fix python3.8 error 2024-05-20 10:55:10 -07:00
Ishaan Jaff
76a1444621 add ImageObject 2024-05-20 10:45:37 -07:00
Krrish Dholakia
45fedb83c6 feat(proxy_server.py): allow admin to return rejected response as string to user
Closes https://github.com/BerriAI/litellm/issues/3671
2024-05-20 10:30:23 -07:00
Krrish Dholakia
7da925cada fix(utils.py): drop response_format if 'drop_params=True' for gpt-4 2024-05-18 13:02:48 -07:00
Krrish Dholakia
ab747c8fe9 fix(utils.py): fix replicate completion cost calculation 2024-05-17 22:18:57 -07:00
Ishaan Jaff
cdfa9c9232 fix - cooldown based on exception header 2024-05-17 18:52:45 -07:00
Krish Dholakia
60615f46c4 Merge branch 'main' into litellm_bedrock_anthropic_fix 2024-05-17 17:47:32 -07:00
Krrish Dholakia
b20f4f65b4 fix(bedrock_httpx.py): raise better timeout exception 2024-05-17 17:16:36 -07:00
Krrish Dholakia
54b4e24427 fix(utils.py): exception map bedrock error 2024-05-17 16:18:25 -07:00
Krrish Dholakia
9ab2389b7e feat(proxy_server.py): enable custom branding + routes on openapi docs
Allows user to add their branding + show only openai routes on docs
2024-05-17 15:21:29 -07:00
Krrish Dholakia
963b207473 fix(utils.py): support openrouter function calling 2024-05-17 08:02:24 -07:00
Mikkel Gravgaard
13b7dba006 Merge branch 'main' into patch-1 2024-05-17 10:26:14 +02:00
Krrish Dholakia
13e4196e3e fix(bedrock_httpx.py): add async support for bedrock amazon, meta, mistral models 2024-05-16 22:39:25 -07:00
Krrish Dholakia
8409b39f0d fix(bedrock_httpx.py): move bedrock ai21 calls to being async 2024-05-16 22:21:30 -07:00
Krrish Dholakia
118fc4ffac fix(bedrock_httpx.py): move anthropic bedrock calls to httpx
Fixing https://github.com/BerriAI/litellm/issues/2921
2024-05-16 21:51:55 -07:00
lj
eee9be353a Removed config dict type definition 2024-05-17 10:39:00 +08:00
Ishaan Jaff
b722dfd0ce Merge pull request #3705 from BerriAI/litellm_add_cost_tracking_for_ft_models
[FEAT] add cost tracking for Fine Tuned OpenAI `ft:davinci-002` and `ft:babbage-002`
2024-05-16 17:37:35 -07:00
Ishaan Jaff
b9cda67bb8 fix add cost tracking for OpenAI ft models 2024-05-16 17:31:19 -07:00
Krrish Dholakia
e41897808d fix(replicate.py): move replicate calls to being completely async
Closes https://github.com/BerriAI/litellm/issues/3128
2024-05-16 17:24:08 -07:00
Krrish Dholakia
782b44818c fix(utils.py): allow passing in custom pricing to completion_cost as params 2024-05-16 16:24:44 -07:00
Krrish Dholakia
dc00b4a7ed fix(utils.py): update completion_cost docstring 2024-05-16 15:47:40 -07:00
Krrish Dholakia
1f99fa7411 fix(utils.py): fix logging level of error message 2024-05-16 15:44:08 -07:00
Ishaan Jaff
3a52073b48 Merge pull request #3682 from BerriAI/litellm_token_counter_endpoint
[Feat] `token_counter` endpoint
2024-05-16 13:39:23 -07:00
Ishaan Jaff
a3763c8608 feat - try using hf tokenizer 2024-05-16 10:59:29 -07:00
Krrish Dholakia
d167a9ea99 feat(lago.py): adding support for usage-based billing with lago
Closes https://github.com/BerriAI/litellm/issues/3639
2024-05-16 10:54:18 -07:00
lj
3620c6fc1a Update model config in utils.py 2024-05-16 16:39:37 +08:00
Ishaan Jaff
c6e91daad7 Merge pull request #3543 from kmheckel/main
Updated Ollama cost models to include LLaMa3 and Mistral/Mixtral Instruct series
2024-05-15 20:50:50 -07:00