Commit graph

1472 commits

Author SHA1 Message Date
Krish Dholakia
c14584722e
Merge pull request #3808 from BerriAI/litellm_databricks_api
feat(databricks.py): adds databricks support - completion, async, streaming
2024-05-23 19:23:19 -07:00
Krrish Dholakia
43353c28b3 feat(databricks.py): add embedding model support 2024-05-23 18:22:03 -07:00
Krrish Dholakia
d2229dcd21 feat(databricks.py): adds databricks support - completion, async, streaming
Closes https://github.com/BerriAI/litellm/issues/2160
2024-05-23 16:29:46 -07:00
Ishaan Jaff
e0af091b5e feat - add prixing for vertex_ai image gen 2024-05-23 16:27:08 -07:00
Krish Dholakia
beb61702c3
Merge branch 'main' into litellm_filter_invalid_params 2024-05-21 20:42:21 -07:00
Krrish Dholakia
988970f4c2 feat(router.py): Fixes https://github.com/BerriAI/litellm/issues/3769 2024-05-21 17:24:51 -07:00
Krrish Dholakia
62e5559e0a fix(utils.py): filter out hf eos token
Closes https://github.com/BerriAI/litellm/issues/3757
2024-05-21 14:31:54 -07:00
alisalim17
01bb26bbba Revert "Revert "Logfire Integration""
This reverts commit b04a8d878a.
2024-05-21 11:07:40 +04:00
Krish Dholakia
c6bb6e325b
Merge pull request #3740 from BerriAI/litellm_return_rejected_response
feat(proxy_server.py): allow admin to return rejected response as string to user
2024-05-20 17:48:21 -07:00
Ishaan Jaff
8413fdf4c7
Merge branch 'main' into litellm_standardize_slack_exception_msg_format 2024-05-20 16:39:41 -07:00
Ishaan Jaff
25fae37efc
Merge pull request #3716 from BerriAI/litellm_set_cooldown_time_based_on_exception_header
[Feat] Router/ Proxy - set cooldown_time based on Azure exception headers
2024-05-20 16:34:12 -07:00
Ishaan Jaff
8263d15ca3 fix - standardize slack alerting format 2024-05-20 16:26:11 -07:00
Ishaan Jaff
91a89eb4ed
Merge pull request #3739 from BerriAI/litellm_add_imagen_support
[FEAT] Async VertexAI Image Generation
2024-05-20 14:14:43 -07:00
Ishaan Jaff
518db13982 add parameter mapping with vertex ai 2024-05-20 13:28:20 -07:00
Krrish Dholakia
b41f30ca60 fix(proxy_server.py): fixes for making rejected responses work with streaming 2024-05-20 12:32:19 -07:00
Ishaan Jaff
655478e8dc fix python3.8 error 2024-05-20 10:55:10 -07:00
Ishaan Jaff
2519879e67 add ImageObject 2024-05-20 10:45:37 -07:00
Krrish Dholakia
372323c38a feat(proxy_server.py): allow admin to return rejected response as string to user
Closes https://github.com/BerriAI/litellm/issues/3671
2024-05-20 10:30:23 -07:00
Krrish Dholakia
12942c39db fix(utils.py): drop response_format if 'drop_params=True' for gpt-4 2024-05-18 13:02:48 -07:00
Krrish Dholakia
1cecdc4690 fix(utils.py): fix replicate completion cost calculation 2024-05-17 22:18:57 -07:00
Ishaan Jaff
f7a1675337 fix - cooldown based on exception header 2024-05-17 18:52:45 -07:00
Krish Dholakia
3a06fe2818
Merge branch 'main' into litellm_bedrock_anthropic_fix 2024-05-17 17:47:32 -07:00
Krrish Dholakia
4682802444 fix(bedrock_httpx.py): raise better timeout exception 2024-05-17 17:16:36 -07:00
Krrish Dholakia
b486bb1f8b fix(utils.py): exception map bedrock error 2024-05-17 16:18:25 -07:00
Krrish Dholakia
c0d62e94ae feat(proxy_server.py): enable custom branding + routes on openapi docs
Allows user to add their branding + show only openai routes on docs
2024-05-17 15:21:29 -07:00
Krrish Dholakia
26aefc7cd6 fix(utils.py): support openrouter function calling 2024-05-17 08:02:24 -07:00
Mikkel Gravgaard
80ef0f86d1
Merge branch 'main' into patch-1 2024-05-17 10:26:14 +02:00
Krrish Dholakia
92c2e2af6a fix(bedrock_httpx.py): add async support for bedrock amazon, meta, mistral models 2024-05-16 22:39:25 -07:00
Krrish Dholakia
0293f7766a fix(bedrock_httpx.py): move bedrock ai21 calls to being async 2024-05-16 22:21:30 -07:00
Krrish Dholakia
180bc46ca4 fix(bedrock_httpx.py): move anthropic bedrock calls to httpx
Fixing https://github.com/BerriAI/litellm/issues/2921
2024-05-16 21:51:55 -07:00
lj
f3d0f003fb
Removed config dict type definition 2024-05-17 10:39:00 +08:00
Ishaan Jaff
4952e244d9
Merge pull request #3705 from BerriAI/litellm_add_cost_tracking_for_ft_models
[FEAT] add cost tracking for Fine Tuned OpenAI `ft:davinci-002` and `ft:babbage-002`
2024-05-16 17:37:35 -07:00
Ishaan Jaff
28ae8284df fix add cost tracking for OpenAI ft models 2024-05-16 17:31:19 -07:00
Krrish Dholakia
709373b15c fix(replicate.py): move replicate calls to being completely async
Closes https://github.com/BerriAI/litellm/issues/3128
2024-05-16 17:24:08 -07:00
Krrish Dholakia
a2a5884df1 fix(utils.py): allow passing in custom pricing to completion_cost as params 2024-05-16 16:24:44 -07:00
Krrish Dholakia
bc23365acc fix(utils.py): update completion_cost docstring 2024-05-16 15:47:40 -07:00
Krrish Dholakia
ce4dffb7cb fix(utils.py): fix logging level of error message 2024-05-16 15:44:08 -07:00
Ishaan Jaff
0a816b2c45
Merge pull request #3682 from BerriAI/litellm_token_counter_endpoint
[Feat] `token_counter` endpoint
2024-05-16 13:39:23 -07:00
Ishaan Jaff
22ba5fa186 feat - try using hf tokenizer 2024-05-16 10:59:29 -07:00
Krrish Dholakia
e273e66618 feat(lago.py): adding support for usage-based billing with lago
Closes https://github.com/BerriAI/litellm/issues/3639
2024-05-16 10:54:18 -07:00
lj
6a60bfbd97
Update model config in utils.py 2024-05-16 16:39:37 +08:00
Ishaan Jaff
881812d5de
Merge pull request #3543 from kmheckel/main
Updated Ollama cost models to include LLaMa3 and Mistral/Mixtral Instruct series
2024-05-15 20:50:50 -07:00
Ishaan Jaff
7aac76b485
Merge pull request #3662 from BerriAI/litellm_feat_predibase_exceptions
[Fix] Mask API Keys from Predibase AuthenticationErrors
2024-05-15 20:45:40 -07:00
Krish Dholakia
25e4b34574
Merge pull request #3660 from BerriAI/litellm_proxy_ui_general_settings
feat(proxy_server.py): Enabling Admin to control general settings on proxy ui
2024-05-15 20:36:42 -07:00
Krrish Dholakia
594ca947c8 fix(parallel_request_limiter.py): fix max parallel request limiter on retries 2024-05-15 20:16:11 -07:00
Ishaan Jaff
765d43d241 fix utils.py 2024-05-15 19:54:52 -07:00
Ishaan Jaff
b88ad61490 feat - predibase exceptions 2024-05-15 16:52:33 -07:00
Ishaan Jaff
59e18f23e0 fix - show litellm_debug_info 2024-05-15 13:07:04 -07:00
Krrish Dholakia
54587db402 fix(alerting.py): fix datetime comparison logic 2024-05-14 22:10:09 -07:00
Krrish Dholakia
a1dd341ca1 fix(utils.py): default claude-3 to tiktoken (0.8s faster than hf tokenizer) 2024-05-14 18:37:14 -07:00