Commit graph

1512 commits

Author SHA1 Message Date
Ishaan Jaff
25fae37efc
Merge pull request #3716 from BerriAI/litellm_set_cooldown_time_based_on_exception_header
[Feat] Router/ Proxy - set cooldown_time based on Azure exception headers
2024-05-20 16:34:12 -07:00
Ishaan Jaff
8263d15ca3 fix - standardize slack alerting format 2024-05-20 16:26:11 -07:00
Ishaan Jaff
91a89eb4ed
Merge pull request #3739 from BerriAI/litellm_add_imagen_support
[FEAT] Async VertexAI Image Generation
2024-05-20 14:14:43 -07:00
Ishaan Jaff
518db13982 add parameter mapping with vertex ai 2024-05-20 13:28:20 -07:00
Krrish Dholakia
b41f30ca60 fix(proxy_server.py): fixes for making rejected responses work with streaming 2024-05-20 12:32:19 -07:00
Ishaan Jaff
655478e8dc fix python3.8 error 2024-05-20 10:55:10 -07:00
Ishaan Jaff
2519879e67 add ImageObject 2024-05-20 10:45:37 -07:00
Krrish Dholakia
372323c38a feat(proxy_server.py): allow admin to return rejected response as string to user
Closes https://github.com/BerriAI/litellm/issues/3671
2024-05-20 10:30:23 -07:00
Krrish Dholakia
12942c39db fix(utils.py): drop response_format if 'drop_params=True' for gpt-4 2024-05-18 13:02:48 -07:00
Krrish Dholakia
1cecdc4690 fix(utils.py): fix replicate completion cost calculation 2024-05-17 22:18:57 -07:00
Ishaan Jaff
f7a1675337 fix - cooldown based on exception header 2024-05-17 18:52:45 -07:00
Krish Dholakia
3a06fe2818
Merge branch 'main' into litellm_bedrock_anthropic_fix 2024-05-17 17:47:32 -07:00
Krrish Dholakia
4682802444 fix(bedrock_httpx.py): raise better timeout exception 2024-05-17 17:16:36 -07:00
Krrish Dholakia
b486bb1f8b fix(utils.py): exception map bedrock error 2024-05-17 16:18:25 -07:00
Krrish Dholakia
c0d62e94ae feat(proxy_server.py): enable custom branding + routes on openapi docs
Allows user to add their branding + show only openai routes on docs
2024-05-17 15:21:29 -07:00
Krrish Dholakia
26aefc7cd6 fix(utils.py): support openrouter function calling 2024-05-17 08:02:24 -07:00
Mikkel Gravgaard
80ef0f86d1
Merge branch 'main' into patch-1 2024-05-17 10:26:14 +02:00
Krrish Dholakia
92c2e2af6a fix(bedrock_httpx.py): add async support for bedrock amazon, meta, mistral models 2024-05-16 22:39:25 -07:00
Krrish Dholakia
0293f7766a fix(bedrock_httpx.py): move bedrock ai21 calls to being async 2024-05-16 22:21:30 -07:00
Krrish Dholakia
180bc46ca4 fix(bedrock_httpx.py): move anthropic bedrock calls to httpx
Fixing https://github.com/BerriAI/litellm/issues/2921
2024-05-16 21:51:55 -07:00
lj
f3d0f003fb
Removed config dict type definition 2024-05-17 10:39:00 +08:00
Ishaan Jaff
4952e244d9
Merge pull request #3705 from BerriAI/litellm_add_cost_tracking_for_ft_models
[FEAT] add cost tracking for Fine Tuned OpenAI `ft:davinci-002` and `ft:babbage-002`
2024-05-16 17:37:35 -07:00
Ishaan Jaff
28ae8284df fix add cost tracking for OpenAI ft models 2024-05-16 17:31:19 -07:00
Krrish Dholakia
709373b15c fix(replicate.py): move replicate calls to being completely async
Closes https://github.com/BerriAI/litellm/issues/3128
2024-05-16 17:24:08 -07:00
Krrish Dholakia
a2a5884df1 fix(utils.py): allow passing in custom pricing to completion_cost as params 2024-05-16 16:24:44 -07:00
Krrish Dholakia
bc23365acc fix(utils.py): update completion_cost docstring 2024-05-16 15:47:40 -07:00
Krrish Dholakia
ce4dffb7cb fix(utils.py): fix logging level of error message 2024-05-16 15:44:08 -07:00
Ishaan Jaff
0a816b2c45
Merge pull request #3682 from BerriAI/litellm_token_counter_endpoint
[Feat] `token_counter` endpoint
2024-05-16 13:39:23 -07:00
Ishaan Jaff
22ba5fa186 feat - try using hf tokenizer 2024-05-16 10:59:29 -07:00
Krrish Dholakia
e273e66618 feat(lago.py): adding support for usage-based billing with lago
Closes https://github.com/BerriAI/litellm/issues/3639
2024-05-16 10:54:18 -07:00
lj
6a60bfbd97
Update model config in utils.py 2024-05-16 16:39:37 +08:00
Ishaan Jaff
881812d5de
Merge pull request #3543 from kmheckel/main
Updated Ollama cost models to include LLaMa3 and Mistral/Mixtral Instruct series
2024-05-15 20:50:50 -07:00
Ishaan Jaff
7aac76b485
Merge pull request #3662 from BerriAI/litellm_feat_predibase_exceptions
[Fix] Mask API Keys from Predibase AuthenticationErrors
2024-05-15 20:45:40 -07:00
Krish Dholakia
25e4b34574
Merge pull request #3660 from BerriAI/litellm_proxy_ui_general_settings
feat(proxy_server.py): Enabling Admin to control general settings on proxy ui
2024-05-15 20:36:42 -07:00
Krrish Dholakia
594ca947c8 fix(parallel_request_limiter.py): fix max parallel request limiter on retries 2024-05-15 20:16:11 -07:00
Ishaan Jaff
765d43d241 fix utils.py 2024-05-15 19:54:52 -07:00
Ishaan Jaff
b88ad61490 feat - predibase exceptions 2024-05-15 16:52:33 -07:00
Ishaan Jaff
59e18f23e0 fix - show litellm_debug_info 2024-05-15 13:07:04 -07:00
Krrish Dholakia
54587db402 fix(alerting.py): fix datetime comparison logic 2024-05-14 22:10:09 -07:00
Krrish Dholakia
a1dd341ca1 fix(utils.py): default claude-3 to tiktoken (0.8s faster than hf tokenizer) 2024-05-14 18:37:14 -07:00
Krish Dholakia
b04a8d878a
Revert "Logfire Integration" 2024-05-14 17:38:47 -07:00
Krrish Dholakia
dd0b4b8644 fix(utils.py): fix pydantic v1 error 2024-05-14 17:17:20 -07:00
Krrish Dholakia
2b41f09268 fix(utils.py): add lru-cache logic to _select_tokenizer
speed up tokenizer load times
2024-05-14 16:39:50 -07:00
alisalim17
765c382b2a Merge remote-tracking branch 'upstream/main' 2024-05-14 22:32:57 +04:00
Krrish Dholakia
f41b2a09a4 fix(utils.py): fix python 3.8 linting error 2024-05-14 11:25:36 -07:00
alisalim17
ed7c9e4e87 refactor: logging class to use continue instead of break for streaming logging 2024-05-14 21:21:21 +04:00
alisalim17
68637997c7 chore: fix typo 2024-05-14 21:18:00 +04:00
alisalim17
18bf68298f Merge remote-tracking branch 'upstream/main' 2024-05-14 18:42:20 +04:00
Krish Dholakia
2c867ea9a5
Merge pull request #3600 from msabramo/msabramo/fix-pydantic-warnings
Update pydantic code to fix warnings
2024-05-13 22:00:39 -07:00
Krish Dholakia
5b390b6512
Merge pull request #3602 from msabramo/msabramo/fix_pkg_resources_warning
Fix `pkg_resources` warning
2024-05-13 21:59:52 -07:00