litellm-mirror

mirror of https://github.com/BerriAI/litellm.git synced 2025-04-26 11:14:04 +00:00

Author	SHA1	Message	Date
Ishaan Jaff	69ea7d57fb	feat - show openai params on model hub ui	2024-05-27 08:49:51 -07:00
Krrish Dholakia	8e9a3fef81	feat(proxy_server.py): expose new `/model_group/info` endpoint returns model-group level info on supported params, max tokens, pricing, etc.	2024-05-26 14:07:35 -07:00
Krrish Dholakia	cd34d00d80	fix(router.py): fix pre call check only check if response_format supported by model, if pre-call check enabled	2024-05-24 20:09:15 -07:00
Krrish Dholakia	4536ed6f6e	feat(slack_alerting.py): refactor region outage alerting to do model based alerting instead Unable to extract azure region from api base, makes sense to start with model alerting and then move to region	2024-05-24 19:10:33 -07:00
Ishaan Jaff	84f8ead4a1	fix test_filter_invalid_params_pre_call_check	2024-05-23 21:16:32 -07:00
Krrish Dholakia	c50074a0b7	feat(ui/model_dashboard.tsx): add databricks models via admin ui	2024-05-23 20:28:54 -07:00
Krrish Dholakia	c989b92801	feat(router.py): Fixes https://github.com/BerriAI/litellm/issues/3769	2024-05-21 17:24:51 -07:00
Krish Dholakia	c0e43a7296	Merge pull request #3412 from sumanth13131/usage-based-routing-ttl-on-cache usage-based-routing-ttl-on-cache	2024-05-21 07:58:41 -07:00
Ishaan Jaff	ef9372ce00	fix add doc string for abatch_completion_one_model_multiple_requests	2024-05-20 17:51:08 -07:00
Ishaan Jaff	13c787f9b5	feat - add abatch_completion_one_model_multiple_requests	2024-05-20 17:47:25 -07:00
Ishaan Jaff	7e6c9274fc	Merge branch 'main' into litellm_standardize_slack_exception_msg_format	2024-05-20 16:39:41 -07:00
Ishaan Jaff	2ccef68c2d	fix - standardize format of exceptions occuring on slack alerts	2024-05-20 16:29:16 -07:00
Ishaan Jaff	3c4bf52509	feat - read cooldown time from exception header	2024-05-17 18:50:33 -07:00
David Manouchehri	61ef93a14a	Fix(router.py): Kill a bug that forced Azure OpenAI to have an API key, even though we can use OIDC instead.	2024-05-17 00:37:56 +00:00
Ishaan Jaff	5ba4f5b4f1	feat - include model name in cool down alerts	2024-05-16 12:52:15 -07:00
Ishaan Jaff	48c92b1612	fix - router show better client side errors	2024-05-16 09:01:27 -07:00
Krrish Dholakia	7d71e41992	fix(router.py): fix validation error for default fallback	2024-05-15 13:23:00 -07:00
Krrish Dholakia	5c33145ee6	fix(router.py): add validation for how router fallbacks are setup prevent user errors	2024-05-15 10:44:16 -07:00
Ishaan Jaff	2d08d766ed	feat - router use _is_cooldown_required	2024-05-15 10:03:55 -07:00
Ishaan Jaff	543909a200	feat - don't cooldown deployment on BadRequestError	2024-05-15 09:03:27 -07:00
Krrish Dholakia	cb758fbfad	fix(router.py): error string fix	2024-05-14 11:20:57 -07:00
Krrish Dholakia	b054f39bab	fix(init.py): set 'default_fallbacks' as a litellm_setting	2024-05-14 11:15:53 -07:00
sumanth	4bbd9c866c	addressed comments	2024-05-14 10:05:19 +05:30
Krrish Dholakia	55b62f3334	fix(router.py): fix typing	2024-05-13 18:06:10 -07:00
Krrish Dholakia	6f20389bd5	feat(router.py): enable default fallbacks allow user to define a generic list of fallbacks, in case a new deployment is bad Closes https://github.com/BerriAI/litellm/issues/3623	2024-05-13 17:49:56 -07:00
Krrish Dholakia	044177d5ff	fix(router.py): overloads fix	2024-05-13 17:04:04 -07:00
Krrish Dholakia	684e4e8c89	fix(router.py): overloads for better router.acompletion typing	2024-05-13 14:27:16 -07:00
Krrish Dholakia	f162835937	fix(router.py): give an 'info' log when fallbacks work successfully	2024-05-13 10:17:32 -07:00
Krrish Dholakia	56b6efae50	fix(slack_alerting.py): don't fire spam alerts when backend api call fails	2024-05-13 10:04:43 -07:00
Krrish Dholakia	8575cdf562	fix(router.py): fix error message to return if pre-call-checks + allowed model region	2024-05-13 09:04:38 -07:00
Krish Dholakia	784ae85ba0	Merge branch 'main' into litellm_bedrock_command_r_support	2024-05-11 21:24:42 -07:00
Ishaan Jaff	f862539282	fix get healthy deployments	2024-05-11 19:46:35 -07:00
Ishaan Jaff	8f2e61dccc	fix - test router fallbacks	2024-05-11 19:13:22 -07:00
Ishaan Jaff	ffdf68d7e8	fix - _time_to_sleep_before_retry	2024-05-11 19:08:10 -07:00
Ishaan Jaff	a3b4074c22	unify sync and async logic for retries	2024-05-11 18:17:04 -07:00
Ishaan Jaff	04bb2bf9f2	fix _time_to_sleep_before_retry	2024-05-11 18:05:12 -07:00
Ishaan Jaff	c57ddf0537	fix _time_to_sleep_before_retry logic	2024-05-11 18:00:02 -07:00
Ishaan Jaff	9ca793fffd	router - clean up should_retry_this_error	2024-05-11 17:30:21 -07:00
Ishaan Jaff	6a967b3267	retry logic on router	2024-05-11 17:04:19 -07:00
Krrish Dholakia	bd0c3a81cb	fix(bedrock_httpx.py): working async bedrock command r calls	2024-05-11 16:45:20 -07:00
Ishaan Jaff	b71f35de72	Merge pull request #3585 from BerriAI/litellm_router_batch_comp [Litellm Proxy + litellm.Router] - Pass the same message/prompt to N models	2024-05-11 13:51:45 -07:00
Ishaan Jaff	6704b32e44	feat - router async batch acompletion	2024-05-11 13:08:16 -07:00
Krish Dholakia	7f64c61275	Merge pull request #3582 from BerriAI/litellm_explicit_region_name_setting feat(router.py): allow setting model_region in litellm_params	2024-05-11 11:36:22 -07:00
Krrish Dholakia	691c185ff8	feat(router.py): support region routing for bedrock, vertex ai, watsonx	2024-05-11 11:04:00 -07:00
Krrish Dholakia	2ed155b4d4	feat(router.py): allow setting model_region in litellm_params Closes https://github.com/BerriAI/litellm/issues/3580	2024-05-11 10:18:08 -07:00
Krish Dholakia	997ef2e480	Merge pull request #3507 from Manouchehri/oidc-3505-part-1 Initial OIDC support (Google/GitHub/CircleCI -> Amazon Bedrock & Azure OpenAI)	2024-05-11 09:25:17 -07:00
Krish Dholakia	1510f3a37a	Merge pull request #3576 from BerriAI/litellm_langfuse_fix fix(langfuse.py): fix logging user_id in trace param on new trace creation	2024-05-10 19:27:34 -07:00
Krrish Dholakia	3a98b6b8df	fix(langfuse.py): fix logging user_id in trace param on new trace creation Closes https://github.com/BerriAI/litellm/issues/3560	2024-05-10 18:25:07 -07:00
Ishaan Jaff	58acc76352	fix auto inferring region	2024-05-10 16:08:05 -07:00
Ishaan Jaff	2848c0ff2b	fix AUTO_INFER_REGION	2024-05-10 14:08:38 -07:00

1 2 3 4 5 ...

412 commits