litellm

Author	SHA1	Message	Date
Krish Dholakia	1aa567f3b5	Merge pull request #3571 from BerriAI/litellm_hf_classifier_support Huggingface classifier support	2024-05-10 17:54:27 -07:00
Krish Dholakia	69068f9577	Merge pull request #3572 from powerhouseofthecell/feature/enforce-unique-key-and-team-aliases enforce unique key and team aliases in the ui	2024-05-10 17:53:26 -07:00
Ishaan Jaff	2c0c9e1fa4	Merge pull request #3573 from BerriAI/litellm_team_based_failure_callback [Feat] Use Team based callbacks with litellm.failure_callbacks	2024-05-10 17:53:15 -07:00
Ishaan Jaff	1d25be0ca8	fix langfuse logger re-initialized on all failure callbacks	2024-05-10 17:48:44 -07:00
Ishaan Jaff	a4695c3010	test - using langfuse as a failure callback	2024-05-10 17:37:32 -07:00
Ishaan Jaff	4584989a31	fix - langfuse copy metadata	2024-05-10 17:33:29 -07:00
Ishaan Jaff	ce8523808b	fix langfuse failure logging	2024-05-10 17:02:38 -07:00
Krrish Dholakia	9e9f5d41d9	fix(proxy_server.py): check + get end-user obj even for master key calls fixes issue where region-based routing wasn't working for end-users if master key was given	2024-05-10 16:54:51 -07:00
Ishaan Jaff	db0db5c62c	Merge pull request #3570 from BerriAI/litellm_test_model_openai_client [Test] Proxy - uses the same OpenAI Client after 1 min	2024-05-10 16:54:45 -07:00
Ishaan Jaff	e3848abdfe	Merge pull request #3569 from BerriAI/litellm_fix_bug_upsert_deployments [Fix] Upsert deployment bug	2024-05-10 16:53:59 -07:00
Ishaan Jaff	92b86056cf	fix langfuse team based logging tests	2024-05-10 16:39:49 -07:00
Ishaan Jaff	53f9d8280f	fix - support dynamic failure callbacks	2024-05-10 16:37:01 -07:00
Ishaan Jaff	1a8e853817	(ci/cd) run again	2024-05-10 16:19:03 -07:00
Ishaan Jaff	b6e0f00ed8	fix - using failure callbacks with team based logging	2024-05-10 16:18:13 -07:00
Ishaan Jaff	7d96272d52	fix auto inferring region	2024-05-10 16:08:05 -07:00
Krrish Dholakia	30d2df8940	docs(enterprise.md): add aws marketplace notice on docs	2024-05-10 15:54:29 -07:00
Krrish Dholakia	6a400a6200	test: fix test	2024-05-10 15:49:20 -07:00
Nick Wong	759ff3f750	added code to enforce unique key and team aliases in the ui	2024-05-10 15:42:07 -07:00
Krrish Dholakia	500995696a	test: fix linting	2024-05-10 14:42:06 -07:00
Krrish Dholakia	d4d175030f	docs(huggingface.md): add text-classification to huggingface docs	2024-05-10 14:39:14 -07:00
Krrish Dholakia	50be25d11a	test(test_optional_params.py): fix optional params	2024-05-10 14:08:47 -07:00
Ishaan Jaff	c744851d13	fix AUTO_INFER_REGION	2024-05-10 14:08:38 -07:00
Krrish Dholakia	c17f221b89	test(test_completion.py): reintegrate testing for huggingface tgi + non-tgi	2024-05-10 14:07:01 -07:00
Ishaan Jaff	9bbb13c373	fix bug upsert_deployment	2024-05-10 13:54:52 -07:00
Ishaan Jaff	933f8ed16b	fix - proxy_server.py	2024-05-10 13:47:35 -07:00
Ishaan Jaff	414af64343	test - OpenAI client is re-used for Azure, OpenAI	2024-05-10 13:43:19 -07:00
Ishaan Jaff	5c69515a13	fix - upsert_deployment logic	2024-05-10 13:41:51 -07:00
Ishaan Jaff	547976448f	fix feature flag logic	2024-05-10 12:50:46 -07:00
Ishaan Jaff	75d6658bbc	fix - explain why behind feature flag	2024-05-10 12:39:19 -07:00
Ishaan Jaff	6fd6490d63	fix hide - _auto_infer_region behind a feature flag	2024-05-10 12:38:06 -07:00
Ishaan Jaff	9d3f01c6ae	fix - router add model logic	2024-05-10 12:32:16 -07:00
Krrish Dholakia	781d5888c3	docs(predibase.md): add support for predibase to docs	2024-05-10 10:58:35 -07:00
Krish Dholakia	8a35354dd6	Merge pull request #3378 from duckboy81/patch-1 Expand access for other jwt algorithms	2024-05-10 10:07:36 -07:00
Krrish Dholakia	cdec7a414f	test(test_router_fallbacks.py): fix test	2024-05-10 09:58:40 -07:00
Krrish Dholakia	40e19a838c	bump: version 1.37.1 → 1.37.2	2024-05-10 08:40:31 -07:00
Krrish Dholakia	f9a0364bff	bump: version 1.37.0 → 1.37.1	2024-05-10 08:34:01 -07:00
Krrish Dholakia	9a31f3d3d9	fix(main.py): support env var 'VERTEX_PROJECT' and 'VERTEX_LOCATION'	2024-05-10 07:57:56 -07:00
Rajan Paneru	65b07bcb8c	Preserving the Pydantic Message Object Following statement replaces the Pydantic Message Object and initialize it with the dict model_response["choices"][0]["message"] = response_json["message"] We need to make sure message is always litellm.Message object As a fix, based on the code of ollama.py file, i am updating just the content intead of entire object for both sync and async functions	2024-05-10 22:12:32 +09:30
Rajan Paneru	8eb842dcf5	revered the patch so that the fix can be applied in the main place	2024-05-10 22:04:44 +09:30
Antonio Loison	c0c244006f	deps: remove diskcache from dependencies and add install in docs	2024-05-10 12:34:05 +02:00
Antonio Loison	9c1d312fdd	docs: add disk cache doc and update cache arguments	2024-05-10 12:17:03 +02:00
Simon Sanchez Viloria	e1372de9ee	Merge branch 'main' into feature/watsonx-integration	2024-05-10 12:09:09 +02:00
Antonio Loison	79c3d39d67	build(caching.py): move diskcache import inside class and add cache_dir argument to Cache	2024-05-10 12:04:54 +02:00
Simon Sanchez Viloria	d3d82827ed	(test) Add tests for WatsonX completion/acompletion streaming	2024-05-10 11:55:58 +02:00
Simon Sanchez Viloria	170fd11c82	(fix) watsonx.py: Fixed linting errors and make sure stream chunk always return usage	2024-05-10 11:53:33 +02:00
Antonio Loison	c1ba4ec078	chore: add diskcache as extra dependency	2024-05-10 11:19:14 +02:00
Antonio Loison	7ee07cd961	test(test_caching.py): use mock_response in disk cache test	2024-05-10 11:00:18 +02:00
Antonio Loison	9b2dcb2807	chore(pyproject.toml): add diskcache as extra	2024-05-10 10:56:36 +02:00
Antonio Loison	004877c7e5	build(caching.py): add disk option for cache	2024-05-10 10:03:38 +02:00
Antonio Loison	ac27f431a4	test(test_caching.py): add disk cache test when using completion	2024-05-10 10:03:38 +02:00

... 10 11 12 13 14 ...

11876 commits