Commit graph

11876 commits

Author SHA1 Message Date
Krish Dholakia
1aa567f3b5
Merge pull request #3571 from BerriAI/litellm_hf_classifier_support
Huggingface classifier support
2024-05-10 17:54:27 -07:00
Krish Dholakia
69068f9577
Merge pull request #3572 from powerhouseofthecell/feature/enforce-unique-key-and-team-aliases
enforce unique key and team aliases in the ui
2024-05-10 17:53:26 -07:00
Ishaan Jaff
2c0c9e1fa4
Merge pull request #3573 from BerriAI/litellm_team_based_failure_callback
[Feat] Use Team based callbacks with litellm.failure_callbacks
2024-05-10 17:53:15 -07:00
Ishaan Jaff
1d25be0ca8 fix langfuse logger re-initialized on all failure callbacks 2024-05-10 17:48:44 -07:00
Ishaan Jaff
a4695c3010 test - using langfuse as a failure callback 2024-05-10 17:37:32 -07:00
Ishaan Jaff
4584989a31 fix - langfuse copy metadata 2024-05-10 17:33:29 -07:00
Ishaan Jaff
ce8523808b fix langfuse failure logging 2024-05-10 17:02:38 -07:00
Krrish Dholakia
9e9f5d41d9 fix(proxy_server.py): check + get end-user obj even for master key calls
fixes issue where region-based routing wasn't working for end-users if master key was given
2024-05-10 16:54:51 -07:00
Ishaan Jaff
db0db5c62c
Merge pull request #3570 from BerriAI/litellm_test_model_openai_client
[Test] Proxy - uses the same OpenAI Client after 1 min
2024-05-10 16:54:45 -07:00
Ishaan Jaff
e3848abdfe
Merge pull request #3569 from BerriAI/litellm_fix_bug_upsert_deployments
[Fix] Upsert deployment bug
2024-05-10 16:53:59 -07:00
Ishaan Jaff
92b86056cf fix langfuse team based logging tests 2024-05-10 16:39:49 -07:00
Ishaan Jaff
53f9d8280f fix - support dynamic failure callbacks 2024-05-10 16:37:01 -07:00
Ishaan Jaff
1a8e853817 (ci/cd) run again 2024-05-10 16:19:03 -07:00
Ishaan Jaff
b6e0f00ed8 fix - using failure callbacks with team based logging 2024-05-10 16:18:13 -07:00
Ishaan Jaff
7d96272d52 fix auto inferring region 2024-05-10 16:08:05 -07:00
Krrish Dholakia
30d2df8940 docs(enterprise.md): add aws marketplace notice on docs 2024-05-10 15:54:29 -07:00
Krrish Dholakia
6a400a6200 test: fix test 2024-05-10 15:49:20 -07:00
Nick Wong
759ff3f750
added code to enforce unique key and team aliases in the ui 2024-05-10 15:42:07 -07:00
Krrish Dholakia
500995696a test: fix linting 2024-05-10 14:42:06 -07:00
Krrish Dholakia
d4d175030f docs(huggingface.md): add text-classification to huggingface docs 2024-05-10 14:39:14 -07:00
Krrish Dholakia
50be25d11a test(test_optional_params.py): fix optional params 2024-05-10 14:08:47 -07:00
Ishaan Jaff
c744851d13 fix AUTO_INFER_REGION 2024-05-10 14:08:38 -07:00
Krrish Dholakia
c17f221b89 test(test_completion.py): reintegrate testing for huggingface tgi + non-tgi 2024-05-10 14:07:01 -07:00
Ishaan Jaff
9bbb13c373 fix bug upsert_deployment 2024-05-10 13:54:52 -07:00
Ishaan Jaff
933f8ed16b fix - proxy_server.py 2024-05-10 13:47:35 -07:00
Ishaan Jaff
414af64343 test - OpenAI client is re-used for Azure, OpenAI 2024-05-10 13:43:19 -07:00
Ishaan Jaff
5c69515a13 fix - upsert_deployment logic 2024-05-10 13:41:51 -07:00
Ishaan Jaff
547976448f fix feature flag logic 2024-05-10 12:50:46 -07:00
Ishaan Jaff
75d6658bbc fix - explain why behind feature flag 2024-05-10 12:39:19 -07:00
Ishaan Jaff
6fd6490d63 fix hide - _auto_infer_region behind a feature flag 2024-05-10 12:38:06 -07:00
Ishaan Jaff
9d3f01c6ae fix - router add model logic 2024-05-10 12:32:16 -07:00
Krrish Dholakia
781d5888c3 docs(predibase.md): add support for predibase to docs 2024-05-10 10:58:35 -07:00
Krish Dholakia
8a35354dd6
Merge pull request #3378 from duckboy81/patch-1
Expand access for other jwt algorithms
2024-05-10 10:07:36 -07:00
Krrish Dholakia
cdec7a414f test(test_router_fallbacks.py): fix test 2024-05-10 09:58:40 -07:00
Krrish Dholakia
40e19a838c bump: version 1.37.1 → 1.37.2 2024-05-10 08:40:31 -07:00
Krrish Dholakia
f9a0364bff bump: version 1.37.0 → 1.37.1 2024-05-10 08:34:01 -07:00
Krrish Dholakia
9a31f3d3d9 fix(main.py): support env var 'VERTEX_PROJECT' and 'VERTEX_LOCATION' 2024-05-10 07:57:56 -07:00
Rajan Paneru
65b07bcb8c Preserving the Pydantic Message Object
Following statement replaces the Pydantic Message Object and initialize it with the dict
model_response["choices"][0]["message"] = response_json["message"]

We need to make sure message is always litellm.Message object

As a fix, based on the code of ollama.py file, i am updating just the content intead of entire object for both sync and async functions
2024-05-10 22:12:32 +09:30
Rajan Paneru
8eb842dcf5 revered the patch so that the fix can be applied in the main place 2024-05-10 22:04:44 +09:30
Antonio Loison
c0c244006f deps: remove diskcache from dependencies and add install in docs 2024-05-10 12:34:05 +02:00
Antonio Loison
9c1d312fdd docs: add disk cache doc and update cache arguments 2024-05-10 12:17:03 +02:00
Simon Sanchez Viloria
e1372de9ee Merge branch 'main' into feature/watsonx-integration 2024-05-10 12:09:09 +02:00
Antonio Loison
79c3d39d67 build(caching.py): move diskcache import inside class and add cache_dir argument to Cache 2024-05-10 12:04:54 +02:00
Simon Sanchez Viloria
d3d82827ed (test) Add tests for WatsonX completion/acompletion streaming 2024-05-10 11:55:58 +02:00
Simon Sanchez Viloria
170fd11c82 (fix) watsonx.py: Fixed linting errors and make sure stream chunk always return usage 2024-05-10 11:53:33 +02:00
Antonio Loison
c1ba4ec078 chore: add diskcache as extra dependency 2024-05-10 11:19:14 +02:00
Antonio Loison
7ee07cd961 test(test_caching.py): use mock_response in disk cache test 2024-05-10 11:00:18 +02:00
Antonio Loison
9b2dcb2807 chore(pyproject.toml): add diskcache as extra 2024-05-10 10:56:36 +02:00
Antonio Loison
004877c7e5 build(caching.py): add disk option for cache 2024-05-10 10:03:38 +02:00
Antonio Loison
ac27f431a4 test(test_caching.py): add disk cache test when using completion 2024-05-10 10:03:38 +02:00