Commit graph

11603 commits

Author SHA1 Message Date
Krrish Dholakia
d4d175030f docs(huggingface.md): add text-classification to huggingface docs 2024-05-10 14:39:14 -07:00
Krrish Dholakia
50be25d11a test(test_optional_params.py): fix optional params 2024-05-10 14:08:47 -07:00
Ishaan Jaff
c744851d13 fix AUTO_INFER_REGION 2024-05-10 14:08:38 -07:00
Krrish Dholakia
c17f221b89 test(test_completion.py): reintegrate testing for huggingface tgi + non-tgi 2024-05-10 14:07:01 -07:00
Ishaan Jaff
9bbb13c373 fix bug upsert_deployment 2024-05-10 13:54:52 -07:00
Ishaan Jaff
933f8ed16b fix - proxy_server.py 2024-05-10 13:47:35 -07:00
Ishaan Jaff
414af64343 test - OpenAI client is re-used for Azure, OpenAI 2024-05-10 13:43:19 -07:00
Ishaan Jaff
5c69515a13 fix - upsert_deployment logic 2024-05-10 13:41:51 -07:00
Ishaan Jaff
547976448f fix feature flag logic 2024-05-10 12:50:46 -07:00
Ishaan Jaff
75d6658bbc fix - explain why behind feature flag 2024-05-10 12:39:19 -07:00
Ishaan Jaff
6fd6490d63 fix hide - _auto_infer_region behind a feature flag 2024-05-10 12:38:06 -07:00
Ishaan Jaff
9d3f01c6ae fix - router add model logic 2024-05-10 12:32:16 -07:00
Krrish Dholakia
781d5888c3 docs(predibase.md): add support for predibase to docs 2024-05-10 10:58:35 -07:00
Krish Dholakia
8a35354dd6
Merge pull request #3378 from duckboy81/patch-1
Expand access for other jwt algorithms
2024-05-10 10:07:36 -07:00
Krrish Dholakia
cdec7a414f test(test_router_fallbacks.py): fix test 2024-05-10 09:58:40 -07:00
Krrish Dholakia
40e19a838c bump: version 1.37.1 → 1.37.2 2024-05-10 08:40:31 -07:00
Krrish Dholakia
f9a0364bff bump: version 1.37.0 → 1.37.1 2024-05-10 08:34:01 -07:00
Krrish Dholakia
9a31f3d3d9 fix(main.py): support env var 'VERTEX_PROJECT' and 'VERTEX_LOCATION' 2024-05-10 07:57:56 -07:00
Rajan Paneru
65b07bcb8c Preserving the Pydantic Message Object
Following statement replaces the Pydantic Message Object and initialize it with the dict
model_response["choices"][0]["message"] = response_json["message"]

We need to make sure message is always litellm.Message object

As a fix, based on the code of ollama.py file, i am updating just the content intead of entire object for both sync and async functions
2024-05-10 22:12:32 +09:30
Rajan Paneru
8eb842dcf5 revered the patch so that the fix can be applied in the main place 2024-05-10 22:04:44 +09:30
Antonio Loison
c0c244006f deps: remove diskcache from dependencies and add install in docs 2024-05-10 12:34:05 +02:00
Antonio Loison
9c1d312fdd docs: add disk cache doc and update cache arguments 2024-05-10 12:17:03 +02:00
Simon Sanchez Viloria
e1372de9ee Merge branch 'main' into feature/watsonx-integration 2024-05-10 12:09:09 +02:00
Antonio Loison
79c3d39d67 build(caching.py): move diskcache import inside class and add cache_dir argument to Cache 2024-05-10 12:04:54 +02:00
Simon Sanchez Viloria
d3d82827ed (test) Add tests for WatsonX completion/acompletion streaming 2024-05-10 11:55:58 +02:00
Simon Sanchez Viloria
170fd11c82 (fix) watsonx.py: Fixed linting errors and make sure stream chunk always return usage 2024-05-10 11:53:33 +02:00
Antonio Loison
c1ba4ec078 chore: add diskcache as extra dependency 2024-05-10 11:19:14 +02:00
Antonio Loison
7ee07cd961 test(test_caching.py): use mock_response in disk cache test 2024-05-10 11:00:18 +02:00
Antonio Loison
9b2dcb2807 chore(pyproject.toml): add diskcache as extra 2024-05-10 10:56:36 +02:00
Antonio Loison
004877c7e5 build(caching.py): add disk option for cache 2024-05-10 10:03:38 +02:00
Antonio Loison
ac27f431a4 test(test_caching.py): add disk cache test when using completion 2024-05-10 10:03:38 +02:00
Antonio Loison
c1a81d90a6 build(caching.py) - add disk cache object 2024-05-10 10:03:35 +02:00
Krish Dholakia
a671046b45
Merge pull request #3552 from BerriAI/litellm_predibase_support
feat(predibase.py): add support for predibase provider
2024-05-09 22:21:16 -07:00
Krrish Dholakia
714370956f fix(predibase.py): fix async streaming 2024-05-09 22:18:16 -07:00
Krrish Dholakia
76d4290591 fix(predibase.py): fix event loop closed error 2024-05-09 19:07:19 -07:00
Krrish Dholakia
491e177348 fix(predibase.py): fix async completion call 2024-05-09 18:44:19 -07:00
Krrish Dholakia
5a38438c3f docs(customer_routing.md): add region-based routing for specific customers, to docs 2024-05-09 18:40:49 -07:00
Krrish Dholakia
425efc60f4 fix(main.py): fix linting error 2024-05-09 18:12:28 -07:00
Ishaan Jaff
5eb12e30cc
Merge pull request #3547 from BerriAI/litellm_support_stream_options_text_completion
[Feat] support `stream_options` on `litellm.text_completion`
2024-05-09 18:05:58 -07:00
Ishaan Jaff
63bfc12a63
Merge pull request #3555 from CyanideByte/global-ignore-warning
Globally filtering pydantic conflict warnings
2024-05-09 18:04:08 -07:00
Krrish Dholakia
9083d8e490 fix: fix linting errors 2024-05-09 17:55:27 -07:00
CyanideByte
4a7be9163b Globally filtering pydantic conflict warnings 2024-05-09 17:42:19 -07:00
Krrish Dholakia
d7189c21fd feat(predibase.py): support async_completion + streaming (sync + async)
finishes up pr
2024-05-09 17:41:27 -07:00
Krish Dholakia
dab176b7e7
Merge pull request #3551 from powerhouseofthecell/fix/error-on-get-user-role
Fix/error on get user role
2024-05-09 17:40:18 -07:00
Rajan Paneru
c45085b728 Based on the data-type using json
The value of response_obj["choices"][0]["message"] is Message object and dict

Added a conditional to use .json only iff it is Message Object
2024-05-10 10:06:50 +09:30
Krrish Dholakia
186c0ec77b feat(predibase.py): add support for predibase provider
Closes https://github.com/BerriAI/litellm/issues/1253
2024-05-09 16:39:43 -07:00
Nick Wong
d3a228d03b
added changes from upstream
Merge branch 'main' into fix/error-on-get-user-role
2024-05-09 16:14:14 -07:00
Nick Wong
c42f1ce2c6
removed extra default dict return, which causes error if user_role is a string 2024-05-09 16:13:26 -07:00
dependabot[bot]
9bcd93178f
build(deps): bump next from 14.1.0 to 14.1.1 in /ui/litellm-dashboard
Bumps [next](https://github.com/vercel/next.js) from 14.1.0 to 14.1.1.
- [Release notes](https://github.com/vercel/next.js/releases)
- [Changelog](https://github.com/vercel/next.js/blob/canary/release.js)
- [Commits](https://github.com/vercel/next.js/compare/v14.1.0...v14.1.1)

---
updated-dependencies:
- dependency-name: next
  dependency-type: direct:production
...

Signed-off-by: dependabot[bot] <support@github.com>
2024-05-09 22:42:27 +00:00
Krrish Dholakia
43b2050cc2 bump: version 1.36.4 → 1.37.0 2024-05-09 15:41:40 -07:00