Commit graph

863 commits

Author SHA1 Message Date
Ishaan Jaff
d038568be4 ci/cd run again 2024-09-12 08:31:17 -07:00
Ishaan Jaff
ce5182710f ci/cd run again 2024-09-10 19:21:13 -07:00
Krish Dholakia
2d2282101b
LiteLLM Minor Fixes and Improvements (09/09/2024) (#5602)
* fix(main.py): pass default azure api version as alternative in completion call

Fixes api error caused due to api version

Closes https://github.com/BerriAI/litellm/issues/5584

* Fixed gemini-1.5-flash pricing (#5590)

* add /key/list endpoint

* bump: version 1.44.21 → 1.44.22

* docs architecture

* Fixed gemini-1.5-flash pricing

---------

Co-authored-by: Ishaan Jaff <ishaanjaffer0324@gmail.com>

* fix(bedrock/chat.py): fix converse api stop sequence param mapping

Fixes https://github.com/BerriAI/litellm/issues/5592

* fix(databricks/cost_calculator.py): handle databricks model name changes

Fixes https://github.com/BerriAI/litellm/issues/5597

* fix(azure.py): support azure api version 2024-08-01-preview

Closes https://github.com/BerriAI/litellm/issues/5377

* fix(proxy/_types.py): allow dev keys to call cohere /rerank endpoint

Fixes issue where only admin could call rerank endpoint

* fix(azure.py): check if model is gpt-4o

* fix(proxy/_types.py): support /v1/rerank on non-admin routes as well

* fix(cost_calculator.py): fix split on `/` logic in cost calculator

---------

Co-authored-by: F1bos <44951186+F1bos@users.noreply.github.com>
Co-authored-by: Ishaan Jaff <ishaanjaffer0324@gmail.com>
2024-09-09 21:56:12 -07:00
Krish Dholakia
4ac66bd843
LiteLLM Minor Fixes and Improvements (09/07/2024) (#5580)
* fix(litellm_logging.py): set completion_start_time_float to end_time_float if none

Fixes https://github.com/BerriAI/litellm/issues/5500

* feat(_init_.py): add new 'openai_text_completion_compatible_providers' list

Fixes https://github.com/BerriAI/litellm/issues/5558

Handles correctly routing fireworks ai calls when done via text completions

* fix: fix linting errors

* fix: fix linting errors

* fix(openai.py): fix exception raised

* fix(openai.py): fix error handling

* fix(_redis.py): allow all supported arguments for redis cluster (#5554)

* Revert "fix(_redis.py): allow all supported arguments for redis cluster (#5554)" (#5583)

This reverts commit f2191ef4cb.

* fix(router.py): return model alias w/ underlying deployment on router.get_model_list()

Fixes https://github.com/BerriAI/litellm/issues/5524#issuecomment-2336410666

* test: handle flaky tests

---------

Co-authored-by: Jonas Dittrich <58814480+Kakadus@users.noreply.github.com>
2024-09-09 18:54:17 -07:00
Ishaan Jaff
b573847dad run ci cd again 2024-09-05 19:07:16 -07:00
Ishaan Jaff
f42a0528db
Merge branch 'main' into litellm_allow_internal_user_view_usage 2024-09-05 16:46:06 -07:00
Ishaan Jaff
30137b0b72 run ci - cd again 2024-09-05 15:42:40 -07:00
Ishaan Jaff
9eb35e47ee run ci/cd on main 2024-09-05 15:39:30 -07:00
Ishaan Jaff
05e02fbe76 run ci/cd again 2024-09-05 15:38:49 -07:00
Ishaan Jaff
004a71b8dc use requirements txt 2024-09-05 15:34:17 -07:00
Ishaan Jaff
42b2566906 run ci/cd agaiin 2024-09-05 15:26:12 -07:00
Ishaan Jaff
5b1d207cf4 run test again 2024-09-05 15:20:21 -07:00
Ishaan Jaff
89649282ee run again 2024-09-05 15:17:17 -07:00
Ishaan Jaff
4e392ebdb2
Merge pull request #5484 from BerriAI/litellm_fix_redis
fix proxy server - always read redis for rate limiting logic
2024-09-04 12:39:15 -07:00
Krrish Dholakia
e7855329e8 style: ci/cd run again 2024-09-04 08:32:35 -07:00
Ishaan Jaff
f5122e0325 ci/cd run again 2024-09-04 07:41:08 -07:00
Krish Dholakia
be3c7b401e
LiteLLM Minor fixes + improvements (08/03/2024) (#5488)
* fix(internal_user_endpoints.py): set budget_reset_at for /user/update

* fix(vertex_and_google_ai_studio_gemini.py): handle accumulated json

Fixes https://github.com/BerriAI/litellm/issues/5479

* fix(vertex_ai_and_gemini.py): fix assistant message function call when content is not None

Fixes https://github.com/BerriAI/litellm/issues/5490

* fix(proxy_server.py): generic state uuid for okta sso

* fix(lago.py): improve debug logs

Debugging for https://github.com/BerriAI/litellm/issues/5477

* docs(bedrock.md): add bedrock cross-region inferencing to docs

* fix(azure.py): return azure response headers on aembedding call

* feat(azure.py): return azure response headers for `/audio/transcription`

* fix(types/utils.py): standardize deepseek / anthropic prompt caching usage information

Closes https://github.com/BerriAI/litellm/issues/5285

* docs(usage.md): add docs on litellm usage object

* test(test_completion.py): mark flaky test
2024-09-03 21:21:34 -07:00
Ishaan Jaff
666dd6aca2
Merge pull request #5478 from BerriAI/litellm_Add_ai21
[Feat] Add AI21 /chat API
2024-09-02 16:20:37 -07:00
Krish Dholakia
f9e6507cd1
LiteLLM Minor Fixes + Improvements (#5474)
* feat(proxy/_types.py): add lago billing to callbacks ui

Closes https://github.com/BerriAI/litellm/issues/5472

* fix(anthropic.py): return anthropic prompt caching information

Fixes https://github.com/BerriAI/litellm/issues/5364

* feat(bedrock/chat.py): support 'json_schema' for bedrock models

Closes https://github.com/BerriAI/litellm/issues/5434

* fix(bedrock/embed/embeddings.py): support async embeddings for amazon titan models

* fix: linting fixes

* fix: handle key errors

* fix(bedrock/chat.py): fix bedrock ai21 streaming object

* feat(bedrock/embed): support bedrock embedding optional params

* fix(databricks.py): fix usage chunk

* fix(internal_user_endpoints.py): apply internal user defaults, if user role updated

Fixes issue where user update wouldn't apply defaults

* feat(slack_alerting.py): provide multiple slack channels for a given alert type

multiple channels might be interested in receiving an alert for a given type

* docs(alerting.md): add multiple channel alerting to docs
2024-09-02 14:29:57 -07:00
Ishaan Jaff
d4f77832c0 docs add docs on supported params 2024-09-02 12:39:07 -07:00
Ishaan Jaff
93650db380 fix linting 2024-09-02 12:15:14 -07:00
Ishaan Jaff
263e283126 add ai21 model test 2024-09-02 12:14:13 -07:00
Ishaan Jaff
ae25c5695f add streaming test for ai21 2024-09-02 11:58:15 -07:00
Ishaan Jaff
0874885f04 add all ai21 params 2024-09-02 11:54:40 -07:00
Ishaan Jaff
0970475a3b test ai21 2024-09-02 11:47:48 -07:00
Ishaan Jaff
69dea613ca ci/cd run again 2024-08-30 14:03:00 -07:00
Ishaan Jaff
f1147696a3 ci/cd run again 2024-08-27 22:59:21 -07:00
Ishaan Jaff
c8fc5c44e1 ci/cd run again 2024-08-27 22:53:18 -07:00
Ishaan Jaff
3966c545b4 fix flaky tests 2024-08-27 22:47:11 -07:00
Krrish Dholakia
640e5d0dc9 test: fix test 2024-08-27 22:44:29 -07:00
Ishaan Jaff
1b04380540 fix palm api is deactivated by google 2024-08-27 21:08:49 -07:00
Krish Dholakia
415abc86c6
Merge pull request #5358 from BerriAI/litellm_fix_retry_after
fix retry after - cooldown individual models based on their specific 'retry-after' header
2024-08-27 11:50:14 -07:00
Krrish Dholakia
18b67a455e test: fix test 2024-08-27 10:46:57 -07:00
Krrish Dholakia
b0f01e5b95 test: fix test 2024-08-27 07:14:14 -07:00
Krrish Dholakia
d13d2e8a62 feat(vertex_httpx.py): support functions param for gemini google ai studio + vertex ai
Closes https://github.com/BerriAI/litellm/issues/5344
2024-08-26 10:59:01 -07:00
Ishaan Jaff
735eb041b9 ci/cd run again 2024-08-26 08:36:58 -07:00
Krrish Dholakia
8c5ff150f6 fix(huggingface_restapi.py): fix tests 2024-08-23 21:40:27 -07:00
Ishaan Jaff
18cd905a1f ci/cd run again 2024-08-22 18:41:10 -07:00
Krrish Dholakia
11bfc1dca7 fix(cohere_chat.py): support passing 'extra_headers'
Fixes https://github.com/BerriAI/litellm/issues/4709
2024-08-22 10:17:36 -07:00
Ishaan Jaff
9dbd1b089b fix tg ai -deprecated model 2024-08-21 12:14:39 -07:00
Krrish Dholakia
8406016096 test(test_completion.py): fix gemini rate limit error 2024-08-21 07:17:27 -07:00
Krrish Dholakia
a6ee1e04fe test(test_completion.py): fix test 2024-08-21 07:01:02 -07:00
Krish Dholakia
ad1e327208
Merge pull request #5284 from BerriAI/litellm_fix_azure_api_version
feat(azure.py): support dynamic azure api versions
2024-08-20 11:46:55 -07:00
Krish Dholakia
02eb6455b2
Merge pull request #5296 from BerriAI/litellm_azure_json_schema_support
feat(azure.py): support 'json_schema' for older models
2024-08-20 11:41:38 -07:00
Krish Dholakia
409306b266
Merge branch 'main' into litellm_fix_azure_api_version 2024-08-20 11:40:53 -07:00
Ishaan Jaff
8e7bcb42a6 ci/cd run again 2024-08-19 21:00:38 -07:00
Krrish Dholakia
49416e121c feat(azure.py): support dynamic api versions
Closes https://github.com/BerriAI/litellm/issues/5228
2024-08-19 12:17:43 -07:00
Krrish Dholakia
663a0c1b83 feat(Support-pass-through-for-bedrock-endpoints): Allows pass-through support for bedrock endpoints 2024-08-17 17:57:43 -07:00
Krrish Dholakia
7ec7c9970b feat(azure.py): support 'json_schema' for older models
Converts the json schema input to a tool call, allows the call to still work on older azure models
2024-08-17 16:31:13 -07:00
Ishaan Jaff
40dc27e72c fix sagemaker tests 2024-08-15 18:34:20 -07:00