Commit graph

17755 commits

Author SHA1 Message Date
dependabot[bot]
e48459389c
Bump send and express in /docs/my-website
Bumps [send](https://github.com/pillarjs/send) and [express](https://github.com/expressjs/express). These dependencies needed to be updated together.

Updates `send` from 0.18.0 to 0.19.0
- [Release notes](https://github.com/pillarjs/send/releases)
- [Changelog](https://github.com/pillarjs/send/blob/master/HISTORY.md)
- [Commits](https://github.com/pillarjs/send/compare/0.18.0...0.19.0)

Updates `express` from 4.19.2 to 4.20.0
- [Release notes](https://github.com/expressjs/express/releases)
- [Changelog](https://github.com/expressjs/express/blob/master/History.md)
- [Commits](https://github.com/expressjs/express/compare/4.19.2...4.20.0)

---
updated-dependencies:
- dependency-name: send
  dependency-type: indirect
- dependency-name: express
  dependency-type: indirect
...

Signed-off-by: dependabot[bot] <support@github.com>
2024-09-11 02:11:46 +00:00
Ishaan Jaff
a6f9020884 bump: version 1.44.23 → 1.44.24 2024-09-10 19:10:27 -07:00
Ishaan Jaff
c1262addbe
Merge pull request #5623 from BerriAI/litellm_vertex_use_async_for_getting_token
[Feat-Vertex Perf] Use async func to get auth credentials
2024-09-10 18:53:48 -07:00
Ishaan Jaff
9261599ab9 add doc string to vertex llm base 2024-09-10 18:52:43 -07:00
Ishaan Jaff
8d773e66af fix gemini streaming test 2024-09-10 17:50:24 -07:00
Ishaan Jaff
26aa20df73 fix test get token url 2024-09-10 17:38:03 -07:00
Ishaan Jaff
899eaa9566
Merge pull request #5571 from jalammar/cohere-updated-models
Add Cohere refresh models and update pricing
2024-09-10 17:22:51 -07:00
Ishaan Jaff
24474789d7 fix gemini test 2024-09-10 17:20:01 -07:00
Ishaan Jaff
96fa9d46f5 fix case when gemini is used 2024-09-10 17:06:45 -07:00
Ishaan Jaff
1c6f8b1be2 fix vertex use async func to set auth creds 2024-09-10 16:12:18 -07:00
Ishaan Jaff
aed48e3bad
Merge pull request #5622 from BerriAI/litellm_fix_auth_refresh_vertex
[Feat-Perf Improvement Vertex] Only Refresh credentials when token is expired
2024-09-10 15:03:35 -07:00
Ishaan Jaff
2d4be4cf1f
Merge pull request #5620 from BerriAI/litellm_use_helper_to_get_httpx_clients
[Feat-Perf] Use common helper to get async httpx clients for all providers
2024-09-10 15:03:11 -07:00
Ishaan Jaff
3ebff903c3
Merge branch 'main' into litellm_use_helper_to_get_httpx_clients 2024-09-10 15:02:54 -07:00
Ishaan Jaff
478a36ae36 fix bedrock get async client 2024-09-10 14:17:18 -07:00
Ishaan Jaff
26ae86e59b fix types for vertex project id 2024-09-10 14:02:15 -07:00
Ishaan Jaff
f3593aed68
Merge pull request #5619 from BerriAI/litellm_vertex_use_get_httpx_client
[Fix-Perf] Vertex AI cache httpx clients
2024-09-10 13:59:39 -07:00
Ishaan Jaff
68ce46c4b8 fix getting params 2024-09-10 13:54:42 -07:00
Ishaan Jaff
7bddbdd56e fix vertex only refresh auth when required 2024-09-10 13:49:28 -07:00
Ishaan Jaff
72dd21dc47 fix linting error 2024-09-10 13:29:02 -07:00
Ishaan Jaff
7ad3fe464e fix get_async_httpx_client 2024-09-10 13:20:55 -07:00
Ishaan Jaff
d7afeee71c fix test 2024-09-10 13:15:50 -07:00
Ishaan Jaff
dc63a16a6f
Merge pull request #5616 from BerriAI/litellm_fix_regen_keys
[Fix-Proxy] Regenerate keys when no duration is passed
2024-09-10 13:09:57 -07:00
Ishaan Jaff
08f8f9634f use get async httpx client 2024-09-10 13:08:49 -07:00
Ishaan Jaff
0f154abf9e use get_async_httpx_client for logging httpx 2024-09-10 13:03:55 -07:00
Ishaan Jaff
421b857714 pass llm provider when creating async httpx clients 2024-09-10 11:51:42 -07:00
Ishaan Jaff
87bac7c026 fix rps / rpm values on load testing 2024-09-10 11:22:19 -07:00
Jay Alammar
795b29dfc4 Updating Cohere models, prices, and documentation 2024-09-10 13:47:05 -04:00
Ishaan Jaff
a0e4510f53 add enum for all llm providers LlmProviders 2024-09-10 10:44:57 -07:00
Ishaan Jaff
d4b9a1307d rename get_async_httpx_client 2024-09-10 10:38:01 -07:00
Ishaan Jaff
1e8cf9f2a6 fix vertex ai use _get_async_client 2024-09-10 10:33:19 -07:00
Peter Laß
b1ecfe065c
fix #5614 (#5615)
Co-authored-by: Peter Laß <peter.lass@maibornwolff.de>
2024-09-10 09:26:44 -07:00
Ishaan Jaff
39a8bb2bc4 add test test_regenerate_key_ui 2024-09-10 09:12:03 -07:00
Ishaan Jaff
428762542c fix regen keys when no duration is passed 2024-09-10 08:04:18 -07:00
Ishaan Jaff
43cd657ac5
Merge pull request #5603 from BerriAI/litellm_allow_turning_off_message_logging_for_callbacks
[Feat-Proxy] allow turning off message logging for OTEL (callback specific)
2024-09-09 22:00:09 -07:00
Ishaan Jaff
479b12be09
Merge branch 'main' into litellm_allow_turning_off_message_logging_for_callbacks 2024-09-09 21:59:36 -07:00
Krrish Dholakia
2e5583919a bump: version 1.44.22 → 1.44.23 2024-09-09 21:58:27 -07:00
Krish Dholakia
2d2282101b
LiteLLM Minor Fixes and Improvements (09/09/2024) (#5602)
* fix(main.py): pass default azure api version as alternative in completion call

Fixes api error caused due to api version

Closes https://github.com/BerriAI/litellm/issues/5584

* Fixed gemini-1.5-flash pricing (#5590)

* add /key/list endpoint

* bump: version 1.44.21 → 1.44.22

* docs architecture

* Fixed gemini-1.5-flash pricing

---------

Co-authored-by: Ishaan Jaff <ishaanjaffer0324@gmail.com>

* fix(bedrock/chat.py): fix converse api stop sequence param mapping

Fixes https://github.com/BerriAI/litellm/issues/5592

* fix(databricks/cost_calculator.py): handle databricks model name changes

Fixes https://github.com/BerriAI/litellm/issues/5597

* fix(azure.py): support azure api version 2024-08-01-preview

Closes https://github.com/BerriAI/litellm/issues/5377

* fix(proxy/_types.py): allow dev keys to call cohere /rerank endpoint

Fixes issue where only admin could call rerank endpoint

* fix(azure.py): check if model is gpt-4o

* fix(proxy/_types.py): support /v1/rerank on non-admin routes as well

* fix(cost_calculator.py): fix split on `/` logic in cost calculator

---------

Co-authored-by: F1bos <44951186+F1bos@users.noreply.github.com>
Co-authored-by: Ishaan Jaff <ishaanjaffer0324@gmail.com>
2024-09-09 21:56:12 -07:00
Krish Dholakia
4ac66bd843
LiteLLM Minor Fixes and Improvements (09/07/2024) (#5580)
* fix(litellm_logging.py): set completion_start_time_float to end_time_float if none

Fixes https://github.com/BerriAI/litellm/issues/5500

* feat(_init_.py): add new 'openai_text_completion_compatible_providers' list

Fixes https://github.com/BerriAI/litellm/issues/5558

Handles correctly routing fireworks ai calls when done via text completions

* fix: fix linting errors

* fix: fix linting errors

* fix(openai.py): fix exception raised

* fix(openai.py): fix error handling

* fix(_redis.py): allow all supported arguments for redis cluster (#5554)

* Revert "fix(_redis.py): allow all supported arguments for redis cluster (#5554)" (#5583)

This reverts commit f2191ef4cb.

* fix(router.py): return model alias w/ underlying deployment on router.get_model_list()

Fixes https://github.com/BerriAI/litellm/issues/5524#issuecomment-2336410666

* test: handle flaky tests

---------

Co-authored-by: Jonas Dittrich <58814480+Kakadus@users.noreply.github.com>
2024-09-09 18:54:17 -07:00
Ishaan Jaff
569f3ddda9 fix test_awesome_otel_with_message_logging_off 2024-09-09 17:59:07 -07:00
Ishaan Jaff
16b6b56c8b fix otel logging test 2024-09-09 17:51:35 -07:00
Ishaan Jaff
c86b333054
Merge pull request #5601 from BerriAI/litellm_tag_routing_fixes
[Feat] Tag Routing - Allow setting default deployments
2024-09-09 17:45:24 -07:00
Ishaan Jaff
a6d3bd0ab7
Merge branch 'main' into litellm_tag_routing_fixes 2024-09-09 17:45:18 -07:00
Ishaan Jaff
407bdf10ce run test in verbose mode 2024-09-09 17:43:11 -07:00
Ishaan Jaff
00f1d7b1ff
Merge pull request #5576 from BerriAI/litellm_set_max_batch_size
[Fix - Otel logger] Set a max queue size of 100 logs for OTEL
2024-09-09 17:39:16 -07:00
Ishaan Jaff
c57683421b
Merge pull request #5606 from BerriAI/litellm_log_failureS_key_based_logging
[Feat-Proxy] Allow using key based logging for success and failure
2024-09-09 17:38:36 -07:00
Ishaan Jaff
e25786ed8e fix test otel message logging off 2024-09-09 17:01:20 -07:00
Ishaan Jaff
949af7be2e fix team based logging doc 2024-09-09 16:49:26 -07:00
Ishaan Jaff
57ebe4649e add test for using success and failure 2024-09-09 16:44:37 -07:00
Elad Segal
da30da9a97
Properly use allowed_fails_policy when it has fields with a value of 0 (#5604) 2024-09-09 16:35:12 -07:00
Ishaan Jaff
bbdcc75c60 fix log failures for key based logging 2024-09-09 16:33:06 -07:00