Commit graph

11479 commits

Author SHA1 Message Date
Ishaan Jaff
245960708d fix - lowest cost routing 2024-05-07 12:49:20 -07:00
Ishaan Jaff
6cb059cce8 fix - use cost-based-routing 2024-05-07 12:48:53 -07:00
Ishaan Jaff
41ffaee821 test - basic lowest cost routing 2024-05-07 12:48:20 -07:00
Jean-Luc Duckworth
d5767e9403
Expanding jwt access to other RS and PS algos. Updated to resolve merge conflicts. 2024-05-07 15:45:07 -04:00
Ishaan Jaff
4c909194c7 docs - lowest - latency routing 2024-05-07 12:43:44 -07:00
Ishaan Jaff
e5e477d7f5 test - lowest cost routing 2024-05-07 12:19:44 -07:00
Ishaan Jaff
1ba4440096 feat add lowest cost router 2024-05-07 12:12:39 -07:00
Ishaan Jaff
31ac43bfdc feat - add lowst cost router 2024-05-07 12:12:09 -07:00
phact
4c64e3da10 locals().copy() 2024-05-07 14:58:35 -04:00
Paul Gauthier
82a4c68e60 Added deepseek completion test 2024-05-07 11:58:05 -07:00
Krrish Dholakia
724660606a fix(slack_alerting.py): fix storing + reading datetime object from cache
this converts the dt object to isoformat before storing, and loads it back to dt when comparing
2024-05-07 11:44:55 -07:00
Paul Gauthier
9162f9c2c5 Added costs & context json 2024-05-07 11:44:55 -07:00
Paul Gauthier
90eb0ea022 Added support for the deepseek api 2024-05-07 11:44:03 -07:00
Krish Dholakia
93e5fb49d3
Merge pull request #3500 from ghaemisr/main
Added support for JWT auth with PEM cert public keys
2024-05-07 11:07:30 -07:00
phact
7c5c9a8152 looks like cohere does support function calling 2024-05-07 13:41:05 -04:00
Sara Ghaemi
86e0dd68c3 updated tests 2024-05-07 13:28:57 -04:00
phact
1b811cd152 unit test and list fix 2024-05-07 13:24:28 -04:00
Sara Ghaemi
7017899d37 updated tests to also check for audience if found 2024-05-07 12:10:47 -04:00
Ishaan Jaff
cf638d54c4
Merge pull request #3494 from Manouchehri/devcontainer-1
Add devcontainer
2024-05-07 08:57:37 -07:00
David Manouchehri
3ee0328b04
feat(bedrock.py): Support using OIDC tokens. 2024-05-07 15:46:54 +00:00
David Manouchehri
4b655d8b33
feat(util.py): Add OIDC support. 2024-05-07 15:46:48 +00:00
Ishaan Jaff
ee1b1fe4f8
Merge pull request #3464 from sepiatone/main
update langchain documentation to reflect refactor
2024-05-07 08:42:23 -07:00
Ishaan Jaff
8b15507add
Merge pull request #3471 from paneru-rajan/Issue-3466-update-doc
Synced the doc with the Mistral
2024-05-07 08:41:18 -07:00
Sara Ghaemi
66b2b5fab9 made audience optional and updated docs 2024-05-07 11:37:04 -04:00
Sara Ghaemi
0ca5af8ae8
Merge branch 'BerriAI:main' into main 2024-05-07 11:23:00 -04:00
Sara Ghaemi
1580f993fe Updated JWT handler to support PEM public key 2024-05-07 11:22:17 -04:00
Ishaan Jaff
e91f0382d7 feat - update gemini-1.5-pro pricing 2024-05-07 07:56:52 -07:00
nkvch
06d1aec49e * fix(factory.py): fix conditional statement for merging messages with previous ones 2024-05-07 13:42:42 +02:00
David Manouchehri
a9cae01cf5
Add devcontainer. 2024-05-07 11:33:04 +00:00
sepiatone
1518a7b42d update langchain documentation to reflect refactor 2024-05-07 17:01:55 +05:30
nkvch
389530efb4 * chore(.gitignore): add 'venv' to the list of ignored files/directories
* fix(test_completion.py): fix import order and remove unused imports
* feat(test_completion.py): add test for empty assistant message in completion_claude_3_empty_message()
2024-05-07 12:51:30 +02:00
nkvch
7d7b59ff78 * feat(factory.py): add support for merging consecutive messages of one role when separated with empty message of another role 2024-05-07 12:51:30 +02:00
Krish Dholakia
30003afbf8
Merge pull request #3459 from alexanderepstein/langfuse_improvements
Update support for langfuse metadata
2024-05-06 21:56:29 -07:00
Ishaan Jaff
bfef424b39 fix don't let slack alert block /model/new 2024-05-06 20:47:29 -07:00
Ishaan Jaff
eb84c69ec6 fix - /model/new 2024-05-06 20:45:17 -07:00
Mehmet Bektas
8e9a4fa9eb comment out test method calls, following the pattern 2024-05-06 20:13:11 -07:00
Mehmet Bektas
157d7a7c28 add ollama embeddings unit tests 2024-05-06 20:11:45 -07:00
Krish Dholakia
57b3eaf136
Merge pull request #3491 from BerriAI/litellm_slack_daily_reports
feat(slack_alerting.py): support sending daily reports on deployments
2024-05-06 19:31:56 -07:00
Krish Dholakia
aa62d891a0
Merge branch 'main' into litellm_slack_daily_reports 2024-05-06 19:31:20 -07:00
Krrish Dholakia
69bcde6f6b test(test_router_fallbacks.py): fix test - making it easier to debug 2024-05-06 18:49:36 -07:00
Krrish Dholakia
26c0ed0f2d refactor(proxy_server.py): show ttl's on a top-level enum
Addresses - https://github.com/BerriAI/litellm/issues/2649#issuecomment-2097203372
2024-05-06 18:43:42 -07:00
Krrish Dholakia
72299a6894 fix(slack_alerting.py): allow internal cache to be an optional param 2024-05-06 18:34:09 -07:00
Krrish Dholakia
3a8876b0d5 refactor(slack_alerting.py): trigger new build 2024-05-06 18:29:55 -07:00
Krrish Dholakia
6b9b4f05ba feat(proxy_server.py): schedule slack daily report if enabled
if user enabled daily_reports, send them a slack report every 12 hours
2024-05-06 18:25:48 -07:00
Ishaan Jaff
d94065ca43 fix - num active callbacks 2024-05-06 18:20:27 -07:00
Ishaan Jaff
dd14714b94
Merge pull request #3487 from BerriAI/dependabot/pip/jinja2-3.1.4
build(deps): bump jinja2 from 3.1.3 to 3.1.4
2024-05-06 18:15:50 -07:00
Ishaan Jaff
15b264bd4e fix num callbacks test 2024-05-06 18:13:15 -07:00
Ishaan Jaff
5368382c03 fix num callbacks test 2024-05-06 18:12:45 -07:00
Ishaan Jaff
22347f2f56 bump: version 1.36.0 → 1.36.1 2024-05-06 18:08:40 -07:00
Ishaan Jaff
a382517992 fix - test alerting 2024-05-06 18:07:58 -07:00