Commit graph

1298 commits

Author SHA1 Message Date
Krish Dholakia
b7beab2e39 Merge pull request #3270 from simonsanvil/feature/watsonx-integration
(feat) add IBM watsonx.ai as an llm provider
2024-04-27 05:48:34 -07:00
Krrish Dholakia
6d141f6dac refactor(main.py): trigger new build 2024-04-25 19:51:38 -07:00
Lucca Zenobio
e73978b0d9 merge 2024-04-25 15:00:07 -03:00
Krrish Dholakia
abd35d6b60 refactor(main.py): trigger new build 2024-04-24 22:04:24 -07:00
Krrish Dholakia
b10f03706d fix(utils.py): fix streaming to not return usage dict
Fixes https://github.com/BerriAI/litellm/issues/3237
2024-04-24 08:06:07 -07:00
Simon S. Viloria
79855b372d Merge branch 'BerriAI:main' into feature/watsonx-integration 2024-04-23 12:18:34 +02:00
Simon Sanchez Viloria
572cbef43b feat - watsonx refractoring, removed dependency, and added support for embedding calls 2024-04-23 12:01:13 +02:00
Krrish Dholakia
3b6d204314 fix(vertex_ai.py): fix streaming logic 2024-04-22 19:15:20 -07:00
Krrish Dholakia
0f8cf067ea refactor(main.py): trigger new build 2024-04-22 10:54:35 -07:00
Simon S. Viloria
0c4cf91c79 Merge branch 'BerriAI:main' into feature/watsonx-integration 2024-04-21 10:35:51 +02:00
Krrish Dholakia
79056690f3 fix(main.py): ignore max_parallel_requests as a litellm param 2024-04-20 12:15:04 -07:00
Simon Sanchez Viloria
9b3a1b3f35 Added support for IBM watsonx.ai models 2024-04-20 20:06:46 +02:00
Krrish Dholakia
9dc0871023 fix(ollama_chat.py): accept api key as a param for ollama calls
allows user to call hosted ollama endpoint using bearer token for auth
2024-04-19 13:02:13 -07:00
Krrish Dholakia
41c02028a7 refactor(main.py): trigger new build 2024-04-18 22:17:19 -07:00
Ishaan Jaff
1ba216627a fix - pass kwargs to exception_type 2024-04-18 12:58:30 -07:00
Krrish Dholakia
388ecadd5d refactor(main.py): trigger new build 2024-04-18 07:34:09 -07:00
Krrish Dholakia
6d508468ef fix(vertex_ai.py): accept credentials as a json string 2024-04-16 17:34:25 -07:00
Krrish Dholakia
88c8ef6aa0 refactor(main.py): trigger new build 2024-04-15 18:36:51 -07:00
Krrish Dholakia
8c3c45fbb5 fix(vertex_ai_anthropic.py): set vertex_credentials for vertex ai anthropic calls
allows setting vertex credentials as a json string for vertex ai anthropic calls
2024-04-15 14:16:28 -07:00
Krrish Dholakia
3d645f95a5 fix(main.py): accept vertex service account credentials as json string
allows us to dynamically set vertex ai credentials
2024-04-15 13:28:59 -07:00
Krrish Dholakia
1cd0551a1e fix(anthropic_text.py): add support for async text completion calls 2024-04-15 08:15:00 -07:00
Krrish Dholakia
9004e9d0ab refactor(main.py): trigger new build 2024-04-13 19:35:39 -07:00
Krrish Dholakia
be9e3ed8b5 fix(main.py): automatically infer mode for text completion models 2024-04-12 14:16:21 -07:00
Krrish Dholakia
26e7933bdb refactor(main.py): trigger new build
contains fixes for async batch get
2024-04-10 21:45:06 -07:00
Ishaan Jaff
5e4cb92673 Merge pull request #2923 from BerriAI/litellm_return_better_error_from_health
fix - return stack trace on failing /health checks - first 1000 chars
2024-04-10 17:48:13 -07:00
Krrish Dholakia
13bce835b9 refactor(main.py): trigger new build 2024-04-09 21:15:33 -07:00
Krrish Dholakia
70fd803a6e fix(main.py): handle translating text completion openai to chat completion for async requests
also adds testing for this, to prevent future regressions
2024-04-09 16:47:49 -07:00
Ishaan Jaff
4bb6a9bc65 fix - return stack trace on failing /health checks 2024-04-09 15:12:09 -07:00
Krrish Dholakia
e88c4f6581 fix(main.py): trigger new build 2024-04-08 14:33:38 -07:00
Krrish Dholakia
ffa2efb6df refactor(main.py): trigger new build 2024-04-08 12:19:11 -07:00
Ishaan Jaff
d1d3d932ca Merge pull request #2879 from BerriAI/litellm_async_anthropic_api
[Feat] Async Anthropic API 97.5% lower median latency
2024-04-07 09:56:52 -07:00
Krrish Dholakia
a718041248 refactor(main.py): trigger new build 2024-04-06 19:42:54 -07:00
Krrish Dholakia
a137a3973e refactor(main.py): trigger new build 2024-04-06 18:50:38 -07:00
Ishaan Jaff
1dc5b01e01 fix - use anthropic class for clients 2024-04-06 18:19:28 -07:00
Ishaan Jaff
2f1b55dd70 async streaming for anthropic 2024-04-06 17:34:23 -07:00
Ishaan Jaff
32c3aab34e feat - make anthropic async 2024-04-06 15:50:13 -07:00
Ishaan Jaff
d23d6068ff Merge pull request #2877 from BerriAI/litellm_fix_text_completion
[Feat] Text-Completion-OpenAI - Re-use OpenAI Client
2024-04-06 12:15:52 -07:00
Ishaan Jaff
d8d10c313a feat - re-use openai client for text completion 2024-04-06 11:25:33 -07:00
Krrish Dholakia
650f8853af refactor(main.py): trigger new build 2024-04-06 09:06:53 -07:00
Krrish Dholakia
a1137d26d4 refactor(main.py): trigger new build 2024-04-05 13:42:56 -07:00
Krrish Dholakia
bf8097e961 fix(router.py): fix client init for streaming timeouts 2024-04-05 12:30:15 -07:00
Krish Dholakia
5ea9946925 Merge pull request #2665 from BerriAI/litellm_claude_vertex_ai
[WIP] feat(vertex_ai_anthropic.py): Add support for claude 3 on vertex ai
2024-04-05 07:06:04 -07:00
Krrish Dholakia
a8b4bc747e fix(main.py): trigger new build 2024-04-04 13:36:21 -07:00
Krrish Dholakia
053a06df97 build(config.yml): add missing dep 2024-04-04 10:59:26 -07:00
Krrish Dholakia
f863f30c4f refactor(main.py): trigger new build 2024-04-04 10:54:13 -07:00
Krrish Dholakia
1f56ea6015 build: trigger new build 2024-04-04 10:23:13 -07:00
Krrish Dholakia
b73cd05674 build: trigger new build 2024-04-04 10:20:25 -07:00
Krrish Dholakia
6f64eccafe refactor(main.py): trigger new build 2024-04-03 08:01:26 -07:00
Krrish Dholakia
bc1ee5c838 fix(main.py): support async calls from azure_text 2024-04-03 07:59:32 -07:00
Krrish Dholakia
bd7040969b feat(vertex_ai_anthropic.py): add claude 3 on vertex ai support - working .completions call
.completions() call works
2024-04-02 22:07:39 -07:00