Ishaan Jaff
|
5e4cb92673
|
Merge pull request #2923 from BerriAI/litellm_return_better_error_from_health
fix - return stack trace on failing /health checks - first 1000 chars
|
2024-04-10 17:48:13 -07:00 |
|
Krrish Dholakia
|
13bce835b9
|
refactor(main.py): trigger new build
|
2024-04-09 21:15:33 -07:00 |
|
Krrish Dholakia
|
70fd803a6e
|
fix(main.py): handle translating text completion openai to chat completion for async requests
also adds testing for this, to prevent future regressions
|
2024-04-09 16:47:49 -07:00 |
|
Ishaan Jaff
|
4bb6a9bc65
|
fix - return stack trace on failing /health checks
|
2024-04-09 15:12:09 -07:00 |
|
Krrish Dholakia
|
e88c4f6581
|
fix(main.py): trigger new build
|
2024-04-08 14:33:38 -07:00 |
|
Krrish Dholakia
|
ffa2efb6df
|
refactor(main.py): trigger new build
|
2024-04-08 12:19:11 -07:00 |
|
Ishaan Jaff
|
d1d3d932ca
|
Merge pull request #2879 from BerriAI/litellm_async_anthropic_api
[Feat] Async Anthropic API 97.5% lower median latency
|
2024-04-07 09:56:52 -07:00 |
|
Krrish Dholakia
|
a718041248
|
refactor(main.py): trigger new build
|
2024-04-06 19:42:54 -07:00 |
|
Krrish Dholakia
|
a137a3973e
|
refactor(main.py): trigger new build
|
2024-04-06 18:50:38 -07:00 |
|
Ishaan Jaff
|
1dc5b01e01
|
fix - use anthropic class for clients
|
2024-04-06 18:19:28 -07:00 |
|
Ishaan Jaff
|
2f1b55dd70
|
async streaming for anthropic
|
2024-04-06 17:34:23 -07:00 |
|
Ishaan Jaff
|
32c3aab34e
|
feat - make anthropic async
|
2024-04-06 15:50:13 -07:00 |
|
Ishaan Jaff
|
d23d6068ff
|
Merge pull request #2877 from BerriAI/litellm_fix_text_completion
[Feat] Text-Completion-OpenAI - Re-use OpenAI Client
|
2024-04-06 12:15:52 -07:00 |
|
Ishaan Jaff
|
d8d10c313a
|
feat - re-use openai client for text completion
|
2024-04-06 11:25:33 -07:00 |
|
Krrish Dholakia
|
650f8853af
|
refactor(main.py): trigger new build
|
2024-04-06 09:06:53 -07:00 |
|
Krrish Dholakia
|
a1137d26d4
|
refactor(main.py): trigger new build
|
2024-04-05 13:42:56 -07:00 |
|
Krrish Dholakia
|
bf8097e961
|
fix(router.py): fix client init for streaming timeouts
|
2024-04-05 12:30:15 -07:00 |
|
Krish Dholakia
|
5ea9946925
|
Merge pull request #2665 from BerriAI/litellm_claude_vertex_ai
[WIP] feat(vertex_ai_anthropic.py): Add support for claude 3 on vertex ai
|
2024-04-05 07:06:04 -07:00 |
|
Krrish Dholakia
|
a8b4bc747e
|
fix(main.py): trigger new build
|
2024-04-04 13:36:21 -07:00 |
|
Krrish Dholakia
|
053a06df97
|
build(config.yml): add missing dep
|
2024-04-04 10:59:26 -07:00 |
|
Krrish Dholakia
|
f863f30c4f
|
refactor(main.py): trigger new build
|
2024-04-04 10:54:13 -07:00 |
|
Krrish Dholakia
|
1f56ea6015
|
build: trigger new build
|
2024-04-04 10:23:13 -07:00 |
|
Krrish Dholakia
|
b73cd05674
|
build: trigger new build
|
2024-04-04 10:20:25 -07:00 |
|
Krrish Dholakia
|
6f64eccafe
|
refactor(main.py): trigger new build
|
2024-04-03 08:01:26 -07:00 |
|
Krrish Dholakia
|
bc1ee5c838
|
fix(main.py): support async calls from azure_text
|
2024-04-03 07:59:32 -07:00 |
|
Krrish Dholakia
|
bd7040969b
|
feat(vertex_ai_anthropic.py): add claude 3 on vertex ai support - working .completions call
.completions() call works
|
2024-04-02 22:07:39 -07:00 |
|
Krrish Dholakia
|
ed46af19ec
|
fix(openai.py): return logprobs for text completion calls
|
2024-04-02 14:05:56 -07:00 |
|
Krrish Dholakia
|
fba2ae61d3
|
fix(main.py): fix elif block
|
2024-04-02 09:47:49 -07:00 |
|
Krish Dholakia
|
221cac0ac2
|
Merge pull request #2790 from phact/patch-2
Fix max_tokens type in main.py
|
2024-04-02 09:02:34 -07:00 |
|
Krrish Dholakia
|
812dc7e3cc
|
refactor(main.py): trigger new build
|
2024-04-02 08:51:18 -07:00 |
|
Krrish Dholakia
|
67f62aa53e
|
fix(main.py): support text completion input being a list of strings
addresses - https://github.com/BerriAI/litellm/issues/2792, https://github.com/BerriAI/litellm/issues/2777
|
2024-04-02 08:50:16 -07:00 |
|
Sebastián Estévez
|
5c4823923e
|
Fix max_tokens type in main.py
|
2024-04-02 00:28:08 -04:00 |
|
Krrish Dholakia
|
5546f9f10a
|
fix(main.py): support max retries for transcription calls
|
2024-04-01 18:37:53 -07:00 |
|
Krrish Dholakia
|
5f3b7ba523
|
refactor(main.py): trigger new build
|
2024-04-01 18:03:46 -07:00 |
|
Krrish Dholakia
|
82052689e7
|
refactor(main.py): trigger new build
|
2024-03-30 21:41:14 -07:00 |
|
Krrish Dholakia
|
5c199e4e4e
|
fix(main.py): fix translation to text_completions format for async text completion calls
|
2024-03-30 09:02:51 -07:00 |
|
Krrish Dholakia
|
fb72b79d2e
|
refactor(main.py): trigger new build
|
2024-03-29 09:24:47 -07:00 |
|
Krrish Dholakia
|
62ac3e1de4
|
fix(sagemaker.py): support 'model_id' param for sagemaker
allow passing inference component param to sagemaker in the same format as we handle this for bedrock
|
2024-03-29 08:43:17 -07:00 |
|
Krish Dholakia
|
8f0b4457fe
|
Merge pull request #2720 from onukura/ollama-batch-embedding
Batch embedding for Ollama
|
2024-03-28 14:58:55 -07:00 |
|
Krrish Dholakia
|
4a2abfd659
|
refactor(main.py): trigger new build
|
2024-03-28 14:52:47 -07:00 |
|
onukura
|
1bd60287ba
|
Add a feature to ollama aembedding to accept batch input
|
2024-03-27 21:39:19 +00:00 |
|
Krrish Dholakia
|
71cb12b0f8
|
refactor(main.py): trigger new build
|
2024-03-26 21:18:51 -07:00 |
|
Krish Dholakia
|
4d53b484cb
|
Merge pull request #2675 from onukura/ollama-embedding
Fix Ollama embedding
|
2024-03-26 16:08:28 -07:00 |
|
onukura
|
3423038601
|
Fix ollama api_base to enable remote url
|
2024-03-25 16:26:40 +00:00 |
|
Krrish Dholakia
|
8821b3d243
|
feat(main.py): support router.chat.completions.create
allows using router with instructor
https://github.com/BerriAI/litellm/issues/2673
|
2024-03-25 08:26:28 -07:00 |
|
Krrish Dholakia
|
292cdd81e4
|
fix(router.py): fix pre call check logic
|
2024-03-23 18:56:08 -07:00 |
|
Krrish Dholakia
|
e8fbe9a9a5
|
fix(bedrock.py): support claude 3 function calling when stream=true
https://github.com/BerriAI/litellm/issues/2615
|
2024-03-21 18:39:03 -07:00 |
|
Krrish Dholakia
|
a626f4abfb
|
refactor(main.py): trigger new build
|
2024-03-21 10:56:44 -07:00 |
|
Lucca Zenobio
|
274d936195
|
extra headers
|
2024-03-21 10:43:27 -03:00 |
|
Krrish Dholakia
|
c9f20c8142
|
refactor(main.py): trigger new build
|
2024-03-19 21:05:53 -07:00 |
|