Ishaan Jaff
|
c27640e6e4
|
add /rerank test
|
2024-08-27 17:50:37 -07:00 |
|
Ishaan Jaff
|
5f2f7aa754
|
feat - add rerank on proxy
|
2024-08-27 17:36:40 -07:00 |
|
Ishaan Jaff
|
06529f19df
|
Merge pull request #5392 from BerriAI/litellm_add_native_cohere_rerank
[Feat] Add cohere rerank and together ai rerank
|
2024-08-27 17:29:37 -07:00 |
|
Ishaan Jaff
|
fb5be57bb8
|
v0 add rerank on litellm proxy
|
2024-08-27 17:28:39 -07:00 |
|
Ishaan Jaff
|
37ed201c50
|
fix install on 3.8
|
2024-08-27 17:09:16 -07:00 |
|
Krrish Dholakia
|
5b06ea136c
|
fix(openai.py): fix error re-raising
|
2024-08-27 17:06:25 -07:00 |
|
Ishaan Jaff
|
b3892b871d
|
add async support for rerank
|
2024-08-27 17:02:48 -07:00 |
|
Ishaan Jaff
|
f33dfe0b95
|
add rerank params
|
2024-08-27 16:45:39 -07:00 |
|
Ishaan Jaff
|
255ad865cd
|
add rerank api tests
|
2024-08-27 16:27:55 -07:00 |
|
Ishaan Jaff
|
dc42ad0021
|
add tg ai rerank support
|
2024-08-27 16:25:54 -07:00 |
|
Krrish Dholakia
|
bd3057e495
|
test(test_proxy_exception_mapping): loosen assert
|
2024-08-27 16:14:30 -07:00 |
|
Krrish Dholakia
|
6431af0678
|
fix(bedrock_httpx.py): support 'Auth' header as extra_header
Fixes https://github.com/BerriAI/litellm/issues/5389#issuecomment-2313677977
|
2024-08-27 16:08:54 -07:00 |
|
Ishaan Jaff
|
b8bc185bd5
|
add main cohere ai rerank handler + test
|
2024-08-27 15:57:36 -07:00 |
|
Ishaan Jaff
|
3a82334762
|
add basic cohere rerank
|
2024-08-27 15:55:43 -07:00 |
|
Krrish Dholakia
|
1b2f73c220
|
fix(azure_text.py): fix streaming parsing
|
2024-08-27 15:52:55 -07:00 |
|
Ishaan Jaff
|
6ab8cbc105
|
Merge pull request #5391 from BerriAI/litellm_add_ai21_support
[Feat] Add Vertex AI21 support
|
2024-08-27 15:06:26 -07:00 |
|
Krrish Dholakia
|
d2239e3c52
|
test(test_router_debug_logs.py): simplify test
|
2024-08-27 14:47:43 -07:00 |
|
Ishaan Jaff
|
33a3a01949
|
add mock test for ai21
|
2024-08-27 14:42:13 -07:00 |
|
Krrish Dholakia
|
deff357c92
|
fix(router.py): fix aembedding type hints
Fixes https://github.com/BerriAI/litellm/issues/5383
|
2024-08-27 14:29:18 -07:00 |
|
Krrish Dholakia
|
b91e5d3887
|
fix(openai.py): fix post call error logging for aembedding calls
|
2024-08-27 14:26:06 -07:00 |
|
Ishaan Jaff
|
3dee98dc47
|
add jamba-1.5
|
2024-08-27 14:20:06 -07:00 |
|
Krrish Dholakia
|
d2464052ec
|
docs(bedrock.md): add doc on passing extra headers + custom api endpoints to bedrock
Closes https://github.com/BerriAI/litellm/issues/5389
|
2024-08-27 14:18:41 -07:00 |
|
Ishaan Jaff
|
3c8a19e343
|
use cost per token for jamba
|
2024-08-27 14:18:04 -07:00 |
|
Ishaan Jaff
|
5bfacce548
|
add doc on using jamba-1.5-large
|
2024-08-27 14:12:38 -07:00 |
|
Krrish Dholakia
|
89ac324676
|
docs(anthropic.md): cleanup docs
|
2024-08-27 14:05:39 -07:00 |
|
Ishaan Jaff
|
e9808346a7
|
add jamba-1.5-mini models
|
2024-08-27 14:02:02 -07:00 |
|
Krrish Dholakia
|
d43441ae5d
|
fix(anthropic.py): support setting cache control headers, automatically
Don't require user to manually pass in 'extra_headers' for anthropic cache control usage
|
2024-08-27 13:57:03 -07:00 |
|
Krrish Dholakia
|
63adb3f940
|
fix(azure.py): fix raw response dump
|
2024-08-27 13:44:38 -07:00 |
|
Ishaan Jaff
|
093f7f15fd
|
add test for test_partner_models_httpx_ai21
|
2024-08-27 13:36:47 -07:00 |
|
Ishaan Jaff
|
11c175a215
|
refactor partner models to include ai21
|
2024-08-27 13:35:22 -07:00 |
|
Krrish Dholakia
|
18731cf42b
|
fix: fix linting errors
|
2024-08-27 12:14:23 -07:00 |
|
Krish Dholakia
|
415abc86c6
|
Merge pull request #5358 from BerriAI/litellm_fix_retry_after
fix retry after - cooldown individual models based on their specific 'retry-after' header
|
2024-08-27 11:50:14 -07:00 |
|
Krrish Dholakia
|
18b67a455e
|
test: fix test
|
2024-08-27 10:46:57 -07:00 |
|
Ishaan Jaff
|
0fca960221
|
docs langfuse link
|
2024-08-27 08:14:03 -07:00 |
|
Krrish Dholakia
|
bf81b484c6
|
fix(sagemaker.py): fix streaming logic
|
2024-08-27 08:10:47 -07:00 |
|
Krrish Dholakia
|
c7bbfef846
|
docs(gemini.md): add context caching on google ai studio to docs
|
2024-08-27 08:02:52 -07:00 |
|
Krrish Dholakia
|
cd7dd2a511
|
fix(cooldown_cache.py): fix linting errors
|
2024-08-27 07:40:28 -07:00 |
|
Krrish Dholakia
|
e542475f39
|
build(config.yml): bump anyio version
|
2024-08-27 07:37:06 -07:00 |
|
Krrish Dholakia
|
6bfcb431cd
|
fix(asyncify.py): fix linting errors
|
2024-08-27 07:37:06 -07:00 |
|
Krrish Dholakia
|
8d8c961b50
|
fix(asyncify.py): fix linting errors
|
2024-08-27 07:37:06 -07:00 |
|
Krrish Dholakia
|
2cf149fbad
|
perf(sagemaker.py): asyncify hf prompt template check
leads to 189% improvement in RPS @ 100 users
|
2024-08-27 07:37:06 -07:00 |
|
Krrish Dholakia
|
b0f01e5b95
|
test: fix test
|
2024-08-27 07:14:14 -07:00 |
|
Krish Dholakia
|
ec2357f1d2
|
Merge pull request #5384 from miraclebakelaser/fix/cohere-tool-calls-content-field
fix(factory.py): handle missing 'content' in cohere assistant messages
|
2024-08-27 07:13:28 -07:00 |
|
miraclebakelaser
|
97f714d2b0
|
fix(factory.py): handle missing 'content' in cohere assistant messages
Update cohere_messages_pt_v2 function to check for 'content' existence
|
2024-08-27 19:38:37 +09:00 |
|
Krrish Dholakia
|
3ec8a0d25d
|
fix: fix imports
|
2024-08-26 22:24:30 -07:00 |
|
Krish Dholakia
|
81e62aec16
|
Merge pull request #5381 from BerriAI/litellm_gemini_context_caching
gemini context caching (openai format) support
|
2024-08-26 22:22:23 -07:00 |
|
Krish Dholakia
|
08bd4788dc
|
Merge branch 'main' into litellm_gemini_context_caching
|
2024-08-26 22:22:17 -07:00 |
|
Krrish Dholakia
|
5aad9d2db7
|
fix: fix imports
|
2024-08-26 22:19:01 -07:00 |
|
Krrish Dholakia
|
4868a6cf55
|
fix: fix unbound var
|
2024-08-26 22:19:01 -07:00 |
|
Krrish Dholakia
|
0eea01dae9
|
feat(vertex_ai_context_caching.py): check gemini cache, if key already exists
|
2024-08-26 22:19:01 -07:00 |
|