Commit graph

17027 commits

Author SHA1 Message Date
Ishaan Jaff
c27640e6e4 add /rerank test 2024-08-27 17:50:37 -07:00
Ishaan Jaff
5f2f7aa754 feat - add rerank on proxy 2024-08-27 17:36:40 -07:00
Ishaan Jaff
06529f19df
Merge pull request #5392 from BerriAI/litellm_add_native_cohere_rerank
[Feat] Add cohere rerank and together ai rerank
2024-08-27 17:29:37 -07:00
Ishaan Jaff
fb5be57bb8 v0 add rerank on litellm proxy 2024-08-27 17:28:39 -07:00
Ishaan Jaff
37ed201c50 fix install on 3.8 2024-08-27 17:09:16 -07:00
Krrish Dholakia
5b06ea136c fix(openai.py): fix error re-raising 2024-08-27 17:06:25 -07:00
Ishaan Jaff
b3892b871d add async support for rerank 2024-08-27 17:02:48 -07:00
Ishaan Jaff
f33dfe0b95 add rerank params 2024-08-27 16:45:39 -07:00
Ishaan Jaff
255ad865cd add rerank api tests 2024-08-27 16:27:55 -07:00
Ishaan Jaff
dc42ad0021 add tg ai rerank support 2024-08-27 16:25:54 -07:00
Krrish Dholakia
bd3057e495 test(test_proxy_exception_mapping): loosen assert 2024-08-27 16:14:30 -07:00
Krrish Dholakia
6431af0678 fix(bedrock_httpx.py): support 'Auth' header as extra_header
Fixes https://github.com/BerriAI/litellm/issues/5389#issuecomment-2313677977
2024-08-27 16:08:54 -07:00
Ishaan Jaff
b8bc185bd5 add main cohere ai rerank handler + test 2024-08-27 15:57:36 -07:00
Ishaan Jaff
3a82334762 add basic cohere rerank 2024-08-27 15:55:43 -07:00
Krrish Dholakia
1b2f73c220 fix(azure_text.py): fix streaming parsing 2024-08-27 15:52:55 -07:00
Ishaan Jaff
6ab8cbc105
Merge pull request #5391 from BerriAI/litellm_add_ai21_support
[Feat] Add Vertex  AI21 support
2024-08-27 15:06:26 -07:00
Krrish Dholakia
d2239e3c52 test(test_router_debug_logs.py): simplify test 2024-08-27 14:47:43 -07:00
Ishaan Jaff
33a3a01949 add mock test for ai21 2024-08-27 14:42:13 -07:00
Krrish Dholakia
deff357c92 fix(router.py): fix aembedding type hints
Fixes https://github.com/BerriAI/litellm/issues/5383
2024-08-27 14:29:18 -07:00
Krrish Dholakia
b91e5d3887 fix(openai.py): fix post call error logging for aembedding calls 2024-08-27 14:26:06 -07:00
Ishaan Jaff
3dee98dc47 add jamba-1.5 2024-08-27 14:20:06 -07:00
Krrish Dholakia
d2464052ec docs(bedrock.md): add doc on passing extra headers + custom api endpoints to bedrock
Closes https://github.com/BerriAI/litellm/issues/5389
2024-08-27 14:18:41 -07:00
Ishaan Jaff
3c8a19e343 use cost per token for jamba 2024-08-27 14:18:04 -07:00
Ishaan Jaff
5bfacce548 add doc on using jamba-1.5-large 2024-08-27 14:12:38 -07:00
Krrish Dholakia
89ac324676 docs(anthropic.md): cleanup docs 2024-08-27 14:05:39 -07:00
Ishaan Jaff
e9808346a7 add jamba-1.5-mini models 2024-08-27 14:02:02 -07:00
Krrish Dholakia
d43441ae5d fix(anthropic.py): support setting cache control headers, automatically
Don't require user to manually pass in 'extra_headers' for anthropic cache control usage
2024-08-27 13:57:03 -07:00
Krrish Dholakia
63adb3f940 fix(azure.py): fix raw response dump 2024-08-27 13:44:38 -07:00
Ishaan Jaff
093f7f15fd add test for test_partner_models_httpx_ai21 2024-08-27 13:36:47 -07:00
Ishaan Jaff
11c175a215 refactor partner models to include ai21 2024-08-27 13:35:22 -07:00
Krrish Dholakia
18731cf42b fix: fix linting errors 2024-08-27 12:14:23 -07:00
Krish Dholakia
415abc86c6
Merge pull request #5358 from BerriAI/litellm_fix_retry_after
fix retry after - cooldown individual models based on their specific 'retry-after' header
2024-08-27 11:50:14 -07:00
Krrish Dholakia
18b67a455e test: fix test 2024-08-27 10:46:57 -07:00
Ishaan Jaff
0fca960221 docs langfuse link 2024-08-27 08:14:03 -07:00
Krrish Dholakia
bf81b484c6 fix(sagemaker.py): fix streaming logic 2024-08-27 08:10:47 -07:00
Krrish Dholakia
c7bbfef846 docs(gemini.md): add context caching on google ai studio to docs 2024-08-27 08:02:52 -07:00
Krrish Dholakia
cd7dd2a511 fix(cooldown_cache.py): fix linting errors 2024-08-27 07:40:28 -07:00
Krrish Dholakia
e542475f39 build(config.yml): bump anyio version 2024-08-27 07:37:06 -07:00
Krrish Dholakia
6bfcb431cd fix(asyncify.py): fix linting errors 2024-08-27 07:37:06 -07:00
Krrish Dholakia
8d8c961b50 fix(asyncify.py): fix linting errors 2024-08-27 07:37:06 -07:00
Krrish Dholakia
2cf149fbad perf(sagemaker.py): asyncify hf prompt template check
leads to 189% improvement in RPS @ 100 users
2024-08-27 07:37:06 -07:00
Krrish Dholakia
b0f01e5b95 test: fix test 2024-08-27 07:14:14 -07:00
Krish Dholakia
ec2357f1d2
Merge pull request #5384 from miraclebakelaser/fix/cohere-tool-calls-content-field
fix(factory.py): handle missing 'content' in cohere assistant messages
2024-08-27 07:13:28 -07:00
miraclebakelaser
97f714d2b0 fix(factory.py): handle missing 'content' in cohere assistant messages
Update cohere_messages_pt_v2 function to check for 'content' existence
2024-08-27 19:38:37 +09:00
Krrish Dholakia
3ec8a0d25d fix: fix imports 2024-08-26 22:24:30 -07:00
Krish Dholakia
81e62aec16
Merge pull request #5381 from BerriAI/litellm_gemini_context_caching
gemini context caching (openai format) support
2024-08-26 22:22:23 -07:00
Krish Dholakia
08bd4788dc
Merge branch 'main' into litellm_gemini_context_caching 2024-08-26 22:22:17 -07:00
Krrish Dholakia
5aad9d2db7 fix: fix imports 2024-08-26 22:19:01 -07:00
Krrish Dholakia
4868a6cf55 fix: fix unbound var 2024-08-26 22:19:01 -07:00
Krrish Dholakia
0eea01dae9 feat(vertex_ai_context_caching.py): check gemini cache, if key already exists 2024-08-26 22:19:01 -07:00