litellm

Author	SHA1	Message	Date
Ishaan Jaff	c27640e6e4	add /rerank test	2024-08-27 17:50:37 -07:00
Ishaan Jaff	5f2f7aa754	feat - add rerank on proxy	2024-08-27 17:36:40 -07:00
Ishaan Jaff	06529f19df	Merge pull request #5392 from BerriAI/litellm_add_native_cohere_rerank [Feat] Add cohere rerank and together ai rerank	2024-08-27 17:29:37 -07:00
Ishaan Jaff	fb5be57bb8	v0 add rerank on litellm proxy	2024-08-27 17:28:39 -07:00
Ishaan Jaff	37ed201c50	fix install on 3.8	2024-08-27 17:09:16 -07:00
Krrish Dholakia	5b06ea136c	fix(openai.py): fix error re-raising	2024-08-27 17:06:25 -07:00
Ishaan Jaff	b3892b871d	add async support for rerank	2024-08-27 17:02:48 -07:00
Ishaan Jaff	f33dfe0b95	add rerank params	2024-08-27 16:45:39 -07:00
Ishaan Jaff	255ad865cd	add rerank api tests	2024-08-27 16:27:55 -07:00
Ishaan Jaff	dc42ad0021	add tg ai rerank support	2024-08-27 16:25:54 -07:00
Krrish Dholakia	bd3057e495	test(test_proxy_exception_mapping): loosen assert	2024-08-27 16:14:30 -07:00
Krrish Dholakia	6431af0678	fix(bedrock_httpx.py): support 'Auth' header as extra_header Fixes https://github.com/BerriAI/litellm/issues/5389#issuecomment-2313677977	2024-08-27 16:08:54 -07:00
Ishaan Jaff	b8bc185bd5	add main cohere ai rerank handler + test	2024-08-27 15:57:36 -07:00
Ishaan Jaff	3a82334762	add basic cohere rerank	2024-08-27 15:55:43 -07:00
Krrish Dholakia	1b2f73c220	fix(azure_text.py): fix streaming parsing	2024-08-27 15:52:55 -07:00
Ishaan Jaff	6ab8cbc105	Merge pull request #5391 from BerriAI/litellm_add_ai21_support [Feat] Add Vertex AI21 support	2024-08-27 15:06:26 -07:00
Krrish Dholakia	d2239e3c52	test(test_router_debug_logs.py): simplify test	2024-08-27 14:47:43 -07:00
Ishaan Jaff	33a3a01949	add mock test for ai21	2024-08-27 14:42:13 -07:00
Krrish Dholakia	deff357c92	fix(router.py): fix aembedding type hints Fixes https://github.com/BerriAI/litellm/issues/5383	2024-08-27 14:29:18 -07:00
Krrish Dholakia	b91e5d3887	fix(openai.py): fix post call error logging for aembedding calls	2024-08-27 14:26:06 -07:00
Ishaan Jaff	3dee98dc47	add jamba-1.5	2024-08-27 14:20:06 -07:00
Krrish Dholakia	d2464052ec	docs(bedrock.md): add doc on passing extra headers + custom api endpoints to bedrock Closes https://github.com/BerriAI/litellm/issues/5389	2024-08-27 14:18:41 -07:00
Ishaan Jaff	3c8a19e343	use cost per token for jamba	2024-08-27 14:18:04 -07:00
Ishaan Jaff	5bfacce548	add doc on using jamba-1.5-large	2024-08-27 14:12:38 -07:00
Krrish Dholakia	89ac324676	docs(anthropic.md): cleanup docs	2024-08-27 14:05:39 -07:00
Ishaan Jaff	e9808346a7	add jamba-1.5-mini models	2024-08-27 14:02:02 -07:00
Krrish Dholakia	d43441ae5d	fix(anthropic.py): support setting cache control headers, automatically Don't require user to manually pass in 'extra_headers' for anthropic cache control usage	2024-08-27 13:57:03 -07:00
Krrish Dholakia	63adb3f940	fix(azure.py): fix raw response dump	2024-08-27 13:44:38 -07:00
Ishaan Jaff	093f7f15fd	add test for test_partner_models_httpx_ai21	2024-08-27 13:36:47 -07:00
Ishaan Jaff	11c175a215	refactor partner models to include ai21	2024-08-27 13:35:22 -07:00
Krrish Dholakia	18731cf42b	fix: fix linting errors	2024-08-27 12:14:23 -07:00
Krish Dholakia	415abc86c6	Merge pull request #5358 from BerriAI/litellm_fix_retry_after fix retry after - cooldown individual models based on their specific 'retry-after' header	2024-08-27 11:50:14 -07:00
Krrish Dholakia	18b67a455e	test: fix test	2024-08-27 10:46:57 -07:00
Ishaan Jaff	0fca960221	docs langfuse link	2024-08-27 08:14:03 -07:00
Krrish Dholakia	bf81b484c6	fix(sagemaker.py): fix streaming logic	2024-08-27 08:10:47 -07:00
Krrish Dholakia	c7bbfef846	docs(gemini.md): add context caching on google ai studio to docs	2024-08-27 08:02:52 -07:00
Krrish Dholakia	cd7dd2a511	fix(cooldown_cache.py): fix linting errors	2024-08-27 07:40:28 -07:00
Krrish Dholakia	e542475f39	build(config.yml): bump anyio version	2024-08-27 07:37:06 -07:00
Krrish Dholakia	6bfcb431cd	fix(asyncify.py): fix linting errors	2024-08-27 07:37:06 -07:00
Krrish Dholakia	8d8c961b50	fix(asyncify.py): fix linting errors	2024-08-27 07:37:06 -07:00
Krrish Dholakia	2cf149fbad	perf(sagemaker.py): asyncify hf prompt template check leads to 189% improvement in RPS @ 100 users	2024-08-27 07:37:06 -07:00
Krrish Dholakia	b0f01e5b95	test: fix test	2024-08-27 07:14:14 -07:00
Krish Dholakia	ec2357f1d2	Merge pull request #5384 from miraclebakelaser/fix/cohere-tool-calls-content-field fix(factory.py): handle missing 'content' in cohere assistant messages	2024-08-27 07:13:28 -07:00
miraclebakelaser	97f714d2b0	fix(factory.py): handle missing 'content' in cohere assistant messages Update cohere_messages_pt_v2 function to check for 'content' existence	2024-08-27 19:38:37 +09:00
Krrish Dholakia	3ec8a0d25d	fix: fix imports	2024-08-26 22:24:30 -07:00
Krish Dholakia	81e62aec16	Merge pull request #5381 from BerriAI/litellm_gemini_context_caching gemini context caching (openai format) support	2024-08-26 22:22:23 -07:00
Krish Dholakia	08bd4788dc	Merge branch 'main' into litellm_gemini_context_caching	2024-08-26 22:22:17 -07:00
Krrish Dholakia	5aad9d2db7	fix: fix imports	2024-08-26 22:19:01 -07:00
Krrish Dholakia	4868a6cf55	fix: fix unbound var	2024-08-26 22:19:01 -07:00
Krrish Dholakia	0eea01dae9	feat(vertex_ai_context_caching.py): check gemini cache, if key already exists	2024-08-26 22:19:01 -07:00

1 2 3 4 5 ...

17027 commits