phoenix/litellm-mirror

mirror of https://github.com/BerriAI/litellm.git synced 2025-04-26 19:24:27 +00:00

History

Krish Dholakia cd9b92b402 Litellm vllm refactor (#7158 ) * refactor(vllm/): move vllm to use base llm config * test: mark flaky test		2024-12-10 21:48:35 -08:00
..
main.py	Litellm vllm refactor (#7158 )	2024-12-10 21:48:35 -08:00
Readme.md	(fix) batch_completion fails with bedrock due to extraneous [max_workers] key (#6176 )	2024-10-12 14:10:24 +05:30

Readme.md

Implementation of `litellm.batch_completion`, `litellm.batch_completion_models`, `litellm.batch_completion_models_all_responses`

Doc: https://docs.litellm.ai/docs/completion/batching

LiteLLM Python SDK allows you to:

litellm.batch_completion Batch litellm.completion function for a given model.
litellm.batch_completion_models Send a request to multiple language models concurrently and return the response as soon as one of the models responds.
litellm.batch_completion_models_all_responses Send a request to multiple language models concurrently and return a list of responses from all models that respond.