mirror of
https://github.com/BerriAI/litellm.git
synced 2025-04-26 11:14:04 +00:00
* fix(langfuse.py): prevent double logging requester metadata Fixes https://github.com/BerriAI/litellm/issues/5935 * build(model_prices_and_context_window.json): add mistral pixtral cost tracking Closes https://github.com/BerriAI/litellm/issues/5837 * handle streaming for azure ai studio error * [Perf Proxy] parallel request limiter - use one cache update call (#5932) * fix parallel request limiter - use one cache update call * ci/cd run again * run ci/cd again * use docker username password * fix config.yml * fix config * fix config * fix config.yml * ci/cd run again * use correct typing for batch set cache * fix async_set_cache_pipeline * fix only check user id tpm / rpm limits when limits set * fix test_openai_azure_embedding_with_oidc_and_cf * fix(groq/chat/transformation.py): Fixes https://github.com/BerriAI/litellm/issues/5839 * feat(anthropic/chat.py): return 'retry-after' headers from anthropic Fixes https://github.com/BerriAI/litellm/issues/4387 * feat: raise validation error if message has tool calls without passing `tools` param for anthropic/bedrock Closes https://github.com/BerriAI/litellm/issues/5747 * [Feature]#5940, add max_workers parameter for the batch_completion (#5947) * handle streaming for azure ai studio error * bump: version 1.48.2 → 1.48.3 * docs(data_security.md): add legal/compliance faq's Make it easier for companies to use litellm * docs: resolve imports * [Feature]#5940, add max_workers parameter for the batch_completion method --------- Co-authored-by: Ishaan Jaff <ishaanjaffer0324@gmail.com> Co-authored-by: Krrish Dholakia <krrishdholakia@gmail.com> Co-authored-by: josearangos <josearangos@Joses-MacBook-Pro.local> * fix(converse_transformation.py): fix default message value * fix(utils.py): fix get_model_info to handle finetuned models Fixes issue for standard logging payloads, where model_map_value was null for finetuned openai models * fix(litellm_pre_call_utils.py): add debug statement for data sent after updating with team/key callbacks * fix: fix linting errors * fix(anthropic/chat/handler.py): fix cache creation input tokens * fix(exception_mapping_utils.py): fix missing imports * fix(anthropic/chat/handler.py): fix usage block translation * test: fix test * test: fix tests * style(types/utils.py): trigger new build * test: fix test --------- Co-authored-by: Ishaan Jaff <ishaanjaffer0324@gmail.com> Co-authored-by: Jose Alberto Arango Sanchez <jose.arangos@udea.edu.co> Co-authored-by: josearangos <josearangos@Joses-MacBook-Pro.local> |
||
---|---|---|
.. | ||
AI21 | ||
anthropic | ||
azure_ai | ||
AzureOpenAI | ||
bedrock | ||
cerebras | ||
cohere | ||
custom_httpx | ||
databricks | ||
files_apis | ||
fine_tuning_apis | ||
fireworks_ai | ||
groq | ||
huggingface_llms_metadata | ||
mistral | ||
OpenAI | ||
prompt_templates | ||
sagemaker | ||
sambanova | ||
together_ai | ||
tokenizers | ||
vertex_ai_and_google_ai_studio | ||
__init__.py | ||
aleph_alpha.py | ||
azure_text.py | ||
base.py | ||
base_aws_llm.py | ||
baseten.py | ||
clarifai.py | ||
cloudflare.py | ||
custom_llm.py | ||
gemini.py | ||
huggingface_restapi.py | ||
maritalk.py | ||
nlp_cloud.py | ||
nvidia_nim.py | ||
ollama.py | ||
ollama_chat.py | ||
oobabooga.py | ||
openrouter.py | ||
palm.py | ||
petals.py | ||
predibase.py | ||
README.md | ||
replicate.py | ||
text_completion_codestral.py | ||
triton.py | ||
vllm.py | ||
volcengine.py | ||
watsonx.py |
File Structure
August 27th, 2024
To make it easy to see how calls are transformed for each model/provider:
we are working on moving all supported litellm providers to a folder structure, where folder name is the supported litellm provider name.
Each folder will contain a *_transformation.py
file, which has all the request/response transformation logic, making it easy to see how calls are modified.
E.g. cohere/
, bedrock/
.