litellm-mirror

mirror of https://github.com/BerriAI/litellm.git synced 2025-04-27 11:43:54 +00:00

History

Krish Dholakia ec5a354eac add azure o1 pricing (#7715 ) * build(model_prices_and_context_window.json): add azure o1 pricing Closes https://github.com/BerriAI/litellm/issues/7712 * refactor: replace regex with string method for whitespace check in stop-sequences handling (#7713) * Allows overriding keep_alive time in ollama (#7079) * Allows overriding keep_alive time in ollama * Also adds to ollama_chat * Adds some info on the docs about this parameter * fix: together ai warning (#7688) Co-authored-by: Carl Senze <carl.senze@aleph-alpha.com> * fix(proxy_server.py): handle config containing thread locked objects when using get_config_state * fix(proxy_server.py): add exception to debug * build(model_prices_and_context_window.json): update 'supports_vision' for azure o1 --------- Co-authored-by: Wolfram Ravenwolf <52386626+WolframRavenwolf@users.noreply.github.com> Co-authored-by: Regis David Souza Mesquita <github@rdsm.dev> Co-authored-by: Carl <45709281+capsenz@users.noreply.github.com> Co-authored-by: Carl Senze <carl.senze@aleph-alpha.com>		2025-01-12 18:15:35 -08:00
..
ai21/chat	(code quality) run ruff rule to ban unused imports (#7313 )	2024-12-19 12:33:42 -08:00
aiohttp_openai/chat	(proxy - RPS) - Get 2K RPS at 4 instances, minor fix `aiohttp_openai/` (#7659 )	2025-01-09 17:24:18 -08:00
anthropic	add azure o1 pricing (#7715 )	2025-01-12 18:15:35 -08:00
azure	Litellm dev 12 30 2024 p2 (#7495 )	2025-01-01 18:57:29 -08:00
azure_ai	fix unused imports	2025-01-02 22:28:22 -08:00
base_llm	[BETA] Add OpenAI `/images/variations` + Topaz API support (#7700 )	2025-01-11 23:27:46 -08:00
bedrock	Litellm dev 12 30 2024 p2 (#7495 )	2025-01-01 18:57:29 -08:00
cerebras	(code quality) run ruff rule to ban unused imports (#7313 )	2024-12-19 12:33:42 -08:00
clarifai	Litellm dev 12 30 2024 p2 (#7495 )	2025-01-01 18:57:29 -08:00
cloudflare/chat	Litellm dev 12 30 2024 p2 (#7495 )	2025-01-01 18:57:29 -08:00
codestral/completion	(code quality) run ruff rule to ban unused imports (#7313 )	2024-12-19 12:33:42 -08:00
cohere	Litellm dev 12 30 2024 p2 (#7495 )	2025-01-01 18:57:29 -08:00
custom_httpx	[BETA] Add OpenAI `/images/variations` + Topaz API support (#7700 )	2025-01-11 23:27:46 -08:00
databricks	LiteLLM Minor Fixes & Improvements (01/08/2025) - p2 (#7643 )	2025-01-08 19:45:19 -08:00
deepgram	Litellm dev 01 02 2025 p2 (#7512 )	2025-01-02 21:57:51 -08:00
deepinfra/chat	(code quality) run ruff rule to ban unused imports (#7313 )	2024-12-19 12:33:42 -08:00
deepseek	LiteLLM Minor Fixes & Improvements (12/23/2024) - p3 (#7394 )	2024-12-23 22:02:52 -08:00
deprecated_providers	(code quality) run ruff rule to ban unused imports (#7313 )	2024-12-19 12:33:42 -08:00
empower/chat	LiteLLM Common Base LLM Config (pt.3): Move all OAI compatible providers to base llm config (#7148 )	2024-12-10 17:12:42 -08:00
fireworks_ai	[BETA] Add OpenAI `/images/variations` + Topaz API support (#7700 )	2025-01-11 23:27:46 -08:00
friendliai/chat	(code quality) run ruff rule to ban unused imports (#7313 )	2024-12-19 12:33:42 -08:00
galadriel/chat	(code quality) run ruff rule to ban unused imports (#7313 )	2024-12-19 12:33:42 -08:00
gemini	Litellm dev 12 28 2024 p1 (#7463 )	2024-12-28 20:26:00 -08:00
github/chat	(code quality) run ruff rule to ban unused imports (#7313 )	2024-12-19 12:33:42 -08:00
groq	fix(groq/chat/transformation.py): fix groq response_format transformation (#7565 )	2025-01-04 19:39:04 -08:00
hosted_vllm	(code quality) run ruff rule to ban unused imports (#7313 )	2024-12-19 12:33:42 -08:00
huggingface	Litellm dev 12 30 2024 p2 (#7495 )	2025-01-01 18:57:29 -08:00
infinity/rerank	(feat) add infinity rerank models (#7321 )	2024-12-19 18:30:28 -08:00
jina_ai	(code quality) run ruff rule to ban unused imports (#7313 )	2024-12-19 12:33:42 -08:00
litellm_proxy/chat	[BETA] Add OpenAI `/images/variations` + Topaz API support (#7700 )	2025-01-11 23:27:46 -08:00
lm_studio	(code quality) run ruff rule to ban unused imports (#7313 )	2024-12-19 12:33:42 -08:00
mistral	(code quality) run ruff rule to ban unused imports (#7313 )	2024-12-19 12:33:42 -08:00
nlp_cloud	Litellm dev 12 30 2024 p2 (#7495 )	2025-01-01 18:57:29 -08:00
nvidia_nim	(code quality) run ruff rule to ban unused imports (#7313 )	2024-12-19 12:33:42 -08:00
ollama	Litellm dev 12 30 2024 p2 (#7495 )	2025-01-01 18:57:29 -08:00
oobabooga	Litellm dev 12 30 2024 p2 (#7495 )	2025-01-01 18:57:29 -08:00
openai	[BETA] Add OpenAI `/images/variations` + Topaz API support (#7700 )	2025-01-11 23:27:46 -08:00
openai_like	Litellm dev 01 10 2025 p3 (#7682 )	2025-01-10 21:56:42 -08:00
openrouter/chat	(code quality) run ruff rule to ban unused imports (#7313 )	2024-12-19 12:33:42 -08:00
perplexity/chat	(code quality) run ruff rule to ban unused imports (#7313 )	2024-12-19 12:33:42 -08:00
petals	Litellm dev 12 30 2024 p2 (#7495 )	2025-01-01 18:57:29 -08:00
predibase	Litellm dev 12 30 2024 p2 (#7495 )	2025-01-01 18:57:29 -08:00
replicate	Litellm dev 12 30 2024 p2 (#7495 )	2025-01-01 18:57:29 -08:00
sagemaker	Litellm dev 12 30 2024 p2 (#7495 )	2025-01-01 18:57:29 -08:00
sambanova	(code quality) run ruff rule to ban unused imports (#7313 )	2024-12-19 12:33:42 -08:00
together_ai	add azure o1 pricing (#7715 )	2025-01-12 18:15:35 -08:00
topaz	[BETA] Add OpenAI `/images/variations` + Topaz API support (#7700 )	2025-01-11 23:27:46 -08:00
triton	Litellm dev 12 30 2024 p2 (#7495 )	2025-01-01 18:57:29 -08:00
vertex_ai	fix(vertex_ai/gemini/transformation.py): handle 'http://' in gemini p… (#7660 )	2025-01-10 07:31:59 -08:00
vllm/completion	(code quality) run ruff rule to ban unused imports (#7313 )	2024-12-19 12:33:42 -08:00
voyage/embedding	Litellm dev 12 30 2024 p2 (#7495 )	2025-01-01 18:57:29 -08:00
watsonx	Support checking provider-specific `/models` endpoints for available models based on key (#7538 )	2025-01-03 19:29:59 -08:00
xai/chat	(code quality) run ruff rule to ban unused imports (#7313 )	2024-12-19 12:33:42 -08:00
__init__.py	add linting	2023-08-18 11:05:05 -07:00
base.py	Complete 'requests' library removal (#7350 )	2024-12-22 07:21:25 -08:00
baseten.py	(code quality) run ruff rule to ban unused imports (#7313 )	2024-12-19 12:33:42 -08:00
custom_llm.py	(code quality) run ruff rule to ban unused imports (#7313 )	2024-12-19 12:33:42 -08:00
maritalk.py	(code quality) run ruff rule to ban unused imports (#7313 )	2024-12-19 12:33:42 -08:00
ollama_chat.py	add azure o1 pricing (#7715 )	2025-01-12 18:15:35 -08:00
README.md	LiteLLM Minor Fixes and Improvements (09/13/2024) (#5689 )	2024-09-14 10:02:55 -07:00
volcengine.py	(code quality) run ruff rule to ban unused imports (#7313 )	2024-12-19 12:33:42 -08:00

README.md

File Structure

August 27th, 2024

To make it easy to see how calls are transformed for each model/provider:

we are working on moving all supported litellm providers to a folder structure, where folder name is the supported litellm provider name.

Each folder will contain a *_transformation.py file, which has all the request/response transformation logic, making it easy to see how calls are modified.

E.g. cohere/, bedrock/.