mirror of https://github.com/BerriAI/litellm.git synced 2025-04-26 03:04:13 +00:00

History

Krrish Dholakia b9e6989e41 test: fix linting issues		2023-11-09 16:50:43 -08:00
..
tests	(test) add proxy cli tests	2023-11-09 10:23:14 -08:00
.gitignore	fix(gitmodules): remapping to new proxy	2023-10-12 21:23:53 -07:00
__init__.py	update proxy cli	2023-09-28 16:24:41 -07:00
config.yaml	(feat) proxy add config	2023-11-09 13:06:38 -08:00
openapi.json	(feat) add swagger.json for litellm proxy	2023-10-13 20:41:04 -07:00
proxy_cli.py	(fix) proxy cli default local debug to False	2023-11-09 11:30:11 -08:00
proxy_server.py	test: fix linting issues	2023-11-09 16:50:43 -08:00
README.md	Update README.md	2023-11-07 12:08:59 -08:00
start.sh	fix(factory.py): fixing llama-2 non-chat models prompt templating	2023-11-07 21:33:54 -08:00
utils.py	fix(factory.py): fixing llama-2 non-chat models prompt templating	2023-11-07 21:33:54 -08:00

README.md

litellm-proxy

A local, fast, and lightweight OpenAI-compatible server to call 100+ LLM APIs.

usage

$ pip install litellm

$ litellm --model ollama/codellama 

#INFO: Ollama running on http://0.0.0.0:8000

replace openai base

import openai 

openai.api_base = "http://0.0.0.0:8000"

print(openai.ChatCompletion.create(model="test", messages=[{"role":"user", "content":"Hey!"}]))

See how to call Huggingface,Bedrock,TogetherAI,Anthropic, etc.