Commit graph

3406 commits

Author SHA1 Message Date
ishaan-jaff
b1deaa1246 (fix) proxy raise exceptions from litellm.completion() 2023-11-15 16:26:04 -08:00
ishaan-jaff
f3827bc6da (feat) proxy - show loaded config for users 2023-11-15 16:26:04 -08:00
Krrish Dholakia
b42cf80585 fix(utils): fixing exception mapping 2023-11-15 15:51:17 -08:00
Krrish Dholakia
6d1b8945dd fix(huggingface_restapi.py): fix linting errors 2023-11-15 15:34:21 -08:00
Krrish Dholakia
0ede0e836e feat(get_max_tokens): get max tokens for huggingface hub models 2023-11-15 15:25:40 -08:00
Krrish Dholakia
e35ce15a89 refactor(huggingface_restapi.py): moving async completion + streaming to real async calls 2023-11-15 15:14:21 -08:00
ishaan-jaff
4fa4a7f082 (docs) using litellm proxy + langfuse 2023-11-15 14:29:14 -08:00
Krrish Dholakia
04ce14e404 fix(utils.py): fix langfuse integration 2023-11-15 14:05:40 -08:00
ishaan-jaff
7d3b653a99 (docs) using langfuse with litellm 2023-11-15 13:53:12 -08:00
ishaan-jaff
a0ff9e7d7b (docs) using litellm proxy + Langfuse 2023-11-15 13:52:47 -08:00
ishaan-jaff
b10e7b7973 (fix) proxy safe access router model names 2023-11-15 13:44:17 -08:00
ishaan-jaff
d920801fb8 (docs) show how to debug proxy logs 2023-11-15 13:44:17 -08:00
Krrish Dholakia
e324388520 fix(utils.py): check for none params 2023-11-15 13:39:09 -08:00
ishaan-jaff
500e55ce2b (docs) proxy cli settings 2023-11-15 13:33:29 -08:00
Krrish Dholakia
8eaa1eb37f fix(utils.py): azure streaming initial format 2023-11-15 13:30:08 -08:00
ishaan-jaff
20e0682138 (docs) using litellm with curl, OpenAI v1.0.0+ 2023-11-15 13:29:20 -08:00
Krrish Dholakia
54bdca4ba6 docs(routing.md): add contributor s/o 2023-11-15 13:20:02 -08:00
Krrish Dholakia
e5929f2f7e fix(azure.py-+-proxy_server.py): fix function calling response object + support router on proxy 2023-11-15 13:15:16 -08:00
ishaan-jaff
24cc1f620d add ollama/zephyr to config 2023-11-15 13:04:34 -08:00
ishaan-jaff
53e3253759 (docs) proxy add example on using local hf llm 2023-11-15 13:00:43 -08:00
ishaan-jaff
33a69bca72 (fix) remove bloat from ci/cid 2023-11-15 12:39:51 -08:00
Krrish Dholakia
c47ca6cc50 refactor(router.py): renaming variable 2023-11-15 12:31:29 -08:00
Krrish Dholakia
4676b3dabd feat(router.py): enable passing chat completion params for Router.chat.completion.create 2023-11-15 12:28:16 -08:00
Krish Dholakia
ace4e52060 Merge pull request #820 from kfsone/main
need to re-attempt backoff and yaml imports if the first import attempt fails
2023-11-15 11:28:34 -08:00
Oliver Smith
ab80055cad Correct pypi package name for yaml 2023-11-15 10:46:26 -08:00
Oliver Smith
c5147a6f07 Ensure the imports needed by proxy_server are in requirements.txt to start with 2023-11-15 10:38:05 -08:00
Oliver Smith
4e4e0a2251 need to re-attempt backoff and yaml imports if the first import attempt fails. not sure which import is missing from requirements 2023-11-15 00:10:22 -08:00
Krrish Dholakia
545fcbf786 test(test_completion.py): fix palm test 2023-11-14 22:28:56 -08:00
Krrish Dholakia
a7c05a4d62 bump: version 1.0.2 → 1.0.3.dev1 2023-11-14 22:20:17 -08:00
Krrish Dholakia
29a0c29eb3 fix(utils.py): await async function in client wrapper 2023-11-14 22:07:28 -08:00
Krrish Dholakia
c3f81b9e34 fix: fix linting errors 2023-11-14 21:51:39 -08:00
Krrish Dholakia
0f6713993d fix(router.py): enabling retrying with expo backoff (without tenacity) for router 2023-11-14 20:57:51 -08:00
Krrish Dholakia
cb139d535d bump: version 1.0.1 → 1.0.2 2023-11-14 18:55:20 -08:00
Krrish Dholakia
5d58bb9cd0 fix(main.py): misrouting ollama models to nlp cloud 2023-11-14 18:55:08 -08:00
ishaan-jaff
9585856b9f (feat) debug POST logs 2023-11-14 18:16:45 -08:00
ishaan-jaff
838cb3e20b (fix) debugging with POST request 2023-11-14 18:05:34 -08:00
ishaan-jaff
84e931ce0f (feat) completion debug view HF POST request 2023-11-14 17:57:41 -08:00
ishaan-jaff
e0f7120459 (feat) improve logging of raw POST curl command 2023-11-14 17:54:09 -08:00
ishaan-jaff
d25d4d26bd (feat) debug ollama POST request 2023-11-14 17:53:48 -08:00
ishaan-jaff
c7fbbe8764 (feat) add ability to view POST requests from litellm.completion() 2023-11-14 17:27:20 -08:00
Krrish Dholakia
9b582b2c85 fix(main.py): keep client consistent across calls + exponential backoff retry on ratelimit errors 2023-11-14 16:26:05 -08:00
ishaan-jaff
3110b6d7d6 (docs) config for azure ad 2023-11-14 15:47:37 -08:00
ishaan-jaff
301ef86f5a (docs) proxy 2023-11-14 15:42:23 -08:00
ishaan-jaff
9f1d050967 (docs) passing azure_ad_token to config 2023-11-14 15:27:59 -08:00
ishaan-jaff
a1a041b305 (fix) linting error 2023-11-14 15:07:43 -08:00
ishaan-jaff
52e2730e44 (test) AI21 test 2023-11-14 15:01:50 -08:00
ishaan-jaff
e1ea6f2cc7 (fix) AI21 exception mapping - raise error when status !=200 2023-11-14 15:01:22 -08:00
ishaan-jaff
ebdd559175 (fix) APIError set body=None 2023-11-14 14:50:22 -08:00
ishaan-jaff
c5e81bfb19 (fix) linting fix 2023-11-14 14:40:59 -08:00
ishaan-jaff
99207ae9ec (docs) azure ad 2023-11-14 14:38:33 -08:00