Commit graph

1706 commits

Author SHA1 Message Date
ishaan-jaff
0e2811d878 (chore) remove bloat 2023-11-15 17:52:57 -08:00
Krrish Dholakia
ef4e5b9636 test: set request timeout at request level 2023-11-15 17:42:31 -08:00
ishaan-jaff
a451db6ac7 (chore) remove bloat testing 2023-11-15 17:40:24 -08:00
ishaan-jaff
563c70dc87 (fix) testing model alias map 2023-11-15 17:40:15 -08:00
ishaan-jaff
9ad8cf396f (chore) remvoe bloat testing 2023-11-15 17:33:15 -08:00
Rodney Rodriguez
f2d8bfd40d bugfixes for ollama 2023-11-15 19:27:06 -06:00
ishaan-jaff
3f27c7392e (chore) linting fixes 2023-11-15 17:25:06 -08:00
ishaan-jaff
fd0b284a92 (chore) remove bloat 2023-11-15 17:16:14 -08:00
ishaan-jaff
396f0023dd (chore) fix config yaml 2023-11-15 17:12:28 -08:00
ishaan-jaff
c44fb66d7d (chore) removing bloat 2023-11-15 17:11:56 -08:00
ishaan-jaff
c4f74a94c6 (chore) litellm_server rename utils -> server utils 2023-11-15 17:05:50 -08:00
Krrish Dholakia
c4c24dafb8 fix(openai-+-azure): async calling 2023-11-15 17:03:02 -08:00
ishaan-jaff
07f6b16af2 (chore) remove bloat deprecated config.json 2023-11-15 17:00:14 -08:00
ishaan-jaff
c1ae717a9c (chore) remove bloat - deprecated gpt cache integration 2023-11-15 16:59:31 -08:00
ishaan-jaff
7e51f00338 (chore) remove bloat old testing.py file 2023-11-15 16:58:54 -08:00
ishaan-jaff
17fe89aaf3 (chore) rename litellm server 2023-11-15 16:57:42 -08:00
ishaan-jaff
dd16595f52 (chore) move litellm_server inside litellm 2023-11-15 16:56:35 -08:00
Krrish Dholakia
e28b9534e4 fix(huggingface_restapi.py): async implementation 2023-11-15 16:54:15 -08:00
ishaan-jaff
ff498be65c (docs) add example configs to proxy 2023-11-15 16:41:26 -08:00
ishaan-jaff
b1deaa1246 (fix) proxy raise exceptions from litellm.completion() 2023-11-15 16:26:04 -08:00
ishaan-jaff
f3827bc6da (feat) proxy - show loaded config for users 2023-11-15 16:26:04 -08:00
Krrish Dholakia
b42cf80585 fix(utils): fixing exception mapping 2023-11-15 15:51:17 -08:00
Krrish Dholakia
6d1b8945dd fix(huggingface_restapi.py): fix linting errors 2023-11-15 15:34:21 -08:00
Krrish Dholakia
0ede0e836e feat(get_max_tokens): get max tokens for huggingface hub models 2023-11-15 15:25:40 -08:00
Krrish Dholakia
e35ce15a89 refactor(huggingface_restapi.py): moving async completion + streaming to real async calls 2023-11-15 15:14:21 -08:00
Krrish Dholakia
04ce14e404 fix(utils.py): fix langfuse integration 2023-11-15 14:05:40 -08:00
ishaan-jaff
7d3b653a99 (docs) using langfuse with litellm 2023-11-15 13:53:12 -08:00
ishaan-jaff
b10e7b7973 (fix) proxy safe access router model names 2023-11-15 13:44:17 -08:00
Krrish Dholakia
e324388520 fix(utils.py): check for none params 2023-11-15 13:39:09 -08:00
Krrish Dholakia
8eaa1eb37f fix(utils.py): azure streaming initial format 2023-11-15 13:30:08 -08:00
Krrish Dholakia
54bdca4ba6 docs(routing.md): add contributor s/o 2023-11-15 13:20:02 -08:00
Krrish Dholakia
e5929f2f7e fix(azure.py-+-proxy_server.py): fix function calling response object + support router on proxy 2023-11-15 13:15:16 -08:00
ishaan-jaff
24cc1f620d add ollama/zephyr to config 2023-11-15 13:04:34 -08:00
Krrish Dholakia
c47ca6cc50 refactor(router.py): renaming variable 2023-11-15 12:31:29 -08:00
Krrish Dholakia
4676b3dabd feat(router.py): enable passing chat completion params for Router.chat.completion.create 2023-11-15 12:28:16 -08:00
Oliver Smith
4e4e0a2251 need to re-attempt backoff and yaml imports if the first import attempt fails. not sure which import is missing from requirements 2023-11-15 00:10:22 -08:00
Krrish Dholakia
545fcbf786 test(test_completion.py): fix palm test 2023-11-14 22:28:56 -08:00
Krrish Dholakia
29a0c29eb3 fix(utils.py): await async function in client wrapper 2023-11-14 22:07:28 -08:00
Krrish Dholakia
c3f81b9e34 fix: fix linting errors 2023-11-14 21:51:39 -08:00
Krrish Dholakia
0f6713993d fix(router.py): enabling retrying with expo backoff (without tenacity) for router 2023-11-14 20:57:51 -08:00
Krrish Dholakia
5d58bb9cd0 fix(main.py): misrouting ollama models to nlp cloud 2023-11-14 18:55:08 -08:00
ishaan-jaff
9585856b9f (feat) debug POST logs 2023-11-14 18:16:45 -08:00
ishaan-jaff
838cb3e20b (fix) debugging with POST request 2023-11-14 18:05:34 -08:00
ishaan-jaff
84e931ce0f (feat) completion debug view HF POST request 2023-11-14 17:57:41 -08:00
ishaan-jaff
e0f7120459 (feat) improve logging of raw POST curl command 2023-11-14 17:54:09 -08:00
ishaan-jaff
d25d4d26bd (feat) debug ollama POST request 2023-11-14 17:53:48 -08:00
ishaan-jaff
c7fbbe8764 (feat) add ability to view POST requests from litellm.completion() 2023-11-14 17:27:20 -08:00
Krrish Dholakia
9b582b2c85 fix(main.py): keep client consistent across calls + exponential backoff retry on ratelimit errors 2023-11-14 16:26:05 -08:00
ishaan-jaff
9f1d050967 (docs) passing azure_ad_token to config 2023-11-14 15:27:59 -08:00
ishaan-jaff
a1a041b305 (fix) linting error 2023-11-14 15:07:43 -08:00