ishaan-jaff
|
0e2811d878
|
(chore) remove bloat
|
2023-11-15 17:52:57 -08:00 |
|
Krrish Dholakia
|
ef4e5b9636
|
test: set request timeout at request level
|
2023-11-15 17:42:31 -08:00 |
|
ishaan-jaff
|
a451db6ac7
|
(chore) remove bloat testing
|
2023-11-15 17:40:24 -08:00 |
|
ishaan-jaff
|
563c70dc87
|
(fix) testing model alias map
|
2023-11-15 17:40:15 -08:00 |
|
ishaan-jaff
|
9ad8cf396f
|
(chore) remvoe bloat testing
|
2023-11-15 17:33:15 -08:00 |
|
Rodney Rodriguez
|
f2d8bfd40d
|
bugfixes for ollama
|
2023-11-15 19:27:06 -06:00 |
|
ishaan-jaff
|
3f27c7392e
|
(chore) linting fixes
|
2023-11-15 17:25:06 -08:00 |
|
ishaan-jaff
|
fd0b284a92
|
(chore) remove bloat
|
2023-11-15 17:16:14 -08:00 |
|
ishaan-jaff
|
396f0023dd
|
(chore) fix config yaml
|
2023-11-15 17:12:28 -08:00 |
|
ishaan-jaff
|
c44fb66d7d
|
(chore) removing bloat
|
2023-11-15 17:11:56 -08:00 |
|
ishaan-jaff
|
c4f74a94c6
|
(chore) litellm_server rename utils -> server utils
|
2023-11-15 17:05:50 -08:00 |
|
Krrish Dholakia
|
c4c24dafb8
|
fix(openai-+-azure): async calling
|
2023-11-15 17:03:02 -08:00 |
|
ishaan-jaff
|
07f6b16af2
|
(chore) remove bloat deprecated config.json
|
2023-11-15 17:00:14 -08:00 |
|
ishaan-jaff
|
c1ae717a9c
|
(chore) remove bloat - deprecated gpt cache integration
|
2023-11-15 16:59:31 -08:00 |
|
ishaan-jaff
|
7e51f00338
|
(chore) remove bloat old testing.py file
|
2023-11-15 16:58:54 -08:00 |
|
ishaan-jaff
|
17fe89aaf3
|
(chore) rename litellm server
|
2023-11-15 16:57:42 -08:00 |
|
ishaan-jaff
|
dd16595f52
|
(chore) move litellm_server inside litellm
|
2023-11-15 16:56:35 -08:00 |
|
Krrish Dholakia
|
e28b9534e4
|
fix(huggingface_restapi.py): async implementation
|
2023-11-15 16:54:15 -08:00 |
|
ishaan-jaff
|
ff498be65c
|
(docs) add example configs to proxy
|
2023-11-15 16:41:26 -08:00 |
|
ishaan-jaff
|
b1deaa1246
|
(fix) proxy raise exceptions from litellm.completion()
|
2023-11-15 16:26:04 -08:00 |
|
ishaan-jaff
|
f3827bc6da
|
(feat) proxy - show loaded config for users
|
2023-11-15 16:26:04 -08:00 |
|
Krrish Dholakia
|
b42cf80585
|
fix(utils): fixing exception mapping
|
2023-11-15 15:51:17 -08:00 |
|
Krrish Dholakia
|
6d1b8945dd
|
fix(huggingface_restapi.py): fix linting errors
|
2023-11-15 15:34:21 -08:00 |
|
Krrish Dholakia
|
0ede0e836e
|
feat(get_max_tokens): get max tokens for huggingface hub models
|
2023-11-15 15:25:40 -08:00 |
|
Krrish Dholakia
|
e35ce15a89
|
refactor(huggingface_restapi.py): moving async completion + streaming to real async calls
|
2023-11-15 15:14:21 -08:00 |
|
Krrish Dholakia
|
04ce14e404
|
fix(utils.py): fix langfuse integration
|
2023-11-15 14:05:40 -08:00 |
|
ishaan-jaff
|
7d3b653a99
|
(docs) using langfuse with litellm
|
2023-11-15 13:53:12 -08:00 |
|
ishaan-jaff
|
b10e7b7973
|
(fix) proxy safe access router model names
|
2023-11-15 13:44:17 -08:00 |
|
Krrish Dholakia
|
e324388520
|
fix(utils.py): check for none params
|
2023-11-15 13:39:09 -08:00 |
|
Krrish Dholakia
|
8eaa1eb37f
|
fix(utils.py): azure streaming initial format
|
2023-11-15 13:30:08 -08:00 |
|
Krrish Dholakia
|
54bdca4ba6
|
docs(routing.md): add contributor s/o
|
2023-11-15 13:20:02 -08:00 |
|
Krrish Dholakia
|
e5929f2f7e
|
fix(azure.py-+-proxy_server.py): fix function calling response object + support router on proxy
|
2023-11-15 13:15:16 -08:00 |
|
ishaan-jaff
|
24cc1f620d
|
add ollama/zephyr to config
|
2023-11-15 13:04:34 -08:00 |
|
Krrish Dholakia
|
c47ca6cc50
|
refactor(router.py): renaming variable
|
2023-11-15 12:31:29 -08:00 |
|
Krrish Dholakia
|
4676b3dabd
|
feat(router.py): enable passing chat completion params for Router.chat.completion.create
|
2023-11-15 12:28:16 -08:00 |
|
Oliver Smith
|
4e4e0a2251
|
need to re-attempt backoff and yaml imports if the first import attempt fails. not sure which import is missing from requirements
|
2023-11-15 00:10:22 -08:00 |
|
Krrish Dholakia
|
545fcbf786
|
test(test_completion.py): fix palm test
|
2023-11-14 22:28:56 -08:00 |
|
Krrish Dholakia
|
29a0c29eb3
|
fix(utils.py): await async function in client wrapper
|
2023-11-14 22:07:28 -08:00 |
|
Krrish Dholakia
|
c3f81b9e34
|
fix: fix linting errors
|
2023-11-14 21:51:39 -08:00 |
|
Krrish Dholakia
|
0f6713993d
|
fix(router.py): enabling retrying with expo backoff (without tenacity) for router
|
2023-11-14 20:57:51 -08:00 |
|
Krrish Dholakia
|
5d58bb9cd0
|
fix(main.py): misrouting ollama models to nlp cloud
|
2023-11-14 18:55:08 -08:00 |
|
ishaan-jaff
|
9585856b9f
|
(feat) debug POST logs
|
2023-11-14 18:16:45 -08:00 |
|
ishaan-jaff
|
838cb3e20b
|
(fix) debugging with POST request
|
2023-11-14 18:05:34 -08:00 |
|
ishaan-jaff
|
84e931ce0f
|
(feat) completion debug view HF POST request
|
2023-11-14 17:57:41 -08:00 |
|
ishaan-jaff
|
e0f7120459
|
(feat) improve logging of raw POST curl command
|
2023-11-14 17:54:09 -08:00 |
|
ishaan-jaff
|
d25d4d26bd
|
(feat) debug ollama POST request
|
2023-11-14 17:53:48 -08:00 |
|
ishaan-jaff
|
c7fbbe8764
|
(feat) add ability to view POST requests from litellm.completion()
|
2023-11-14 17:27:20 -08:00 |
|
Krrish Dholakia
|
9b582b2c85
|
fix(main.py): keep client consistent across calls + exponential backoff retry on ratelimit errors
|
2023-11-14 16:26:05 -08:00 |
|
ishaan-jaff
|
9f1d050967
|
(docs) passing azure_ad_token to config
|
2023-11-14 15:27:59 -08:00 |
|
ishaan-jaff
|
a1a041b305
|
(fix) linting error
|
2023-11-14 15:07:43 -08:00 |
|