Ishaan Jaff
|
aba600a892
|
fix triton linting
|
2024-07-23 11:03:34 -07:00 |
|
Sophia Loris
|
d779253949
|
resolve merge conflicts
|
2024-07-19 09:45:53 -05:00 |
|
Sophia Loris
|
d5c65c6be2
|
Add support for Triton streaming & triton async completions
|
2024-07-19 09:35:27 -05:00 |
|
Ishaan Jaff
|
b04d20d367
|
fix linting error
|
2024-07-16 21:21:50 -07:00 |
|
davidschuler-8451
|
35b733f14d
|
feat: enables batch embedding support for triton
|
2024-07-16 13:31:59 -04:00 |
|
Krrish Dholakia
|
4f91205530
|
refactor(utils.py): refactor Logging to it's own class. Cut down utils.py to <10k lines.
Easier debugging
Reference: https://github.com/BerriAI/litellm/issues/4206
|
2024-06-15 10:57:20 -07:00 |
|
Giri Tatavarty
|
1b3050477a
|
#added type ignore for httpx and requests
|
2024-05-29 15:47:23 -07:00 |
|
Giri Tatavarty
|
51b9178630
|
#Fixed mypy errors. The requests package and stubs need to be imported - waiting to hear from Ishaan/Krrish before changing requirements.txt
|
2024-05-29 15:08:56 -07:00 |
|
Ishaan Jaff
|
99e506525c
|
Revert "Added support for Triton chat completion using trtlllm generate endpo…"
|
2024-05-29 13:42:49 -07:00 |
|
Giri Tatavarty
|
a58dc68418
|
Added support for Triton chat completion using trtlllm generate endpoint and custom infer endpoint
|
2024-05-28 07:54:11 -07:00 |
|
Ishaan Jaff
|
d3550379b0
|
feat - triton embeddings
|
2024-05-10 18:57:06 -07:00 |
|