davidschuler-8451
|
35b733f14d
|
feat: enables batch embedding support for triton
|
2024-07-16 13:31:59 -04:00 |
|
Krrish Dholakia
|
4f91205530
|
refactor(utils.py): refactor Logging to it's own class. Cut down utils.py to <10k lines.
Easier debugging
Reference: https://github.com/BerriAI/litellm/issues/4206
|
2024-06-15 10:57:20 -07:00 |
|
Ishaan Jaff
|
99e506525c
|
Revert "Added support for Triton chat completion using trtlllm generate endpo…"
|
2024-05-29 13:42:49 -07:00 |
|
Giri Tatavarty
|
a58dc68418
|
Added support for Triton chat completion using trtlllm generate endpoint and custom infer endpoint
|
2024-05-28 07:54:11 -07:00 |
|
Ishaan Jaff
|
d3550379b0
|
feat - triton embeddings
|
2024-05-10 18:57:06 -07:00 |
|