Ishaan Jaff
|
a6e850f0cd
|
fix linting error
|
2024-07-16 21:21:50 -07:00 |
|
davidschuler-8451
|
a3a4867695
|
feat: enables batch embedding support for triton
|
2024-07-16 13:31:59 -04:00 |
|
Krrish Dholakia
|
9d7f5d503c
|
refactor(utils.py): refactor Logging to it's own class. Cut down utils.py to <10k lines.
Easier debugging
Reference: https://github.com/BerriAI/litellm/issues/4206
|
2024-06-15 10:57:20 -07:00 |
|
Ishaan Jaff
|
93bf4c2dc4
|
Revert "Added support for Triton chat completion using trtlllm generate endpo…"
|
2024-05-29 13:42:49 -07:00 |
|
Giri Tatavarty
|
ff18d93a3a
|
Added support for Triton chat completion using trtlllm generate endpoint and custom infer endpoint
|
2024-05-28 07:54:11 -07:00 |
|
Ishaan Jaff
|
5eca68d504
|
feat - triton embeddings
|
2024-05-10 18:57:06 -07:00 |
|