Ishaan Jaff
|
d6e7cb2df7
|
fix triton linting
|
2024-07-23 11:03:34 -07:00 |
|
Sophia Loris
|
adae0777d6
|
resolve merge conflicts
|
2024-07-19 09:45:53 -05:00 |
|
Sophia Loris
|
91fa69c0c2
|
Add support for Triton streaming & triton async completions
|
2024-07-19 09:35:27 -05:00 |
|
Ishaan Jaff
|
a6e850f0cd
|
fix linting error
|
2024-07-16 21:21:50 -07:00 |
|
davidschuler-8451
|
a3a4867695
|
feat: enables batch embedding support for triton
|
2024-07-16 13:31:59 -04:00 |
|
Krrish Dholakia
|
9d7f5d503c
|
refactor(utils.py): refactor Logging to it's own class. Cut down utils.py to <10k lines.
Easier debugging
Reference: https://github.com/BerriAI/litellm/issues/4206
|
2024-06-15 10:57:20 -07:00 |
|
Giri Tatavarty
|
82c0944490
|
#added type ignore for httpx and requests
|
2024-05-29 15:47:23 -07:00 |
|
Giri Tatavarty
|
2d8b4928bf
|
#Fixed mypy errors. The requests package and stubs need to be imported - waiting to hear from Ishaan/Krrish before changing requirements.txt
|
2024-05-29 15:08:56 -07:00 |
|
Ishaan Jaff
|
93bf4c2dc4
|
Revert "Added support for Triton chat completion using trtlllm generate endpo…"
|
2024-05-29 13:42:49 -07:00 |
|
Giri Tatavarty
|
ff18d93a3a
|
Added support for Triton chat completion using trtlllm generate endpoint and custom infer endpoint
|
2024-05-28 07:54:11 -07:00 |
|
Ishaan Jaff
|
5eca68d504
|
feat - triton embeddings
|
2024-05-10 18:57:06 -07:00 |
|