mirror of
https://github.com/BerriAI/litellm.git
synced 2025-04-26 11:14:04 +00:00
* migrate triton to base llm http handler * clean up triton handler.py * use transform functions for triton * add TritonConfig * get openai params for triton * use triton embedding config * test_completion_triton_generate_api * test_completion_triton_infer_api * fix TritonConfig doc string * use TritonResponseIterator * fix triton embeddings * docs triton chat usage
5 lines
145 B
Python
5 lines
145 B
Python
"""
|
|
Triton Completion - uses `llm_http_handler.py` to make httpx requests
|
|
|
|
Request/Response transformation is handled in `transformation.py`
|
|
"""
|