Ishaan Jaff
|
6107f9f3f3
|
[Bug fix ]: Triton /infer handler incompatible with batch responses (#7337)
* migrate triton to base llm http handler
* clean up triton handler.py
* use transform functions for triton
* add TritonConfig
* get openai params for triton
* use triton embedding config
* test_completion_triton_generate_api
* test_completion_triton_infer_api
* fix TritonConfig doc string
* use TritonResponseIterator
* fix triton embeddings
* docs triton chat usage
|
2024-12-20 20:59:40 -08:00 |
|
Ishaan Jaff
|
c7f14e936a
|
(code quality) run ruff rule to ban unused imports (#7313)
* remove unused imports
* fix AmazonConverseConfig
* fix test
* fix import
* ruff check fixes
* test fixes
* fix testing
* fix imports
|
2024-12-19 12:33:42 -08:00 |
|
Ishaan Jaff
|
fe768a9ab7
|
fix - handle merge conflicts
|
2024-12-11 01:06:40 -08:00 |
|
Krrish Dholakia
|
06074bb13b
|
build: Squashed commit of https://github.com/BerriAI/litellm/pull/7170
Closes https://github.com/BerriAI/litellm/pull/7170
|
2024-12-11 01:03:57 -08:00 |
|