mirror of
https://github.com/meta-llama/llama-stack.git
synced 2025-07-02 20:40:36 +00:00
* TGI adapter and some refactoring of other inference adapters * Use the lower-level `generate_stream()` method for correct tool calling --------- Co-authored-by: Ashwin Bharambe <ashwin@meta.com> |
||
---|---|---|
.. | ||
agentic_system | ||
batch_inference | ||
cli | ||
common | ||
core | ||
dataset/api | ||
evaluations/api | ||
inference | ||
memory | ||
models/api | ||
observability | ||
post_training/api | ||
reward_scoring/api | ||
safety | ||
synthetic_data_generation/api | ||
tools | ||
__init__.py | ||
stack.py |