llama-stack-mirror/llama_toolchain/inference
Ashwin Bharambe 45987996c4 Several smaller fixes to make adapters work
Also, reorganized the pattern of __init__ inside providers so
configuration can stay lightweight
2024-08-28 15:55:21 -07:00
..
adapters Several smaller fixes to make adapters work 2024-08-28 15:55:21 -07:00
api split batch_inference from inference 2024-08-26 13:21:37 -07:00
meta_reference Several smaller fixes to make adapters work 2024-08-28 15:55:21 -07:00
quantization Fix api.datatypes imports 2024-08-26 14:43:30 -07:00
__init__.py Initial commit 2024-07-23 08:32:33 -07:00
client.py Several smaller fixes to make adapters work 2024-08-28 15:55:21 -07:00
event_logger.py formatting 2024-08-14 17:03:43 -04:00
prepare_messages.py basic RAG seems to work 2024-08-24 23:36:58 -07:00
providers.py Several smaller fixes to make adapters work 2024-08-28 15:55:21 -07:00