llama-stack-mirror

mirror of https://github.com/meta-llama/llama-stack.git synced 2025-12-05 18:27:22 +00:00

History

Ashwin Bharambe c1ab66f1e6 Further generalize Xi's changes (#88 ) * Further generalize Xi's changes - introduce a slightly more general notion of an AutoRouted provider - the AutoRouted provider is associated with a RoutingTable provider - e.g. inference -> models - Introduced safety -> shields and memory -> memory_banks correspondences * typo * Basic build and run succeeded		2024-09-22 16:31:18 -07:00
..
quantization	Add a test runner and 2 very simple tests for agents	2024-09-19 12:22:48 -07:00
__init__.py	API Updates (#73 )	2024-09-17 19:51:35 -07:00
config.py	Further generalize Xi's changes (#88 )	2024-09-22 16:31:18 -07:00
generation.py	API Updates (#73 )	2024-09-17 19:51:35 -07:00
inference.py	memory routers working	2024-09-21 16:40:23 -07:00
model_parallel.py	API Updates (#73 )	2024-09-17 19:51:35 -07:00
parallel_utils.py	API Updates (#73 )	2024-09-17 19:51:35 -07:00