llama-stack-mirror/llama_stack/providers/impls/meta_reference/inference
Ashwin Bharambe c1ab66f1e6
Further generalize Xi's changes (#88)
* Further generalize Xi's changes

- introduce a slightly more general notion of an AutoRouted provider
- the AutoRouted provider is associated with a RoutingTable provider
- e.g. inference -> models
- Introduced safety -> shields and memory -> memory_banks
  correspondences

* typo

* Basic build and run succeeded
2024-09-22 16:31:18 -07:00
..
quantization Add a test runner and 2 very simple tests for agents 2024-09-19 12:22:48 -07:00
__init__.py API Updates (#73) 2024-09-17 19:51:35 -07:00
config.py Further generalize Xi's changes (#88) 2024-09-22 16:31:18 -07:00
generation.py API Updates (#73) 2024-09-17 19:51:35 -07:00
inference.py memory routers working 2024-09-21 16:40:23 -07:00
model_parallel.py API Updates (#73) 2024-09-17 19:51:35 -07:00
parallel_utils.py API Updates (#73) 2024-09-17 19:51:35 -07:00