forked from phoenix-oss/llama-stack-mirror

History

Ashwin Bharambe 04de2f84e9 fix: register provider model name and HF alias in run.yaml (#1304 ) Each model known to the system has two identifiers: - the `provider_resource_id` (what the provider calls it) -- e.g., `accounts/fireworks/models/llama-v3p1-8b-instruct` - the `identifier` (`model_id`) under which it is registered and gets routed to the appropriate provider. We have so far used the HuggingFace repo alias as the standardized identifier you can use to refer to the model. So in the above example, we'd use `meta-llama/Llama-3.1-8B-Instruct` as the name under which it gets registered. This makes it convenient for users to refer to these models across providers. However, we forgot to register the _actual_ provider model ID also. You should be able to route via `provider_resource_id` also, of course. This change fixes this (somewhat grave) omission. Note: this change is additive -- more aliases work now compared to before. ## Test Plan Run the following for distro=(ollama fireworks together) ``` LLAMA_STACK_CONFIG=$distro \ pytest -s -v tests/client-sdk/inference/test_text_inference.py \ --inference-model=meta-llama/Llama-3.1-8B-Instruct --vision-inference-model="" ```		2025-02-27 16:39:23 -08:00
..
_static	feat: tool outputs metadata (#1155 )	2025-02-21 13:15:31 -08:00
notebooks	fix: update notebooks to avoid using the nutsy --image-name __system__ thing (#1308 )	2025-02-27 16:39:04 -08:00
openapi_generator	fix: some telemetry APIs don't currently work (#1188 )	2025-02-20 14:09:25 -08:00
resources	Several documentation fixes and fix link to API reference	2025-02-04 14:00:43 -08:00
source	fix: register provider model name and HF alias in run.yaml (#1304 )	2025-02-27 16:39:23 -08:00
zero_to_hero_guide	chore: update the zero_to_hero_guide doc link (#1220 )	2025-02-25 17:16:02 -08:00
conftest.py	No spaces in ipynb tests	2025-02-07 11:56:22 -08:00
contbuild.sh	Fix broken links with docs	2024-11-22 20:42:17 -08:00
dog.jpg	Support for Llama3.2 models and Swift SDK (#98 )	2024-09-25 10:29:58 -07:00
getting_started.ipynb	fix: update notebooks to avoid using the nutsy --image-name __system__ thing (#1308 )	2025-02-27 16:39:04 -08:00
license_header.txt	Initial commit	2024-07-23 08:32:33 -07:00
make.bat	first version of readthedocs (#278 )	2024-10-22 10:15:58 +05:30
Makefile	first version of readthedocs (#278 )	2024-10-22 10:15:58 +05:30
readme.md	Fix README.md notebook links (#976 )	2025-02-05 14:33:46 -08:00
requirements.txt	Pin sphinx	2025-02-19 20:20:46 -08:00

readme.md

Llama Stack Documentation

Here's a collection of comprehensive guides, examples, and resources for building AI applications with Llama Stack. For the complete documentation, visit our ReadTheDocs page.

Content

Try out Llama Stack's capabilities through our detailed Jupyter notebooks:

Building AI Applications Notebook - A comprehensive guide to building production-ready AI applications using Llama Stack
Benchmark Evaluations Notebook - Detailed performance evaluations and benchmarking results
Zero-to-Hero Guide - Step-by-step guide for getting started with Llama Stack