docs

2025-12-24 09:58:07 +00:00 · 2025-07-14 17:52:27 -07:00 · 2025-07-14 17:52:27 -07:00 · 2153957ab6
commit 2153957ab6
parent 723a870171
2 changed files with 6 additions and 4 deletions
--- a/docs/source/distributions/self_hosted_distro/llamacpp.md
+++ b/docs/source/distributions/self_hosted_distro/llamacpp.md
@ -1,8 +1,9 @@
 <!-- This file was auto-generated by distro_codegen.py, please edit source -->
 # Llama Stack with llama.cpp

-This template demonstrates how to utilize Llama Stack with [llama.cpp](https://github.com/ggerganov/llama.cpp) as the inference provider. \n
-Previously, the use of quantized models with Llama Stack was restricted, but now it is fully supported through llama.cpp. \n
+This template demonstrates how to utilize Llama Stack with [llama.cpp](https://github.com/ggerganov/llama.cpp) as the inference provider.
+
+Previously, the use of quantized models with Llama Stack was restricted, but now it is fully supported through llama.cpp.
 You can employ any .gguf models available on [Hugging Face](https://huggingface.co/models) with this template.


--- a/llama_stack/templates/llamacpp/doc_template.md
+++ b/llama_stack/templates/llamacpp/doc_template.md
@ -1,7 +1,8 @@
 # Llama Stack with llama.cpp

-This template demonstrates how to utilize Llama Stack with [llama.cpp](https://github.com/ggerganov/llama.cpp) as the inference provider. \n
-Previously, the use of quantized models with Llama Stack was restricted, but now it is fully supported through llama.cpp. \n
+This template demonstrates how to utilize Llama Stack with [llama.cpp](https://github.com/ggerganov/llama.cpp) as the inference provider.
+
+Previously, the use of quantized models with Llama Stack was restricted, but now it is fully supported through llama.cpp.
 You can employ any .gguf models available on [Hugging Face](https://huggingface.co/models) with this template.