mirror of
https://github.com/meta-llama/llama-stack.git
synced 2025-12-30 08:04:17 +00:00
docs: Remove docs for meta-reference-quantized-gpu
The distrobrution was removed in 530d4bdfe1
but these files were left behind.
Signed-off-by: Derek Higgins <derekh@redhat.com>
This commit is contained in:
parent
14e60e3c02
commit
7397534497
3 changed files with 0 additions and 126 deletions
|
|
@ -109,8 +109,6 @@ llama stack build --list-templates
|
|||
+------------------------------+-----------------------------------------------------------------------------+
|
||||
| nvidia | Use NVIDIA NIM for running LLM inference |
|
||||
+------------------------------+-----------------------------------------------------------------------------+
|
||||
| meta-reference-quantized-gpu | Use Meta Reference with fp8, int4 quantization for running LLM inference |
|
||||
+------------------------------+-----------------------------------------------------------------------------+
|
||||
| cerebras | Use Cerebras for running LLM inference |
|
||||
+------------------------------+-----------------------------------------------------------------------------+
|
||||
| ollama | Use (an external) Ollama server for running LLM inference |
|
||||
|
|
|
|||
Loading…
Add table
Add a link
Reference in a new issue