forked from phoenix-oss/llama-stack-mirror
# What does this PR do? * Removes the use of `huggingface-cli` * Simplifies HF cache mount path * Simplifies vLLM server startup command * Separates PVC/secret creation from deployment/service * Fixes a typo: "pod" should be "deployment" Signed-off-by: Yuan Tang <terrytangyuan@gmail.com> |
||
|---|---|---|
| .. | ||
| building_applications | ||
| concepts | ||
| contributing | ||
| distributions | ||
| getting_started | ||
| introduction | ||
| playground | ||
| providers | ||
| references | ||
| conf.py | ||
| index.md | ||