forked from phoenix-oss/llama-stack-mirror

History

Xi Yan 15dcc4ea5e openapi gen return type fix for streaming/non-streaming (#910 ) # What does this PR do? We need to change ```yaml /v1/inference/chat-completion: post: responses: '200': description: >- If stream=False, returns a ChatCompletionResponse with the full completion. If stream=True, returns an SSE event stream of ChatCompletionResponseStreamChunk content: text/event-stream: schema: oneOf: - $ref: '#/components/schemas/ChatCompletionResponse' - $ref: '#/components/schemas/ChatCompletionResponseStreamChunk' ``` into ```yaml /v1/inference/chat-completion: post: responses: '200': description: >- If stream=False, returns a ChatCompletionResponse with the full completion. If stream=True, returns an SSE event stream of ChatCompletionResponseStreamChunk content: text/event-stream: schema: $ref: '#/components/schemas/ChatCompletionResponseStreamChunk' application/json: schema: $ref: '#/components/schemas/ChatCompletionResponse' ``` ## Test Plan Python - tested in SDK sync: https://github.com/meta-llama/llama-stack-client-python/pull/108 Node - tested w/ https://gist.github.com/yanxi0830/b782f4b91e21dcccdfef8898ce55157e (SDK udpate follow up) ## Sources Please link relevant resources if necessary. ## Before submitting - [ ] This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case). - [ ] Ran pre-commit to handle lint / formatting issues. - [ ] Read the [contributor guideline](https://github.com/meta-llama/llama-stack/blob/main/CONTRIBUTING.md), Pull Request section? - [ ] Updated relevant documentation. - [ ] Wrote necessary unit or integration tests.		2025-01-30 18:03:02 -08:00
..
_static	Make a new llama stack image	2024-11-22 23:49:22 -08:00
notebooks	Llama_Stack_Building_AI_Applications.ipynb -> getting_started.ipynb (#854 )	2025-01-23 12:04:06 -08:00
openapi_generator	openapi gen return type fix for streaming/non-streaming (#910 )	2025-01-30 18:03:02 -08:00
resources	openapi gen return type fix for streaming/non-streaming (#910 )	2025-01-30 18:03:02 -08:00
source	SambaNova supports Llama 3.3 (#905 )	2025-01-30 09:24:46 -08:00
zero_to_hero_guide	Update default port from 5000 -> 8321	2025-01-16 15:26:48 -08:00
contbuild.sh	Fix broken links with docs	2024-11-22 20:42:17 -08:00
dog.jpg	Support for Llama3.2 models and Swift SDK (#98 )	2024-09-25 10:29:58 -07:00
getting_started.ipynb	Llama_Stack_Building_AI_Applications.ipynb -> getting_started.ipynb (#854 )	2025-01-23 12:04:06 -08:00
license_header.txt	Initial commit	2024-07-23 08:32:33 -07:00
make.bat	first version of readthedocs (#278 )	2024-10-22 10:15:58 +05:30
Makefile	first version of readthedocs (#278 )	2024-10-22 10:15:58 +05:30
readme.md	adding readme to docs folder for easier discoverability of notebooks … (#857 )	2025-01-28 04:58:46 -08:00
requirements.txt	[docs] add playground ui docs (#592 )	2024-12-12 10:40:38 -08:00

readme.md

Llama Stack Documentation

Here's a collection of comprehensive guides, examples, and resources for building AI applications with Llama Stack. For the complete documentation, visit our ReadTheDocs page.

Content

Try out Llama Stack's capabilities through our detailed Jupyter notebooks:

Building AI Applications Notebook - A comprehensive guide to building production-ready AI applications using Llama Stack
Benchmark Evaluations Notebook - Detailed performance evaluations and benchmarking results
Zero-to-Hero Guide - Step-by-step guide for getting started with Llama Stack