llama-stack-mirror

mirror of https://github.com/meta-llama/llama-stack.git synced 2025-12-08 03:00:56 +00:00

Author	SHA1	Message	Date
Justin Lee	ecad16b904	doc enhancements, converted md into jupyter, reorganize files	2024-11-05 13:12:30 -08:00
Justin Lee	5f5220b8fc	git commit created a dir for zero-hero	2024-11-05 11:11:27 -08:00
Kai Wu	36d3520bf2	refactor notebooks	2024-11-05 11:10:08 -08:00
Sanyam Bhutani	6d3769aedd	Update tool_Calling101.ipynb	2024-11-04 19:35:31 -08:00
Sanyam Bhutani	800a9dc134	Create tool_Calling101.ipynb	2024-11-04 19:34:51 -08:00
Sanyam Bhutani	579cb110b4	add image	2024-11-04 19:28:37 -08:00
Sanyam Bhutani	d37853074b	Merge branch 'docs_improvement' of https://github.com/meta-llama/llama-stack into docs_improvement	2024-11-04 19:11:53 -08:00
Sanyam Bhutani	19fa3bde97	push	2024-11-04 19:11:37 -08:00
Kai Wu	08ccf47ff1	Merge branch 'main' into docs_improvement	2024-11-04 17:25:33 -08:00
Kai Wu	556ec85d63	change safety101 agent101 to notebook	2024-11-04 17:22:23 -08:00
Kai Wu	ddf9a2b62c	change safety101 agent101 to notebook	2024-11-04 17:19:21 -08:00
Sanyam Bhutani	af8453436e	Update Memory101.ipynb	2024-11-04 17:17:26 -08:00
Sanyam Bhutani	09cd99fc2d	Create Memory101.ipynb	2024-11-04 17:16:23 -08:00
Xi Yan	c810a4184d	[docs] update documentations (#356 ) * move docs -> source * Add files via upload * mv image * Add files via upload * colocate iOS setup doc * delete image * Add files via upload * fix * delete image * Add files via upload * Update developer_cookbook.md * toctree * wip subfolder * docs update * subfolder * updates * name * updates * index * updates * refactor structure * depth * docs * content * docs * getting started * distributions * fireworks * fireworks * update * theme * theme * theme * pdj theme * pytorch theme * css * theme * agents example * format * index * headers * copy button * test tabs * test tabs * fix * tabs * tab * tabs * sphinx_design * quick start commands * size * width * css * css * download models * asthetic fix * tab format * update * css * width * css * docs * tab based * tab * tabs * docs * style * image * css * color * typo * update docs * missing links * list templates * links * links update * troubleshooting * fix * distributions * docs * fix table * kill llamastack-local-gpu/cpu * Update index.md * Update index.md * mv ios_setup.md * Update ios_setup.md * Add remote_or_local.gif * Update ios_setup.md * release notes * typos * Add ios_setup to index * nav bar * hide torctree * ios image * links update * rename * rename * docs * rename * links * distributions * distributions * distributions * distributions * remove release * remote --------- Co-authored-by: dltn <6599399+dltn@users.noreply.github.com> Co-authored-by: Ashwin Bharambe <ashwin.bharambe@gmail.com>	2024-11-04 16:52:38 -08:00
Kai Wu	87904d329f	add more about safety and agent docs	2024-11-04 16:23:46 -08:00
Kai Wu	d61f328ffb	Merge branch 'docs_improvement' of github.com:meta-llama/llama-stack into docs_improvement	2024-11-04 16:22:44 -08:00
Sanyam Bhutani	f8da92900e	Create zero_to_getting_started.ipynb	2024-11-04 16:08:55 -08:00
Justin Lee	d200a6b002	beef up quickstart	2024-11-04 14:56:03 -08:00
Kai Wu	1422d631a8	Merge branch 'main' into docs_improvement	2024-11-04 12:40:18 -08:00
Kai Wu	2898a9bc9e	prompt guide added	2024-11-04 12:38:44 -08:00
Justin Lee	4f31f1b4cc	added few-shot-guide	2024-11-01 14:14:35 -07:00
Justin Lee	78083c4e0a	removed unnecessary files	2024-11-01 13:56:54 -07:00
Justin Lee	43289c36e1	added todo	2024-11-01 13:55:30 -07:00
Justin Lee	b41abff4fb	minor enhancement md	2024-11-01 13:50:24 -07:00
Justin Lee	1794ebc627	added cloud-local-inference-guide	2024-11-01 13:39:11 -07:00
Justin Lee	46763bc001	quick fix on title	2024-11-01 11:46:35 -07:00
Justin Lee	ed70e140eb	added streaming guide	2024-11-01 11:41:03 -07:00
Justin Lee	bf16d7729f	wrote guide for chat completion	2024-11-01 11:33:15 -07:00
Justin Lee	b514f1ec3a	quickstart guide (might be dated)	2024-10-31 15:49:05 -07:00
Justin Lee	703d7ebb6e	adding a few more inference examples	2024-10-31 15:46:47 -07:00
Justin Lee	626dffa0d9	added simple inferences	2024-10-31 15:45:47 -07:00
Ashwin Bharambe	37b330b4ef	add dynamic clients for all APIs (#348 ) * add dynamic clients for all APIs * fix openapi generator * inference + memory + agents tests now pass with "remote" providers * Add docstring which fixes openapi generator :/	2024-10-31 14:46:25 -07:00
Kai Wu	e4560a5e74	second draft	2024-10-31 13:37:55 -07:00
Xi Yan	3b1917d5ea	run openapi generator	2024-10-30 16:17:35 -07:00
Kai Wu	050b1ae718	agents101 draft	2024-10-30 14:01:28 -07:00
Kai Wu	384b31c4c2	first draft	2024-10-29 17:42:41 -07:00
Ashwin Bharambe	4aa1bf6a60	Kill --name from llama stack build (#340 )	2024-10-28 23:07:32 -07:00
Xi Yan	8a74e400d6	Update getting_started.md	2024-10-25 13:30:33 -07:00
Xi Yan	f168752bba	Update getting_started.md	2024-10-25 13:27:43 -07:00
Xi Yan	abdf7cddf3	[Evals API][4/n] evals with generation meta-reference impl (#303 ) * wip * dataset validation * test_scoring * cleanup * clean up test * comments * error checking * dataset client * test client: * datasetio client * clean up * basic scoring function works * scorer wip * equality scorer * score batch impl * score batch * update scoring test * refactor * validate scorer input * address comments * evals with generation * add all rows scores to ScoringResult * minor typing * bugfix * scoring function def rename * rebase name * refactor * address comments * Update iOS inference instructions for new quantization * Small updates to quantization config * Fix score threshold in faiss * Bump version to 0.0.45 * Handle both ipv6 and ipv4 interfaces together * update manifest for build templates * Update getting_started.md * chatcompletion & completion input type validation * inclusion->subsetof * error checking * scoring_function -> scoring_fn rename, scorer -> scoring_fn rename * address comments * [Evals API][5/n] fixes to generate openapi spec (#323) * generate openapi * typing comment, dataset -> dataset_id * remove custom type * sample eval run.yaml --------- Co-authored-by: Dalton Flanagan <6599399+dltn@users.noreply.github.com> Co-authored-by: Ashwin Bharambe <ashwin.bharambe@gmail.com>	2024-10-25 13:12:39 -07:00
Xi Yan	07f9bf723f	fix broken --list-templates with adding build.yaml files for packaging (#327 ) * add build files to templates * fix templates * manifest * symlink * symlink * precommit * change everything to docker build.yaml * remove image_type in templates * fix build from templates CLI * fix readmes	2024-10-25 12:51:22 -07:00
Justin Lee	b6d8246b82	added templates and enhanced readme (#307 ) Co-authored-by: Justin Lee <justinai@fb.com>	2024-10-24 17:07:06 -07:00
Xi Yan	e70420a06e	Update getting_started.md	2024-10-24 14:19:35 -07:00
Suraj Subramanian	668a495aba	Add REST api example for chat_completion (#286 )	2024-10-22 10:35:20 -07:00
raghotham	e2a5a2e10d	first version of readthedocs (#278 )	2024-10-22 10:15:58 +05:30
Ashwin Bharambe	1944405dca	Update new_api_provider.md	2024-10-21 14:02:51 -07:00
Ashwin Bharambe	606c48309e	Small updates to encourage integration testing	2024-10-21 13:52:33 -07:00
Xi Yan	3a7884345a	Update new_api_provider.md	2024-10-21 13:41:56 -07:00
Xi Yan	25b37c9ff7	Update new_api_provider.md	2024-10-21 13:41:46 -07:00
Xi Yan	23210e8679	llama stack distributions / templates / docker refactor (#266 ) * docker compose ollama * comment * update compose file * readme for distributions * readme * move distribution folders * move distribution/templates to distributions/ * rename * kill distribution/templates * readme * readme * build/developer cookbook/new api provider * developer cookbook * readme * readme * [bugfix] fix case for agent when memory bank registered without specifying provider_id (#264) * fix case where memory bank is registered without provider_id * memory test * agents unit test * Add an option to not use elastic agents for meta-reference inference (#269) * Allow overridding checkpoint_dir via config * Small rename * Make all methods `async def` again; add completion() for meta-reference (#270) PR #201 had made several changes while trying to fix issues with getting the stream=False branches of inference and agents API working. As part of this, it made a change which was slightly gratuitous. Namely, making chat_completion() and brethren "def" instead of "async def". The rationale was that this allowed the user (within llama-stack) of this to use it as: ``` async for chunk in api.chat_completion(params) ``` However, it causes unnecessary confusion for several folks. Given that clients (e.g., llama-stack-apps) anyway use the SDK methods (which are completely isolated) this choice was not ideal. Let's revert back so the call now looks like: ``` async for chunk in await api.chat_completion(params) ``` Bonus: Added a completion() implementation for the meta-reference provider. Technically should have been another PR :) * Improve an important error message * update ollama for llama-guard3 * Add vLLM inference provider for OpenAI compatible vLLM server (#178) This PR adds vLLM inference provider for OpenAI compatible vLLM server. * Create .readthedocs.yaml Trying out readthedocs * Update event_logger.py (#275) spelling error * vllm * build templates * delete templates * tmp add back build to avoid merge conflicts * vllm * vllm --------- Co-authored-by: Ashwin Bharambe <ashwin.bharambe@gmail.com> Co-authored-by: Ashwin Bharambe <ashwin@meta.com> Co-authored-by: Yuan Tang <terrytangyuan@gmail.com> Co-authored-by: raghotham <rsm@meta.com> Co-authored-by: nehal-a2z <nehal@coderabbit.ai>	2024-10-21 11:17:53 -07:00

1 2 3

112 commits