llama-stack-mirror

mirror of https://github.com/meta-llama/llama-stack.git synced 2025-10-15 22:47:59 +00:00

Author	SHA1	Message	Date
Xi Yan	39872ca4b4	distributions	2024-10-29 15:00:58 -07:00
Xi Yan	3fb9a8e82e	getting started	2024-10-29 14:30:51 -07:00
Xi Yan	3df549f958	docs	2024-10-29 14:21:40 -07:00
Xi Yan	e4c503d8f4	content	2024-10-29 14:19:49 -07:00
Xi Yan	4d8bfcea1b	docs	2024-10-29 14:14:05 -07:00
Xi Yan	eeab1278f2	depth	2024-10-29 14:08:48 -07:00
Xi Yan	42104361a3	refactor structure	2024-10-29 14:04:41 -07:00
Ashwin Bharambe	9ddc28eca7	updates	2024-10-29 12:39:09 -07:00
Xi Yan	557e1f9fe7	index	2024-10-29 12:34:10 -07:00
Ashwin Bharambe	2c34f1eb52	updates	2024-10-29 12:32:46 -07:00
Xi Yan	f42f459f5d	name	2024-10-29 12:32:00 -07:00
Ashwin Bharambe	99dc10f6f4	updates	2024-10-29 12:28:27 -07:00
Xi Yan	9759298884	subfolder	2024-10-29 12:26:14 -07:00
Ashwin Bharambe	73b878d358	docs update	2024-10-29 12:21:30 -07:00
Xi Yan	3e5b002335	wip subfolder	2024-10-29 12:20:43 -07:00
Xi Yan	c497cb57f3	toctree	2024-10-29 12:12:39 -07:00
Dalton Flanagan	3c042cb92e	Update developer_cookbook.md	2024-10-29 14:43:59 -04:00
Xi Yan	caf15777f8	Add files via upload	2024-10-29 11:42:45 -07:00
Xi Yan	960fbf90d6	delete image	2024-10-29 11:41:36 -07:00
Ashwin Bharambe	1ffbcd4a9b	fix	2024-10-29 11:38:44 -07:00
Xi Yan	1cc322b4be	Add files via upload	2024-10-29 11:36:17 -07:00
Xi Yan	54c11f8ad1	delete image	2024-10-29 11:35:42 -07:00
dltn	0fed2480b1	colocate iOS setup doc	2024-10-29 11:34:05 -07:00
Xi Yan	6a0a6a218f	Add files via upload	2024-10-29 11:32:36 -07:00
Xi Yan	0813a020a2	mv image	2024-10-29 11:31:43 -07:00
Xi Yan	167976ef56	Add files via upload	2024-10-29 11:29:42 -07:00
Xi Yan	044b13bd36	move docs -> source	2024-10-29 11:20:28 -07:00
Ashwin Bharambe	4aa1bf6a60	Kill --name from llama stack build (#340 )	2024-10-28 23:07:32 -07:00
Xi Yan	8a74e400d6	Update getting_started.md	2024-10-25 13:30:33 -07:00
Xi Yan	f168752bba	Update getting_started.md	2024-10-25 13:27:43 -07:00
Xi Yan	abdf7cddf3	[Evals API][4/n] evals with generation meta-reference impl (#303 ) * wip * dataset validation * test_scoring * cleanup * clean up test * comments * error checking * dataset client * test client: * datasetio client * clean up * basic scoring function works * scorer wip * equality scorer * score batch impl * score batch * update scoring test * refactor * validate scorer input * address comments * evals with generation * add all rows scores to ScoringResult * minor typing * bugfix * scoring function def rename * rebase name * refactor * address comments * Update iOS inference instructions for new quantization * Small updates to quantization config * Fix score threshold in faiss * Bump version to 0.0.45 * Handle both ipv6 and ipv4 interfaces together * update manifest for build templates * Update getting_started.md * chatcompletion & completion input type validation * inclusion->subsetof * error checking * scoring_function -> scoring_fn rename, scorer -> scoring_fn rename * address comments * [Evals API][5/n] fixes to generate openapi spec (#323) * generate openapi * typing comment, dataset -> dataset_id * remove custom type * sample eval run.yaml --------- Co-authored-by: Dalton Flanagan <6599399+dltn@users.noreply.github.com> Co-authored-by: Ashwin Bharambe <ashwin.bharambe@gmail.com>	2024-10-25 13:12:39 -07:00
Xi Yan	07f9bf723f	fix broken --list-templates with adding build.yaml files for packaging (#327 ) * add build files to templates * fix templates * manifest * symlink * symlink * precommit * change everything to docker build.yaml * remove image_type in templates * fix build from templates CLI * fix readmes	2024-10-25 12:51:22 -07:00
Justin Lee	b6d8246b82	added templates and enhanced readme (#307 ) Co-authored-by: Justin Lee <justinai@fb.com>	2024-10-24 17:07:06 -07:00
Xi Yan	e70420a06e	Update getting_started.md	2024-10-24 14:19:35 -07:00
Suraj Subramanian	668a495aba	Add REST api example for chat_completion (#286 )	2024-10-22 10:35:20 -07:00
raghotham	e2a5a2e10d	first version of readthedocs (#278 )	2024-10-22 10:15:58 +05:30
Ashwin Bharambe	1944405dca	Update new_api_provider.md	2024-10-21 14:02:51 -07:00
Ashwin Bharambe	606c48309e	Small updates to encourage integration testing	2024-10-21 13:52:33 -07:00
Xi Yan	3a7884345a	Update new_api_provider.md	2024-10-21 13:41:56 -07:00
Xi Yan	25b37c9ff7	Update new_api_provider.md	2024-10-21 13:41:46 -07:00
Xi Yan	23210e8679	llama stack distributions / templates / docker refactor (#266 ) * docker compose ollama * comment * update compose file * readme for distributions * readme * move distribution folders * move distribution/templates to distributions/ * rename * kill distribution/templates * readme * readme * build/developer cookbook/new api provider * developer cookbook * readme * readme * [bugfix] fix case for agent when memory bank registered without specifying provider_id (#264) * fix case where memory bank is registered without provider_id * memory test * agents unit test * Add an option to not use elastic agents for meta-reference inference (#269) * Allow overridding checkpoint_dir via config * Small rename * Make all methods `async def` again; add completion() for meta-reference (#270) PR #201 had made several changes while trying to fix issues with getting the stream=False branches of inference and agents API working. As part of this, it made a change which was slightly gratuitous. Namely, making chat_completion() and brethren "def" instead of "async def". The rationale was that this allowed the user (within llama-stack) of this to use it as: ``` async for chunk in api.chat_completion(params) ``` However, it causes unnecessary confusion for several folks. Given that clients (e.g., llama-stack-apps) anyway use the SDK methods (which are completely isolated) this choice was not ideal. Let's revert back so the call now looks like: ``` async for chunk in await api.chat_completion(params) ``` Bonus: Added a completion() implementation for the meta-reference provider. Technically should have been another PR :) * Improve an important error message * update ollama for llama-guard3 * Add vLLM inference provider for OpenAI compatible vLLM server (#178) This PR adds vLLM inference provider for OpenAI compatible vLLM server. * Create .readthedocs.yaml Trying out readthedocs * Update event_logger.py (#275) spelling error * vllm * build templates * delete templates * tmp add back build to avoid merge conflicts * vllm * vllm --------- Co-authored-by: Ashwin Bharambe <ashwin.bharambe@gmail.com> Co-authored-by: Ashwin Bharambe <ashwin@meta.com> Co-authored-by: Yuan Tang <terrytangyuan@gmail.com> Co-authored-by: raghotham <rsm@meta.com> Co-authored-by: nehal-a2z <nehal@coderabbit.ai>	2024-10-21 11:17:53 -07:00
Ashwin Bharambe	2089427d60	Make all methods `async def` again; add completion() for meta-reference (#270 ) PR #201 had made several changes while trying to fix issues with getting the stream=False branches of inference and agents API working. As part of this, it made a change which was slightly gratuitous. Namely, making chat_completion() and brethren "def" instead of "async def". The rationale was that this allowed the user (within llama-stack) of this to use it as: ``` async for chunk in api.chat_completion(params) ``` However, it causes unnecessary confusion for several folks. Given that clients (e.g., llama-stack-apps) anyway use the SDK methods (which are completely isolated) this choice was not ideal. Let's revert back so the call now looks like: ``` async for chunk in await api.chat_completion(params) ``` Bonus: Added a completion() implementation for the meta-reference provider. Technically should have been another PR :)	2024-10-18 20:50:59 -07:00
Xi Yan	02be26098a	getting started	2024-10-16 23:56:21 -07:00
Xi Yan	cf9e5b76b2	Update getting_started.md	2024-10-16 23:52:29 -07:00
Xi Yan	7cc47da8f2	Update getting_started.md	2024-10-16 23:50:31 -07:00
Xi Yan	d787d1e84f	config templates restructure, docs (#262 ) * wip * config templates * readmes	2024-10-16 23:25:10 -07:00
ATH	319a6b5f83	Update getting_started.md (#260 )	2024-10-16 18:05:36 -07:00
Matthieu FRONTON	770647dede	Fix broken rendering in Google Colab (#247 )	2024-10-15 15:41:49 -07:00
Yuan Tang	2128e61da2	Fix incorrect completion() signature for Databricks provider (#236 )	2024-10-11 08:47:57 -07:00
Xi Yan	7ff5800dea	generate openapi	2024-10-10 15:30:34 -07:00

1 2 3

103 commits