llama-stack-mirror

mirror of https://github.com/meta-llama/llama-stack.git synced 2025-12-08 11:07:22 +00:00

Author	SHA1	Message	Date
Kai Wu	87904d329f	add more about safety and agent docs	2024-11-04 16:23:46 -08:00
Kai Wu	d61f328ffb	Merge branch 'docs_improvement' of github.com:meta-llama/llama-stack into docs_improvement	2024-11-04 16:22:44 -08:00
Sanyam Bhutani	f8da92900e	Create zero_to_getting_started.ipynb	2024-11-04 16:08:55 -08:00
Justin Lee	d200a6b002	beef up quickstart	2024-11-04 14:56:03 -08:00
Kai Wu	1422d631a8	Merge branch 'main' into docs_improvement	2024-11-04 12:40:18 -08:00
Kai Wu	2898a9bc9e	prompt guide added	2024-11-04 12:38:44 -08:00
Dinesh Yeduguru	ac93dd89cf	fix bedrock impl (#359 ) * fix bedrock impl * fix linter errors * fix return type and remove debug print	2024-11-03 07:32:30 -08:00
Justin Lee	4f31f1b4cc	added few-shot-guide	2024-11-01 14:14:35 -07:00
Justin Lee	78083c4e0a	removed unnecessary files	2024-11-01 13:56:54 -07:00
Justin Lee	43289c36e1	added todo	2024-11-01 13:55:30 -07:00
Justin Lee	b41abff4fb	minor enhancement md	2024-11-01 13:50:24 -07:00
Justin Lee	1794ebc627	added cloud-local-inference-guide	2024-11-01 13:39:11 -07:00
Ashwin Bharambe	bf4f97a2e1	Fix vLLM adapter chat_completion signature	2024-11-01 13:09:03 -07:00
Justin Lee	46763bc001	quick fix on title	2024-11-01 11:46:35 -07:00
Justin Lee	ed70e140eb	added streaming guide	2024-11-01 11:41:03 -07:00
Dalton Flanagan	adecb2a2d3	update for message parsing on ios	2024-11-01 14:37:19 -04:00
Justin Lee	bf16d7729f	wrote guide for chat completion	2024-11-01 11:33:15 -07:00
Justin Lee	b514f1ec3a	quickstart guide (might be dated)	2024-10-31 15:49:05 -07:00
Justin Lee	703d7ebb6e	adding a few more inference examples	2024-10-31 15:46:47 -07:00
Justin Lee	626dffa0d9	added simple inferences	2024-10-31 15:45:47 -07:00
Ashwin Bharambe	37b330b4ef	add dynamic clients for all APIs (#348 ) * add dynamic clients for all APIs * fix openapi generator * inference + memory + agents tests now pass with "remote" providers * Add docstring which fixes openapi generator :/	2024-10-31 14:46:25 -07:00
Kai Wu	e4560a5e74	second draft	2024-10-31 13:37:55 -07:00
Steve Grubb	f04b566c5c	Do not cache pip (#349 ) Pip has a 3.3GB cache of torch and friends. Do not keep this in the image.	2024-10-31 09:52:40 -07:00
Xi Yan	3b1917d5ea	run openapi generator	2024-10-30 16:17:35 -07:00
Kai Wu	050b1ae718	agents101 draft	2024-10-30 14:01:28 -07:00
Kai Wu	384b31c4c2	first draft	2024-10-29 17:42:41 -07:00
Ashwin Bharambe	4aa1bf6a60	Kill --name from llama stack build (#340 )	2024-10-28 23:07:32 -07:00
Ashwin Bharambe	26d1668f7d	Revert "remove Field for return_type" This reverts commit `ffb3965ade`.	2024-10-28 21:39:48 -07:00
Ashwin Bharambe	eccd7dc4a9	Avoid warnings from pydantic for overriding schema Also fix structured output in completions	2024-10-28 21:39:48 -07:00
Xi Yan	ed833bb758	[Evals API][7/n] braintrust scoring provider (#333 ) * wip scoring refactor * llm as judge, move folders * test full generation + eval * extract score regex to llm context * remove prints, cleanup braintrust in this branch * braintrust skeleton * datasetio test fix * braintrust provider * remove prints * dependencies * change json -> class * json -> class * remove initialize * address nits * check identifier prefix * braintrust scoring identifier check, rebase * udpate MANIFEST * manifest * remove braintrust scoring_fn * remove comments * tests * imports fix	2024-10-28 18:59:35 -07:00
Xi Yan	ae671eaf7a	distro readmes with model serving instructions (#339 ) * readme updates * quantied compose * dell tgi * config update * readme * update model serving readmes * update * update * config	2024-10-28 17:47:14 -07:00
Xi Yan	a70a4706fc	update distributions compose/readme (#338 ) * readme updates * quantied compose * dell tgi * config update	2024-10-28 16:34:43 -07:00
Xi Yan	985ff4d6ce	update distributions/readmes	2024-10-28 15:10:40 -07:00
Xi Yan	7b8748c53e	[Evals API][6/n] meta-reference llm as judge, registration for ScoringFnDefs (#330 ) * wip scoring refactor * llm as judge, move folders * test full generation + eval * extract score regex to llm context * remove prints, cleanup braintrust in this branch * change json -> class * remove initialize * address nits * check identifier prefix * udpate MANIFEST	2024-10-28 14:08:42 -07:00
Xi Yan	04a4784287	Update README.md	2024-10-28 13:25:44 -07:00
Xi Yan	3fa1eaf37d	Update README.md	2024-10-28 13:18:55 -07:00
Xi Yan	0d4215e125	Update README.md	2024-10-28 13:18:34 -07:00
Xi Yan	8f5a850de9	Update README.md	2024-10-28 13:16:23 -07:00
Xi Yan	ffb3965ade	remove Field for return_type	2024-10-28 13:04:41 -07:00
Ashwin Bharambe	b7d2b83d55	Allow passing provider_registry to resolve_impls()	2024-10-28 11:58:16 -07:00
Ashwin Bharambe	8a3b64d1be	Bump version to 0.0.47	2024-10-27 22:30:38 -07:00
Xi Yan	46bb8884a7	distributions readme typos	2024-10-27 11:57:21 -07:00
Dalton Flanagan	44c05c6e7d	add vision instruct models for fireworks	2024-10-27 17:54:54 +00:00
Dinesh Yeduguru	9b85d9a841	completion() for fireworks (#329 )	2024-10-25 16:12:10 -07:00
Dinesh Yeduguru	7ec79f3b9d	completion() for together (#324 ) * completion() for together * test fixes * fix client building	2024-10-25 14:21:12 -07:00
Xi Yan	8a74e400d6	Update getting_started.md	2024-10-25 13:30:33 -07:00
Xi Yan	f168752bba	Update getting_started.md	2024-10-25 13:27:43 -07:00
Xi Yan	abdf7cddf3	[Evals API][4/n] evals with generation meta-reference impl (#303 ) * wip * dataset validation * test_scoring * cleanup * clean up test * comments * error checking * dataset client * test client: * datasetio client * clean up * basic scoring function works * scorer wip * equality scorer * score batch impl * score batch * update scoring test * refactor * validate scorer input * address comments * evals with generation * add all rows scores to ScoringResult * minor typing * bugfix * scoring function def rename * rebase name * refactor * address comments * Update iOS inference instructions for new quantization * Small updates to quantization config * Fix score threshold in faiss * Bump version to 0.0.45 * Handle both ipv6 and ipv4 interfaces together * update manifest for build templates * Update getting_started.md * chatcompletion & completion input type validation * inclusion->subsetof * error checking * scoring_function -> scoring_fn rename, scorer -> scoring_fn rename * address comments * [Evals API][5/n] fixes to generate openapi spec (#323) * generate openapi * typing comment, dataset -> dataset_id * remove custom type * sample eval run.yaml --------- Co-authored-by: Dalton Flanagan <6599399+dltn@users.noreply.github.com> Co-authored-by: Ashwin Bharambe <ashwin.bharambe@gmail.com>	2024-10-25 13:12:39 -07:00
Ashwin Bharambe	426d821e7f	Bump version to 0.0.46	2024-10-25 13:10:55 -07:00
Sachin Mehta	c05fbf14b3	Added hadamard transform for spinquant (#326 ) * Added hadamard transform for spinquant * Changed from config to model_args * Added an assertion for model args * Use enum.value to check against str * pre-commit --------- Co-authored-by: Sachin Mehta <sacmehta@fb.com> Co-authored-by: Ashwin Bharambe <ashwin.bharambe@gmail.com>	2024-10-25 12:58:48 -07:00

1 2 3 4 5 ...

413 commits