llama-stack-mirror

mirror of https://github.com/meta-llama/llama-stack.git synced 2025-12-17 11:49:48 +00:00

Author	SHA1	Message	Date
Ashwin Bharambe	5dce17668c	Move run-*.yaml to templates/ so they can be packaged	2024-11-18 14:54:20 -08:00
Ashwin Bharambe	dd732f037f	Docs for meta-reference-gpu	2024-11-18 14:03:30 -08:00
Ashwin Bharambe	a562668dcd	Update Fireworks + Togther documentation	2024-11-18 13:01:10 -08:00
Ashwin Bharambe	1ecaf2cb3c	Add ollama/pull-models.sh	2024-11-18 11:44:03 -08:00
Ashwin Bharambe	fa1d29cfdc	kill built_at field in run config	2024-11-17 20:42:11 -08:00
Ashwin Bharambe	b1d119466e	Allow setting environment variables from llama stack run and fix ollama	2024-11-17 19:49:15 -08:00
Ashwin Bharambe	a061f3f8c1	Convert ollama to the new model	2024-11-17 15:19:55 -08:00
Ashwin Bharambe	028530546f	Convert TGI	2024-11-17 14:49:41 -08:00
Ashwin Bharambe	9bb07ce298	Run the script to produce vllm outputs	2024-11-17 14:10:15 -08:00
Ashwin Bharambe	0218e68849	Write a script to perform the codegen	2024-11-17 14:03:16 -08:00
Ashwin Bharambe	f38e76ee98	Adding docker-compose.yaml, starting to simplify	2024-11-16 21:48:34 -08:00
Ashwin Bharambe	e4509cb568	more progress on auto-generation	2024-11-15 14:17:16 -08:00
Ashwin Bharambe	cfa913fdd5	Start auto-generating { build, run, doc.md } for distributions	2024-11-15 14:17:16 -08:00
Ashwin Bharambe	2c294346ae	Update provider types and prefix with inline::	2024-11-12 12:54:44 -08:00
Ashwin Bharambe	c1f7ba3aed	Split safety into (llama-guard, prompt-guard, code-scanner) (#400 ) Splits the meta-reference safety implementation into three distinct providers: - inline::llama-guard - inline::prompt-guard - inline::code-scanner Note that this PR is a backward incompatible change to the llama stack server. I have added deprecation_error field to ProviderSpec -- the server reads it and immediately barfs. This is used to direct the user with a specific message on what action to perform. An automagical "config upgrade" is a bit too much work to implement right now :/ (Note that we will be gradually prefixing all inline providers with inline:: -- I am only doing this for this set of new providers because otherwise existing configuration files will break even more badly.)	2024-11-11 09:29:18 -08:00
Ashwin Bharambe	4986e46188	Distributions updates (slight updates to ollama, add inline-vllm and remote-vllm) (#408 ) * remote vllm distro * add inline-vllm details, fix things * Write some docs	2024-11-08 18:09:39 -08:00
Ashwin Bharambe	064d2a5287	Remove the safety adapter for Together; we can just use "meta-reference" (#387 )	2024-11-06 17:36:57 -08:00
Xi Yan	748606195b	Kill `llama stack configure` (#371 ) * remove configure * build msg * wip * build->run * delete prints * docs * fix docs, kill configure * precommit * update fireworks build * docs * clean up build * comments * fix * test * remove baking build.yaml into docker * fix msg, urls * configure msg	2024-11-06 13:32:10 -08:00
Xi Yan	a70a4706fc	update distributions compose/readme (#338 ) * readme updates * quantied compose * dell tgi * config update	2024-10-28 16:34:43 -07:00
Xi Yan	07f9bf723f	fix broken --list-templates with adding build.yaml files for packaging (#327 ) * add build files to templates * fix templates * manifest * symlink * symlink * precommit * change everything to docker build.yaml * remove image_type in templates * fix build from templates CLI * fix readmes	2024-10-25 12:51:22 -07:00

20 commits