llama-stack-mirror

mirror of https://github.com/meta-llama/llama-stack.git synced 2025-12-31 23:50:02 +00:00

Author	SHA1	Message	Date
Xi Yan	cbc288680d	bugfix and add requirements	2025-01-09 18:57:01 -08:00
Xi Yan	7e574417ab	bugfix	2025-01-09 18:54:22 -08:00
Xi Yan	650016ffca	trigger models build	2025-01-09 18:53:28 -08:00
Xi Yan	7cdd8b94d6	trigger models build	2025-01-09 18:51:29 -08:00
Xi Yan	97d31d7ab3	add back requirements	2025-01-09 18:37:07 -08:00
Xi Yan	1ea46660a5	add back requirements	2025-01-09 18:35:05 -08:00
Xi Yan	2847d70f38	remove dispatch on push	2025-01-09 17:25:35 -08:00
Xi Yan	5f051b210c	final workflow	2025-01-09 17:24:31 -08:00
Xi Yan	4387863a19	final workflow	2025-01-09 17:24:15 -08:00
Xi Yan	dc74675dc8	add ver	2025-01-09 17:19:46 -08:00
Xi Yan	cca27819b9	fix versions	2025-01-09 17:15:47 -08:00
Xi Yan	63232d7771	remove double quotes	2025-01-09 17:09:46 -08:00
Xi Yan	d8c9798ca8	test	2025-01-09 17:07:07 -08:00
Xi Yan	0b0446f219	fix	2025-01-09 17:02:35 -08:00
Xi Yan	df55ec654e	fix	2025-01-09 16:59:49 -08:00
Xi Yan	2644e096d6	bugfix	2025-01-09 16:54:04 -08:00
Xi Yan	19887139b4	update requirements	2025-01-09 16:51:49 -08:00
Xi Yan	ccd3ec142a	test	2025-01-09 16:45:20 -08:00
Xi Yan	7ca2f5edb1	llama-stack-client-python	2025-01-09 16:34:20 -08:00
Xi Yan	16af87c822	test trigger	2025-01-09 16:33:18 -08:00
Xi Yan	620250324c	initial test	2025-01-09 16:15:37 -08:00
Xi Yan	8527b79bfd	test	2025-01-09 15:37:43 -08:00
Xi Yan	20dc1860c6	test	2025-01-09 15:22:25 -08:00
Xi Yan	45cf46e62f	rebase	2025-01-09 11:45:51 -08:00
Yuan Tang	b8df87bd85	Add automatic PyPI release GitHub workflow (#618 ) This PR adds a workflow to automatically publish the package (including attestations) to Python upon tag/release creation. Note that this relies on trusted publishing: https://docs.pypi.org/trusted-publishers/ --------- Signed-off-by: Yuan Tang <terrytangyuan@gmail.com>	2025-01-09 11:29:26 -08:00
Xi Yan	a45ce85ec1	change schedule	2025-01-08 17:24:01 -08:00
Xi Yan	6c3b9fa09b	back to rc	2025-01-08 17:22:22 -08:00
Xi Yan	3ce9601f9d	nightly	2025-01-08 17:20:42 -08:00
Xi Yan	10b136055a	remove hash	2025-01-08 17:19:08 -08:00
Xi Yan	8640a30e6a	rc?	2025-01-08 17:17:47 -08:00
Xi Yan	8ffdff1c7a	rc?	2025-01-08 17:16:42 -08:00
Xi Yan	a6e1740464	test 0.0.64	2025-01-08 17:13:58 -08:00
Xi Yan	c7becdaffc	test	2025-01-08 17:07:19 -08:00
Xi Yan	e855291d3b	test	2025-01-08 17:04:49 -08:00
Xi Yan	87e2cb8029	test	2025-01-08 17:03:16 -08:00
Xi Yan	94d619b58e	nightly	2025-01-08 17:02:24 -08:00
Xi Yan	efb14c154e	cleanup setup	2025-01-08 16:57:37 -08:00
Xi Yan	665c088adb	on workflow dispatch	2025-01-08 16:56:15 -08:00
Xi Yan	074d8561e5	test	2025-01-08 16:52:51 -08:00
Xi Yan	bc27343c75	test workflow	2025-01-08 16:45:44 -08:00
Xi Yan	596afc6497	add --version to llama stack CLI & /version endpoint (#732 ) # What does this PR do? - add --version to llama stack CLI - add /version endpoint - run OpenAPI generator for the new endpoint ## Test Plan CLI <img width="184" alt="image" src="https://github.com/user-attachments/assets/3acb1d22-453e-4b79-baf6-e98e88d0671c" /> endpoint <img width="430" alt="image" src="https://github.com/user-attachments/assets/79cdd670-493b-40cf-8f9e-28a4ac0988ac" /> ## Sources Please link relevant resources if necessary. ## Before submitting - [ ] This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case). - [ ] Ran pre-commit to handle lint / formatting issues. - [ ] Read the [contributor guideline](https://github.com/meta-llama/llama-stack/blob/main/CONTRIBUTING.md), Pull Request section? - [ ] Updated relevant documentation. - [ ] Wrote necessary unit or integration tests.	2025-01-08 16:30:06 -08:00
Xi Yan	a5e6f10e33	fix links for distro (#733 ) # What does this PR do? - fix links for distro docs ## Test Plan <img width="653" alt="image" src="https://github.com/user-attachments/assets/a546a11e-2071-4d72-8232-8f30552b7341" /> ## Sources Please link relevant resources if necessary. ## Before submitting - [ ] This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case). - [ ] Ran pre-commit to handle lint / formatting issues. - [ ] Read the [contributor guideline](https://github.com/meta-llama/llama-stack/blob/main/CONTRIBUTING.md), Pull Request section? - [ ] Updated relevant documentation. - [ ] Wrote necessary unit or integration tests.	2025-01-08 14:47:09 -08:00
Sixian Yi	ca66a1b188	Update CODEOWNERS - add sixianyi0721 as the owner (#731 ) # What does this PR do? Add my own github id to CODEOWNERS file - [ ] Addresses issue (#issue) ## Test Plan ## Sources Please link relevant resources if necessary. ## Before submitting - [ ] This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case). - [ ] Ran pre-commit to handle lint / formatting issues. - [ ] Read the [contributor guideline](https://github.com/meta-llama/llama-stack/blob/main/CONTRIBUTING.md), Pull Request section? - [ ] Updated relevant documentation. - [ ] Wrote necessary unit or integration tests.	2025-01-07 21:11:59 -08:00
Xi Yan	7a4383e4c1	add 3.3 to together inference provider (#729 ) # What does this PR do? - add llama3.3 model for together - fix fireworks distro_codegen ``` python llama_stack/scripts/distro_codegen.py ``` ## Test Plan <img width="1132" alt="image" src="https://github.com/user-attachments/assets/bf94b933-9200-4e73-878e-d1a95d450a88" /> Tests ``` pytest -v -s -k "together" --inference-model="meta-llama/Llama-3.3-70B-Instruct" ./llama_stack/providers/tests/inference/test_text_inference.py ``` <img width="1139" alt="image" src="https://github.com/user-attachments/assets/407dc98b-8de3-4841-8cb1-75e4b5128544" /> ## Sources Please link relevant resources if necessary. ## Before submitting - [ ] This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case). - [ ] Ran pre-commit to handle lint / formatting issues. - [ ] Read the [contributor guideline](https://github.com/meta-llama/llama-stack/blob/main/CONTRIBUTING.md), Pull Request section? - [ ] Updated relevant documentation. - [ ] Wrote necessary unit or integration tests.	2025-01-06 15:39:41 -08:00
Xi Yan	7a90fc5854	move DataSchemaValidatorMixin into standalone utils (#720 ) # What does this PR do? - there's no value in keeping data schema validation logic in a DataSchemaValidatorMixin - move into data schema validation logic into standalone utils ## Test Plan ``` pytest -v -s -m llm_as_judge_scoring_together_inference scoring/test_scoring.py --judge-model meta-llama/Llama-3.2-3B-Instruct pytest -v -s -m basic_scoring_together_inference scoring/test_scoring.py pytest -v -s -m braintrust_scoring_together_inference scoring/test_scoring.py pytest -v -s -m meta_reference_eval_together_inference eval/test_eval.py pytest -v -s -m meta_reference_eval_together_inference_huggingface_datasetio eval/test_eval.py ``` ## Sources Please link relevant resources if necessary. ## Before submitting - [ ] This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case). - [ ] Ran pre-commit to handle lint / formatting issues. - [ ] Read the [contributor guideline](https://github.com/meta-llama/llama-stack/blob/main/CONTRIBUTING.md), Pull Request section? - [ ] Updated relevant documentation. - [ ] Wrote necessary unit or integration tests.	2025-01-06 13:25:09 -08:00
Dinesh Yeduguru	0bc5d05243	remove default logger handlers when using libcli with notebook (#718 ) # What does this PR do? Remove the default log handlers for notebook to avoid polluting logs	2025-01-06 13:06:22 -08:00
Botao Chen	e86271aeac	support llama3.1 8B instruct in post training (#698 ) ## What does this PR do? - Change to support llama3.1 8B instruct model other than llama3 8B model as llama3.1 8B instruct model is a better model to finetune on top of - Make the copy files logic in checkpointer safer in case the file be copied doesn't exist in source path ## test issue a post training request from client and verify training works as expect <img width="1101" alt="Screenshot 2025-01-02 at 12 18 45 PM" src="https://github.com/user-attachments/assets/47cc4df9-3edc-4afd-b5dd-abe1f039f1ed" /> <img width="782" alt="Screenshot 2025-01-02 at 12 18 52 PM" src="https://github.com/user-attachments/assets/b9435274-ef1d-4570-bd8e-0880c3a4b2e9" />	2025-01-03 17:33:05 -08:00
Aidan Do	485476c29a	Fix Groq invalid self.config reference (#719 ) # What does this PR do? Contributes towards: #432 RE: https://github.com/meta-llama/llama-stack/pull/609 I missed this one while refactoring. Fixes: ```python Traceback (most recent call last): File "/Users/aidand/dev/llama-stack/llama_stack/distribution/server/server.py", line 191, in endpoint return await maybe_await(value) File "/Users/aidand/dev/llama-stack/llama_stack/distribution/server/server.py", line 155, in maybe_await return await value File "/Users/aidand/dev/llama-stack/llama_stack/providers/utils/telemetry/trace_protocol.py", line 101, in async_wrapper result = await method(self, args, kwargs) File "/Users/aidand/dev/llama-stack/llama_stack/distribution/routers/routers.py", line 156, in chat_completion return await provider.chat_completion(params) File "/Users/aidand/dev/llama-stack/llama_stack/providers/utils/telemetry/trace_protocol.py", line 101, in async_wrapper result = await method(self, args, kwargs) File "/Users/aidand/dev/llama-stack/llama_stack/providers/remote/inference/groq/groq.py", line 127, in chat_completion response = self._get_client().chat.completions.create(request) File "/Users/aidand/dev/llama-stack/llama_stack/providers/remote/inference/groq/groq.py", line 143, in _get_client return Groq(api_key=self.config.api_key) AttributeError: 'GroqInferenceAdapter' object has no attribute 'config'. Did you mean: '_config'? ``` ## Test Plan Environment: ```shell export GROQ_API_KEY=<api-key> # build.yaml and run.yaml files wget https://raw.githubusercontent.com/aidando73/llama-stack/9165502582cd7cb178bc1dcf89955b45768ab6c1/build.yaml wget https://raw.githubusercontent.com/aidando73/llama-stack/9165502582cd7cb178bc1dcf89955b45768ab6c1/run.yaml # Create environment if not already conda create --prefix ./envs python=3.10 conda activate ./envs # Build pip install -e . && llama stack build --config ./build.yaml --image-type conda # Activate built environment conda activate llamastack-groq ``` <details> <summary>Manual</summary> ```bash llama stack run ./run.yaml --port 5001 ``` Via this Jupyter notebook: `9165502582/hello.ipynb` </details> ## Sources Please link relevant resources if necessary. ## Before submitting - [ ] This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case). - [x] Ran pre-commit to handle lint / formatting issues. - [x] Read the [contributor guideline](https://github.com/meta-llama/llama-stack/blob/main/CONTRIBUTING.md), Pull Request section? - [x] Updated relevant documentation. - [ ] Wrote necessary unit or integration tests.	2025-01-03 15:47:10 -08:00
Yuan Tang	04d5b9814f	Fix assert message and call to completion_request_to_prompt in remote:vllm (#709 ) The current message is incorrect and model arg is not needed in `completion_request_to_prompt`. Signed-off-by: Yuan Tang <terrytangyuan@gmail.com>	2025-01-03 13:44:49 -08:00
Yuan Tang	96d8375663	Fix incorrect entrypoint for broken `llama stack run` (#706 ) This fixes the issue when using `llama stack run` by correctly specifying entrypoint: ``` LLAMA_STACK_DIR=. llama stack run /home/yutang/.llama/distributions/llamastack-vllm/vllm-run.yaml Using config file: /home/yutang/.llama/distributions/llamastack-vllm/vllm-run.yaml + command -v selinuxenabled + selinuxenabled + DOCKER_OPTS=' --security-opt label=disable' + mounts= + '[' -n . ']' ++ readlink -f . + mounts=' -v /home/yutang/repos/llama-stack:/app/llama-stack-source' + '[' -n '' ']' + version_tag=latest + '[' -n '' ']' + '[' -n . ']' + version_tag=dev + podman run --security-opt label=disable -it -p 5000:5000 -v /home/yutang/.llama/distributions/llamastack-vllm/vllm-run.yaml:/app/config.yaml -v /home/yutang/repos/llama-stack:/app/llama-stack-source localhost/distribution-vllm:dev python -m llama_stack.distribution.server.server --yaml-config /app/config.yaml --port 5000 usage: server.py [-h] [--yaml-config YAML_CONFIG] [--template TEMPLATE] [--port PORT] [--disable-ipv6] [--env ENV] server.py: error: unrecognized arguments: python -m llama_stack.distribution.server.server ++ error_handler 88 ++ echo 'Error occurred in script at line: 88' Error occurred in script at line: 88 ++ exit 1 ``` --------- Signed-off-by: Yuan Tang <terrytangyuan@gmail.com>	2025-01-03 09:47:10 -08:00

1 2 3 4 5 ...

842 commits