llama-stack-mirror

mirror of https://github.com/meta-llama/llama-stack.git synced 2025-12-31 18:50:01 +00:00

Author	SHA1	Message	Date
Xi Yan	adead707cd	Merge branch 'api_2' into api_3	2025-03-12 11:38:25 -07:00
Xi Yan	eddf3e1bb2	Merge branch 'pr1573' into api_2	2025-03-12 11:38:13 -07:00
Xi Yan	31e3409909	Merge branch 'main' into pr1573	2025-03-12 11:38:02 -07:00
Botao Chen	90ca4d94de	fix: fix passthrough inference provider to make it work for agent (#1577 ) ## What does this PR do? We noticed that the passthrough inference provider doesn't work agent due to the type mis-match between client and server. We manually cast the llama stack client type to llama stack server type to fix the issue. ## test run `python -m examples.agents.hello localhost 8321` within llama-stack-apps <img width="1073" alt="Screenshot 2025-03-11 at 8 43 44 PM" src="https://github.com/user-attachments/assets/bd1bdd31-606a-420c-a249-95f6184cc0b1" /> fix https://github.com/meta-llama/llama-stack/issues/1560	2025-03-12 11:16:17 -07:00
Botao Chen	0b0be70605	feat: Add open benchmark template codegen (#1579 ) ## What does this PR do? As title, add codegen for open-benchmark template ## test checked the new generated run.yaml file and it's identical before and after the change Also add small improvement to together template so that missing TOGETHER_API_KEY won't crash the server which is the consistent user experience as other remote providers	2025-03-12 11:12:08 -07:00
Charlie Doern	4eee349acd	fix: respect log_level in uvicorn and third party libs (#1524 ) # What does this PR do? uvicorn has a `log_level` arg in uvicorn.run, pass in the effective level set by the logger. Additionally, third party libraries like httpx are using our logging format, but not honoring our log level. This seems unintended, so loop through all items in the loggerDict and apply the same log level as what we have set. ## Test Plan before: ``` llama stack run --image-type venv ~/.llama/distributions/ollama/ollama-run.yaml Environment variable LLAMA_STACK_LOGGING found: all=warn Using virtual environment: /Users/charliedoern/projects/Documents/llama-stack/venv + python -m llama_stack.distribution.server.server --yaml-config /Users/charliedoern/.llama/distributions/ollama/ollama-run.yaml --port 8321 Environment variable LLAMA_STACK_LOGGING found: all=warn WARNING 2025-03-10 16:05:49,706 root:71 uncategorized: Warning: `bwrap` is not available. Code interpreter tool will not work correctly. INFO 2025-03-10 16:05:49,916 datasets:54 uncategorized: PyTorch version 2.5.1 available. INFO 2025-03-10 16:05:50,010 httpx:1740 uncategorized: HTTP Request: GET http://localhost:11434/api/ps "HTTP/1.1 200 OK" INFO 2025-03-10 16:05:50,297 httpx:1740 uncategorized: HTTP Request: POST http://localhost:11434/api/pull "HTTP/1.1 200 OK" INFO 2025-03-10 16:05:50,314 httpx:1740 uncategorized: HTTP Request: GET http://localhost:11434/api/tags "HTTP/1.1 200 OK" INFO: Started server process [89663] INFO: Waiting for application startup. INFO: ASGI 'lifespan' protocol appears unsupported. INFO: Application startup complete. INFO: Uvicorn running on http://['::', '0.0.0.0']:8321 (Press CTRL+C to quit) ``` after: ``` llama stack run --image-type venv ~/.llama/distributions/ollama/ollama-run.yaml Environment variable LLAMA_STACK_LOGGING found: all=warn Using virtual environment: /Users/charliedoern/projects/Documents/llama-stack/venv + python -m llama_stack.distribution.server.server --yaml-config /Users/charliedoern/.llama/distributions/ollama/ollama-run.yaml --port 8321 Environment variable LLAMA_STACK_LOGGING found: all=warn WARNING 2025-03-10 16:05:20,429 root:71 uncategorized: Warning: `bwrap` is not available. Code interpreter tool will not work correctly. INFO 2025-03-10 16:05:20,639 datasets:54 uncategorized: PyTorch version 2.5.1 available. ``` Signed-off-by: Charlie Doern <cdoern@redhat.com>	2025-03-12 11:07:28 -07:00
Xi Yan	fecfb32855	remove benchmark_id in eval path	2025-03-12 01:45:50 -07:00
Xi Yan	ddcbf5e8d5	scoring job	2025-03-12 01:18:46 -07:00
Xi Yan	f88755eb93	scoring job	2025-03-12 01:16:03 -07:00
Xi Yan	5f330c3971	update scoring	2025-03-12 01:09:54 -07:00
Xi Yan	f2ede05f3f	update scoring	2025-03-12 01:01:53 -07:00
Xi Yan	fdf251234e	eval job	2025-03-12 00:59:23 -07:00
Xi Yan	ea0b29ae50	Merge branch 'api_2' into api_3	2025-03-12 00:49:09 -07:00
Xi Yan	1d62ae595d	include benchmarks	2025-03-12 00:45:32 -07:00
Xi Yan	b4d868a1e5	include benchmarks	2025-03-12 00:43:24 -07:00
Xi Yan	4acd1e404e	tmp	2025-03-12 00:39:49 -07:00
Xi Yan	3e0d4901da	Merge branch 'api_2' into api_3	2025-03-12 00:22:46 -07:00
Xi Yan	e68e8c96ae	fix	2025-03-12 00:21:34 -07:00
Xi Yan	d0e372058d	Merge branch 'api_2' into api_3	2025-03-12 00:21:03 -07:00
Xi Yan	124040af77	params -> fn	2025-03-12 00:20:41 -07:00
Xi Yan	af4216f34f	Merge branch 'pr1573' into api_2	2025-03-12 00:19:25 -07:00
Xi Yan	1d80ec7f81	upgrade doc	2025-03-12 00:17:58 -07:00
Xi Yan	0abedd070c	comment	2025-03-12 00:13:27 -07:00
Xi Yan	78b4cdad67	wip	2025-03-12 00:09:03 -07:00
Xi Yan	5c954dd033	single type	2025-03-11 23:25:19 -07:00
Xi Yan	bec5a46915	single type	2025-03-11 23:20:16 -07:00
Xi Yan	bc71980769	alternative	2025-03-11 23:14:35 -07:00
Xi Yan	cd3a3a5e26	add alternative	2025-03-11 23:10:17 -07:00
Xi Yan	4236769b65	precommit	2025-03-11 22:49:44 -07:00
Xi Yan	58d9cb1276	docs	2025-03-11 22:46:52 -07:00
Xi Yan	f9ea90c4f7	docs	2025-03-11 22:45:48 -07:00
Xi Yan	11e57e17e6	custom	2025-03-11 22:39:50 -07:00
Xi Yan	504eeef413	custom	2025-03-11 22:39:22 -07:00
Xi Yan	8952e40201	custom	2025-03-11 22:14:06 -07:00
Xi Yan	5162889709	precommit	2025-03-11 22:13:05 -07:00
Xi Yan	685e863bb5	remove json_schema_type decorator	2025-03-11 22:08:15 -07:00
Xi Yan	98dfc99584	docs	2025-03-11 22:06:55 -07:00
Xi Yan	de382e7b45	merge description with metadata	2025-03-11 22:06:22 -07:00
Xi Yan	2bb6ca818a	scoring api update	2025-03-11 21:53:47 -07:00
Xi Yan	bbb1947fb4	scoring api update	2025-03-11 21:52:01 -07:00
Xi Yan	b3ee4c00ce	scoring function type	2025-03-11 21:50:25 -07:00
Xi Yan	70fdf6c04b	precommit	2025-03-11 21:43:43 -07:00
ehhuang	59dddafd12	feat: convert typehints from client_tool to litellm format (#1565 ) Summary: supports https://github.com/meta-llama/llama-stack-client-python/pull/193 Test Plan: LLAMA_STACK_CONFIG=fireworks pytest -s -v tests/integration/agents/test_agents.py --safety-shield meta-llama/Llama-Guard-3-8B --text-model meta-llama/Llama-3.1-8B-Instruct	2025-03-11 20:02:11 -07:00
Xi Yan	817331e76e	precommit	2025-03-11 18:34:38 -07:00
Xi Yan	0e47c65051	update	2025-03-11 18:29:55 -07:00
Xi Yan	02aa9a1e85	remove json_schema_type decorator	2025-03-11 16:08:06 -07:00
Xi Yan	8592c2b48a	precommit	2025-03-11 14:56:12 -07:00
Xi Yan	bc551e6459	datasets api Summary: Test Plan: Reviewers: Subscribers: Tasks: Tags:	2025-03-11 14:44:49 -07:00
Josh Salomon	5f90be5388	fix: Fixed bad file name in inline::localfs (#1358 ) Bug https://github.com/meta-llama/llama-stack/issues/1357 # What does this PR do? Fix a bug of a wrong file name in inline::localfs datasetio provider [//]: # (If resolving an issue, uncomment and update the line below) # (Closes #1357) ## Test Plan [Describe the tests you ran to verify your changes with result summaries. Provide clear instructions so the plan can be easily re-executed.] [//]: # (## Documentation) Signed-off-by: Josh Salomon <jsalomon@redhat.com>	2025-03-11 12:46:11 -07:00
Xi Yan	43044f29e2	fix: fix llama stack run with missing agent impl (#1559 ) # What does this PR do? - recent merge https://github.com/meta-llama/llama-stack/pull/1410 introduce error ``` ValueError: Provider meta-reference (Api.agents) does not implement the following methods: [('list_agent_sessions', 'not_actually_implemented'), ('list_agents', 'not_actually_implemented')] ``` [//]: # (If resolving an issue, uncomment and update the line below) [//]: # (Closes #[issue-number]) ## Test Plan ``` llama stack run ``` ``` LLAMA_STACK_CONFIG=fireworks pytest -v tests/integration/agents/test_agents.py --text-model meta-llama/Llama-3.3-70B-Instruct ``` `1379530386` [//]: # (## Documentation)	2025-03-11 11:22:22 -07:00

1 2 3 4 5 ...

935 commits