llama-stack-mirror

mirror of https://github.com/meta-llama/llama-stack.git synced 2025-12-31 18:54:30 +00:00

History

Ihar Hrachyshka 2433ef218d feat: implement async job scheduler for torchtune Now a separate thread is started to execute training jobs. Training requests now return job ID before the job completes. (Which fixes API timeouts for any jobs that take longer than a minute.) Note: the scheduler code is meant to be spun out in the future into a common provider service that can be reused for different APIs and providers. It is also expected to back the /jobs API proposed here: https://github.com/meta-llama/llama-stack/discussions/1238 Hence its somewhat generalized form which is expected to simplify its adoption elsewhere in the future. Note: this patch doesn't attempt to implement missing APIs (e.g. cancel or job removal). This work will belong to follow-up PRs. Signed-off-by: Ihar Hrachyshka <ihar.hrachyshka@gmail.com>		2025-03-28 12:11:59 -04:00
..
agents	feat(rag): entire document context with attachments (#1763 )	2025-03-23 16:57:48 -07:00
datasetio	fix: Call pandas.read_* in a seperate thread (#1698 )	2025-03-19 10:46:37 -07:00
eval	fix: fix jobs api literal return type (#1757 )	2025-03-21 14:04:21 -07:00
inference	fix: Updating `ToolCall.arguments` to allow for json strings that can be decoded on client side (#1685 )	2025-03-19 10:36:19 -07:00
ios/inference	chore: removed executorch submodule (#1265 )	2025-02-25 21:57:21 -08:00
post_training	feat: implement async job scheduler for torchtune	2025-03-28 12:11:59 -04:00
safety	feat(agent): support multiple tool groups (#1556 )	2025-03-17 22:13:09 -07:00
scoring	fix: a couple of tests were broken and not yet exercised by our per-PR test workflow	2025-03-21 12:12:14 -07:00
telemetry	chore: Revert "chore(telemetry): remove service_name entirely" (#1785 )	2025-03-25 14:42:05 -07:00
tool_runtime	chore: mypy violations cleanup for inline::{telemetry,tool_runtime,vector_io} (#1711 )	2025-03-20 10:01:10 -07:00
vector_io	chore: Updating sqlite-vec to make non-blocking calls (#1762 )	2025-03-23 17:25:44 -07:00
__init__.py	`impls` -> `inline`, `adapters` -> `remote` (#381 )	2024-11-06 14:54:05 -08:00