llama-stack-mirror

mirror of https://github.com/meta-llama/llama-stack.git synced 2025-12-31 04:50:01 +00:00

History

Ihar Hrachyshka 2433ef218d feat: implement async job scheduler for torchtune Now a separate thread is started to execute training jobs. Training requests now return job ID before the job completes. (Which fixes API timeouts for any jobs that take longer than a minute.) Note: the scheduler code is meant to be spun out in the future into a common provider service that can be reused for different APIs and providers. It is also expected to back the /jobs API proposed here: https://github.com/meta-llama/llama-stack/discussions/1238 Hence its somewhat generalized form which is expected to simplify its adoption elsewhere in the future. Note: this patch doesn't attempt to implement missing APIs (e.g. cancel or job removal). This work will belong to follow-up PRs. Signed-off-by: Ihar Hrachyshka <ihar.hrachyshka@gmail.com>		2025-03-28 12:11:59 -04:00
..
inline	feat: implement async job scheduler for torchtune	2025-03-28 12:11:59 -04:00
registry	feat(api): don't return a payload on file delete (#1640 )	2025-03-25 17:12:36 -07:00
remote	feat: Add nemo customizer (#1448 )	2025-03-25 11:01:10 -07:00
tests	refactor(test): introduce --stack-config and simplify options (#1404 )	2025-03-05 17:02:02 -08:00
utils	feat: implement async job scheduler for torchtune	2025-03-28 12:11:59 -04:00
__init__.py	API Updates (#73 )	2024-09-17 19:51:35 -07:00
datatypes.py	chore: move all Llama Stack types from llama-models to llama-stack (#1098 )	2025-02-14 09:10:59 -08:00