llama-stack-mirror

mirror of https://github.com/meta-llama/llama-stack.git synced 2025-12-07 18:57:21 +00:00

History

Charlie Doern 65b4fae51d fix: proper checkpointing logic for HF trainer (#2429 ) # What does this PR do? currently only the last saved model is reported as a checkpoint and associated with the job UUID. since the HF trainer handles checkpoint collection during training, we need to add all of the `checkpoint-*` folders as Checkpoint objects. Adjust the save strategy to be per-epoch to make this easier and to use less storage Signed-off-by: Charlie Doern <cdoern@redhat.com>		2025-06-27 17:36:25 -04:00
..
agents	chore: remove nested imports (#2515 )	2025-06-26 08:01:05 +05:30
datasetio	chore(refact): move paginate_records fn outside of datasetio (#2137 )	2025-05-12 10:56:14 -07:00
eval	chore: remove nested imports (#2515 )	2025-06-26 08:01:05 +05:30
files/localfs	refactor(env)!: enhanced environment variable substitution (#2490 )	2025-06-26 08:20:08 +05:30
inference	refactor(env)!: enhanced environment variable substitution (#2490 )	2025-06-26 08:20:08 +05:30
ios/inference	chore: removed executorch submodule (#1265 )	2025-02-25 21:57:21 -08:00
post_training	fix: proper checkpointing logic for HF trainer (#2429 )	2025-06-27 17:36:25 -04:00
safety	feat: add cpu/cuda config for prompt guard (#2194 )	2025-05-28 12:23:15 -07:00
scoring	refactor(env)!: enhanced environment variable substitution (#2490 )	2025-06-26 08:20:08 +05:30
telemetry	fix: fix test of root span to match what is being set (#2494 )	2025-06-26 11:41:35 -04:00
tool_runtime	feat: Add ChunkMetadata to Chunk (#2497 )	2025-06-25 15:55:23 -04:00
vector_io	fix: ValueError in faiss vector database serialization (resolves #2519 ) (#2526 )	2025-06-27 14:34:52 -04:00
__init__.py	`impls` -> `inline`, `adapters` -> `remote` (#381 )	2024-11-06 14:54:05 -08:00