mirror of
				https://github.com/meta-llama/llama-stack.git
				synced 2025-10-26 09:15:40 +00:00 
			
		
		
		
	
		
			Some checks failed
		
		
	
	Integration Tests (Replay) / discover-tests (push) Successful in 3s
				
			Test External Providers Installed via Module / test-external-providers-from-module (venv) (push) Has been skipped
				
			Integration Auth Tests / test-matrix (oauth2_token) (push) Failing after 9s
				
			Python Package Build Test / build (3.12) (push) Failing after 4s
				
			Vector IO Integration Tests / test-matrix (3.12, inline::milvus) (push) Failing after 12s
				
			Test Llama Stack Build / generate-matrix (push) Successful in 11s
				
			Test Llama Stack Build / build-ubi9-container-distribution (push) Failing after 12s
				
			Vector IO Integration Tests / test-matrix (3.12, inline::faiss) (push) Failing after 14s
				
			SqlStore Integration Tests / test-postgres (3.12) (push) Failing after 22s
				
			Test External API and Providers / test-external (venv) (push) Failing after 14s
				
			Integration Tests (Replay) / Integration Tests (, , , client=, vision=) (push) Failing after 12s
				
			Vector IO Integration Tests / test-matrix (3.12, remote::pgvector) (push) Failing after 15s
				
			SqlStore Integration Tests / test-postgres (3.13) (push) Failing after 22s
				
			Test Llama Stack Build / build-custom-container-distribution (push) Failing after 14s
				
			Unit Tests / unit-tests (3.13) (push) Failing after 14s
				
			Test Llama Stack Build / build-single-provider (push) Failing after 13s
				
			Vector IO Integration Tests / test-matrix (3.12, remote::chromadb) (push) Failing after 18s
				
			Unit Tests / unit-tests (3.12) (push) Failing after 16s
				
			Vector IO Integration Tests / test-matrix (3.12, remote::qdrant) (push) Failing after 18s
				
			Vector IO Integration Tests / test-matrix (3.13, remote::weaviate) (push) Failing after 10s
				
			Vector IO Integration Tests / test-matrix (3.13, inline::faiss) (push) Failing after 11s
				
			Vector IO Integration Tests / test-matrix (3.12, remote::weaviate) (push) Failing after 16s
				
			Vector IO Integration Tests / test-matrix (3.13, remote::qdrant) (push) Failing after 18s
				
			Test Llama Stack Build / build (push) Failing after 12s
				
			Vector IO Integration Tests / test-matrix (3.13, remote::chromadb) (push) Failing after 18s
				
			Vector IO Integration Tests / test-matrix (3.13, remote::pgvector) (push) Failing after 20s
				
			Vector IO Integration Tests / test-matrix (3.13, inline::sqlite-vec) (push) Failing after 16s
				
			Python Package Build Test / build (3.13) (push) Failing after 53s
				
			Vector IO Integration Tests / test-matrix (3.13, inline::milvus) (push) Failing after 59s
				
			Vector IO Integration Tests / test-matrix (3.12, inline::sqlite-vec) (push) Failing after 1m1s
				
			Update ReadTheDocs / update-readthedocs (push) Failing after 1m6s
				
			Pre-commit / pre-commit (push) Successful in 1m53s
				
			A bunch of miscellaneous cleanup focusing on tests, but ended up speeding up starter distro substantially. - Pulled llama stack client init for tests into `pytest_sessionstart` so it does not clobber output - Profiling of that told me where we were doing lots of heavy imports for starter, so lazied them - starter now starts 20seconds+ faster on my Mac - A few other smallish refactors for `compat_client`
		
			
				
	
	
	
	
		
			1.5 KiB
		
	
	
	
	
	
	
	
			
		
		
	
	
			1.5 KiB
		
	
	
	
	
	
	
	
inline::huggingface
Description
HuggingFace-based post-training provider for fine-tuning models using the HuggingFace ecosystem.
Configuration
| Field | Type | Required | Default | Description | 
|---|---|---|---|---|
| device | <class 'str'> | No | cuda | |
| distributed_backend | Literal['fsdp', 'deepspeed' | No | ||
| checkpoint_format | Literal['full_state', 'huggingface' | No | huggingface | |
| chat_template | <class 'str'> | No | < | user | 
| {input} | ||||
| < | assistant | > | ||
| {output} | ||||
| model_specific_config | <class 'dict'> | No | {'trust_remote_code': True, 'attn_implementation': 'sdpa'} | |
| max_seq_length | <class 'int'> | No | 2048 | |
| gradient_checkpointing | <class 'bool'> | No | False | |
| save_total_limit | <class 'int'> | No | 3 | |
| logging_steps | <class 'int'> | No | 10 | |
| warmup_ratio | <class 'float'> | No | 0.1 | |
| weight_decay | <class 'float'> | No | 0.01 | |
| dataloader_num_workers | <class 'int'> | No | 4 | |
| dataloader_pin_memory | <class 'bool'> | No | True | |
| dpo_beta | <class 'float'> | No | 0.1 | |
| use_reference_model | <class 'bool'> | No | True | |
| dpo_loss_type | Literal['sigmoid', 'hinge', 'ipo', 'kto_pair' | No | sigmoid | |
| dpo_output_dir | <class 'str'> | No | 
Sample Configuration
checkpoint_format: huggingface
distributed_backend: null
device: cpu
dpo_output_dir: ~/.llama/dummy/dpo_output