phoenix-oss/llama-stack-mirror

Fork 1

mirror of https://github.com/meta-llama/llama-stack.git synced 2025-08-16 06:27:58 +00:00

Ashwin Bharambe 7f834339ba

Integration Tests (Replay) / discover-tests (push) Successful in 3s

Details

Test External Providers Installed via Module / test-external-providers-from-module (venv) (push) Has been skipped

Details

Integration Auth Tests / test-matrix (oauth2_token) (push) Failing after 9s

Details

Python Package Build Test / build (3.12) (push) Failing after 4s

Details

Vector IO Integration Tests / test-matrix (3.12, inline::milvus) (push) Failing after 12s

Details

Test Llama Stack Build / generate-matrix (push) Successful in 11s

Details

Test Llama Stack Build / build-ubi9-container-distribution (push) Failing after 12s

Details

Vector IO Integration Tests / test-matrix (3.12, inline::faiss) (push) Failing after 14s

Details

SqlStore Integration Tests / test-postgres (3.12) (push) Failing after 22s

Details

Test External API and Providers / test-external (venv) (push) Failing after 14s

Details

Integration Tests (Replay) / Integration Tests (, , , client=, vision=) (push) Failing after 12s

Details

Vector IO Integration Tests / test-matrix (3.12, remote::pgvector) (push) Failing after 15s

Details

SqlStore Integration Tests / test-postgres (3.13) (push) Failing after 22s

Details

Test Llama Stack Build / build-custom-container-distribution (push) Failing after 14s

Details

Unit Tests / unit-tests (3.13) (push) Failing after 14s

Details

Test Llama Stack Build / build-single-provider (push) Failing after 13s

Details

Vector IO Integration Tests / test-matrix (3.12, remote::chromadb) (push) Failing after 18s

Details

Unit Tests / unit-tests (3.12) (push) Failing after 16s

Details

Vector IO Integration Tests / test-matrix (3.12, remote::qdrant) (push) Failing after 18s

Details

Vector IO Integration Tests / test-matrix (3.13, remote::weaviate) (push) Failing after 10s

Details

Vector IO Integration Tests / test-matrix (3.13, inline::faiss) (push) Failing after 11s

Details

Vector IO Integration Tests / test-matrix (3.12, remote::weaviate) (push) Failing after 16s

Details

Vector IO Integration Tests / test-matrix (3.13, remote::qdrant) (push) Failing after 18s

Details

Test Llama Stack Build / build (push) Failing after 12s

Details

Vector IO Integration Tests / test-matrix (3.13, remote::chromadb) (push) Failing after 18s

Details

Vector IO Integration Tests / test-matrix (3.13, remote::pgvector) (push) Failing after 20s

Details

Vector IO Integration Tests / test-matrix (3.13, inline::sqlite-vec) (push) Failing after 16s

Details

Python Package Build Test / build (3.13) (push) Failing after 53s

Details

Vector IO Integration Tests / test-matrix (3.13, inline::milvus) (push) Failing after 59s

Details

Vector IO Integration Tests / test-matrix (3.12, inline::sqlite-vec) (push) Failing after 1m1s

Details

Update ReadTheDocs / update-readthedocs (push) Failing after 1m6s

Details

Pre-commit / pre-commit (push) Successful in 1m53s

Details

chore(misc): make tests and starter faster (#3042 )

A bunch of miscellaneous cleanup focusing on tests, but ended up
speeding up starter distro substantially.

- Pulled llama stack client init for tests into `pytest_sessionstart` so
it does not clobber output
- Profiling of that told me where we were doing lots of heavy imports
for starter, so lazied them
- starter now starts 20seconds+ faster on my Mac
- A few other smallish refactors for `compat_client`

2025-08-05 14:55:05 -07:00

1.5 KiB

Raw Permalink Blame History

inline::huggingface

Description

HuggingFace-based post-training provider for fine-tuning models using the HuggingFace ecosystem.

Configuration

Field	Type	Required	Default	Description
`device`	`<class 'str'>`	No	cuda
`distributed_backend`	`Literal['fsdp', 'deepspeed'`	No
`checkpoint_format`	`Literal['full_state', 'huggingface'`	No	huggingface
`chat_template`	`<class 'str'>`	No	<	user
{input}
<	assistant	>
{output}
`model_specific_config`	`<class 'dict'>`	No	{'trust_remote_code': True, 'attn_implementation': 'sdpa'}
`max_seq_length`	`<class 'int'>`	No	2048
`gradient_checkpointing`	`<class 'bool'>`	No	False
`save_total_limit`	`<class 'int'>`	No	3
`logging_steps`	`<class 'int'>`	No	10
`warmup_ratio`	`<class 'float'>`	No	0.1
`weight_decay`	`<class 'float'>`	No	0.01
`dataloader_num_workers`	`<class 'int'>`	No	4
`dataloader_pin_memory`	`<class 'bool'>`	No	True
`dpo_beta`	`<class 'float'>`	No	0.1
`use_reference_model`	`<class 'bool'>`	No	True
`dpo_loss_type`	`Literal['sigmoid', 'hinge', 'ipo', 'kto_pair'`	No	sigmoid
`dpo_output_dir`	`<class 'str'>`	No

Sample Configuration

checkpoint_format: huggingface
distributed_backend: null
device: cpu
dpo_output_dir: ~/.llama/dummy/dpo_output

1.5 KiB Raw Permalink Blame History

inline::huggingface

Description

Configuration

Sample Configuration

1.5 KiB

Raw Permalink Blame History