mirror of
				https://github.com/meta-llama/llama-stack.git
				synced 2025-10-25 17:11:12 +00:00 
			
		
		
		
	| 
		
			Some checks failed
		
		
	 Integration Auth Tests / test-matrix (oauth2_token) (push) Failing after 1s Integration Tests / test-matrix (http, 3.10, vector_io) (push) Failing after 6s Integration Tests / test-matrix (http, 3.10, inference) (push) Failing after 8s Integration Tests / test-matrix (http, 3.11, inference) (push) Failing after 6s Integration Tests / test-matrix (http, 3.10, inspect) (push) Failing after 11s Integration Tests / test-matrix (http, 3.12, post_training) (push) Failing after 6s Integration Tests / test-matrix (http, 3.12, scoring) (push) Failing after 7s Integration Tests / test-matrix (http, 3.11, inspect) (push) Failing after 10s Integration Tests / test-matrix (http, 3.10, post_training) (push) Failing after 10s Integration Tests / test-matrix (http, 3.12, inspect) (push) Failing after 14s Integration Tests / test-matrix (library, 3.10, inference) (push) Failing after 9s Integration Tests / test-matrix (library, 3.10, agents) (push) Failing after 9s Integration Tests / test-matrix (http, 3.10, scoring) (push) Failing after 11s Integration Tests / test-matrix (http, 3.10, tool_runtime) (push) Failing after 14s Integration Tests / test-matrix (http, 3.12, tool_runtime) (push) Failing after 13s Integration Tests / test-matrix (http, 3.12, agents) (push) Failing after 8s Integration Tests / test-matrix (http, 3.12, inference) (push) Failing after 9s Integration Tests / test-matrix (http, 3.10, providers) (push) Failing after 8s Integration Tests / test-matrix (http, 3.11, post_training) (push) Failing after 13s Integration Tests / test-matrix (http, 3.11, vector_io) (push) Failing after 5s Integration Tests / test-matrix (http, 3.11, datasets) (push) Failing after 8s Integration Tests / test-matrix (http, 3.10, agents) (push) Failing after 13s Integration Tests / test-matrix (http, 3.11, tool_runtime) (push) Failing after 8s Integration Tests / test-matrix (http, 3.11, agents) (push) Failing after 15s Integration Tests / test-matrix (http, 3.12, datasets) (push) Failing after 11s Integration Tests / test-matrix (library, 3.10, tool_runtime) (push) Failing after 8s Integration Tests / test-matrix (http, 3.10, datasets) (push) Failing after 16s Integration Tests / test-matrix (http, 3.12, providers) (push) Failing after 11s Integration Tests / test-matrix (library, 3.10, datasets) (push) Failing after 9s Integration Tests / test-matrix (http, 3.11, scoring) (push) Failing after 14s Integration Tests / test-matrix (http, 3.12, vector_io) (push) Failing after 8s Integration Tests / test-matrix (library, 3.10, vector_io) (push) Failing after 12s Integration Tests / test-matrix (library, 3.10, inspect) (push) Failing after 9s Integration Tests / test-matrix (http, 3.11, providers) (push) Failing after 5s Integration Tests / test-matrix (library, 3.10, providers) (push) Failing after 7s Integration Tests / test-matrix (library, 3.10, post_training) (push) Failing after 10s Integration Tests / test-matrix (library, 3.10, scoring) (push) Failing after 13s Integration Tests / test-matrix (library, 3.11, agents) (push) Failing after 7s Integration Tests / test-matrix (library, 3.11, inference) (push) Failing after 6s Integration Tests / test-matrix (library, 3.11, datasets) (push) Failing after 9s Integration Tests / test-matrix (library, 3.11, inspect) (push) Failing after 8s Integration Tests / test-matrix (library, 3.11, post_training) (push) Failing after 7s Integration Tests / test-matrix (library, 3.11, providers) (push) Failing after 9s Integration Tests / test-matrix (library, 3.11, scoring) (push) Failing after 7s Integration Tests / test-matrix (library, 3.11, vector_io) (push) Failing after 8s Integration Tests / test-matrix (library, 3.11, tool_runtime) (push) Failing after 9s Integration Tests / test-matrix (library, 3.12, agents) (push) Failing after 9s Integration Tests / test-matrix (library, 3.12, datasets) (push) Failing after 7s Integration Tests / test-matrix (library, 3.12, inspect) (push) Failing after 7s Integration Tests / test-matrix (library, 3.12, inference) (push) Failing after 9s Integration Tests / test-matrix (library, 3.12, providers) (push) Failing after 7s Integration Tests / test-matrix (library, 3.12, post_training) (push) Failing after 9s Integration Tests / test-matrix (library, 3.12, scoring) (push) Failing after 11s Integration Tests / test-matrix (library, 3.12, tool_runtime) (push) Failing after 9s Test External Providers / test-external-providers (venv) (push) Failing after 9s Integration Tests / test-matrix (library, 3.12, vector_io) (push) Failing after 14s Unit Tests / unit-tests (3.10) (push) Failing after 19s Unit Tests / unit-tests (3.11) (push) Failing after 20s Unit Tests / unit-tests (3.12) (push) Failing after 18s Unit Tests / unit-tests (3.13) (push) Failing after 16s Update ReadTheDocs / update-readthedocs (push) Failing after 8s Pre-commit / pre-commit (push) Successful in 58s For code completion apps need "fill in the middle" capabilities. Added option of `suffix` to `openai_completion` to enable this. Updated ollama provider to showcase the same. ### Test Plan ``` pytest -sv --stack-config="inference=ollama" tests/integration/inference/test_openai_completion.py --text-model qwen2.5-coder:1.5b -k test_openai_completion_non_streaming_suffix ``` ### OpenAI Sample script ``` from openai import OpenAI client = OpenAI(base_url="http://localhost:8321/v1/openai/v1") response = client.completions.create( model="qwen2.5-coder:1.5b", prompt="The capital of ", suffix="is Paris.", max_tokens=10, ) print(response.choices[0].text) ``` ### Output ``` France is ____. To answer this question, we ``` | ||
|---|---|---|
| .. | ||
| chat_completion.json | ||
| completion.json | ||