llama-stack-mirror

mirror of https://github.com/meta-llama/llama-stack.git synced 2025-12-17 11:02:36 +00:00

History

Shabana Baig 805abf573f Some checks failed SqlStore Integration Tests / test-postgres (3.12) (push) Failing after 0s Details Integration Auth Tests / test-matrix (oauth2_token) (push) Failing after 1s Details Test External Providers Installed via Module / test-external-providers-from-module (venv) (push) Has been skipped Details Integration Tests (Replay) / generate-matrix (push) Successful in 3s Details SqlStore Integration Tests / test-postgres (3.13) (push) Failing after 3s Details API Conformance Tests / check-schema-compatibility (push) Successful in 15s Details Python Package Build Test / build (3.12) (push) Successful in 17s Details Python Package Build Test / build (3.13) (push) Successful in 18s Details Test External API and Providers / test-external (venv) (push) Failing after 28s Details Vector IO Integration Tests / test-matrix (push) Failing after 43s Details UI Tests / ui-tests (22) (push) Successful in 52s Details Unit Tests / unit-tests (3.13) (push) Failing after 1m45s Details Unit Tests / unit-tests (3.12) (push) Failing after 1m58s Details Pre-commit / pre-commit (22) (push) Successful in 3m9s Details Integration Tests (Replay) / Integration Tests (, , , client=, ) (push) Failing after 4m5s Details feat!: Implement include parameter specifically for adding logprobs in the output message (#4261 ) # Problem As an Application Developer, I want to use the include parameter with the value message.output_text.logprobs, so that I can receive log probabilities for output tokens to assess the model's confidence in its response. # What does this PR do? - Updates the include parameter in various resource definitions - Updates the inline provider to return logprobs when "message.output_text.logprobs" is passed in the include parameter - Converts the logprobs returned by the inference provider from chat completion format to responses format Closes #[4260](https://github.com/llamastack/llama-stack/issues/4260) ## Test Plan - Created a script to explore OpenAI behavior: https://github.com/s-akhtar-baig/llama-stack-examples/blob/main/responses/src/include.py - Added integration tests and new recordings --------- Co-authored-by: Matthew Farrellee <matt@cs.wisc.edu> Co-authored-by: Ashwin Bharambe <ashwin.bharambe@gmail.com>		2025-12-11 11:11:21 -08:00
..
config.yml	feat(openapi): generate stainless config "more" programmatically (#4164 )	2025-11-17 12:48:03 -08:00
openapi.yml	feat!: Implement include parameter specifically for adding logprobs in the output message (#4261 )	2025-12-11 11:11:21 -08:00
README.md	feat(openapi): generate stainless config "more" programmatically (#4164 )	2025-11-17 12:48:03 -08:00

README.md

These are the source-of-truth configuration files used to generate the Stainless client SDKs via Stainless.

openapi.yml: this is the OpenAPI specification for the Llama Stack API.
config.yml: this is the Stainless configuration which instructs Stainless how to generate the client SDKs.

A small side note: notice the .yml suffixes since Stainless uses that suffix typically for its configuration files.

These files go hand-in-hand. Both openapi.yml and config.yml are generated by scripts/run_openapi_generator.sh:

openapi.yml comes from the FastAPI-based generator.
config.yml is rendered from scripts/openapi_generator/stainless_config/config_data.py so the Stainless config stays in lock-step with the spec.