llama-stack

forked from phoenix-oss/llama-stack-mirror

History

Dinesh Yeduguru 99bbe0e70b feat: Add new compact MetricInResponse type (#1593 ) # What does this PR do? This change adds a compact type to include metrics in response as opposed to the full MetricEvent which is relevant for internal logging purposes. ## Test Plan ``` LLAMA_STACK_CONFIG=~/.llama/distributions/fireworks/fireworks-run.yaml pytest -s -v agents/test_agents.py --safety-shield meta-llama/Llama-Guard-3-8B --text-model meta-llama/Llama-3.1-8B-Instruct llama stack run ~/.llama/distributions/fireworks/fireworks-run.yaml curl --request POST \ --url http://localhost:8321/v1/inference/chat-completion \ --header 'content-type: application/json' \ --data '{ "model_id": "meta-llama/Llama-3.1-70B-Instruct", "messages": [ { "role": "user", "content": { "type": "text", "text": "where do humans live" } } ], "stream": false }' { "metrics": [ { "metric": "prompt_tokens", "value": 10, "unit": null }, { "metric": "completion_tokens", "value": 522, "unit": null }, { "metric": "total_tokens", "value": 532, "unit": null } ], "completion_message": { "role": "assistant", "content": "Humans live in various parts of the world...............", "stop_reason": "out_of_tokens", "tool_calls": [] }, "logprobs": null } ```		2025-03-12 15:45:44 -07:00
..
agents	chore: deprecate ToolResponseMessage in agent.resume API (#1566 )	2025-03-12 12:10:21 -07:00
batch_inference	fix: solve ruff B008 warnings (#1444 )	2025-03-06 16:48:35 -08:00
benchmarks	chore!: deprecate eval/tasks (#1186 )	2025-02-20 14:06:21 -08:00
common	ci: add mypy for static type checking (#1101 )	2025-02-21 13:15:40 -08:00
datasetio	docs: api documentation for agents/eval/scoring/datasets (#1400 )	2025-03-05 09:40:24 -08:00
datasets	chore: move all Llama Stack types from llama-models to llama-stack (#1098 )	2025-02-14 09:10:59 -08:00
eval	docs: api documentation for agents/eval/scoring/datasets (#1400 )	2025-03-05 09:40:24 -08:00
files	feat: adding endpoints for files and uploads (#1070 )	2025-02-20 13:09:00 -08:00
inference	feat: Add back inference metrics and preserve context variables across asyncio boundary (#1552 )	2025-03-12 12:01:03 -07:00
inspect	chore: move all Llama Stack types from llama-models to llama-stack (#1098 )	2025-02-14 09:10:59 -08:00
models	chore: move all Llama Stack types from llama-models to llama-stack (#1098 )	2025-02-14 09:10:59 -08:00
post_training	chore: move all Llama Stack types from llama-models to llama-stack (#1098 )	2025-02-14 09:10:59 -08:00
safety	chore: move all Llama Stack types from llama-models to llama-stack (#1098 )	2025-02-14 09:10:59 -08:00
scoring	docs: api documentation for agents/eval/scoring/datasets (#1400 )	2025-03-05 09:40:24 -08:00
scoring_functions	chore: move all Llama Stack types from llama-models to llama-stack (#1098 )	2025-02-14 09:10:59 -08:00
shields	chore: move all Llama Stack types from llama-models to llama-stack (#1098 )	2025-02-14 09:10:59 -08:00
synthetic_data_generation	chore: move all Llama Stack types from llama-models to llama-stack (#1098 )	2025-02-14 09:10:59 -08:00
telemetry	feat: Add new compact MetricInResponse type (#1593 )	2025-03-12 15:45:44 -07:00
tools	feat: tool outputs metadata (#1155 )	2025-02-21 13:15:31 -08:00
vector_dbs	chore: move all Llama Stack types from llama-models to llama-stack (#1098 )	2025-02-14 09:10:59 -08:00
vector_io	chore: move all Llama Stack types from llama-models to llama-stack (#1098 )	2025-02-14 09:10:59 -08:00
__init__.py	API Updates (#73 )	2024-09-17 19:51:35 -07:00
datatypes.py	feat: enhance OpenAPI spec to include Error types (#1320 )	2025-02-28 11:16:12 -08:00
resource.py	fix!: update eval-tasks -> benchmarks (#1032 )	2025-02-13 16:40:58 -08:00
version.py	llama-stack version alpha -> v1	2025-01-15 05:58:09 -08:00