llama-stack-mirror

mirror of https://github.com/meta-llama/llama-stack.git synced 2025-12-03 18:00:36 +00:00

History

Dinesh Yeduguru 60e7f3d705 fix: Revert "feat: record token usage for inference API (#1300 )" (#1476 ) This reverts commit `b8535417e0`. Test plan: LLAMA_STACK_DISABLE_VERSION_CHECK=true llama stack run ~/.llama/distributions/together/together-run.yaml python -m examples.agents.e2e_loop_with_client_tools localhost 8321		2025-03-07 10:16:47 -08:00
..
agents	refactor(test): introduce --stack-config and simplify options (#1404 )	2025-03-05 17:02:02 -08:00
datasetio	build: format codebase imports using ruff linter (#1028 )	2025-02-13 10:06:21 -08:00
eval	chore: rename task_config to benchmark_config (#1397 )	2025-03-04 12:44:04 -08:00
inference	fix: solve ruff B008 warnings (#1444 )	2025-03-06 16:48:35 -08:00
ios/inference	chore: removed executorch submodule (#1265 )	2025-02-25 21:57:21 -08:00
post_training	fix: replace eval with json decoding for format_adapter (#1328 )	2025-02-28 11:25:23 -08:00
safety	chore: move all Llama Stack types from llama-models to llama-stack (#1098 )	2025-02-14 09:10:59 -08:00
scoring	test: revamp eval related integration tests (#1433 )	2025-03-06 10:51:35 -08:00
telemetry	fix: Revert "feat: record token usage for inference API (#1300 )" (#1476 )	2025-03-07 10:16:47 -08:00
tool_runtime	chore: remove dependency on llama_models completely (#1344 )	2025-03-01 12:48:08 -08:00
vector_io	feat: add Milvus vectorDB (#1467 )	2025-03-06 20:59:31 -08:00
__init__.py	`impls` -> `inline`, `adapters` -> `remote` (#381 )	2024-11-06 14:54:05 -08:00