llama-stack-mirror

mirror of https://github.com/meta-llama/llama-stack.git synced 2025-12-23 06:22:25 +00:00

History

Charlie Doern 49b729b30a feat: api level request metrics via middleware add RequestMetricsMiddleware which tracks key metrics related to each request the LLS server will recieve: 1. llama_stack_requests_total: tracks the total amount of requests the server has processed 2. llama_stack_request_duration_seconds: tracks the duration of each request 3. llama_stack_concurrent_requests: tracks concurrently processed requests by the server The usage of a middleware allows this to be done on the server level without having to add custom handling to each router like the inference router has today for its API specific metrics. Also, add some unit tests for this functionality resolves #2597 Signed-off-by: Charlie Doern <cdoern@redhat.com>		2025-08-03 13:14:25 -04:00
..
access_control	chore(rename): move llama_stack.distribution to llama_stack.core (#2975 )	2025-07-30 23:30:53 -07:00
routers	chore(rename): move llama_stack.distribution to llama_stack.core (#2975 )	2025-07-30 23:30:53 -07:00
routing_tables	fix: remove redundant code from unregister_vector_db (#2983 )	2025-07-31 09:22:04 -07:00
server	feat: api level request metrics via middleware	2025-08-03 13:14:25 -04:00
store	chore(rename): move llama_stack.distribution to llama_stack.core (#2975 )	2025-07-30 23:30:53 -07:00
ui	refactor: remove Conda support from Llama Stack (#2969 )	2025-08-02 15:52:59 -07:00
utils	refactor: remove Conda support from Llama Stack (#2969 )	2025-08-02 15:52:59 -07:00
__init__.py	chore(rename): move llama_stack.distribution to llama_stack.core (#2975 )	2025-07-30 23:30:53 -07:00
build.py	refactor: remove Conda support from Llama Stack (#2969 )	2025-08-02 15:52:59 -07:00
build_conda_env.sh	chore(rename): move llama_stack.distribution to llama_stack.core (#2975 )	2025-07-30 23:30:53 -07:00
build_container.sh	chore(rename): move llama_stack.distribution to llama_stack.core (#2975 )	2025-07-30 23:30:53 -07:00
build_venv.sh	refactor: remove Conda support from Llama Stack (#2969 )	2025-08-02 15:52:59 -07:00
client.py	chore(rename): move llama_stack.distribution to llama_stack.core (#2975 )	2025-07-30 23:30:53 -07:00
common.sh	refactor: remove Conda support from Llama Stack (#2969 )	2025-08-02 15:52:59 -07:00
configure.py	chore(rename): move llama_stack.distribution to llama_stack.core (#2975 )	2025-07-30 23:30:53 -07:00
datatypes.py	refactor: remove Conda support from Llama Stack (#2969 )	2025-08-02 15:52:59 -07:00
distribution.py	chore(rename): move llama_stack.distribution to llama_stack.core (#2975 )	2025-07-30 23:30:53 -07:00
external.py	chore(rename): move llama_stack.distribution to llama_stack.core (#2975 )	2025-07-30 23:30:53 -07:00
inspect.py	chore(rename): move llama_stack.distribution to llama_stack.core (#2975 )	2025-07-30 23:30:53 -07:00
library_client.py	chore(rename): move llama_stack.distribution to llama_stack.core (#2975 )	2025-07-30 23:30:53 -07:00
providers.py	chore(rename): move llama_stack.distribution to llama_stack.core (#2975 )	2025-07-30 23:30:53 -07:00
request_headers.py	chore(rename): move llama_stack.distribution to llama_stack.core (#2975 )	2025-07-30 23:30:53 -07:00
resolver.py	chore(rename): move llama_stack.distribution to llama_stack.core (#2975 )	2025-07-30 23:30:53 -07:00
stack.py	chore(rename): move llama_stack.distribution to llama_stack.core (#2975 )	2025-07-30 23:30:53 -07:00
start_stack.sh	refactor: remove Conda support from Llama Stack (#2969 )	2025-08-02 15:52:59 -07:00