llama-stack-mirror

mirror of https://github.com/meta-llama/llama-stack.git synced 2025-12-03 18:00:36 +00:00

History

ehhuang 0b5a794c27 fix: telemetry logger spams when queue is full (#3070 ) # What does this PR do? ## Test Plan Ran a stress test on chat completion endpoint locally: For 10 concurrent users over 3 minutes: Before: <img width="1440" height="201" alt="image" src="https://github.com/user-attachments/assets/24e0d580-186e-4e24-931e-2b936c5859b6" /> After: <img width="1434" height="204" alt="image" src="https://github.com/user-attachments/assets/4b806d88-f822-41e9-b25a-018cc4bec866" /> (Will send scripts in a future PR.)		2025-08-08 13:47:36 -07:00
..
apis	feat: Add moderations create api (#3020 )	2025-08-06 13:51:23 -07:00
cli	chore: rename templates to distributions (#3035 )	2025-08-04 11:34:17 -07:00
core	feat: Add moderations create api (#3020 )	2025-08-06 13:51:23 -07:00
distributions	docs: fix the docs for NVIDIA Inference Provider (#3055 )	2025-08-08 11:27:55 +02:00
models	chore(api): add `mypy` coverage to `chat_format` (#2654 )	2025-07-18 11:56:53 +02:00
providers	fix: telemetry logger spams when queue is full (#3070 )	2025-08-08 13:47:36 -07:00
strong_typing	chore: enable pyupgrade fixes (#1806 )	2025-05-01 14:23:50 -07:00
testing	fix(recording): endpoint resolution (#3013 )	2025-08-01 16:23:54 -07:00
ui	feat(ui): Adding Vector Store Files to Admin UI (#3041 )	2025-08-08 07:44:06 -07:00
__init__.py	chore(rename): move llama_stack.distribution to llama_stack.core (#2975 )	2025-07-30 23:30:53 -07:00
env.py	refactor(test): move tools, evals, datasetio, scoring and post training tests (#1401 )	2025-03-04 14:53:47 -08:00
log.py	chore(rename): move llama_stack.distribution to llama_stack.core (#2975 )	2025-07-30 23:30:53 -07:00
schema_utils.py	feat(auth): API access control (#2822 )	2025-07-24 15:30:48 -07:00