llama-stack-mirror

mirror of https://github.com/meta-llama/llama-stack.git synced 2025-12-30 19:23:52 +00:00

History

Ben Browning ac5dc8fae2 Add prompt_logprobs and guided_choice to OpenAI completions This adds the vLLM-specific extra_body parameters of prompt_logprobs and guided_choice to our openai_completion inference endpoint. The plan here would be to expand this to support all common optional parameters of any of the OpenAI providers, allowing each provider to use or ignore these parameters based on whether their server supports them. Signed-off-by: Ben Browning <bbrownin@redhat.com>		2025-04-09 15:47:02 -04:00
..
apis	Add prompt_logprobs and guided_choice to OpenAI completions	2025-04-09 15:47:02 -04:00
cli	refactor: move all llama code to models/llama out of meta reference (#1887 )	2025-04-07 15:03:58 -07:00
distribution	Add prompt_logprobs and guided_choice to OpenAI completions	2025-04-09 15:47:02 -04:00
models	fix: Mirror llama4 rope scaling fixes, small model simplify (#1917 )	2025-04-09 11:28:45 -07:00
providers	Add prompt_logprobs and guided_choice to OpenAI completions	2025-04-09 15:47:02 -04:00
strong_typing	chore: more mypy checks (ollama, vllm, ...) (#1777 )	2025-04-01 17:12:39 +02:00
templates	docs: Update remote-vllm.md with AMD GPU vLLM server supported. (#1858 )	2025-04-08 21:35:32 -07:00
__init__.py	export LibraryClient	2024-12-13 12:08:00 -08:00
env.py	refactor(test): move tools, evals, datasetio, scoring and post training tests (#1401 )	2025-03-04 14:53:47 -08:00
log.py	chore: Remove style tags from log formatter (#1808 )	2025-03-27 10:18:21 -04:00
schema_utils.py	chore: make mypy happy with webmethod (#1758 )	2025-03-22 08:17:23 -07:00