llama-stack/llama_stack/apis
Ashwin Bharambe c06718fbd5
Add support for Structured Output / Guided decoding (#281)
Added support for structured output in the API and added a reference implementation for meta-reference.

A few notes:

* Two formats are specified in the API: Json schema and EBNF based grammar
* Implementation only supports Json for now
We use lm-format-enhancer to provide the implementation right now but may change this especially because BNF grammars aren't supported by that library.
Fireworks has support for structured output and Together has limited supported for it too. Subsequent PRs will add these changes. We would like all our inference providers to provide structured output for llama models since it is an extremely important and highly sought-after need by the developers.
2024-10-22 12:53:34 -07:00
..
agents Update event_logger.py (#275) 2024-10-21 10:46:53 -07:00
batch_inference Remove "routing_table" and "routing_key" concepts for the user (#201) 2024-10-10 10:24:13 -07:00
common [Evals API] [1/n] Initial API (#287) 2024-10-22 09:31:19 -07:00
datasetio [Evals API] [1/n] Initial API (#287) 2024-10-22 09:31:19 -07:00
datasets [Evals API] [1/n] Initial API (#287) 2024-10-22 09:31:19 -07:00
eval [Evals API] [1/n] Initial API (#287) 2024-10-22 09:31:19 -07:00
inference Add support for Structured Output / Guided decoding (#281) 2024-10-22 12:53:34 -07:00
inspect Remove "routing_table" and "routing_key" concepts for the user (#201) 2024-10-10 10:24:13 -07:00
memory Remove "routing_table" and "routing_key" concepts for the user (#201) 2024-10-10 10:24:13 -07:00
memory_banks [bugfix] fix case for agent when memory bank registered without specifying provider_id (#264) 2024-10-17 17:28:17 -07:00
models Remove "routing_table" and "routing_key" concepts for the user (#201) 2024-10-10 10:24:13 -07:00
post_training API Updates (#73) 2024-09-17 19:51:35 -07:00
safety Remove "routing_table" and "routing_key" concepts for the user (#201) 2024-10-10 10:24:13 -07:00
scoring [Evals API] [1/n] Initial API (#287) 2024-10-22 09:31:19 -07:00
scoring_functions [Evals API] [1/n] Initial API (#287) 2024-10-22 09:31:19 -07:00
shields Remove "routing_table" and "routing_key" concepts for the user (#201) 2024-10-10 10:24:13 -07:00
synthetic_data_generation API Updates (#73) 2024-09-17 19:51:35 -07:00
telemetry Remove "routing_table" and "routing_key" concepts for the user (#201) 2024-10-10 10:24:13 -07:00
__init__.py API Updates (#73) 2024-09-17 19:51:35 -07:00