llama-stack/llama_stack/apis
ehhuang c9ab72fa82
Support sys_prompt behavior in inference (#937)
# What does this PR do?

The current default system prompt for llama3.2 tends to overindex on
tool calling and doesn't work well when the prompt does not require tool
calling.

This PR adds an option to override the default system prompt, and
organizes tool-related configs into a new config object.

- [ ] Addresses issue (#issue)


## Test Plan

python -m unittest
llama_stack.providers.tests.inference.test_prompt_adapter


## Sources

Please link relevant resources if necessary.


## Before submitting

- [ ] This PR fixes a typo or improves the docs (you can dismiss the
other checks if that's the case).
- [ ] Ran pre-commit to handle lint / formatting issues.
- [ ] Read the [contributor
guideline](https://github.com/meta-llama/llama-stack/blob/main/CONTRIBUTING.md),
      Pull Request section?
- [ ] Updated relevant documentation.
- [ ] Wrote necessary unit or integration tests.
---
[//]: # (BEGIN SAPLING FOOTER)
Stack created with [Sapling](https://sapling-scm.com). Best reviewed
with
[ReviewStack](https://reviewstack.dev/meta-llama/llama-stack/pull/937).
* #938
* __->__ #937
2025-02-03 23:35:16 -08:00
..
agents Fix precommit check after moving to ruff (#927) 2025-02-02 06:46:45 -08:00
batch_inference Update OpenAPI generator to add param and field documentation (#896) 2025-01-29 10:04:30 -08:00
common fix ImageContentItem to take base64 string as image.data (#909) 2025-01-30 15:58:23 -08:00
datasetio Fix precommit check after moving to ruff (#927) 2025-02-02 06:46:45 -08:00
datasets More idiomatic REST API (#765) 2025-01-15 13:20:09 -08:00
eval Fix precommit check after moving to ruff (#927) 2025-02-02 06:46:45 -08:00
eval_tasks More idiomatic REST API (#765) 2025-01-15 13:20:09 -08:00
inference Support sys_prompt behavior in inference (#937) 2025-02-03 23:35:16 -08:00
inspect REST API fixes (#789) 2025-01-16 13:47:08 -08:00
models More idiomatic REST API (#765) 2025-01-15 13:20:09 -08:00
post_training Fix precommit check after moving to ruff (#927) 2025-02-02 06:46:45 -08:00
safety More idiomatic REST API (#765) 2025-01-15 13:20:09 -08:00
scoring More idiomatic REST API (#765) 2025-01-15 13:20:09 -08:00
scoring_functions Fix precommit check after moving to ruff (#927) 2025-02-02 06:46:45 -08:00
shields More idiomatic REST API (#765) 2025-01-15 13:20:09 -08:00
synthetic_data_generation [remove import *] clean up import *'s (#689) 2024-12-27 15:45:44 -08:00
telemetry Fix precommit check after moving to ruff (#927) 2025-02-02 06:46:45 -08:00
tools Fix precommit check after moving to ruff (#927) 2025-02-02 06:46:45 -08:00
vector_dbs [memory refactor][1/n] Rename Memory -> VectorIO, MemoryBanks -> VectorDBs (#828) 2025-01-22 09:59:30 -08:00
vector_io [memory refactor][6/n] Update naming and routes (#839) 2025-01-22 10:39:13 -08:00
__init__.py API Updates (#73) 2024-09-17 19:51:35 -07:00
datatypes.py [memory refactor][1/n] Rename Memory -> VectorIO, MemoryBanks -> VectorDBs (#828) 2025-01-22 09:59:30 -08:00
resource.py Fix precommit check after moving to ruff (#927) 2025-02-02 06:46:45 -08:00
version.py llama-stack version alpha -> v1 2025-01-15 05:58:09 -08:00