Commit graph

  • dd62a2388c
    docs: add notes to websearch tool and two extra example scripts (#1354) AN YU (安宇) 2025-04-18 01:20:52 +01:00
  • 0ed41aafbf
    test: add multi_image test (#1972) ehhuang 2025-04-17 12:51:42 -07:00
  • 2976b5d992
    fix: OAI compat endpoint for meta reference inference provider (#1962) ehhuang 2025-04-17 11:16:04 -07:00
  • 8bd6665775
    chore(verification): update README and reorganize generate_report.py (#1978) ehhuang 2025-04-17 10:41:22 -07:00
  • cb874287a4
    fix: resync api spec (#1987) Sébastien Han 2025-04-17 17:36:04 +02:00
  • 326cbba579
    feat(agents): add agent naming functionality (#1922) Alexey Rybak 2025-04-17 07:02:47 -07:00
  • 5b8e75b392
    fix: OpenAI spec cleanup for assistant requests (#1963) Ben Browning 2025-04-17 09:56:10 -04:00
  • 4205376653
    chore: add meta/llama-3.3-70b-instruct as supported nvidia inference provider model (#1985) Matthew Farrellee 2025-04-17 09:50:40 -04:00
  • 2ae1d7f4e6
    docs: Add NVIDIA platform distro docs (#1971) Jash Gulabrai 2025-04-17 08:54:30 -04:00
  • 45e08ff417
    fix: Handle case when Customizer Job status is unknown (#1965) Jash Gulabrai 2025-04-17 04:27:07 -04:00
  • 6f97f9a593
    chore: Use hashes to pull actions for build-single-provider job (#1977) Ihar Hrachyshka 2025-04-17 04:26:08 -04:00
  • 8f57b08f2c
    fix(build): always pass path when no template/config provided (#1982) Alexey Rybak 2025-04-17 01:20:43 -07:00
  • 6ed92e03bc
    fix: print traceback on build failure (#1966) Sébastien Han 2025-04-17 09:45:21 +02:00
  • f12011794b
    fix: Updated tools playground to allow vdb selection (#1960) Michael Clifford 2025-04-17 03:29:40 -04:00
  • b44f84ce18
    test: disable flaky dataset (#1979) ehhuang 2025-04-16 15:33:37 -07:00
  • 30fc66923b
    fix: Add llama-3.2-1b-instruct to NVIDIA fine-tuned model list (#1975) Jash Gulabrai 2025-04-16 18:02:08 -04:00
  • 00b232c282
    chore: Fix to persist the theme preference across page navigation. (#1974) Francisco Arceo 2025-04-16 14:58:25 -06:00
  • b5a9ef4c6d
    fix: Do not send an empty 'tools' list to remote vllm (#1957) Daniel Alvarez Sanchez 2025-04-16 02:31:12 +02:00
  • fb8ff77ff2
    docs: 0.2.2 doc updates (#1961) Chirag Modi 2025-04-15 13:26:17 -07:00
  • 093881071a
    fix: add max_tokens slider to playground tools page (#1958) Michael Clifford 2025-04-15 12:11:08 -04:00
  • 71ed47ea76
    docs: add example for intel gpu in vllm remote (#1952) Dmitry Rogozhkin 2025-04-15 07:56:23 -07:00
  • 83b5523e2d
    feat: add --providers to llama stack build (#1718) Charlie Doern 2025-04-15 08:17:03 -04:00
  • 32e3da7392
    test(verification): more tests, multiturn tool use tests (#1954) ehhuang 2025-04-14 18:45:22 -07:00
  • 86c6f1f112
    fix: FastAPI built-in paths bypass custom routing (Docs) and update r… (#1841) Peter Double 2025-04-14 13:28:25 -04:00
  • cf158f2cb9
    feat: allow ollama to use 'latest' if available but not specified (#1903) Nathan Weinberg 2025-04-14 12:03:54 -04:00
  • 3ed4316ed5
    feat: Implement async job execution for torchtune training (#1437) Ihar Hrachyshka 2025-04-14 11:59:11 -04:00
  • 7641a5cd0b
    fix: 100% OpenAI API verification for together and fireworks (#1946) Ben Browning 2025-04-14 11:56:29 -04:00
  • 68eeacec0e
    docs: resync missing nvidia doc (#1947) Sébastien Han 2025-04-14 15:09:16 +02:00
  • 2ec5879f14
    chore(github-deps): bump astral-sh/setup-uv from 5.4.0 to 5.4.1 (#1881) dependabot[bot] 2025-04-14 14:33:43 +02:00
  • 030ca4b2be
    docs: Move Llama 4 instructions in a collapsed section (#1936) Yuan Tang 2025-04-14 08:14:59 -04:00
  • 6d6b40983e
    refactor: update integration test workflow (#1856) Matthew Farrellee 2025-04-14 06:17:51 -04:00
  • 69554158fa
    feat: add health to all providers through providers endpoint (#1418) Sébastien Han 2025-04-14 11:59:36 +02:00
  • ff14773fa7 fix: update llama stack client dependency Ashwin Bharambe 2025-04-12 18:14:33 -07:00
  • 429f6de7d7 fix: misc fixes for tests kill horrible warnings Ashwin Bharambe 2025-04-12 17:12:11 -07:00
  • 8b4158169f fix: dont check protocol compliance for experimental methods Ashwin Bharambe 2025-04-12 12:17:39 -07:00
  • ad86a68a32
    feat: support '-' in tool names (#1807) ehhuang 2025-04-12 14:23:03 -07:00
  • ef3dc143ec fix: test_registration was borked somehow Ashwin Bharambe 2025-04-12 12:04:01 -07:00
  • 1e5bf6c19d
    feat: update default tool use prompt (#1803) ehhuang 2025-04-12 11:54:22 -07:00
  • f34f22f8c7
    feat: add batch inference API to llama stack inference (#1945) Ashwin Bharambe 2025-04-12 11:41:12 -07:00
  • 854c2ad264
    fix: misleading help text for 'llama stack build' and 'llama stack run' (#1910) Nathan Weinberg 2025-04-12 04:19:11 -04:00
  • 0751a960a5
    feat: make training config fields optional (#1861) Charlie Doern 2025-04-12 04:13:45 -04:00
  • 70a7e4d51e fix: unhide python_start, python_end Ashwin Bharambe 2025-04-11 20:30:44 -07:00
  • 51492bd9b6
    docs: Update docs and fix warning in start-stack.sh (#1937) Aidan Reilly 2025-04-12 00:26:17 +01:00
  • ed58a94b30
    docs: fixes to quick start (#1943) raghotham 2025-04-11 13:41:23 -07:00
  • 2b2db5fbda
    feat: OpenAI-Compatible models, completions, chat/completions (#1894) Ben Browning 2025-04-11 16:14:17 -04:00
  • 24d70cedca
    docs: Updated docs to show minimal RAG example and some other minor changes (#1935) Francisco Arceo 2025-04-11 12:50:36 -06:00
  • c1cb6aad11
    feat: Add unit tests for NVIDIA safety (#1897) Jash Gulabrai 2025-04-11 14:49:55 -04:00
  • 2a74f0db39
    fix: remove extra sft args in NvidiaPostTrainingAdapter (#1939) Ben Browning 2025-04-11 13:17:57 -04:00
  • 40f41af2f7
    feat: Add a direct (non-agentic) RAG option to the Playground RAG page (#1940) Ilya Kolchinsky 2025-04-11 19:16:10 +02:00
  • c6fa47db6f
    fix: ensure resource registration arguments are typed (#1941) Matthew Farrellee 2025-04-11 12:25:57 -04:00
  • 6aa459b00c
    docs: fix errors in kubernetes deployment guide (#1914) Mark Campbell 2025-04-11 12:04:13 +01:00
  • 2fcb70b789
    test(verification): overwrite test result instead of creating new ones (#1934) ehhuang 2025-04-10 16:59:28 -07:00
  • a4cc4b7e31
    test(verification): add streaming tool calling test (#1933) ehhuang 2025-04-10 16:58:06 -07:00
  • 49955a06b1
    docs: Update quickstart page to structure things a little more for the novices (#1873) Francisco Arceo 2025-04-10 15:09:00 -06:00
  • edd9aaac3b
    fix: use torchao 0.8.0 for inference (#1925) Sébastien Han 2025-04-10 22:39:20 +02:00
  • 79fc81f78f
    fix: Playground RAG page errors (#1928) Ilya Kolchinsky 2025-04-10 22:38:31 +02:00
  • de6ec5803e
    fix: Fix linter failures from #1921 (#1932) Francisco Arceo 2025-04-10 11:37:31 -06:00
  • 14146e4b3f
    feat(verification): various improvements (#1921) ehhuang 2025-04-10 10:26:19 -07:00
  • 09a83b1ec1
    docs: Updating background color for code in darkmode (#1930) Francisco Arceo 2025-04-10 10:38:57 -06:00
  • 1f2df59ece
    docs: fix model name (#1926) Sébastien Han 2025-04-10 18:37:48 +02:00
  • 1be66d754e
    docs: Redirect instructions for additional hardware accelerators for remote vLLM provider (#1923) Yuan Tang 2025-04-10 04:04:17 -04:00
  • 712c6758c6
    docs: Avoid bash script syntax highlighting for dark mode (#1918) Yuan Tang 2025-04-09 18:43:43 -04:00
  • 36a31fe5dd
    fix: on-the-fly int4 quantize parameter (#1920) Jiawen Liu 2025-04-09 15:00:12 -07:00
  • e2299291c4
    fix: Mirror llama4 rope scaling fixes, small model simplify (#1917) Ashwin Bharambe 2025-04-09 11:28:45 -07:00
  • 770b38f8b5
    chore: simplify running the demo UI (#1907) Sébastien Han 2025-04-09 20:22:29 +02:00
  • b93318e40b
    chore: Detect browser setting for dark/light mode and set default to light mode (#1913) Francisco Arceo 2025-04-09 10:40:56 -06:00
  • 5c010e234a
    fix: add tavily_search option to playground api (#1909) Michael Clifford 2025-04-09 09:56:41 -04:00
  • 692f56068c
    docs: Add recent release notes (#1899) Yuan Tang 2025-04-09 09:34:41 -04:00
  • 9657105304
    feat: Add tools page to playground (#1904) Michael Clifford 2025-04-09 09:26:52 -04:00
  • 30b49d8dfa
    fix: Playground Container Issue (#1868) Jaland 2025-04-09 10:45:15 +01:00
  • 22814299b0
    fix: solve unregister_toolgroup error (#1608) Paolo Dettori 2025-04-09 04:56:07 -04:00
  • a2cf299906
    fix: update getting started guide to use ollama pull (#1855) Matthew Farrellee 2025-04-09 04:35:19 -04:00
  • 3a9be58523
    fix: use ollama list to find models (#1854) Matthew Farrellee 2025-04-09 04:34:26 -04:00
  • 389767010b
    feat: ability to execute external providers (#1672) Sébastien Han 2025-04-09 10:30:41 +02:00
  • 45e210fd0c fix: llama3 bf16 model load Ashwin Bharambe 2025-04-09 01:09:16 -07:00
  • e3d22d8de7
    chore: fix hash for thollander/actions-comment-pull-request (#1900) Ihar Hrachyshka 2025-04-09 04:10:07 -04:00
  • 8001c30a4f fix: meta reference + llama4 tokenizer fix Ashwin Bharambe 2025-04-09 00:46:02 -07:00
  • 10882bf478
    chore: remove unused tempdir in agent (#1896) Sébastien Han 2025-04-09 09:43:48 +02:00
  • 983f6feeb8
    docs: Update remote-vllm.md with AMD GPU vLLM server supported. (#1858) AlexHe99 2025-04-09 12:35:32 +08:00
  • bcbc56baa2
    feat: adds test suite to verify provider's OAI compat endpoints (#1901) ehhuang 2025-04-08 21:21:38 -07:00
  • 7d9adf22ad
    refactor: move missing tests to test directory (#1892) Sébastien Han 2025-04-09 03:54:00 +02:00
  • 0431a6e90b
    docs: colorize Discord badge & add icon in README (#1865) wesley chun 2025-04-08 11:42:47 -07:00
  • 031a40bec0
    fix: type (#1898) ehhuang 2025-04-08 09:07:25 -07:00
  • c6e93e32f6
    feat: Updated playground rag to use session id for persistent conversation (#1870) Michael Clifford 2025-04-08 03:46:13 -04:00
  • 7b4eb0967e
    test: verification on provider's OAI endpoints (#1893) ehhuang 2025-04-07 23:06:28 -07:00
  • 530d4bdfe1
    refactor: move all llama code to models/llama out of meta reference (#1887) Ashwin Bharambe 2025-04-07 15:03:58 -07:00
  • c52ccc4bbd
    docs: update importing_as_library.md (#1863) Matthew Farrellee 2025-04-07 06:31:04 -04:00
  • c1973f6528
    docs: Fix typo in README.md (#1880) Francisco Arceo 2025-04-07 03:58:33 -06:00
  • 28e262ecdc
    feat: make multi-turn tool call tests work with llama4 (#1886) Hardik Shah 2025-04-06 19:14:21 -07:00
  • 5a31e66a91 fix: update llama-stack-client dependency to fix integration tests Ashwin Bharambe 2025-04-06 19:11:05 -07:00
  • 378f0de439
    docs: llama4 getting started nb (#1878) ehhuang 2025-04-06 18:51:34 -07:00
  • 3f92b2bf85 fix: kill the usage of python_start and python_end tokens Ashwin Bharambe 2025-04-05 19:00:26 -07:00
  • 3021c87271 fix: bump version to 0.2.1 for bugfix release v0.2.1 Ashwin Bharambe 2025-04-05 16:05:37 -07:00
  • fd7ab37c14
    docs: fixing sphinx imports (#1884) raghotham 2025-04-05 14:21:45 -07:00
  • e2213265bc
    docs: Update README.md (#1879) Hardik Shah 2025-04-05 12:15:55 -07:00
  • b8f1561956
    feat: introduce llama4 support (#1877) v0.2.0 Ashwin Bharambe 2025-04-05 11:53:35 -07:00
  • 23a99a4b22
    docs: Minor updates to docs to make them a little friendlier to new users (#1871) Francisco Arceo 2025-04-04 06:10:35 -06:00
  • 66d6c2580e
    chore: more mypy checks (ollama, vllm, ...) (#1777) Ihar Hrachyshka 2025-04-01 11:12:39 -04:00
  • d5e0f32485
    ci: pin github actions to hashes (#1776) Ihar Hrachyshka 2025-04-01 11:09:39 -04:00
  • 19f504e9e2
    docs: Updating docs to source from CONTRIBUTING.md (#1850) Francisco Arceo 2025-04-01 06:50:04 -06:00