Commit graph

  • 68eeacec0e
    docs: resync missing nvidia doc (#1947) Sébastien Han 2025-04-14 15:09:16 +02:00
  • 2ec5879f14
    chore(github-deps): bump astral-sh/setup-uv from 5.4.0 to 5.4.1 (#1881) dependabot[bot] 2025-04-14 14:33:43 +02:00
  • 030ca4b2be
    docs: Move Llama 4 instructions in a collapsed section (#1936) Yuan Tang 2025-04-14 08:14:59 -04:00
  • 6d6b40983e
    refactor: update integration test workflow (#1856) Matthew Farrellee 2025-04-14 06:17:51 -04:00
  • 69554158fa
    feat: add health to all providers through providers endpoint (#1418) Sébastien Han 2025-04-14 11:59:36 +02:00
  • ff14773fa7 fix: update llama stack client dependency Ashwin Bharambe 2025-04-12 18:14:33 -07:00
  • ca26faa7fd build: Bump version to 0.2.2 v0.2.2 release-0.2.2 github-actions[bot] 2025-04-13 01:07:44 +00:00
  • 1079e22b11 Release candidate 0.2.2rc1 v0.2.2rc1 github-actions[bot] 2025-04-13 00:54:05 +00:00
  • 429f6de7d7 fix: misc fixes for tests kill horrible warnings Ashwin Bharambe 2025-04-12 17:12:11 -07:00
  • 8b4158169f fix: dont check protocol compliance for experimental methods Ashwin Bharambe 2025-04-12 12:17:39 -07:00
  • ad86a68a32
    feat: support '-' in tool names (#1807) ehhuang 2025-04-12 14:23:03 -07:00
  • ef3dc143ec fix: test_registration was borked somehow Ashwin Bharambe 2025-04-12 12:04:01 -07:00
  • 1e5bf6c19d
    feat: update default tool use prompt (#1803) ehhuang 2025-04-12 11:54:22 -07:00
  • f34f22f8c7
    feat: add batch inference API to llama stack inference (#1945) Ashwin Bharambe 2025-04-12 11:41:12 -07:00
  • 854c2ad264
    fix: misleading help text for 'llama stack build' and 'llama stack run' (#1910) Nathan Weinberg 2025-04-12 04:19:11 -04:00
  • 0751a960a5
    feat: make training config fields optional (#1861) Charlie Doern 2025-04-12 04:13:45 -04:00
  • 70a7e4d51e fix: unhide python_start, python_end Ashwin Bharambe 2025-04-11 20:30:44 -07:00
  • 51492bd9b6
    docs: Update docs and fix warning in start-stack.sh (#1937) Aidan Reilly 2025-04-12 00:26:17 +01:00
  • ed58a94b30
    docs: fixes to quick start (#1943) raghotham 2025-04-11 13:41:23 -07:00
  • 2b2db5fbda
    feat: OpenAI-Compatible models, completions, chat/completions (#1894) Ben Browning 2025-04-11 16:14:17 -04:00
  • 24d70cedca
    docs: Updated docs to show minimal RAG example and some other minor changes (#1935) Francisco Arceo 2025-04-11 12:50:36 -06:00
  • c1cb6aad11
    feat: Add unit tests for NVIDIA safety (#1897) Jash Gulabrai 2025-04-11 14:49:55 -04:00
  • 2a74f0db39
    fix: remove extra sft args in NvidiaPostTrainingAdapter (#1939) Ben Browning 2025-04-11 13:17:57 -04:00
  • 40f41af2f7
    feat: Add a direct (non-agentic) RAG option to the Playground RAG page (#1940) Ilya Kolchinsky 2025-04-11 19:16:10 +02:00
  • c6fa47db6f
    fix: ensure resource registration arguments are typed (#1941) Matthew Farrellee 2025-04-11 12:25:57 -04:00
  • 6aa459b00c
    docs: fix errors in kubernetes deployment guide (#1914) Mark Campbell 2025-04-11 12:04:13 +01:00
  • 2fcb70b789
    test(verification): overwrite test result instead of creating new ones (#1934) ehhuang 2025-04-10 16:59:28 -07:00
  • a4cc4b7e31
    test(verification): add streaming tool calling test (#1933) ehhuang 2025-04-10 16:58:06 -07:00
  • 49955a06b1
    docs: Update quickstart page to structure things a little more for the novices (#1873) Francisco Arceo 2025-04-10 15:09:00 -06:00
  • edd9aaac3b
    fix: use torchao 0.8.0 for inference (#1925) Sébastien Han 2025-04-10 22:39:20 +02:00
  • 79fc81f78f
    fix: Playground RAG page errors (#1928) Ilya Kolchinsky 2025-04-10 22:38:31 +02:00
  • de6ec5803e
    fix: Fix linter failures from #1921 (#1932) Francisco Arceo 2025-04-10 11:37:31 -06:00
  • 14146e4b3f
    feat(verification): various improvements (#1921) ehhuang 2025-04-10 10:26:19 -07:00
  • 09a83b1ec1
    docs: Updating background color for code in darkmode (#1930) Francisco Arceo 2025-04-10 10:38:57 -06:00
  • 1f2df59ece
    docs: fix model name (#1926) Sébastien Han 2025-04-10 18:37:48 +02:00
  • 1be66d754e
    docs: Redirect instructions for additional hardware accelerators for remote vLLM provider (#1923) Yuan Tang 2025-04-10 04:04:17 -04:00
  • 712c6758c6
    docs: Avoid bash script syntax highlighting for dark mode (#1918) Yuan Tang 2025-04-09 18:43:43 -04:00
  • 36a31fe5dd
    fix: on-the-fly int4 quantize parameter (#1920) Jiawen Liu 2025-04-09 15:00:12 -07:00
  • e2299291c4
    fix: Mirror llama4 rope scaling fixes, small model simplify (#1917) Ashwin Bharambe 2025-04-09 11:28:45 -07:00
  • 770b38f8b5
    chore: simplify running the demo UI (#1907) Sébastien Han 2025-04-09 20:22:29 +02:00
  • b93318e40b
    chore: Detect browser setting for dark/light mode and set default to light mode (#1913) Francisco Arceo 2025-04-09 10:40:56 -06:00
  • 5c010e234a
    fix: add tavily_search option to playground api (#1909) Michael Clifford 2025-04-09 09:56:41 -04:00
  • 692f56068c
    docs: Add recent release notes (#1899) Yuan Tang 2025-04-09 09:34:41 -04:00
  • 9657105304
    feat: Add tools page to playground (#1904) Michael Clifford 2025-04-09 09:26:52 -04:00
  • 30b49d8dfa
    fix: Playground Container Issue (#1868) Jaland 2025-04-09 10:45:15 +01:00
  • 22814299b0
    fix: solve unregister_toolgroup error (#1608) Paolo Dettori 2025-04-09 04:56:07 -04:00
  • a2cf299906
    fix: update getting started guide to use ollama pull (#1855) Matthew Farrellee 2025-04-09 04:35:19 -04:00
  • 3a9be58523
    fix: use ollama list to find models (#1854) Matthew Farrellee 2025-04-09 04:34:26 -04:00
  • 389767010b
    feat: ability to execute external providers (#1672) Sébastien Han 2025-04-09 10:30:41 +02:00
  • 45e210fd0c fix: llama3 bf16 model load Ashwin Bharambe 2025-04-09 01:09:16 -07:00
  • e3d22d8de7
    chore: fix hash for thollander/actions-comment-pull-request (#1900) Ihar Hrachyshka 2025-04-09 04:10:07 -04:00
  • 8001c30a4f fix: meta reference + llama4 tokenizer fix Ashwin Bharambe 2025-04-09 00:46:02 -07:00
  • 10882bf478
    chore: remove unused tempdir in agent (#1896) Sébastien Han 2025-04-09 09:43:48 +02:00
  • 983f6feeb8
    docs: Update remote-vllm.md with AMD GPU vLLM server supported. (#1858) AlexHe99 2025-04-09 12:35:32 +08:00
  • bcbc56baa2
    feat: adds test suite to verify provider's OAI compat endpoints (#1901) ehhuang 2025-04-08 21:21:38 -07:00
  • 7d9adf22ad
    refactor: move missing tests to test directory (#1892) Sébastien Han 2025-04-09 03:54:00 +02:00
  • 0431a6e90b
    docs: colorize Discord badge & add icon in README (#1865) wesley chun 2025-04-08 11:42:47 -07:00
  • 031a40bec0
    fix: type (#1898) ehhuang 2025-04-08 09:07:25 -07:00
  • c6e93e32f6
    feat: Updated playground rag to use session id for persistent conversation (#1870) Michael Clifford 2025-04-08 03:46:13 -04:00
  • 7b4eb0967e
    test: verification on provider's OAI endpoints (#1893) ehhuang 2025-04-07 23:06:28 -07:00
  • 530d4bdfe1
    refactor: move all llama code to models/llama out of meta reference (#1887) Ashwin Bharambe 2025-04-07 15:03:58 -07:00
  • c52ccc4bbd
    docs: update importing_as_library.md (#1863) Matthew Farrellee 2025-04-07 06:31:04 -04:00
  • c1973f6528
    docs: Fix typo in README.md (#1880) Francisco Arceo 2025-04-07 03:58:33 -06:00
  • 28e262ecdc
    feat: make multi-turn tool call tests work with llama4 (#1886) Hardik Shah 2025-04-06 19:14:21 -07:00
  • 5a31e66a91 fix: update llama-stack-client dependency to fix integration tests Ashwin Bharambe 2025-04-06 19:11:05 -07:00
  • 378f0de439
    docs: llama4 getting started nb (#1878) ehhuang 2025-04-06 18:51:34 -07:00
  • 3f92b2bf85 fix: kill the usage of python_start and python_end tokens Ashwin Bharambe 2025-04-05 19:00:26 -07:00
  • 3021c87271 fix: bump version to 0.2.1 for bugfix release v0.2.1 Ashwin Bharambe 2025-04-05 16:05:37 -07:00
  • fd7ab37c14
    docs: fixing sphinx imports (#1884) raghotham 2025-04-05 14:21:45 -07:00
  • e2213265bc
    docs: Update README.md (#1879) Hardik Shah 2025-04-05 12:15:55 -07:00
  • b8f1561956
    feat: introduce llama4 support (#1877) v0.2.0 Ashwin Bharambe 2025-04-05 11:53:35 -07:00
  • 23a99a4b22
    docs: Minor updates to docs to make them a little friendlier to new users (#1871) Francisco Arceo 2025-04-04 06:10:35 -06:00
  • 66d6c2580e
    chore: more mypy checks (ollama, vllm, ...) (#1777) Ihar Hrachyshka 2025-04-01 11:12:39 -04:00
  • d5e0f32485
    ci: pin github actions to hashes (#1776) Ihar Hrachyshka 2025-04-01 11:09:39 -04:00
  • 19f504e9e2
    docs: Updating docs to source from CONTRIBUTING.md (#1850) Francisco Arceo 2025-04-01 06:50:04 -06:00
  • c169c164b3
    fix: NVIDIA embedding results in InternalServerError (#1851) Rashmi Pawar 2025-04-01 17:01:29 +05:30
  • 0a895c70d1
    fix(api): don't return list for runtime tools (#1686) Ihar Hrachyshka 2025-04-01 03:53:11 -04:00
  • b440a1dc42
    test: make sure integration tests runs against the server (#1743) Ashwin Bharambe 2025-03-31 13:38:47 -07:00
  • 2ffa2b77ed
    refactor: extract pagination logic into shared helper function (#1770) Sébastien Han 2025-03-31 22:08:29 +02:00
  • d495922949
    docs: Updated documentation and Sphinx configuration (#1845) Francisco Arceo 2025-03-31 14:08:05 -06:00
  • 60430da48a
    docs: Update readme for integration tests (#1846) Francisco Arceo 2025-03-31 14:00:02 -06:00
  • 9b478f3756
    docs: Adding darkmode to documentation (#1843) Francisco Arceo 2025-03-31 09:31:53 -06:00
  • 7e51a83eac
    docs: Add link to integration tests instructions and minor clarification (#1838) Yuan Tang 2025-03-31 05:37:42 -04:00
  • 90efafafb7
    chore: change context to content for agent (#1840) Xi Yan 2025-03-30 10:33:58 -07:00
  • 3a2314dcef
    fix(telemetry): library client does not log span (#1833) ehhuang 2025-03-29 14:55:31 -07:00
  • d8a8a734b5
    fix: update sink name for traces and metrics in LlamaStack 0.1.8 (#1836) Anamika 2025-03-29 17:09:08 +00:00
  • a4c086cee0
    fix: skip apis with no providers during llama stack build (#1835) Matthew Farrellee 2025-03-29 11:39:35 -04:00
  • a182705ade
    fix(telemetry): query_spans (#1831) ehhuang 2025-03-28 20:58:17 -07:00
  • 74a2584cdb
    chore: Updating Milvus Client calls to be non-blocking (#1830) Francisco Arceo 2025-03-28 20:14:07 -06:00
  • daa34909a0 build: Bump version to 0.1.9 github-actions[bot] 2025-03-29 00:22:35 +00:00
  • 337aa6d183 build: Bump version to 0.1.9 v0.1.9 release-0.1.9 github-actions[bot] 2025-03-29 00:22:07 +00:00
  • b7ab1a9710 build: Bump version to 0.1.19 github-actions[bot] 2025-03-29 00:18:38 +00:00
  • 54747c28fc Release candidate 0.1.19rc8 v0.1.19rc8 github-actions[bot] 2025-03-28 23:53:32 +00:00
  • e58c7f6c37
    fix(telemetry): root span not yet received (#1828) ehhuang 2025-03-28 14:40:17 -07:00
  • 7e7bea66ba
    fix: skip code interp (#1827) Xi Yan 2025-03-28 12:58:08 -07:00
  • af6594f670
    fix: Adding chunk_size_in_tokens to playground rag_tool insert (#1826) Francisco Arceo 2025-03-28 13:56:25 -06:00
  • 37b6da37ba
    docs: Document sqlite-vec faiss comparison (#1821) Francisco Arceo 2025-03-28 10:41:33 -06:00
  • a4f458e1c1
    ci: add myself to CODEOWNERS (#1823) Sébastien Han 2025-03-28 17:37:42 +01:00
  • 18bac27d4e
    fix: Use CONDA_DEFAULT_ENV presence as a flag to use conda mode (#1555) Ihar Hrachyshka 2025-03-27 17:13:22 -04:00
  • b5c27f77ad
    chore: clean up distro doc (#1804) Xi Yan 2025-03-27 12:12:14 -07:00