llama-stack

phoenix-oss/llama-stack

Fork 0

forked from phoenix-oss/llama-stack-mirror

Commit graph

68eeacec0e

docs: resync missing nvidia doc (#1947) Sébastien Han 2025-04-14 15:09:16 +02:00
2ec5879f14

chore(github-deps): bump astral-sh/setup-uv from 5.4.0 to 5.4.1 (#1881) dependabot[bot] 2025-04-14 14:33:43 +02:00
030ca4b2be

docs: Move Llama 4 instructions in a collapsed section (#1936) Yuan Tang 2025-04-14 08:14:59 -04:00
6d6b40983e

refactor: update integration test workflow (#1856) Matthew Farrellee 2025-04-14 06:17:51 -04:00
69554158fa

feat: add health to all providers through providers endpoint (#1418) Sébastien Han 2025-04-14 11:59:36 +02:00
ff14773fa7 fix: update llama stack client dependency Ashwin Bharambe 2025-04-12 18:14:33 -07:00
ca26faa7fd build: Bump version to 0.2.2 v0.2.2 release-0.2.2 github-actions[bot] 2025-04-13 01:07:44 +00:00
1079e22b11 Release candidate 0.2.2rc1 v0.2.2rc1 github-actions[bot] 2025-04-13 00:54:05 +00:00
429f6de7d7 fix: misc fixes for tests kill horrible warnings Ashwin Bharambe 2025-04-12 17:12:11 -07:00
8b4158169f fix: dont check protocol compliance for experimental methods Ashwin Bharambe 2025-04-12 12:17:39 -07:00
ad86a68a32

feat: support '-' in tool names (#1807) ehhuang 2025-04-12 14:23:03 -07:00
ef3dc143ec fix: test_registration was borked somehow Ashwin Bharambe 2025-04-12 12:04:01 -07:00
1e5bf6c19d

feat: update default tool use prompt (#1803) ehhuang 2025-04-12 11:54:22 -07:00
f34f22f8c7

feat: add batch inference API to llama stack inference (#1945) Ashwin Bharambe 2025-04-12 11:41:12 -07:00
854c2ad264

fix: misleading help text for 'llama stack build' and 'llama stack run' (#1910) Nathan Weinberg 2025-04-12 04:19:11 -04:00
0751a960a5

feat: make training config fields optional (#1861) Charlie Doern 2025-04-12 04:13:45 -04:00
70a7e4d51e fix: unhide python_start, python_end Ashwin Bharambe 2025-04-11 20:30:44 -07:00
51492bd9b6

docs: Update docs and fix warning in start-stack.sh (#1937) Aidan Reilly 2025-04-12 00:26:17 +01:00
ed58a94b30

docs: fixes to quick start (#1943) raghotham 2025-04-11 13:41:23 -07:00
2b2db5fbda

feat: OpenAI-Compatible models, completions, chat/completions (#1894) Ben Browning 2025-04-11 16:14:17 -04:00
24d70cedca

docs: Updated docs to show minimal RAG example and some other minor changes (#1935) Francisco Arceo 2025-04-11 12:50:36 -06:00
c1cb6aad11

feat: Add unit tests for NVIDIA safety (#1897) Jash Gulabrai 2025-04-11 14:49:55 -04:00
2a74f0db39

fix: remove extra sft args in NvidiaPostTrainingAdapter (#1939) Ben Browning 2025-04-11 13:17:57 -04:00
40f41af2f7

feat: Add a direct (non-agentic) RAG option to the Playground RAG page (#1940) Ilya Kolchinsky 2025-04-11 19:16:10 +02:00
c6fa47db6f

fix: ensure resource registration arguments are typed (#1941) Matthew Farrellee 2025-04-11 12:25:57 -04:00
6aa459b00c

docs: fix errors in kubernetes deployment guide (#1914) Mark Campbell 2025-04-11 12:04:13 +01:00
2fcb70b789

test(verification): overwrite test result instead of creating new ones (#1934) ehhuang 2025-04-10 16:59:28 -07:00
a4cc4b7e31

test(verification): add streaming tool calling test (#1933) ehhuang 2025-04-10 16:58:06 -07:00
49955a06b1

docs: Update quickstart page to structure things a little more for the novices (#1873) Francisco Arceo 2025-04-10 15:09:00 -06:00
edd9aaac3b

fix: use torchao 0.8.0 for inference (#1925) Sébastien Han 2025-04-10 22:39:20 +02:00
79fc81f78f

fix: Playground RAG page errors (#1928) Ilya Kolchinsky 2025-04-10 22:38:31 +02:00
de6ec5803e

fix: Fix linter failures from #1921 (#1932) Francisco Arceo 2025-04-10 11:37:31 -06:00
14146e4b3f

feat(verification): various improvements (#1921) ehhuang 2025-04-10 10:26:19 -07:00
09a83b1ec1

docs: Updating background color for code in darkmode (#1930) Francisco Arceo 2025-04-10 10:38:57 -06:00
1f2df59ece

docs: fix model name (#1926) Sébastien Han 2025-04-10 18:37:48 +02:00
1be66d754e

docs: Redirect instructions for additional hardware accelerators for remote vLLM provider (#1923) Yuan Tang 2025-04-10 04:04:17 -04:00
712c6758c6

docs: Avoid bash script syntax highlighting for dark mode (#1918) Yuan Tang 2025-04-09 18:43:43 -04:00
36a31fe5dd

fix: on-the-fly int4 quantize parameter (#1920) Jiawen Liu 2025-04-09 15:00:12 -07:00
e2299291c4

fix: Mirror llama4 rope scaling fixes, small model simplify (#1917) Ashwin Bharambe 2025-04-09 11:28:45 -07:00
770b38f8b5

chore: simplify running the demo UI (#1907) Sébastien Han 2025-04-09 20:22:29 +02:00
b93318e40b

chore: Detect browser setting for dark/light mode and set default to light mode (#1913) Francisco Arceo 2025-04-09 10:40:56 -06:00
5c010e234a

fix: add tavily_search option to playground api (#1909) Michael Clifford 2025-04-09 09:56:41 -04:00
692f56068c

docs: Add recent release notes (#1899) Yuan Tang 2025-04-09 09:34:41 -04:00
9657105304

feat: Add tools page to playground (#1904) Michael Clifford 2025-04-09 09:26:52 -04:00
30b49d8dfa

fix: Playground Container Issue (#1868) Jaland 2025-04-09 10:45:15 +01:00
22814299b0

fix: solve unregister_toolgroup error (#1608) Paolo Dettori 2025-04-09 04:56:07 -04:00
a2cf299906

fix: update getting started guide to use ollama pull (#1855) Matthew Farrellee 2025-04-09 04:35:19 -04:00
3a9be58523

fix: use ollama list to find models (#1854) Matthew Farrellee 2025-04-09 04:34:26 -04:00
389767010b

feat: ability to execute external providers (#1672) Sébastien Han 2025-04-09 10:30:41 +02:00
45e210fd0c fix: llama3 bf16 model load Ashwin Bharambe 2025-04-09 01:09:16 -07:00
e3d22d8de7

chore: fix hash for thollander/actions-comment-pull-request (#1900) Ihar Hrachyshka 2025-04-09 04:10:07 -04:00
8001c30a4f fix: meta reference + llama4 tokenizer fix Ashwin Bharambe 2025-04-09 00:46:02 -07:00
10882bf478

chore: remove unused tempdir in agent (#1896) Sébastien Han 2025-04-09 09:43:48 +02:00
983f6feeb8

docs: Update remote-vllm.md with AMD GPU vLLM server supported. (#1858) AlexHe99 2025-04-09 12:35:32 +08:00
bcbc56baa2

feat: adds test suite to verify provider's OAI compat endpoints (#1901) ehhuang 2025-04-08 21:21:38 -07:00
7d9adf22ad

refactor: move missing tests to test directory (#1892) Sébastien Han 2025-04-09 03:54:00 +02:00
0431a6e90b

docs: colorize Discord badge & add icon in README (#1865) wesley chun 2025-04-08 11:42:47 -07:00
031a40bec0

fix: type (#1898) ehhuang 2025-04-08 09:07:25 -07:00
c6e93e32f6

feat: Updated playground rag to use session id for persistent conversation (#1870) Michael Clifford 2025-04-08 03:46:13 -04:00
7b4eb0967e

test: verification on provider's OAI endpoints (#1893) ehhuang 2025-04-07 23:06:28 -07:00
530d4bdfe1

refactor: move all llama code to models/llama out of meta reference (#1887) Ashwin Bharambe 2025-04-07 15:03:58 -07:00
c52ccc4bbd

docs: update importing_as_library.md (#1863) Matthew Farrellee 2025-04-07 06:31:04 -04:00
c1973f6528

docs: Fix typo in README.md (#1880) Francisco Arceo 2025-04-07 03:58:33 -06:00
28e262ecdc

feat: make multi-turn tool call tests work with llama4 (#1886) Hardik Shah 2025-04-06 19:14:21 -07:00
5a31e66a91 fix: update llama-stack-client dependency to fix integration tests Ashwin Bharambe 2025-04-06 19:11:05 -07:00
378f0de439

docs: llama4 getting started nb (#1878) ehhuang 2025-04-06 18:51:34 -07:00
3f92b2bf85 fix: kill the usage of python_start and python_end tokens Ashwin Bharambe 2025-04-05 19:00:26 -07:00
3021c87271 fix: bump version to 0.2.1 for bugfix release v0.2.1 Ashwin Bharambe 2025-04-05 16:05:37 -07:00
fd7ab37c14

docs: fixing sphinx imports (#1884) raghotham 2025-04-05 14:21:45 -07:00
e2213265bc

docs: Update README.md (#1879) Hardik Shah 2025-04-05 12:15:55 -07:00
b8f1561956

feat: introduce llama4 support (#1877) v0.2.0 Ashwin Bharambe 2025-04-05 11:53:35 -07:00
23a99a4b22

docs: Minor updates to docs to make them a little friendlier to new users (#1871) Francisco Arceo 2025-04-04 06:10:35 -06:00
66d6c2580e

chore: more mypy checks (ollama, vllm, ...) (#1777) Ihar Hrachyshka 2025-04-01 11:12:39 -04:00
d5e0f32485

ci: pin github actions to hashes (#1776) Ihar Hrachyshka 2025-04-01 11:09:39 -04:00
19f504e9e2

docs: Updating docs to source from CONTRIBUTING.md (#1850) Francisco Arceo 2025-04-01 06:50:04 -06:00
c169c164b3

fix: NVIDIA embedding results in InternalServerError (#1851) Rashmi Pawar 2025-04-01 17:01:29 +05:30
0a895c70d1

fix(api): don't return list for runtime tools (#1686) Ihar Hrachyshka 2025-04-01 03:53:11 -04:00
b440a1dc42

test: make sure integration tests runs against the server (#1743) Ashwin Bharambe 2025-03-31 13:38:47 -07:00
2ffa2b77ed

refactor: extract pagination logic into shared helper function (#1770) Sébastien Han 2025-03-31 22:08:29 +02:00
d495922949

docs: Updated documentation and Sphinx configuration (#1845) Francisco Arceo 2025-03-31 14:08:05 -06:00
60430da48a

docs: Update readme for integration tests (#1846) Francisco Arceo 2025-03-31 14:00:02 -06:00
9b478f3756

docs: Adding darkmode to documentation (#1843) Francisco Arceo 2025-03-31 09:31:53 -06:00
7e51a83eac

docs: Add link to integration tests instructions and minor clarification (#1838) Yuan Tang 2025-03-31 05:37:42 -04:00
90efafafb7

chore: change context to content for agent (#1840) Xi Yan 2025-03-30 10:33:58 -07:00
3a2314dcef

fix(telemetry): library client does not log span (#1833) ehhuang 2025-03-29 14:55:31 -07:00
d8a8a734b5

fix: update sink name for traces and metrics in LlamaStack 0.1.8 (#1836) Anamika 2025-03-29 17:09:08 +00:00
a4c086cee0

fix: skip apis with no providers during llama stack build (#1835) Matthew Farrellee 2025-03-29 11:39:35 -04:00
a182705ade

fix(telemetry): query_spans (#1831) ehhuang 2025-03-28 20:58:17 -07:00
74a2584cdb

chore: Updating Milvus Client calls to be non-blocking (#1830) Francisco Arceo 2025-03-28 20:14:07 -06:00
daa34909a0 build: Bump version to 0.1.9 github-actions[bot] 2025-03-29 00:22:35 +00:00
337aa6d183 build: Bump version to 0.1.9 v0.1.9 release-0.1.9 github-actions[bot] 2025-03-29 00:22:07 +00:00
b7ab1a9710 build: Bump version to 0.1.19 github-actions[bot] 2025-03-29 00:18:38 +00:00
54747c28fc Release candidate 0.1.19rc8 v0.1.19rc8 github-actions[bot] 2025-03-28 23:53:32 +00:00
e58c7f6c37

fix(telemetry): root span not yet received (#1828) ehhuang 2025-03-28 14:40:17 -07:00
7e7bea66ba

fix: skip code interp (#1827) Xi Yan 2025-03-28 12:58:08 -07:00
af6594f670

fix: Adding chunk_size_in_tokens to playground rag_tool insert (#1826) Francisco Arceo 2025-03-28 13:56:25 -06:00
37b6da37ba

docs: Document sqlite-vec faiss comparison (#1821) Francisco Arceo 2025-03-28 10:41:33 -06:00
a4f458e1c1

ci: add myself to CODEOWNERS (#1823) Sébastien Han 2025-03-28 17:37:42 +01:00
18bac27d4e

fix: Use CONDA_DEFAULT_ENV presence as a flag to use conda mode (#1555) Ihar Hrachyshka 2025-03-27 17:13:22 -04:00
b5c27f77ad

chore: clean up distro doc (#1804) Xi Yan 2025-03-27 12:12:14 -07:00