Commit graph

  • a34cef925b
    fix(faiss): handle case where distance is 0 by setting d to minimum positive… (#2387) Ibrahim Haroon 2025-06-07 16:09:46 -04:00
  • 33ecefd284
    feat: To add health status check for remote VLLM (#2303) Sumit Jaiswal 2025-06-07 01:03:12 +05:30
  • 32c651e3a7
    chore: update CODEOWNERS (#2414) Alexey Rybak 2025-06-06 11:35:15 -07:00
  • 1f48577a02
    fix: ChromaDB provider (#2413) Hardik Shah 2025-06-06 11:25:58 -07:00
  • 0d0b8d2be1
    ci: use ollama container image with loaded models (#2410) Sébastien Han 2025-06-06 12:08:20 +02:00
  • 692709cd45 build: Bump version to 0.2.10 github-actions[bot] 2025-06-05 22:56:39 +00:00
  • 102516f33c
    fix: Pin fastapi to avoid picking up spurious versions in test pypi (#2409) Hardik Shah 2025-06-05 15:33:30 -07:00
  • 446893f791
    feat: add deps dynamically based on metastore config (#2405) ehhuang 2025-06-05 14:07:25 -07:00
  • 92b59a3377
    test: skip files integrations tests for library client (#2407) ehhuang 2025-06-05 13:42:10 -07:00
  • ee6feaa2d5
    chore: remove dead code (#2403) ehhuang 2025-06-05 12:17:54 -07:00
  • 04592b9590
    fix: update pyproject to include recursive LS deps (#2404) Hardik Shah 2025-06-05 11:46:48 -07:00
  • 4fb228a1d8
    ci: run integration test on more python version (#2400) Sébastien Han 2025-06-05 20:40:21 +02:00
  • 3251b44d8a
    refactor: unify stream and non-stream impls for responses (#2388) Ashwin Bharambe 2025-06-05 17:48:09 +02:00
  • ef885d2147
    fix(server): Add missing OpenTelemetry dependencies to resolve telemetry import errors (#2391) Jose Angel Morena Simon 2025-06-05 09:34:46 +02:00
  • 179d72615b
    docs: update contributing guidance around uv python versions (#2398) Nathan Weinberg 2025-06-05 02:12:03 -04:00
  • a58c0639d5
    chore: update postgres_demo distro config (#2396) ehhuang 2025-06-04 17:41:27 -07:00
  • c8c742ba45
    fix: vllm starter name (#2392) Sébastien Han 2025-06-04 16:21:36 +02:00
  • 0de9536717
    fix: remove debug print accidentally merged (#2393) grs 2025-06-04 09:14:14 -04:00
  • e9d9f01b8b
    docs: Add OpenAI API compatibility page (#2316) Ben Browning 2025-06-04 06:51:52 -04:00
  • ed69c1b3cc
    feat(responses): add more streaming response types (#2375) Ashwin Bharambe 2025-06-03 15:48:41 -07:00
  • d96f6ec763
    chore(ui): use proxy server for backend API calls; simplified k8s deployment (#2350) ehhuang 2025-06-03 14:57:10 -07:00
  • 7c1998db25
    feat: fine grained access control policy (#2264) grs 2025-06-03 17:51:12 -04:00
  • 8bee2954be
    feat: Structured output for Responses API (#2324) Ben Browning 2025-06-03 17:43:00 -04:00
  • c70ca8344f
    fix: resolve template name to config path in llama stack run (#2361) Ignas Baranauskas 2025-06-03 22:39:12 +01:00
  • cba55808ab
    feat(distro): add more providers to starter distro, prefix conflicting models (#2362) Ashwin Bharambe 2025-06-03 12:10:46 -07:00
  • b380cb463f
    feat: add postgres deps to starter distro (#2360) Ashwin Bharambe 2025-06-03 11:04:23 -07:00
  • e743257d1d
    docs: Add missing dependencies in quickstart demo command (#2347) Jorge 2025-06-03 18:01:36 +02:00
  • 3c9a10d2fe
    feat: reference implementation for files API (#2330) ehhuang 2025-06-02 21:54:24 -07:00
  • ba25c5e7e1
    docs(k8s): add UI template (#2343) Ashwin Bharambe 2025-06-02 17:55:18 -07:00
  • e92f571f47
    fix: ollama chat completion needs unique ids (#2344) Ben Browning 2025-06-02 20:43:20 -04:00
  • 4540c9b3e5
    chore: revert llama-stack-client dep (#2342) ehhuang 2025-06-02 16:05:21 -07:00
  • dbe4e84aca
    feat(responses): implement full multi-turn support (#2295) Ashwin Bharambe 2025-06-02 15:35:49 -07:00
  • cac7d404a2
    fix: remove openai dep (#2337) ehhuang 2025-06-02 15:15:12 -07:00
  • 76dcf47320
    docs(mcp): add a few lines for how to specify Auth headers in MCP tools (#2336) Ashwin Bharambe 2025-06-02 14:28:38 -07:00
  • 6bb174bb05
    revert: "chore: Remove zero-width space characters from OTEL service" (#2331) Sébastien Han 2025-06-02 23:21:35 +02:00
  • 3511af7c33
    fix: fireworks provider for openai compat inference endpoint (#2335) Hardik Shah 2025-06-02 14:11:15 -07:00
  • 7fb4bdabea
    docs(kubernetes): add more fleshed-out example of a Demo Kubernetes cluster (#2329) Ashwin Bharambe 2025-06-02 13:07:08 -07:00
  • 31a3ae60f4
    feat: openai files api (#2321) ehhuang 2025-06-02 11:45:53 -07:00
  • 17f4414be9
    fix: remote-vllm event loop blocking unit test on Mac (#2332) Ben Browning 2025-06-02 11:24:12 -04:00
  • 1c0c6e1e17
    chore: remove usage of load_tiktoken_bpe (#2276) Sébastien Han 2025-06-02 16:33:37 +02:00
  • af65207ebd
    chore: help setuptools finding the project path (#2333) Sébastien Han 2025-06-02 16:20:46 +02:00
  • c7be73fb16
    refactor: remove container from list of run image types (#2178) Mark Campbell 2025-06-02 08:57:55 +01:00
  • b21050935e
    feat: New OpenAI compat embeddings API (#2314) Hardik Shah 2025-05-31 22:11:47 -07:00
  • 277f8690ef
    fix: Responses streaming tools don't concatenate None and str (#2326) Ben Browning 2025-05-31 21:24:04 -04:00
  • f328436831
    feat: Enable ingestion of precomputed embeddings (#2317) Francisco Arceo 2025-05-31 04:03:37 -06:00
  • 31ce208bda
    fix: Fix requirements from broken github-actions[bot] (#2323) Francisco Arceo 2025-05-30 20:05:47 -06:00
  • ad15276da1 build: Bump version to 0.2.9 github-actions[bot] 2025-05-30 19:43:09 +00:00
  • 2603f10f95
    feat: support postgresql inference store (#2310) ehhuang 2025-05-29 14:33:09 -07:00
  • 168c7113df
    fix(providers): update sambanova json schema mode (#2306) Jorge Piedrahita Ortiz 2025-05-29 11:54:23 -05:00
  • f0d8ceb242
    chore: fix flaky distro_codegen script (#2305) Mark Campbell 2025-05-29 17:53:45 +01:00
  • bfdd15d1fa
    fix(responses): use input, not original_input when storing the Response (#2300) Ashwin Bharambe 2025-05-28 13:17:48 -07:00
  • a654467552
    feat: add cpu/cuda config for prompt guard (#2194) Michael Dawson 2025-05-28 12:23:15 -07:00
  • 63a9f08c9e
    chore: use starlette built-in Route class (#2267) Sébastien Han 2025-05-28 18:53:33 +02:00
  • 56e5ddb39f
    feat(ui): add views for Responses (#2293) ehhuang 2025-05-28 09:51:22 -07:00
  • 6352078e4b
    chore: use groups when running commands (#2298) Sébastien Han 2025-05-28 18:13:16 +02:00
  • a7ecc92be1
    docs: add post training to providers list (#2280) Charlie Doern 2025-05-28 09:32:00 -04:00
  • 9b7f9db05c
    fix: build docs without requirements.txt (#2294) raghotham 2025-05-27 16:27:57 -07:00
  • 0b695538af
    fix: chat completion with more than one choice (#2288) ehhuang 2025-05-27 15:39:15 -07:00
  • 1d46f3102e
    fix: enable test_responses_store (#2290) ehhuang 2025-05-27 15:37:28 -07:00
  • 4f3f28f718
    chore: use dependency-groups for dev (#2287) Sébastien Han 2025-05-27 23:00:17 +02:00
  • 484abe3116
    chore: bump uv version (#2289) Sébastien Han 2025-05-27 22:44:27 +02:00
  • 7105a25b0f build: Bump version to 0.2.8 github-actions[bot] 2025-05-27 20:28:29 +00:00
  • 5cdb29758a
    feat(responses): add output_text delta events to responses (#2265) Ashwin Bharambe 2025-05-27 13:07:14 -07:00
  • 6ee319ae08
    fix: convert boolean string to boolean (#2284) Sébastien Han 2025-05-27 22:05:38 +02:00
  • a8f75d3897
    chore: remove dependencies.json (#2281) Sébastien Han 2025-05-27 19:26:57 +02:00
  • e7e9ec0379
    chore: fix visible comments in pr template (#2279) Mark Campbell 2025-05-27 14:42:33 +01:00
  • b2adaa3f60
    docs: fix evals notebook preview (#2277) Mark Campbell 2025-05-27 14:18:20 +01:00
  • 448f00903d
    chore: mark blobpath as optional (#2271) Sébastien Han 2025-05-27 10:55:24 +02:00
  • 28930cdab6
    fix: handle None external_providers_dir in build with run arg (#2269) Ignas Baranauskas 2025-05-27 08:41:12 +01:00
  • 7504c2f430
    test: disable test_inference_store test urrrggg (#2273) Ashwin Bharambe 2025-05-26 22:48:41 -07:00
  • 51e6f529f3
    fix: index non-MCP toolgroups at registration time (#2272) Ashwin Bharambe 2025-05-26 20:33:36 -07:00
  • 39b33a3b01
    chore: allow to pass CA cert to remote vllm (#2266) Sébastien Han 2025-05-26 20:59:03 +02:00
  • 7710b2f43b
    chore: removed unused class (#2268) Sébastien Han 2025-05-26 17:41:37 +02:00
  • 9623d5d230
    fix: match mcp headers in provider data to Responses API shape (#2263) Ashwin Bharambe 2025-05-25 14:33:10 -07:00
  • ce33d02443
    fix(tools): do not index tools, only index toolgroups (#2261) Ashwin Bharambe 2025-05-25 13:27:52 -07:00
  • 5a422e236c
    chore: make cprint write to stderr (#2250) raghotham 2025-05-24 23:39:57 -07:00
  • c25bd0ad58
    fix: use pypi browser agent (#2260) raghotham 2025-05-24 23:26:30 -07:00
  • 298721c238
    chore: split routing_tables into individual files (#2259) Ashwin Bharambe 2025-05-24 23:15:05 -07:00
  • eedf21f19c
    chore: split routers into individual files (inference, tool, vector_io, eval_scoring) (#2258) Ashwin Bharambe 2025-05-24 22:59:07 -07:00
  • ae7272d8ff
    chore: split routers into individual files (datasets) (#2249) Ashwin Bharambe 2025-05-24 22:11:43 -07:00
  • a2160dc0af
    chore: split routers into individual files (safety) Ashwin Bharambe 2025-05-24 22:00:32 -07:00
  • c290999c63
    fix(telemetry): get rid of annoying sqlite span export error (#2245) Ashwin Bharambe 2025-05-24 20:24:34 -07:00
  • 3faf1e4a79
    feat: enable MCP execution in Responses impl (#2240) Ashwin Bharambe 2025-05-24 14:20:42 -07:00
  • 66f09f24ed
    fix: disable test_responses_store (#2244) Ashwin Bharambe 2025-05-24 08:18:06 -07:00
  • 84751f3e55
    fix: skip failing tests (#2243) raghotham 2025-05-24 07:31:08 -07:00
  • a411029d7e
    docs: Update CHANGELOG.md (#2241) Yuan Tang 2025-05-24 10:06:36 -04:00
  • 15b0a67555
    feat: add responses input items api (#2239) ehhuang 2025-05-24 07:05:53 -07:00
  • 055f48b6a2
    fix(security): Upgrade setuptools to v80.8.0. Fixes CVE-2025-47273 (#2242) Yuan Tang 2025-05-24 09:57:24 -04:00
  • ca65617a71
    feat: start ui server in llama stack run (#2170) ehhuang 2025-05-23 20:00:09 -07:00
  • 5844c2da68
    feat: add list responses API (#2233) ehhuang 2025-05-23 13:16:48 -07:00
  • 6463ee7633
    feat: allow using llama-stack-library-client from verifications (#2238) Ashwin Bharambe 2025-05-23 11:43:41 -07:00
  • 558d109ab7
    fix: signature change to match OpenAI SDK (#2237) Ashwin Bharambe 2025-05-23 10:59:30 -07:00
  • b054023800
    chore: add sqlalchemy to test dependencies (#2236) ehhuang 2025-05-23 10:33:38 -07:00
  • 51945f1e57
    feat: accept MCP authorization headers for MCP toolgroups (#2230) Ashwin Bharambe 2025-05-23 08:52:18 -07:00
  • 2708312168
    feat(ui): implement chat completion views (#2201) ehhuang 2025-05-22 22:05:54 -07:00
  • d8c6ab9bfc
    feat: add MCP tool signature to Responses API (#2232) Ashwin Bharambe 2025-05-22 16:43:08 -07:00
  • 8feb1827c8
    fix: openai provider model id (#2229) ehhuang 2025-05-22 14:51:01 -07:00
  • 549812f51e
    feat: implement get chat completions APIs (#2200) ehhuang 2025-05-21 22:21:52 -07:00
  • 633bb9c5b3
    feat(providers): sambanova safety provider (#2221) Jorge Piedrahita Ortiz 2025-05-21 17:33:02 -05:00
  • 02e5e8a633
    fix: only print routes that match the runtime config (#2226) Sébastien Han 2025-05-22 00:30:29 +02:00