Commit graph

  • 2fb82c9ed0 fixed None to default of 10 Hardik Shah 2025-06-11 17:23:07 -07:00
  • 764026c0d6 set defaults so that stainless does not pick as required params Hardik Shah 2025-06-11 17:19:33 -07:00
  • d1ba53f257
    Merge branch 'main' into oai_mixin Hardik Shah 2025-06-11 15:31:34 -07:00
  • 3a8adf0c39 vector store name should be mandatory Hardik Shah 2025-06-11 15:28:24 -07:00
  • f042df3844
    Merge branch 'main' into feat/add-url-to-paginated-response Rohan Awhad 2025-06-11 06:32:40 -04:00
  • 3d1045fb16
    Merge branch 'main' into bug/inference-router-no-attribute-formatter Rohan Awhad 2025-06-11 06:32:03 -04:00
  • cf4bac1149 introduce OpenAI Mixin class for VectorIO Hardik Shah 2025-06-10 13:10:08 -07:00
  • 5d65c017b0
    Merge branch 'meta-llama:main' into main Sébastien Han 2025-06-10 13:06:53 +02:00
  • b45c650063 Add delete_openai_response route, define delete OpenAI message schema and make an integration test 2000krysztof 2025-05-28 15:53:32 +01:00
  • 3d72a73f8a
    fix(security): Upgrade requests to 2.32.4. Fixes CVE-2024-47081 Yuan Tang 2025-06-09 19:18:49 -04:00
  • 1299bfa16f
    docs: Add recent releases Yuan Tang 2025-06-09 19:14:26 -04:00
  • 1a888a6bfe Update sqlite-vec provider to support OpenAI vector store apis Hardik Shah 2025-06-09 14:38:26 -07:00
  • b55f1249e0 Add OpenAI compat /v1/vector_store APIs Hardik Shah 2025-06-09 14:01:11 -07:00
  • 38faf57db6 Fixes: #1867 InferenceRouter has no attribute formatter Rohan Awhad 2025-06-09 15:49:41 -04:00
  • adc373600e test: added integeration test that queries for identical vectors and verifies no divide by zero exception occurs Ibrahim Haroon 2025-06-09 09:09:51 -04:00
  • f5d2108191 fix: handle case where distance is 0 by setting score to infinity Ibrahim Haroon 2025-06-06 15:33:05 -04:00
  • 91a5b1d921
    Merge branch 'main' into feat/add-url-to-paginated-response Rohan Awhad 2025-06-09 09:04:53 -04:00
  • 06be99f1a2 fix: loosen tool call checks in inference store Ben Browning 2025-06-07 19:47:51 -04:00
  • e4d7651ffa Don't downgrade llama-stack version in uv.lock Ben Browning 2025-06-07 15:58:55 -04:00
  • e28a57ad76 feat: Add url field to PaginatedResponse and populate it using route path Rohan Awhad 2025-06-06 22:24:30 -04:00
  • 6f8312ddd0 fix: handle case where distance is 0 by setting score to infinity Ibrahim Haroon 2025-06-06 15:31:25 -04:00
  • 379b6530ee build: Bump version to 0.2.10.1 v0.2.10.1 release-0.2.10.1 github-actions[bot] 2025-06-06 18:54:51 +00:00
  • 10d4c5a80f Release candidate 0.2.10.1rc1 v0.2.10.1rc1 github-actions[bot] 2025-06-06 18:32:20 +00:00
  • 9239e09181 chore: update CODEOWNERS Alexey Rybak 2025-06-06 11:30:34 -07:00
  • ea110857df update chroma interface to match new spec Hardik Shah 2025-06-06 11:11:36 -07:00
  • b05a3db358
    Merge branch 'main' into fix/divide-by-zero-exception-faiss-query-vector Ibrahim Haroon 2025-06-06 11:29:14 -04:00
  • 57494665e1 buid: test_faiss.py requires faiss-cpu Ibrahim Haroon 2025-06-06 10:33:23 -04:00
  • 1a492ad0cc Merge branch 'main' into nvidia-e2e-notebook Jash Gulabrai 2025-06-06 11:11:53 -04:00
  • 6b6d8d70a5 Remove print statement Jash Gulabrai 2025-06-06 10:43:24 -04:00
  • 6346024fa3 fix: Don't reuse session in NVIDIA post_training request handler Jash Gulabrai 2025-06-06 10:42:10 -04:00
  • 4b32cfa846 chore: fixed formatting issues Ibrahim Haroon 2025-06-06 10:32:39 -04:00
  • c8b5774ff3
    ci: use ollama container image with loaded models Sébastien Han 2025-06-06 09:54:39 +02:00
  • 6ed971c67d
    chore: resync requirements.txt Sébastien Han 2025-06-06 11:49:44 +02:00
  • 8bc53d047c build: Bump version to 0.2.10 v0.2.10 release-0.2.10 github-actions[bot] 2025-06-05 22:55:45 +00:00
  • 1979132115 Release candidate 0.2.10rc3 v0.2.10rc3 github-actions[bot] 2025-06-05 22:35:04 +00:00
  • 18333d1ac8 fix version Hardik Shah 2025-06-05 15:29:56 -07:00
  • acabeec42f Release candidate 0.2.10rc2 v0.2.10rc2 github-actions[bot] 2025-06-05 20:59:41 +00:00
  • d211f5188b internal_providers Eric Huang 2025-06-05 13:44:09 -07:00
  • 7b7a958106 test: skip files integrations tests for library client Eric Huang 2025-06-05 13:41:47 -07:00
  • 460c67be62 kvstore Eric Huang 2025-06-05 11:59:11 -07:00
  • f0170c5d3a chore: remove explicit sqlite dependency Raghotham Murthy 2025-06-05 02:03:21 -07:00
  • fe50ec2190 chore: remove dead code Eric Huang 2025-06-05 11:58:15 -07:00
  • 51c05ae762 Add fastapi dependency Hardik Shah 2025-06-05 11:42:28 -07:00
  • 7e643b9b12 update to import all dependencies inside llama-stack Hardik Shah 2025-06-05 11:27:00 -07:00
  • e6cbe94ace update doc Gordon Sim 2025-06-05 18:06:13 +01:00
  • 1135af30cf remove cluster role binding as it is not needed Gordon Sim 2025-06-05 16:33:32 +01:00
  • a996ce2af2 change value of test token for clarity Gordon Sim 2025-06-05 16:20:46 +01:00
  • d01d3d998b reuse existing token Gordon Sim 2025-06-05 16:11:39 +01:00
  • f60c3c4acf feat: created unit test to verify query_vector function handles the edge case when the query embedding and an embedding in the vector db are identical doesn't lead to zero divison exception. Ibrahim Haroon 2025-06-05 11:06:31 -04:00
  • fea01c5a25 fix: handle case where distance is 0 by setting score to infinity Ibrahim Haroon 2025-06-03 19:20:31 -04:00
  • bc36e50b61 pre-commit Ashwin Bharambe 2025-06-05 15:43:36 +02:00
  • c18b585d32
    Merge branch 'main' into vllm_health_check Sumit Jaiswal 2025-06-05 18:09:36 +05:30
  • 1b36bd377e
    ci: run integration test on more python version Sébastien Han 2025-06-05 12:17:00 +02:00
  • d9c5931363 chore: remove explicit sqlite dependency Raghotham Murthy 2025-06-05 02:03:21 -07:00
  • cc1197cfb8
    Merge branch 'main' into add_opentelemetry_pip_packages raghotham 2025-06-04 23:28:29 -07:00
  • 73f68783c1 docs: update contributin guidance around uv python versions Nathan Weinberg 2025-06-04 22:14:01 -04:00
  • eb5f4c4cf2 chore: update postgres_demo distro config Eric Huang 2025-06-04 16:51:52 -07:00
  • 74d891db72 feat(auth): allow token to be provided for use against jwks endpoint Gordon Sim 2025-06-04 19:12:15 +01:00
  • 34b3b6e049 fix(server): add missing OpenTelemetry dependencies to avoid runtime import errors Jose Angel Morena 2025-06-04 11:15:40 +02:00
  • af0d6014c1
    fix: vllm starter name Sébastien Han 2025-06-04 11:55:26 +02:00
  • b30ddb1d2b fix: remove debug print accidentally merged Gordon Sim 2025-06-04 13:47:44 +01:00
  • db2fb7e3c4 feat: Add experimental integration tests with cached providers Derek Higgins 2025-05-14 14:41:25 +01:00
  • c4f644a1ea Remove remote::openai from openai_completion support Derek Higgins 2025-06-04 09:08:30 +01:00
  • 8afad3f63a refactor: unify stream and non-stream impls for responses Ashwin Bharambe 2025-06-03 16:51:06 -07:00
  • c4c67ac775 chore: cleanups from review feedback on openai api docs Ben Browning 2025-06-03 20:42:56 -04:00
  • 252249cf91 feat(responses): add more streaming response types Ashwin Bharambe 2025-06-03 15:41:07 -07:00
  • 7cd2a1c031 k8s Eric Huang 2025-06-03 12:09:01 -07:00
  • e45e4f947a bugfix Ashwin Bharambe 2025-06-03 12:00:51 -07:00
  • 471d40e80c kill verif template Ashwin Bharambe 2025-06-03 11:56:07 -07:00
  • 96cd51a0c8 Changes to access rule conditions: Gordon Sim 2025-05-29 20:21:20 +01:00
  • 528a391c5f feat(distro): add more providers to starter distro, prefix conflicting models Ashwin Bharambe 2025-06-03 11:43:19 -07:00
  • a8ba160852
    fix: resolve template name to config path in llama stack run Ignas Baranauskas 2025-06-03 19:18:38 +01:00
  • 032f92b3e1 feat: add postgres deps to starter distro Ashwin Bharambe 2025-06-03 10:57:08 -07:00
  • 7e30b5a466
    fix: remove sentence-transformers from remote vllm Sébastien Han 2025-06-03 18:00:27 +02:00
  • 643c0bb747 fix: Add fictional liquids to get_boiling_point Derek Higgins 2025-05-14 12:14:37 +01:00
  • 73ca0fb37a
    chore: remove torch dep from sentence-transformers Sébastien Han 2025-06-03 15:13:11 +02:00
  • 59830e5a22
    chore: return NotImplementedError instead of ValueError Sébastien Han 2025-06-03 15:11:54 +02:00
  • 9e757c433a Add missing dependencies in quickstart command Jorge Garcia Oncins 2025-06-03 14:28:08 +02:00
  • 0ea429c163 fix Ashwin Bharambe 2025-06-02 17:53:57 -07:00
  • badf8594d1 feat: Structured output for Responses API Ben Browning 2025-05-31 13:44:20 -04:00
  • c754d9af7a fixes Ashwin Bharambe 2025-06-02 16:31:24 -07:00
  • 48fdbf7188 fix: ollama chat completion needs unique ids Ben Browning 2025-06-02 18:59:30 -04:00
  • 375546ade3 fix Ashwin Bharambe 2025-06-02 16:07:13 -07:00
  • fd54727aef docs(k8s): add UI template Ashwin Bharambe 2025-06-02 16:04:06 -07:00
  • e7ab5a3649 chore: revert llama-stack-client dep Eric Huang 2025-06-02 16:03:46 -07:00
  • 8779f32e59 update api Ashwin Bharambe 2025-06-02 15:20:13 -07:00
  • dd9e0ec23b multi turn Eric Huang 2025-06-02 15:19:28 -07:00
  • 9011a156a7 files impl Eric Huang 2025-06-02 15:18:09 -07:00
  • 2d40ce2271 many fixes Ashwin Bharambe 2025-06-02 15:03:52 -07:00
  • 17e9b14ccf fix: remove openai dep Eric Huang 2025-06-02 14:38:26 -07:00
  • 021976713b fix is_function_tool_call Ashwin Bharambe 2025-06-02 14:36:37 -07:00
  • fd15a6832c feat(responses): implement full multi-turn support Ashwin Bharambe 2025-05-27 14:32:21 -07:00
  • 8dcdce317d docs(mcp): add a few lines for how to specify Auth headers in MCP tools Ashwin Bharambe 2025-06-02 13:58:58 -07:00
  • 3f43ad7c9e fix fireworks open ai compat endpoint Hardik Shah 2025-06-02 13:57:05 -07:00
  • f427e3092f add ingres Ashwin Bharambe 2025-06-02 13:06:44 -07:00
  • 44401f0a88
    fix ruff pre-commit Sumit Jaiswal 2025-06-02 23:49:55 +05:30
  • 3840ef7a98
    update the code with aysnc iterator as suggested by Ben Sumit Jaiswal 2025-06-02 23:49:08 +05:30
  • c69e52c262 feat: openai files api, api, response to string Eric Huang 2025-06-02 11:04:40 -07:00
  • 697338ec57 kill gp2 Ashwin Bharambe 2025-06-02 09:41:24 -07:00
  • 6e9e870cca Ensure use of string representation of event.name Michael Anstis 2025-06-02 14:58:36 +01:00