Commit graph

  • 88cdab5105 Clarify bash requirement in install flow dltn 2025-06-14 03:56:56 +00:00
  • 200f10b093 test: add test for PaginatedResponse url field population in dynamic routes Rohan Awhad 2025-06-13 19:48:54 -04:00
  • 985d0b156c
    feat: Add suffix to openai_completions (#2449) Hardik Shah 2025-06-13 16:06:06 -07:00
  • 2edb9eb7e0
    Merge branch 'main' into suffic Hardik Shah 2025-06-13 16:03:50 -07:00
  • 2e10617c6a removed test added by mistake Hardik Shah 2025-06-13 15:53:44 -07:00
  • 1cfb5b1205 add suffix to openai.completions Hardik Shah 2025-06-13 15:39:45 -07:00
  • 2e8054bede
    feat: Implement hybrid search in SQLite-vec (#2312) Varsha 2025-06-13 12:54:06 -07:00
  • 6ea5c10d48 feat: Introduce weighted and rrf reranker implementations Varsha Prasad Narsing 2025-06-04 15:59:44 -07:00
  • eab85a7121 feat: Implement hybrid search in SQLite-vec Varsha Prasad Narsing 2025-05-29 14:32:54 -07:00
  • 941f505eb0
    feat: File search tool for Responses API (#2426) Ben Browning 2025-06-13 14:32:48 -04:00
  • 554ada57b0
    chore: Add OpenAI compatibility for Ollama embeddings (#2440) Francisco Arceo 2025-06-13 12:28:51 -06:00
  • 5ef31f6752 removing the base64 encoding Francisco Javier Arceo 2025-06-13 14:13:20 -04:00
  • 7a71d9ebd8 sqlite-vec support for Responses file_search Ben Browning 2025-06-13 14:01:34 -04:00
  • ec09524a91 Fix typo in llama_experts_pdf test case naming Ben Browning 2025-06-13 13:41:37 -04:00
  • b5047db685
    Merge branch 'main' into feat/add-url-to-paginated-response Rohan Awhad 2025-06-13 13:07:45 -04:00
  • e0f1788e9e refactoring some code into openai_compat Francisco Javier Arceo 2025-06-13 12:06:58 -04:00
  • 46da232e83 Make Files API an optional dependency of faiss provider Ben Browning 2025-06-12 08:51:41 -04:00
  • 8a5ea57253 Responses file_search wire up additional params Ben Browning 2025-06-12 07:05:58 -04:00
  • 788d34d8b4 Move vector file attach code to OpenAIVectorStoreMixin Ben Browning 2025-06-11 20:06:30 -04:00
  • 055885bd5a Add pdf support to file_search for Responses API Ben Browning 2025-06-11 16:45:28 -04:00
  • 57eccf023d Test responses queries of empty vector stores Ben Browning 2025-06-11 11:42:07 -04:00
  • de84ee0748 Implement attaching files to vector stores Ben Browning 2025-06-11 11:14:38 -04:00
  • 8ede67b809 More work on file_search verification test Ben Browning 2025-06-11 09:28:33 -04:00
  • fa34468308 feat: File search tool for Responses API Ben Browning 2025-06-10 12:42:09 -04:00
  • 89d4a05303
    Merge branch 'main' into ollama-embeddings Francisco Arceo 2025-06-13 04:11:26 -06:00
  • f63520630e
    fix pre-commit ci Sumit Jaiswal 2025-06-13 15:28:56 +05:30
  • 6cffb6e871 fixing type Francisco Javier Arceo 2025-06-13 05:57:09 -04:00
  • eb05b0f3a3
    to fix watsonx unit test failure Sumit Jaiswal 2025-06-13 15:03:14 +05:30
  • 04825e854c
    fix var name Sumit Jaiswal 2025-06-04 20:49:13 +05:30
  • 61cd8540de
    fix review comment against health check env var naming Sumit Jaiswal 2025-06-04 20:47:58 +05:30
  • f235e2eca1
    update tests based on logic update Sumit Jaiswal 2025-06-04 12:42:40 +05:30
  • 1a4ae92502
    update querying logic to avoid token use Sumit Jaiswal 2025-06-04 12:34:27 +05:30
  • 8de3feb53b
    PR to implement watsonx health check Sumit Jaiswal 2025-06-01 18:10:32 +05:30
  • e2e15ebb6c
    feat(auth): allow token to be provided for use against jwks endpoint (#2394) grs 2025-06-13 04:13:41 -04:00
  • ddaee42650
    test: Update integration-tests.yml (#2443) Hardik Shah 2025-06-13 01:04:08 -07:00
  • 8fff7ebdca
    dropping files since the api is not included in the distro Hardik Shah 2025-06-12 21:55:43 -07:00
  • 179399edf4 fix for model in OpenAIEmbeddingsResponse and rebase Francisco Javier Arceo 2025-06-12 22:05:54 -04:00
  • 15042c6c31 chore: Add OpenAI compatibility for Ollama embeddings Francisco Javier Arceo 2025-06-12 16:31:54 -04:00
  • 834770e521
    Update integration-tests.yml Hardik Shah 2025-06-12 16:37:36 -07:00
  • fef670b024
    feat: update openai tests to work with both clients (#2442) Hardik Shah 2025-06-12 16:30:23 -07:00
  • cebca78f13 updated tests to work with both clients Hardik Shah 2025-06-12 16:23:35 -07:00
  • 0bc1747ed8
    feat: update search for vector_stores (#2441) Hardik Shah 2025-06-12 15:34:22 -07:00
  • 7893641de1
    Merge branch 'main' into vest_search Hardik Shah 2025-06-12 15:32:41 -07:00
  • d54c58c8dd update tests to ignore with library client Hardik Shah 2025-06-12 15:31:27 -07:00
  • f8b85c2176 update vector_stores.search Hardik Shah 2025-06-12 15:19:30 -07:00
  • ce48d47543 feat: DistributedJobScheduler Charlie Doern 2025-06-12 13:59:06 -04:00
  • 6494658a10 feat: add finetune_multi_device recipe with fsdp support Charlie Doern 2025-05-19 13:21:35 -04:00
  • 35c2817d0a
    fix(weaviate): handle case where distance is 0 by setting score to infinity (#2415) Ibrahim Haroon 2025-06-12 11:23:59 -04:00
  • eb04731750
    ci: fix external provider test (#2438) Sébastien Han 2025-06-12 16:14:32 +02:00
  • 69138957d8
    ci: fix external provider test Sébastien Han 2025-06-12 15:18:28 +02:00
  • de37a04c3e
    fix: set appropriate defaults for params (#2434) Hardik Shah 2025-06-11 17:30:34 -07:00
  • 1fd6bf6da9 Release candidate 0.0.0.dev20250612002501 rc-0.0.0.dev20250612002501 github-actions[bot] 2025-06-12 00:25:50 +00:00
  • 2fb82c9ed0 fixed None to default of 10 Hardik Shah 2025-06-11 17:23:07 -07:00
  • 764026c0d6 set defaults so that stainless does not pick as required params Hardik Shah 2025-06-11 17:19:33 -07:00
  • d55100d9b7
    feat: OpenAIVectorIOMixin for vector_stores common logic (#2427) Hardik Shah 2025-06-11 15:40:57 -07:00
  • d1ba53f257
    Merge branch 'main' into oai_mixin Hardik Shah 2025-06-11 15:31:34 -07:00
  • 3a8adf0c39 vector store name should be mandatory Hardik Shah 2025-06-11 15:28:24 -07:00
  • 4e37b49cdc
    fix: #1867 InferenceRouter has no attribute formatter (#2422) Rohan Awhad 2025-06-11 12:14:41 -04:00
  • f042df3844
    Merge branch 'main' into feat/add-url-to-paginated-response Rohan Awhad 2025-06-11 06:32:40 -04:00
  • 3d1045fb16
    Merge branch 'main' into bug/inference-router-no-attribute-formatter Rohan Awhad 2025-06-11 06:32:03 -04:00
  • 99981fc707 Release candidate 0.0.0.dev20250611002524 rc-0.0.0.dev20250611002524 github-actions[bot] 2025-06-11 00:26:13 +00:00
  • cf4bac1149 introduce OpenAI Mixin class for VectorIO Hardik Shah 2025-06-10 13:10:08 -07:00
  • 5ac43268e8
    feat: Add OpenAI compat /v1/vector_store APIs (#2423) Hardik Shah 2025-06-10 13:07:39 -07:00
  • ee57e58f29
    fix: loosen tool call checks in inference store (#2420) Ben Browning 2025-06-10 08:45:55 -04:00
  • b45c650063 Add delete_openai_response route, define delete OpenAI message schema and make an integration test 2000krysztof 2025-05-28 15:53:32 +01:00
  • 5639ad7466
    docs: Add recent releases (#2424) Yuan Tang 2025-06-09 22:13:02 -05:00
  • f6718b2408
    fix(security): Upgrade requests to 2.32.4. Fixes CVE-2024-47081 (#2425) Yuan Tang 2025-06-09 22:03:28 -05:00
  • 34832414b4 Release candidate 0.0.0.dev20250610002515 rc-0.0.0.dev20250610002515 github-actions[bot] 2025-06-10 00:26:01 +00:00
  • 3d72a73f8a
    fix(security): Upgrade requests to 2.32.4. Fixes CVE-2024-47081 Yuan Tang 2025-06-09 19:18:49 -04:00
  • 1299bfa16f
    docs: Add recent releases Yuan Tang 2025-06-09 19:14:26 -04:00
  • 1a888a6bfe Update sqlite-vec provider to support OpenAI vector store apis Hardik Shah 2025-06-09 14:38:26 -07:00
  • b55f1249e0 Add OpenAI compat /v1/vector_store APIs Hardik Shah 2025-06-09 14:01:11 -07:00
  • 38faf57db6 Fixes: #1867 InferenceRouter has no attribute formatter Rohan Awhad 2025-06-09 15:49:41 -04:00
  • adc373600e test: added integeration test that queries for identical vectors and verifies no divide by zero exception occurs Ibrahim Haroon 2025-06-09 09:09:51 -04:00
  • f5d2108191 fix: handle case where distance is 0 by setting score to infinity Ibrahim Haroon 2025-06-06 15:33:05 -04:00
  • 91a5b1d921
    Merge branch 'main' into feat/add-url-to-paginated-response Rohan Awhad 2025-06-09 09:04:53 -04:00
  • dbe4f745e5 Release candidate 0.0.0.dev20250609002706 rc-0.0.0.dev20250609002706 github-actions[bot] 2025-06-09 00:27:49 +00:00
  • cd7a246050 Release candidate 0.0.0.dev20250608002809 rc-0.0.0.dev20250608002809 github-actions[bot] 2025-06-08 00:28:54 +00:00
  • 06be99f1a2 fix: loosen tool call checks in inference store Ben Browning 2025-06-07 19:47:51 -04:00
  • 28ca00d0d9
    fix(pgvector): handle case where distance is 0 by setting score to infinity (#2416) Ibrahim Haroon 2025-06-07 16:31:30 -04:00
  • a34cef925b
    fix(faiss): handle case where distance is 0 by setting d to minimum positive… (#2387) Ibrahim Haroon 2025-06-07 16:09:46 -04:00
  • e4d7651ffa Don't downgrade llama-stack version in uv.lock Ben Browning 2025-06-07 15:58:55 -04:00
  • e28a57ad76 feat: Add url field to PaginatedResponse and populate it using route path Rohan Awhad 2025-06-06 22:24:30 -04:00
  • 75d9d0ad74 Release candidate 0.0.0.dev20250607002501 rc-0.0.0.dev20250607002501 github-actions[bot] 2025-06-07 00:26:30 +00:00
  • 6f8312ddd0 fix: handle case where distance is 0 by setting score to infinity Ibrahim Haroon 2025-06-06 15:31:25 -04:00
  • 33ecefd284
    feat: To add health status check for remote VLLM (#2303) Sumit Jaiswal 2025-06-07 01:03:12 +05:30
  • 379b6530ee build: Bump version to 0.2.10.1 v0.2.10.1 release-0.2.10.1 github-actions[bot] 2025-06-06 18:54:51 +00:00
  • 32c651e3a7
    chore: update CODEOWNERS (#2414) Alexey Rybak 2025-06-06 11:35:15 -07:00
  • 10d4c5a80f Release candidate 0.2.10.1rc1 v0.2.10.1rc1 rc-0.2.10.1rc1 github-actions[bot] 2025-06-06 18:32:20 +00:00
  • 9239e09181 chore: update CODEOWNERS Alexey Rybak 2025-06-06 11:30:34 -07:00
  • 1f48577a02
    fix: ChromaDB provider (#2413) Hardik Shah 2025-06-06 11:25:58 -07:00
  • ea110857df update chroma interface to match new spec Hardik Shah 2025-06-06 11:11:36 -07:00
  • b05a3db358
    Merge branch 'main' into fix/divide-by-zero-exception-faiss-query-vector Ibrahim Haroon 2025-06-06 11:29:14 -04:00
  • 57494665e1 buid: test_faiss.py requires faiss-cpu Ibrahim Haroon 2025-06-06 10:33:23 -04:00
  • 1a492ad0cc Merge branch 'main' into nvidia-e2e-notebook Jash Gulabrai 2025-06-06 11:11:53 -04:00
  • 6b6d8d70a5 Remove print statement Jash Gulabrai 2025-06-06 10:43:24 -04:00
  • 6346024fa3 fix: Don't reuse session in NVIDIA post_training request handler Jash Gulabrai 2025-06-06 10:42:10 -04:00
  • 4b32cfa846 chore: fixed formatting issues Ibrahim Haroon 2025-06-06 10:32:39 -04:00
  • 0d0b8d2be1
    ci: use ollama container image with loaded models (#2410) Sébastien Han 2025-06-06 12:08:20 +02:00
  • c8b5774ff3
    ci: use ollama container image with loaded models Sébastien Han 2025-06-06 09:54:39 +02:00