Commit graph

  • 227760d7ec
    Update openai_compat.py Yuan Tang 2025-02-11 19:32:26 -05:00
  • 89c82dce4f feat: support listing all for llama stack list-providers Ihar Hrachyshka 2025-02-11 17:44:40 -05:00
  • 24385cfd03
    fix: filter out remote::sample providers when listing (#1057) Ihar Hrachyshka 2025-02-11 19:12:46 -05:00
  • ee98eb279b fix: filter out remote::sample providers when listing Ihar Hrachyshka 2025-02-11 18:58:42 -05:00
  • c9265524aa adding fiddlecube as a safety provider to a few distros Kaushik 2025-02-11 15:22:57 -08:00
  • e1ebeea950 add fiddlecube to the docs as a hosted safety provider Kaushik 2025-02-11 15:11:28 -08:00
  • 6562a40980 Revert "add fiddlecube to distros, update docs" Kaushik 2025-02-11 15:10:52 -08:00
  • d8a20e034b
    feat: make telemetry attributes be dict[str,PrimitiveType] (#1055) Dinesh Yeduguru 2025-02-11 15:10:17 -08:00
  • e23f005858 make telemetry attributes be dict[str,PrimitiveType] Dinesh Yeduguru 2025-02-11 15:00:03 -08:00
  • ab7f802698
    feat: add MetricResponseMixin to chat completion response types (#1050) Dinesh Yeduguru 2025-02-11 14:58:12 -08:00
  • b62f7e82bd address feedback Dinesh Yeduguru 2025-02-11 14:53:08 -08:00
  • 96c88397da
    fix: agent config validation (#1053) ehhuang 2025-02-11 14:48:42 -08:00
  • dec0687091 fix: agent config validation Eric Huang 2025-02-11 13:26:21 -08:00
  • 3978df54b6 add fiddlecube to distros, update docs Kaushik 2025-02-11 14:21:35 -08:00
  • 23f492100d nit Xi Yan 2025-02-11 14:18:13 -08:00
  • 6ad272927d
    docs: reflect actual number of spaces for indent (#1052) Ihar Hrachyshka 2025-02-11 17:07:26 -05:00
  • 6e63e461ce docs: reflect actual number of spaces for indent Ihar Hrachyshka 2025-02-11 17:04:14 -05:00
  • 183e9a08cc add MetricResponseMixin to chat completion response types Dinesh Yeduguru 2025-02-11 12:39:10 -08:00
  • afb81da91a feat: add optional metrics to all responses inject-metrics-response-v2 Dinesh Yeduguru 2025-02-11 10:36:27 -08:00
  • 71cae67d7b
    docs: remove changelog mention from PR template (#1049) Sébastien Han 2025-02-11 19:24:53 +01:00
  • 14a1a04b8e remove comments Dinesh Yeduguru 2025-02-11 09:29:47 -08:00
  • 3f432a279c
    build: remove changelog mention from PR template Sébastien Han 2025-02-11 17:54:07 +01:00
  • 2214de9e54 add metrics to all response types Dinesh Yeduguru 2025-02-11 07:55:47 -08:00
  • 947b811022 renaming VecImpl -> VectorIO and SQLiteVecVectorIOImpl -> SQLiteVecVectorIOAdapter Francisco Javier Arceo 2025-02-11 10:12:45 -05:00
  • d947ddd255
    docs: Updating wording and nits in the README.md (#992) Kelly Brown 2025-02-11 09:53:26 -05:00
  • f388778950 Renaming sqlite_vec to sqlite-vec in registry and extra comment Francisco Javier Arceo 2025-02-11 09:20:13 -05:00
  • 71c9063657 clean up Xi Yan 2025-02-10 21:55:36 -08:00
  • ba0b620532 all providers Xi Yan 2025-02-10 21:52:04 -08:00
  • aa04867d3a unit test + fireworks streaming Xi Yan 2025-02-10 21:47:10 -08:00
  • d954f2752e
    fix: Added missing tool_config arg in SambaNova chat_completion() (#1042) Yuan Tang 2025-02-11 00:20:50 -05:00
  • 0e10e38ee3
    fix: Added missing tool_config arg in SambaNova chat_completion() Yuan Tang 2025-02-11 00:07:24 -05:00
  • 0f062a15ec clean up fireworks non-stream Xi Yan 2025-02-10 20:37:13 -08:00
  • f4be05341f Fixed template provider to allow for both sqlite and faiss Francisco Javier Arceo 2025-02-10 22:46:41 -05:00
  • b34c1dd8ad
    test: replace blocked image URLs with GitHub-hosted (#1025) Sébastien Han 2025-02-11 04:38:11 +01:00
  • 34366f0b01
    Make utils non-public Yuan Tang 2025-02-10 21:54:36 -05:00
  • c9ef88854c do not popualte tool calls if it is not in request Xi Yan 2025-02-10 18:43:51 -08:00
  • 42d6e7e4a1
    Merge branch 'main' into fiddlecube-guard Kaushik Srinivasan 2025-02-10 18:14:45 -08:00
  • e419e81ef3 reverting changes to run.yaml for ollama Francisco Javier Arceo 2025-02-10 20:46:06 -05:00
  • f9087d3a56
    Merge branch 'meta-llama:main' into jwm4-add-qdrant-to-provider-tests Bill Murdock 2025-02-10 20:43:11 -05:00
  • c6aae0f81b fix: Add test support for qdrant provider Bill Murdock 2025-02-10 20:19:59 -05:00
  • 24c8824535 import httpx Kaushik 2025-02-10 17:06:31 -08:00
  • 553c1fcee0 reverting ollama changes Francisco Javier Arceo 2025-02-10 20:01:13 -05:00
  • b4c94c9066 add fiddlecube dependencies Kaushik 2025-02-10 16:57:11 -08:00
  • 319df22390 add safety violation code Kaushik 2025-02-10 15:51:51 -08:00
  • 49f7e04f83 adding prod url Kaushik 2025-02-10 15:30:48 -08:00
  • aea5b2745d adding the fiddlecube provider Kaushik 2025-02-06 20:01:31 -08:00
  • 3856927ee8
    fix: Update Qdrant support post-refactor (#1022) Bill Murdock 2025-02-10 18:08:33 -05:00
  • 36d35406a7
    fix: a bad newline in ollama docs (#1036) Ellis Tarn 2025-02-10 14:27:17 -08:00
  • 20ddc987c1 revert kvstore config Francisco Javier Arceo 2025-02-10 17:17:39 -05:00
  • 99a83f17de removing print statements and reverting some changing in routing tables Francisco Javier Arceo 2025-02-10 17:15:43 -05:00
  • b635175a87 fix: a bad newline in ollama docs Ellis Tarn 2025-02-10 13:58:05 -08:00
  • afca9d92f9
    fix: Readthedocs cannot parse comments, resulting in docs bugs (#1033) Ellis Tarn 2025-02-10 13:35:16 -08:00
  • 75a480267a removing some print logs Francisco Javier Arceo 2025-02-10 16:29:21 -05:00
  • 902baa91d2 fix: Readthedocs cannot parse comments, resulting in docs bugs Ellis Tarn 2025-02-10 11:13:00 -08:00
  • ab9516c789
    fix: Gaps in doc codegen (#1035) Ellis Tarn 2025-02-10 13:24:15 -08:00
  • 8fb182e760 fix: Gaps in doc codegen Ellis Tarn 2025-02-10 11:13:00 -08:00
  • af7748a4d5 feat: Adding sqlite-vec as vectordb Francisco Javier Arceo 2025-02-10 16:16:55 -05:00
  • b2a86532a2
    Handle response Yuan Tang 2025-02-10 15:23:09 -05:00
  • cc3bb0938a
    fix: Handle tool calling in remote vLLM provider Yuan Tang 2025-02-10 14:55:59 -05:00
  • 65ffcddd84 deprecation Xi Yan 2025-02-10 11:35:21 -08:00
  • 79e7253625 deprecation in OpenAPI spec Xi Yan 2025-02-10 11:21:51 -08:00
  • e013b9066c fix path Xi Yan 2025-02-10 10:47:28 -08:00
  • b11c38ea55 openapi Xi Yan 2025-02-10 09:41:21 -08:00
  • 5fe3ddb27d update eval_task_id -> task_id Xi Yan 2025-02-10 09:41:01 -08:00
  • f1844a88c4 update eval-tasks -> eval/task Xi Yan 2025-02-10 09:37:21 -08:00
  • 371f11a569
    build: update uv lock to sync package versions (#1026) Sébastien Han 2025-02-10 17:42:30 +01:00
  • 076213165c
    docs: update rag.md example code to prevent errors (#1009) Michael Clifford 2025-02-10 09:25:30 -05:00
  • a12e8b3641
    build: update uv lock to sync package versions Sébastien Han 2025-02-10 09:38:52 +01:00
  • 787e78d7d4
    chore: update return type to Optional[str] Sébastien Han 2025-02-06 13:45:38 +01:00
  • 3df95a07a0
    fix: replace blocked image URLs with GitHub-hosted Sébastien Han 2025-02-10 09:29:04 +01:00
  • 8186c88021
    docs: Render check marks correctly on PyPI (#1024) Yuan Tang 2025-02-09 22:26:36 -05:00
  • 24d012cb33
    docs: Render check marks correctly on PyPI Yuan Tang 2025-02-09 22:18:05 -05:00
  • eeae5f6cbc fix: Update Qdrant support post-refactor Bill Murdock 2025-02-09 17:02:00 -05:00
  • 162cfb280e added note of the image understanding working with LS 0.1.0 and 0.1.2 jeff/getting_started Jeff Tang 2025-02-09 09:27:15 -08:00
  • 44f1a4fd5c fix of the agent image understanding example error for LS 0.1.2 Jeff Tang 2025-02-09 09:24:15 -08:00
  • b981b49bfa
    test: Use JSON tool prompt format for remote::vllm provider (#1019) Yuan Tang 2025-02-08 23:42:57 -05:00
  • adb83e0465
    test: Use JSON tool prompt format for remote::vllm provider Yuan Tang 2025-02-08 23:18:39 -05:00
  • 80ba9deab1
    chore: Updated requirements.txt (#1017) Sarthak Deshpande 2025-02-09 01:20:35 +05:30
  • 413099ef6a
    test: Make text-based chat completion tests run 10x faster (#1016) Yuan Tang 2025-02-08 14:49:46 -05:00
  • aaa624ed64 chore: added pypdf and chardet to requirements.txt sarthakdeshpande 2025-02-08 15:30:53 +05:30
  • 5e393f851d Updated requirements.txt sarthakdeshpande 2025-02-08 15:18:04 +05:30
  • ba33494aee
    edits Yuan Tang 2025-02-07 23:18:24 -05:00
  • 6dea61609d
    test: Make text-based chat completion tests run faster Yuan Tang 2025-02-07 22:35:22 -05:00
  • 7766e68e92
    docs: update index.md for 0.1.2 (#1013) raghotham 2025-02-07 15:36:20 -08:00
  • a229de6d1e
    Getting started notebook update (#936) Jeff Tang 2025-02-07 15:36:15 -08:00
  • 533e39bd08
    Update index.md for 0.1.2 raghotham 2025-02-07 15:30:48 -08:00
  • ddd06105a4 Bump version to 0.1.2 v0.1.2 github-actions[bot] 2025-02-07 21:52:50 +00:00
  • c335ed8765 raise when client initialize fails v0.1.2rc4 Hardik Shah 2025-02-07 12:24:07 -08:00
  • 62e5461da7 No spaces in ipynb tests Ashwin Bharambe 2025-02-07 11:56:22 -08:00
  • a8820597ee Minor clean up of notebook Ashwin Bharambe 2025-02-07 11:36:29 -08:00
  • 54c2513c1b update rag.md example code Michael Clifford 2025-02-07 11:46:05 -05:00
  • 04174bafa0 fix: Clarify llama model prompt-format help text Alina Ryan 2025-02-07 13:10:00 -05:00
  • 10bda65b94 Nuke use_proxy from code execution v0.1.2rc3 Ashwin Bharambe 2025-02-07 09:55:48 -08:00
  • 316c43fdaf
    refactor(ollama): model availability check (#986) Sébastien Han 2025-02-07 18:52:16 +01:00
  • 2a4a612373
    fix: Ensure a better error stack trace when llama-stack is not built (#950) Charlie Doern 2025-02-07 12:47:02 -05:00
  • 2493eb794f
    Avoid error catching Ashwin Bharambe 2025-02-07 09:46:02 -08:00
  • 0b7098493a
    test: encode image data as base64 (#1003) Sébastien Han 2025-02-07 18:44:16 +01:00
  • f8f2f7f9bb
    feat: Add HTTPS serving option (#1000) Ashwin Bharambe 2025-02-07 09:39:08 -08:00
  • c97e05f75e
    test: Split inference tests to text and vision (#1008) Yuan Tang 2025-02-07 12:35:49 -05:00
  • a9950ce806
    test: remove flaky agent test (#1006) ehhuang 2025-02-07 09:35:38 -08:00