Commit graph

  • 050d4b8cc9 chore: rename ramalama provider Charlie Doern 2025-04-23 15:34:46 -04:00
  • cfa4b61a01 fix: Additional streaming error handling Ben Browning 2025-04-23 13:04:16 -04:00
  • fa5dfee07b
    fix: Return HTTP 400 for OpenAI API validation errors (#2002) Ben Browning 2025-04-23 11:48:32 -04:00
  • 6a44e7ba20
    docs: add API to external providers table (#2006) Nathan Weinberg 2025-04-23 09:58:10 -04:00
  • 64f747fe09
    feat: add tool name to chat output in playground (#1996) Michael Clifford 2025-04-23 09:57:54 -04:00
  • e9ee8eb812 feat: add tool name to chat output Michael Clifford 2025-04-18 15:10:03 -04:00
  • 52f465e967 docs: add API to external providers table Nathan Weinberg 2025-04-23 09:45:16 -04:00
  • dc46725f56
    fix: properly handle streaming client disconnects (#2000) Ben Browning 2025-04-23 09:44:28 -04:00
  • e0fa67c81c
    docs: add examples for how to define RAG docs (#1981) Kevin Postlethwait 2025-04-23 09:39:18 -04:00
  • deee355952
    fix: Added lazy initialization of the remote vLLM client to avoid issues with expired asyncio event loop (#1969) Ilya Kolchinsky 2025-04-23 15:33:19 +02:00
  • d39462d073
    feat: Hide tool output under an expander in Playground UI (#2003) Ilya Kolchinsky 2025-04-23 15:32:12 +02:00
  • d6e88e0bc6
    docs: add RamaLama to list of known external providers (#2004) Nathan Weinberg 2025-04-23 03:44:18 -04:00
  • af5d0b4c16 docs: add RamaLama to list of known external providers Nathan Weinberg 2025-04-22 14:51:25 -04:00
  • cf42b6f801 Playground UI: hide tool output under an expander widget. ilya-kolchinsky 2025-04-22 17:49:48 +02:00
  • 825ce39879
    fix: Together provider shutdown and default to non-streaming (#2001) Ben Browning 2025-04-22 11:47:53 -04:00
  • 6fe64ee169 Including tool call in chat Derek Higgins 2025-04-10 13:20:09 +00:00
  • 922ac2d501 address disagreement between ruff versions Matthew Farrellee 2025-04-22 10:00:46 -04:00
  • 519afdde5f vllm unit test, check for exception on error Matthew Farrellee 2025-04-22 09:54:57 -04:00
  • c436269cd4 update ollama register_model to follow aliases Matthew Farrellee 2025-04-22 09:35:08 -04:00
  • 84351c2d67 flatten alias to provider map, test registering to existing alias Matthew Farrellee 2025-04-22 09:33:06 -04:00
  • 9982aa64f0 fix: allow lookup of models registered at runtime Matthew Farrellee 2025-04-16 15:51:08 -04:00
  • e4d001c4e4
    feat: cleanup sidebar formatting on tools playground (#1998) Michael Clifford 2025-04-22 04:40:37 -04:00
  • 43d4b7527b fix: Return HTTP 400 for OpenAI API validation errors Ben Browning 2025-04-21 22:07:04 -04:00
  • 544a804678 fix: Together provider shutdown and default to non-streaming Ben Browning 2025-04-21 17:06:44 -04:00
  • 45ef6eac10 fix: properly handle streaming client disconnects Ben Browning 2025-04-21 16:10:45 -04:00
  • 3110ad1e7c
    fix: update ref to raw_errors due to new version of pydantic (#1995) Kevin Postlethwait 2025-04-21 14:50:12 -04:00
  • 602e949a46
    fix: OpenAI Completions API and Fireworks (#1997) Ben Browning 2025-04-21 14:49:12 -04:00
  • 3e2c418524 RFC: Configuring search modes for RAG Varsha Prasad Narsing 2025-04-11 12:50:25 -07:00
  • 0d06c654d0
    feat: Update NVIDIA to GA docs; remove notebook reference until ready (#1999) Jash Gulabrai 2025-04-18 19:13:18 -04:00
  • 9a1b93abb5 Update NVIDIA to GA docs; remove notebook reference until ready Jash Gulabrai 2025-04-18 17:01:51 -04:00
  • 8fd656dcac Add changes Jash Gulabrai 2025-04-18 16:28:04 -04:00
  • 4131e8146f Clean up instructions and implementation; reorganize notebooks Jash Gulabrai 2025-04-18 16:27:19 -04:00
  • 4ee828a277 feat: cleanup sidebar formatting on tools playground Michael Clifford 2025-04-18 16:15:20 -04:00
  • 1dcde0de67 fix: OpenAI Completions API and Fireworks Ben Browning 2025-04-18 15:57:02 -04:00
  • 909fc692ce add examples for how to define RAG docs Kevin 2025-04-16 21:39:37 -04:00
  • c9a41288a3 feat: RamaLama Documentation and Templates Daniel J Walsh 2025-02-11 13:47:13 -05:00
  • 4de45560bf feat: remote ramalama provider implementation Charlie Doern 2025-03-11 18:15:45 -04:00
  • 94f83382eb
    feat: allow building distro with external providers (#1967) Sébastien Han 2025-04-18 17:18:28 +02:00
  • 89dbf91323
    feat: allow building distro with external providers Sébastien Han 2025-04-16 13:04:43 +02:00
  • c66245b4e7 update ref to raw_errors due to new version of pydantic Kevin 2025-04-18 10:19:17 -04:00
  • c4570bcb48
    docs: Add tips for debugging remote vLLM provider (#1992) Yuan Tang 2025-04-18 08:47:47 -04:00
  • a2b7075fe9
    More specific guidance Yuan Tang 2025-04-18 08:30:41 -04:00
  • 2b1620f8d8
    regenerate Yuan Tang 2025-04-17 20:29:52 -04:00
  • ca43978809
    docs: Add tips for debugging remote vLLM provider Yuan Tang 2025-04-17 20:28:24 -04:00
  • 9845631d51
    feat: update nvidia inference provider to use model_store (#1988) Matthew Farrellee 2025-04-18 04:16:43 -04:00
  • e72b1076ca
    fix(build): add UBI 9 compiler tool‑chain (#1983) Alexey Rybak 2025-04-18 00:49:10 -07:00
  • 4c6b7005fa
    fix: Fix docs lint issues (#1993) Yuan Tang 2025-04-18 02:33:13 -04:00
  • 4b1aa2987f
    fix: Fix docs lint issues Yuan Tang 2025-04-17 20:33:04 -04:00
  • dd62a2388c
    docs: add notes to websearch tool and two extra example scripts (#1354) AN YU (安宇) 2025-04-18 01:20:52 +01:00
  • d4ec593a35 docs(build): clarify UBI9 compiler requirement and remove redundant test reluctantfuturist 2025-04-17 17:02:36 -07:00
  • 220da33402 add todo for schema validation Kevin 2025-04-17 19:35:06 -04:00
  • 0ed41aafbf
    test: add multi_image test (#1972) ehhuang 2025-04-17 12:51:42 -07:00
  • c4b754c9b2 tests: add multi_image test Eric Huang 2025-04-17 12:22:55 -07:00
  • 36de927fd6 add check for interleavedContent Kevin 2025-04-16 15:47:22 -04:00
  • d5abe2ec2e tests: add multi_image test Eric Huang 2025-04-17 11:38:32 -07:00
  • 2976b5d992
    fix: OAI compat endpoint for meta reference inference provider (#1962) ehhuang 2025-04-17 11:16:04 -07:00
  • c407f3c340 Merge branch 'main' into add-watsonx-inference-adapter Sajikumar JS 2025-04-17 23:45:57 +05:30
  • efe5b124f3 pre-commit issues fix Sajikumar JS 2025-04-17 23:45:27 +05:30
  • 8e0217c7bf Fix reverted change to distro_type Jash Gulabrai 2025-04-17 14:14:12 -04:00
  • c171fc6062 fix: OAI compat endpoint for meta reference inference provider Eric Huang 2025-04-17 11:10:09 -07:00
  • 9303fa61c5 Remove file that was removed Jash Gulabrai 2025-04-17 14:03:28 -04:00
  • 397651b69e Merge branch 'main' into nvidia-eval-integration Jash Gulabrai 2025-04-17 14:02:55 -04:00
  • 92aac3ec4d Add back file that was removed Jash Gulabrai 2025-04-17 14:01:05 -04:00
  • 8bd6665775
    chore(verification): update README and reorganize generate_report.py (#1978) ehhuang 2025-04-17 10:41:22 -07:00
  • 2117af25a7 Merge branch 'main' into nvidia-eval-integration Jash Gulabrai 2025-04-17 13:36:42 -04:00
  • c996ca6a64 chore(verification): update README and reorganize generate_report.py Eric Huang 2025-04-17 10:27:46 -07:00
  • 26393e2186 update nvidia inference provider to use model_store Matthew Farrellee 2025-04-17 10:43:39 -04:00
  • cb874287a4
    fix: resync api spec (#1987) Sébastien Han 2025-04-17 17:36:04 +02:00
  • ceecd1c3e6
    fix: resync api spec Sébastien Han 2025-04-17 17:34:10 +02:00
  • bb4ff1dd1f update nvidia inference provider to use model_store Matthew Farrellee 2025-04-17 10:43:39 -04:00
  • bfc960a691 Fallback to provided model in run_eval Jash Gulabrai 2025-04-17 10:39:09 -04:00
  • 0d9d333a4e Ensure sampling_params param is included in run_eval calls Jash Gulabrai 2025-04-17 10:23:21 -04:00
  • 326cbba579
    feat(agents): add agent naming functionality (#1922) Alexey Rybak 2025-04-17 07:02:47 -07:00
  • 120987f9e1
    Update llama_stack/apis/agents/agents.py raghotham 2025-04-17 07:02:12 -07:00
  • 5b8e75b392
    fix: OpenAI spec cleanup for assistant requests (#1963) Ben Browning 2025-04-17 09:56:10 -04:00
  • 4205376653
    chore: add meta/llama-3.3-70b-instruct as supported nvidia inference provider model (#1985) Matthew Farrellee 2025-04-17 09:50:40 -04:00
  • e466962e11 add meta/llama-3.3-70b-instruct as supported nvidia inference provider model Matthew Farrellee 2025-04-17 09:08:57 -04:00
  • 2ae1d7f4e6
    docs: Add NVIDIA platform distro docs (#1971) Jash Gulabrai 2025-04-17 08:54:30 -04:00
  • 6c77d7f693 Add link to public docs Jash Gulabrai 2025-04-17 08:44:21 -04:00
  • a5ab39a497 fix: add missing openai_ methods Philippe Martin 2025-04-17 11:53:22 +02:00
  • 6f77ca1755
    Update llama_stack/providers/remote/inference/vllm/vllm.py Ilya Kolchinsky 2025-04-17 11:35:41 +02:00
  • cfc68012f7 fix: use model-id Jeff MAURY 2025-04-03 19:06:17 +02:00
  • 0397c0cd44 fix: telemetry configuration Jeff MAURY 2025-03-26 18:16:33 +01:00
  • aa68e98b7a fix: rewording Jeff MAURY 2025-03-26 15:51:05 +01:00
  • dd86427ce3 feat: Podman AI Lab provider and distribution Jeff MAURY 2025-03-20 16:09:15 +01:00
  • 45e08ff417
    fix: Handle case when Customizer Job status is unknown (#1965) Jash Gulabrai 2025-04-17 04:27:07 -04:00
  • 6f97f9a593
    chore: Use hashes to pull actions for build-single-provider job (#1977) Ihar Hrachyshka 2025-04-17 04:26:08 -04:00
  • 8f57b08f2c
    fix(build): always pass path when no template/config provided (#1982) Alexey Rybak 2025-04-17 01:20:43 -07:00
  • 6ed92e03bc
    fix: print traceback on build failure (#1966) Sébastien Han 2025-04-17 09:45:21 +02:00
  • 64d8cde4a0
    fix: print traceback on build failure Sébastien Han 2025-04-16 13:10:17 +02:00
  • f12011794b
    fix: Updated tools playground to allow vdb selection (#1960) Michael Clifford 2025-04-17 03:29:40 -04:00
  • 0de530e99f chore: revert CI workflow YAMLs to main versions reluctantfuturist 2025-04-17 00:10:26 -07:00
  • bbbff4309b test fix reluctantfuturist 2025-04-17 00:02:59 -07:00
  • 2396b1e13a fix(build): add UBI 9 compiler tool‑chain and split fast/slow tests (#1970) reluctantfuturist 2025-04-16 23:34:29 -07:00
  • 34a3f1a749 Merge branch 'main' into add-watsonx-inference-adapter Sajikumar JS 2025-04-17 10:43:38 +05:30
  • 48567ebfa3 fix(build): always pass build file path when no template/config is provided reluctantfuturist 2025-04-16 22:11:16 -07:00
  • 88aa70add7 Remove remote_hosted_distro nvidia doc Jash Gulabrai 2025-04-16 18:46:26 -04:00
  • b44f84ce18
    test: disable flaky dataset (#1979) ehhuang 2025-04-16 15:33:37 -07:00
  • 138cff00b8 test: fix dataset Eric Huang 2025-04-16 15:19:55 -07:00
  • 7fd247b65a Change distro type to self_hosted Jash Gulabrai 2025-04-16 18:21:41 -04:00