Commit graph

  • b8d2e33955 Update Eval ReadME Jash Gulabrai 2025-04-15 14:01:01 -04:00
  • 970ff974f2 Sync with main Jash Gulabrai 2025-04-15 13:39:40 -04:00
  • 72711287ec Merge branch 'main' into nvidia-eval-integration Jash Gulabrai 2025-04-15 13:36:42 -04:00
  • 841d8fdf4f Add back nvidia docs Jash Gulabrai 2025-04-15 13:20:16 -04:00
  • 14420e880a Fix MODEL_ENTRIES import Jash Gulabrai 2025-04-15 13:03:36 -04:00
  • 5f2f838656 fix: ensure run_eval accepts model alias and converts to nvidia model ID Jash Gulabrai 2025-04-15 12:56:55 -04:00
  • 0f50cfa561 feat(api): define a more coherent jobs api across different flows Ihar Hrachyshka 2025-03-24 20:54:04 -04:00
  • 538d601472 Do not send an empty 'tools' param to remote vllm Daniel Alvarez 2025-04-15 16:06:51 +02:00
  • daf0c26420 Merge remote-tracking branch 'refs/remotes/origin/feat/litellm_sambanova_usage' into feat/litellm_sambanova_usage jhpiedrahitao 2025-04-15 10:27:52 -05:00
  • 63e3c5812d update sambanova models jhpiedrahitao 2025-04-15 10:26:57 -05:00
  • 241a42bb26 docs: add example for intel gpu in vllm remote Dmitry Rogozhkin 2025-04-11 19:00:48 +00:00
  • 95619892ea Revert model in ModelCandidate to type string Jash Gulabrai 2025-04-15 09:47:58 -04:00
  • 7cdd2a0410 Merge branch 'main' into nvidia-e2e-notebook Jash Gulabrai 2025-04-15 08:38:41 -04:00
  • 581ae9f393
    ci: add test to build a distro with a single provider Sébastien Han 2025-04-15 14:05:09 +02:00
  • 35dd1c27d3 issue fix for doc_template Sajikumar JS 2025-04-15 14:52:42 +05:30
  • 62d426d552 updated code for workflow issues Sajikumar JS 2025-04-15 14:00:20 +05:30
  • ebf994475d Merge branch 'main' into add-watsonx-inference-adapter Sajikumar JS 2025-04-15 11:47:56 +05:30
  • f27f617629 test(verification): more tests, multiturn Eric Huang 2025-04-14 18:20:08 -07:00
  • 4a7b26042d add max_token slider to playground tools page Michael Clifford 2025-04-14 17:21:27 -04:00
  • 488eb8f249
    Merge branch 'main' into feat/litellm_sambanova_usage Jorge Piedrahita Ortiz 2025-04-14 12:15:44 -05:00
  • ff468d91ce fix: apply pre-commit formatting Peter Double 2025-04-14 12:35:32 -04:00
  • dd808a8c1e
    Merge branch 'meta-llama:main' into feat/litellm_sambanova_usage Jorge Piedrahita Ortiz 2025-04-14 08:51:59 -05:00
  • 383ce4f9ed
    chore(github-deps): bump astral-sh/setup-uv from 5.4.0 to 5.4.1 dependabot[bot] 2025-04-05 20:09:11 +00:00
  • f60a6e76d9 Add documentation for postgresql uri Josh Salomon 2025-04-09 21:46:29 +03:00
  • e244a692ed Add remote::postgresql dataset provider to the registry Josh Salomon 2025-04-09 21:33:22 +03:00
  • 73b63ea318 Add remote::postgresql to build files Josh Salomon 2025-04-09 13:21:23 +03:00
  • 8e3b579df2 Initial commit for postgresql dataset provider Josh Salomon 2025-03-26 12:15:40 +02:00
  • c636af9a7c
    docs: resync missing nvidia doc Sébastien Han 2025-04-14 11:58:23 +02:00
  • a53814f835
    feat: add health to all providers through providers endpoint Sébastien Han 2025-03-10 12:08:11 +01:00
  • 8a1c0a1008 Improve groq OpenAI API compatibility Ben Browning 2025-04-13 13:35:53 -04:00
  • 657bb12e85 Get fireworks provider to 100% on OpenAI API verification Ben Browning 2025-04-13 10:45:32 -04:00
  • da2d39a836 Handle chunks with null text in test_openai_completion.py Ben Browning 2025-04-12 17:40:47 -04:00
  • c014571258 fix: OpenAI API - together.ai extra usage chunks Ben Browning 2025-04-12 17:27:43 -04:00
  • a4b573d750 Fix OpenAI API response format handling Ben Browning 2025-04-12 16:29:02 -04:00
  • 1e673010e4 fix: OpenAI API chat completion messages with image_url Ben Browning 2025-04-12 14:51:39 -04:00
  • ca26faa7fd build: Bump version to 0.2.2 v0.2.2 release-0.2.2 github-actions[bot] 2025-04-13 01:07:44 +00:00
  • 1079e22b11 Release candidate 0.2.2rc1 v0.2.2rc1 github-actions[bot] 2025-04-13 00:54:05 +00:00
  • 0e21c33b47 feat: support '-' in tool names Eric Huang 2025-04-12 14:15:31 -07:00
  • 399a68d246
    Update README.md Yuan Tang 2025-04-12 15:44:59 -04:00
  • 14ff4c647c include content in the message even if you have parsed out a tool call Ashwin Bharambe 2025-04-12 11:23:25 -07:00
  • 771daa4b91 fix test, fix llama3 generator Ashwin Bharambe 2025-04-12 10:51:43 -07:00
  • a3b921a5a8 update integration test workflow Matthew Farrellee 2025-04-01 16:29:15 -04:00
  • 172a918fe3 Merge branch 'main' into feat/litellm_sambanova_usage jhpiedrahitao 2025-04-11 19:28:02 -05:00
  • 07488dbfb6
    Update README.md Yuan Tang 2025-04-11 20:19:34 -04:00
  • a3cee70014 kill experimental attr on webmethod Ashwin Bharambe 2025-04-11 17:13:46 -07:00
  • 1d855461d5 kill batch inference registry Ashwin Bharambe 2025-04-11 16:21:21 -07:00
  • 73d927850e updates Ashwin Bharambe 2025-04-11 16:15:59 -07:00
  • 0cfb2e2473 feat: add batch inference API to llama stack inference Ashwin Bharambe 2025-04-08 13:50:52 -07:00
  • 43993cc29c Merge branch 'main' into nvidia-eval-integration Jash Gulabrai 2025-04-11 17:28:26 -04:00
  • b5438c9c82
    Update docs/source/getting_started/index.md raghotham 2025-04-11 13:07:58 -07:00
  • 8253d44c5c docs fixes Raghotham Murthy 2025-04-11 12:48:39 -07:00
  • 3465565df1 fix: ensure resource registration arguments are typed Matthew Farrellee 2025-04-11 11:24:36 -04:00
  • 6cf036f52e Add a direct (non-agentic) RAG option to the Playground UI. ilya-kolchinsky 2025-04-11 16:14:41 +02:00
  • c2d23ddd75 fix: remove extra sft args in NvidiaPostTrainingAdapter Ben Browning 2025-04-11 09:46:16 -04:00
  • dae8fd0a36
    update start-stack.sh with missing color and if statment logic Aidan Reilly 2025-04-11 10:30:12 +01:00
  • 1322bb9bf7 Merge branch 'refs/heads/main' into rag-demo ilya-kolchinsky 2025-04-11 13:55:38 +02:00
  • d40d3a9b31
    docs: Move Llama 4 instructions in a collapsed section Yuan Tang 2025-04-10 22:32:31 -04:00
  • be112fad4f changed copy Francisco Javier Arceo 2025-04-10 22:15:00 -04:00
  • 59861a4ea5 docs: Updated docs to show minimal RAG example and some other minor changes Francisco Javier Arceo 2025-04-10 22:08:05 -04:00
  • 0e5574cf9d feat: allow ollama to use 'latest' if available but not specified Nathan Weinberg 2025-04-08 15:56:19 -04:00
  • d402623f96 fix: misleading help text for 'llama stack build' and 'llama stack run' Nathan Weinberg 2025-04-09 10:17:05 -04:00
  • 2f67f67b43 text(verification): overwrite test result instead of creating new ones Eric Huang 2025-04-10 16:52:52 -07:00
  • 373e392b10 test(verification): add streaming tool calling test Eric Huang 2025-04-10 16:23:11 -07:00
  • 913e9679c2 docs: update tmp directory creation Bobbins228 2025-04-09 17:05:21 +01:00
  • 1a065c7d63 docs: alter hf token substitution Bobbins228 2025-04-09 16:58:35 +01:00
  • 6735344604 docs: fix errors in kubernetes deployment guide Bobbins228 2025-04-09 16:33:36 +01:00
  • d7c976c6d2
    Merge branch 'main' into docs-4 Francisco Arceo 2025-04-10 14:15:54 -06:00
  • 82b485b177 Added unit tests for the query() method. ilya-kolchinsky 2025-04-10 21:40:32 +02:00
  • 31181c070b Fireworks provider support for OpenAI API endpoints Ben Browning 2025-04-10 15:29:32 -04:00
  • 9120e07d9d
    Add support for RamaLama Daniel J Walsh 2025-02-11 13:47:13 -05:00
  • ffae192540 Bug fixes for together.ai OpenAI endpoints Ben Browning 2025-04-10 14:19:48 -04:00
  • a5827f7cb3 Nvidia provider support for OpenAI API endpoints Ben Browning 2025-04-10 13:43:28 -04:00
  • afa1082813 fix: Fix linter failure Francisco Javier Arceo 2025-04-10 13:34:22 -04:00
  • 178a5c3b93 moved the test from test_telemetry to test_agents reluctantfuturist 2025-04-10 10:30:10 -07:00
  • 13c660f5a5
    Merge branch 'meta-llama:main' into feat/litellm_sambanova_usage Jorge Piedrahita Ortiz 2025-04-10 11:01:51 -05:00
  • 1a76c55df4 fix: Use NAMESPACE global variable Jash Gulabrai 2025-04-10 11:31:56 -04:00
  • 84e85e824a Add high-level instructions Jash Gulabrai 2025-04-10 11:14:17 -04:00
  • 7faec2380a Clear notebook output Jash Gulabrai 2025-04-10 10:58:11 -04:00
  • a671b33589 Add back Guardrails section Jash Gulabrai 2025-04-10 10:57:25 -04:00
  • 76aa2782a8 fix: Fix URL path in POST request helper Jash Gulabrai 2025-04-10 10:29:03 -04:00
  • ed1b24f59a doc: Updating background color for code in darkmode Francisco Javier Arceo 2025-04-10 09:20:34 -04:00
  • d8ccc32d67 1) Recreate the agent upon a change in the settings. 2) When mid-session, disable the widgets triggering the change in the settings. ilya-kolchinsky 2025-04-10 13:31:32 +02:00
  • ec9e4116d5
    docs: fix model name Sébastien Han 2025-04-10 12:10:57 +02:00
  • 609a8d63d9
    fix: use torchao 0.8.0 for inference Sébastien Han 2025-04-10 10:35:13 +02:00
  • be527ba711 move models, model display name, case, reorg config Eric Huang 2025-04-09 22:56:01 -07:00
  • 33117e3012 Updated CoreModelId to get from sku_types Sajikumar JS 2025-04-10 10:17:43 +05:30
  • 47d919333a Merge branch 'main' into add-watsonx-inference-adapter Sajikumar JS 2025-04-10 10:17:08 +05:30
  • 5ffe6cee36 small copy change Francisco Javier Arceo 2025-04-09 23:17:29 -04:00
  • 5bdd767e8d added tabs for the tutorial output and rephrased thing based on feedback Francisco Javier Arceo 2025-04-09 23:16:52 -04:00
  • 57813f5606 Updates to notebook; use direct requests to NeMo where needed Jash Gulabrai 2025-04-09 23:03:34 -04:00
  • 098a09cfa3
    docs: Redirect instructions for additional hardware accelerators for remote vLLM Yuan Tang 2025-04-09 21:18:28 -04:00
  • 6a5b73ca7c feat(agents): add agent naming functionality reluctantfuturist 2025-04-09 16:22:00 -07:00
  • 5ecedc12e7 clean up jiawenliu64 2025-04-09 14:55:30 -07:00
  • 8f5cd49159 vllm prompt_logprobs can also be 0 Ben Browning 2025-04-09 17:32:03 -04:00
  • 0961987962 adding server example back in and restructuring steps Francisco Javier Arceo 2025-04-09 17:01:52 -04:00
  • c8a0b110c0 fix: on-the-fly int4 quantize parameter jiawenliu64 2025-04-09 13:35:11 -07:00
  • 8d10556ce3 Add basic tests for OpenAI Chat Completions API Ben Browning 2025-04-09 16:18:13 -04:00
  • 7840a53a12 fix: Fix paths in Eval helper functions; update ModelCandidate to support Evals that use chat datasets Jash Gulabrai 2025-04-09 15:48:46 -04:00
  • ac5dc8fae2 Add prompt_logprobs and guided_choice to OpenAI completions Ben Browning 2025-04-09 15:43:53 -04:00
  • ef684ff178 Fix openai_completion tests for ollama Ben Browning 2025-04-09 15:22:52 -04:00