Commit graph

  • 94614b26c3 Add docs for Inference and Safety Jash Gulabrai 2025-04-16 18:18:29 -04:00
  • 0d3dd3c558 chore: Use hashes to pull actions for build-single-provider job Ihar Hrachyshka 2025-04-16 18:07:40 -04:00
  • 30fc66923b
    fix: Add llama-3.2-1b-instruct to NVIDIA fine-tuned model list (#1975) Jash Gulabrai 2025-04-16 18:02:08 -04:00
  • ccd90b2fa0 fix: Add llama-3.2-1b-instruct to NVIDIA fine-tuned model list Jash Gulabrai 2025-04-16 17:19:50 -04:00
  • 00b232c282
    chore: Fix to persist the theme preference across page navigation. (#1974) Francisco Arceo 2025-04-16 14:58:25 -06:00
  • 0bd3912ebd chore: Fix for theme change to persist after switching theme toggle Francisco Javier Arceo 2025-04-16 16:15:44 -04:00
  • 931ee14a88 Add reference to e2e notebook Jash Gulabrai 2025-04-16 13:08:03 -04:00
  • b54c0c61aa updated tools playground to allow vdb selection Michael Clifford 2025-04-15 14:39:40 -04:00
  • 8d9c35f054 fix tests reluctantfuturist 2025-04-16 09:07:56 -07:00
  • 0cd7f5dfe5 feat: Add NVIDIA docs Jash Gulabrai 2025-04-16 11:29:24 -04:00
  • 6a0ee7180b Added a comment explaining the client handling in register_model. ilya-kolchinsky 2025-04-16 15:29:23 +02:00
  • 0eaf5c21c3 update the env parsing logic Rashmi Pawar 2025-04-16 17:27:01 +05:30
  • f1fd382d51 Remote vLLM client is now initialized directly prior to first use instead of VLLMInferenceAdapter.initialize. ilya-kolchinsky 2025-04-16 13:54:51 +02:00
  • 38ff88ba64 fix linting post merge Rashmi Pawar 2025-04-16 14:38:20 +05:30
  • afb792b9c1
    Merge branch 'main' into register_custom_model Rashmi Pawar 2025-04-16 14:35:51 +05:30
  • 1d960bacdc add version env variable Rashmi Pawar 2025-04-16 14:33:41 +05:30
  • 436b3269dc update readme model registration code Rashmi Pawar 2025-04-16 14:22:52 +05:30
  • 6c2153a6d0 linting changes Rashmi Pawar 2025-04-16 14:12:15 +05:30
  • 8000b0287f Merge branch 'main' into register_custom_model Rashmi Pawar 2025-04-16 13:47:33 +05:30
  • 75f7789de9 fix typo Jash Gulabrai 2025-04-16 00:07:42 -04:00
  • 48f952fc62 fix: Handle case when Customizer Job status is unknown Jash Gulabrai 2025-04-16 00:07:14 -04:00
  • 6927cdf5ce feat: NVIDIA beginner e2e notebook Jash Gulabrai 2025-04-15 23:26:38 -04:00
  • a411746df3 fix: OpenAI spec cleanup for assistant requests Ben Browning 2025-04-15 20:54:16 -04:00
  • b5a9ef4c6d
    fix: Do not send an empty 'tools' list to remote vllm (#1957) Daniel Alvarez Sanchez 2025-04-16 02:31:12 +02:00
  • 334cd8b34a Address PR comments Jash Gulabrai 2025-04-15 18:04:19 -04:00
  • fb8ff77ff2
    docs: 0.2.2 doc updates (#1961) Chirag Modi 2025-04-15 13:26:17 -07:00
  • 2a8b5b75ac
    Update android_sdk.md Chirag Modi 2025-04-15 12:55:54 -07:00
  • 8a15ebf520
    Update android_sdk.md Chirag Modi 2025-04-15 12:53:38 -07:00
  • b8d2e33955 Update Eval ReadME Jash Gulabrai 2025-04-15 14:01:01 -04:00
  • 970ff974f2 Sync with main Jash Gulabrai 2025-04-15 13:39:40 -04:00
  • 72711287ec Merge branch 'main' into nvidia-eval-integration Jash Gulabrai 2025-04-15 13:36:42 -04:00
  • 841d8fdf4f Add back nvidia docs Jash Gulabrai 2025-04-15 13:20:16 -04:00
  • 14420e880a Fix MODEL_ENTRIES import Jash Gulabrai 2025-04-15 13:03:36 -04:00
  • 5f2f838656 fix: ensure run_eval accepts model alias and converts to nvidia model ID Jash Gulabrai 2025-04-15 12:56:55 -04:00
  • 0f50cfa561 feat(api): define a more coherent jobs api across different flows Ihar Hrachyshka 2025-03-24 20:54:04 -04:00
  • 093881071a
    fix: add max_tokens slider to playground tools page (#1958) Michael Clifford 2025-04-15 12:11:08 -04:00
  • 538d601472 Do not send an empty 'tools' param to remote vllm Daniel Alvarez 2025-04-15 16:06:51 +02:00
  • daf0c26420 Merge remote-tracking branch 'refs/remotes/origin/feat/litellm_sambanova_usage' into feat/litellm_sambanova_usage jhpiedrahitao 2025-04-15 10:27:52 -05:00
  • 63e3c5812d update sambanova models jhpiedrahitao 2025-04-15 10:26:57 -05:00
  • 71ed47ea76
    docs: add example for intel gpu in vllm remote (#1952) Dmitry Rogozhkin 2025-04-15 07:56:23 -07:00
  • 241a42bb26 docs: add example for intel gpu in vllm remote Dmitry Rogozhkin 2025-04-11 19:00:48 +00:00
  • 95619892ea Revert model in ModelCandidate to type string Jash Gulabrai 2025-04-15 09:47:58 -04:00
  • 7cdd2a0410 Merge branch 'main' into nvidia-e2e-notebook Jash Gulabrai 2025-04-15 08:38:41 -04:00
  • 83b5523e2d
    feat: add --providers to llama stack build (#1718) Charlie Doern 2025-04-15 08:17:03 -04:00
  • 581ae9f393
    ci: add test to build a distro with a single provider Sébastien Han 2025-04-15 14:05:09 +02:00
  • 35dd1c27d3 issue fix for doc_template Sajikumar JS 2025-04-15 14:52:42 +05:30
  • 62d426d552 updated code for workflow issues Sajikumar JS 2025-04-15 14:00:20 +05:30
  • ebf994475d Merge branch 'main' into add-watsonx-inference-adapter Sajikumar JS 2025-04-15 11:47:56 +05:30
  • 32e3da7392
    test(verification): more tests, multiturn tool use tests (#1954) ehhuang 2025-04-14 18:45:22 -07:00
  • f27f617629 test(verification): more tests, multiturn Eric Huang 2025-04-14 18:20:08 -07:00
  • 4a7b26042d add max_token slider to playground tools page Michael Clifford 2025-04-14 17:21:27 -04:00
  • 86c6f1f112
    fix: FastAPI built-in paths bypass custom routing (Docs) and update r… (#1841) Peter Double 2025-04-14 13:28:25 -04:00
  • 488eb8f249
    Merge branch 'main' into feat/litellm_sambanova_usage Jorge Piedrahita Ortiz 2025-04-14 12:15:44 -05:00
  • ff468d91ce fix: apply pre-commit formatting Peter Double 2025-04-14 12:35:32 -04:00
  • cf158f2cb9
    feat: allow ollama to use 'latest' if available but not specified (#1903) Nathan Weinberg 2025-04-14 12:03:54 -04:00
  • 3ed4316ed5
    feat: Implement async job execution for torchtune training (#1437) Ihar Hrachyshka 2025-04-14 11:59:11 -04:00
  • 7641a5cd0b
    fix: 100% OpenAI API verification for together and fireworks (#1946) Ben Browning 2025-04-14 11:56:29 -04:00
  • dd808a8c1e
    Merge branch 'meta-llama:main' into feat/litellm_sambanova_usage Jorge Piedrahita Ortiz 2025-04-14 08:51:59 -05:00
  • 68eeacec0e
    docs: resync missing nvidia doc (#1947) Sébastien Han 2025-04-14 15:09:16 +02:00
  • 2ec5879f14
    chore(github-deps): bump astral-sh/setup-uv from 5.4.0 to 5.4.1 (#1881) dependabot[bot] 2025-04-14 14:33:43 +02:00
  • 030ca4b2be
    docs: Move Llama 4 instructions in a collapsed section (#1936) Yuan Tang 2025-04-14 08:14:59 -04:00
  • 383ce4f9ed
    chore(github-deps): bump astral-sh/setup-uv from 5.4.0 to 5.4.1 dependabot[bot] 2025-04-05 20:09:11 +00:00
  • f60a6e76d9 Add documentation for postgresql uri Josh Salomon 2025-04-09 21:46:29 +03:00
  • e244a692ed Add remote::postgresql dataset provider to the registry Josh Salomon 2025-04-09 21:33:22 +03:00
  • 73b63ea318 Add remote::postgresql to build files Josh Salomon 2025-04-09 13:21:23 +03:00
  • 8e3b579df2 Initial commit for postgresql dataset provider Josh Salomon 2025-03-26 12:15:40 +02:00
  • 6d6b40983e
    refactor: update integration test workflow (#1856) Matthew Farrellee 2025-04-14 06:17:51 -04:00
  • 69554158fa
    feat: add health to all providers through providers endpoint (#1418) Sébastien Han 2025-04-14 11:59:36 +02:00
  • c636af9a7c
    docs: resync missing nvidia doc Sébastien Han 2025-04-14 11:58:23 +02:00
  • a53814f835
    feat: add health to all providers through providers endpoint Sébastien Han 2025-03-10 12:08:11 +01:00
  • 8a1c0a1008 Improve groq OpenAI API compatibility Ben Browning 2025-04-13 13:35:53 -04:00
  • 657bb12e85 Get fireworks provider to 100% on OpenAI API verification Ben Browning 2025-04-13 10:45:32 -04:00
  • da2d39a836 Handle chunks with null text in test_openai_completion.py Ben Browning 2025-04-12 17:40:47 -04:00
  • c014571258 fix: OpenAI API - together.ai extra usage chunks Ben Browning 2025-04-12 17:27:43 -04:00
  • a4b573d750 Fix OpenAI API response format handling Ben Browning 2025-04-12 16:29:02 -04:00
  • 1e673010e4 fix: OpenAI API chat completion messages with image_url Ben Browning 2025-04-12 14:51:39 -04:00
  • ff14773fa7 fix: update llama stack client dependency Ashwin Bharambe 2025-04-12 18:14:33 -07:00
  • ca26faa7fd build: Bump version to 0.2.2 v0.2.2 release-0.2.2 github-actions[bot] 2025-04-13 01:07:44 +00:00
  • 1079e22b11 Release candidate 0.2.2rc1 v0.2.2rc1 github-actions[bot] 2025-04-13 00:54:05 +00:00
  • 429f6de7d7 fix: misc fixes for tests kill horrible warnings Ashwin Bharambe 2025-04-12 17:12:11 -07:00
  • 8b4158169f fix: dont check protocol compliance for experimental methods Ashwin Bharambe 2025-04-12 12:17:39 -07:00
  • ad86a68a32
    feat: support '-' in tool names (#1807) ehhuang 2025-04-12 14:23:03 -07:00
  • 0e21c33b47 feat: support '-' in tool names Eric Huang 2025-04-12 14:15:31 -07:00
  • 399a68d246
    Update README.md Yuan Tang 2025-04-12 15:44:59 -04:00
  • ef3dc143ec fix: test_registration was borked somehow Ashwin Bharambe 2025-04-12 12:04:01 -07:00
  • 1e5bf6c19d
    feat: update default tool use prompt (#1803) ehhuang 2025-04-12 11:54:22 -07:00
  • f34f22f8c7
    feat: add batch inference API to llama stack inference (#1945) Ashwin Bharambe 2025-04-12 11:41:12 -07:00
  • 14ff4c647c include content in the message even if you have parsed out a tool call Ashwin Bharambe 2025-04-12 11:23:25 -07:00
  • 771daa4b91 fix test, fix llama3 generator Ashwin Bharambe 2025-04-12 10:51:43 -07:00
  • a3b921a5a8 update integration test workflow Matthew Farrellee 2025-04-01 16:29:15 -04:00
  • 854c2ad264
    fix: misleading help text for 'llama stack build' and 'llama stack run' (#1910) Nathan Weinberg 2025-04-12 04:19:11 -04:00
  • 0751a960a5
    feat: make training config fields optional (#1861) Charlie Doern 2025-04-12 04:13:45 -04:00
  • 70a7e4d51e fix: unhide python_start, python_end Ashwin Bharambe 2025-04-11 20:30:44 -07:00
  • 172a918fe3 Merge branch 'main' into feat/litellm_sambanova_usage jhpiedrahitao 2025-04-11 19:28:02 -05:00
  • 07488dbfb6
    Update README.md Yuan Tang 2025-04-11 20:19:34 -04:00
  • a3cee70014 kill experimental attr on webmethod Ashwin Bharambe 2025-04-11 17:13:46 -07:00
  • 51492bd9b6
    docs: Update docs and fix warning in start-stack.sh (#1937) Aidan Reilly 2025-04-12 00:26:17 +01:00
  • 1d855461d5 kill batch inference registry Ashwin Bharambe 2025-04-11 16:21:21 -07:00
  • 73d927850e updates Ashwin Bharambe 2025-04-11 16:15:59 -07:00
  • 0cfb2e2473 feat: add batch inference API to llama stack inference Ashwin Bharambe 2025-04-08 13:50:52 -07:00