Commit graph

  • 9d23c063d5 Fix regressions in inline vLLM provider Fred Reiss 2024-12-19 11:28:15 -08:00
  • b74a05114f Add inline vLLM provider to regression tests Fred Reiss 2024-12-19 11:27:59 -08:00
  • 03607a68c7
    remove unused telemetry related code for console (#659) Dinesh Yeduguru 2024-12-19 11:21:11 -08:00
  • 506d9289f6
    add response_format in AgentConfig Zhonglin Han 2024-12-19 09:19:07 -08:00
  • 92e4ec2a57
    add response com agent_instance.py Zhonglin Han 2024-12-19 09:18:09 -08:00
  • dc21e14f64 make api_key optional in config Dinesh Yeduguru 2024-12-19 07:42:30 -08:00
  • 71434d67f3 rebase and address feedback Dinesh Yeduguru 2024-12-19 07:38:08 -08:00
  • 1c770508df add brave tool provider Dinesh Yeduguru 2024-12-18 16:37:02 -08:00
  • aff79158af remove unused telemetry related code for console Dinesh Yeduguru 2024-12-19 06:55:52 -08:00
  • 4331796692 Update prompt adapter Aidan Do 2024-12-19 22:09:23 +11:00
  • a2be32c27d JKL Aidan Do 2024-12-19 20:13:29 +11:00
  • 69ad23d41f fix trace starting in library client Dinesh Yeduguru 2024-12-18 21:46:29 -08:00
  • 94645dd5f6 . Aidan Do 2024-12-19 15:08:29 +11:00
  • 95f43f8dfa . Aidan Do 2024-12-19 15:01:17 +11:00
  • ea0ca7454a address feedback Dinesh Yeduguru 2024-12-18 18:42:11 -08:00
  • 36b4fe02cc
    [4/n][torchtune integration] support lazy load model during inference (#620) Botao Chen 2024-12-18 16:30:53 -08:00
  • 78b6518b2c refine Botao Chen 2024-12-18 16:14:51 -08:00
  • cd1fc4fd17 refine Botao Chen 2024-12-18 16:08:17 -08:00
  • 7b0deee899 refine Botao Chen 2024-12-18 16:05:35 -08:00
  • 92a367340c refine Botao Chen 2024-12-18 15:59:45 -08:00
  • 7ab807ad76 refine Botao Chen 2024-12-18 15:58:51 -08:00
  • 72dab3e4bf add tools api with a stub provider impl Dinesh Yeduguru 2024-12-13 12:09:12 -08:00
  • 9e5b7d5c9e address comment Botao Chen 2024-12-18 14:32:23 -08:00
  • 75c881770a Merge branch 'main' into inference_refactor Botao Chen 2024-12-18 14:14:14 -08:00
  • 0000e1e8c6 address comments Botao Chen 2024-12-18 14:12:57 -08:00
  • 3b4b2ea30c fix replace_env_vars bug Ashwin Bharambe 2024-12-18 13:48:30 -08:00
  • 12cbed1617 Register Message and ResponseFormat Ashwin Bharambe 2024-12-18 10:32:25 -08:00
  • ceadaf1840 Dont include 3B / 1B models for bedrock since they arent ondemand Ashwin Bharambe 2024-12-18 06:30:02 -08:00
  • c39a3777b5 Make bedrock "just" work Ashwin Bharambe 2024-12-18 06:22:14 -08:00
  • d6fcdefec7 Bump version to 0.0.63 v0.0.63 Ashwin Bharambe 2024-12-17 23:15:27 -08:00
  • f1d6cb22d7 Update URL type to avoid string-ifying and creating complexity Ashwin Bharambe 2024-12-17 22:48:47 -08:00
  • d021983b0e refine Botao Chen 2024-12-17 20:43:20 -08:00
  • fadb7deae5 Merge branch 'main' into inference_refactor Botao Chen 2024-12-17 20:10:23 -08:00
  • 75e72cf2fc model_type=llm for filering available models for playground Xi Yan 2024-12-17 19:42:38 -08:00
  • 2f9fdb0ea7 Update notebook Ashwin Bharambe 2024-12-17 18:51:51 -08:00
  • 0fb4b7de6f Add more debugging logs to when llama guard fails Ashwin Bharambe 2024-12-17 17:11:21 -08:00
  • eea478618d Bump version to 0.0.62 v0.0.62 Ashwin Bharambe 2024-12-17 18:19:47 -08:00
  • af8f1b3531 model selection playground fix Xi Yan 2024-12-17 18:12:59 -08:00
  • 3700022d6f
    store attributes values in builtin types to avoid otel warnings (#649) Dinesh Yeduguru 2024-12-17 17:10:43 -08:00
  • 0e2a99e223
    Update Cerebras from Llama 3.1 to 3.3 (#645) Henry Tu 2024-12-17 19:28:24 -05:00
  • a7dd22988b tool def Dinesh Yeduguru 2024-12-17 16:06:57 -08:00
  • 482a0e4839 migrate tools and make tool runtime discover Dinesh Yeduguru 2024-12-17 14:00:29 -08:00
  • 69a17e93b7 rebase fixes Dinesh Yeduguru 2024-12-17 16:18:34 -08:00
  • 84d01fe8f8 working tools runtime Dinesh Yeduguru 2024-12-17 12:55:59 -08:00
  • 744eb0888c add tools api with a stub provider impl Dinesh Yeduguru 2024-12-16 12:42:37 -08:00
  • b7ad53ca93 minor fixes Dinesh Yeduguru 2024-12-16 13:01:52 -08:00
  • e5ac10f803 add tools api with a stub provider impl Dinesh Yeduguru 2024-12-16 12:42:37 -08:00
  • e65a6fac9d init Dinesh Yeduguru 2024-12-13 12:09:12 -08:00
  • b0bfa845a9 store attributes values in builtin types to avoid otel warnings Dinesh Yeduguru 2024-12-17 15:40:54 -08:00
  • 85d0f5f528 modify doc Botao Chen 2024-12-17 14:09:32 -08:00
  • b7a7caa9a8 Fix conversion to RawMessage everywhere Ashwin Bharambe 2024-12-17 13:38:01 -08:00
  • a63f920a02 Update Cerebras from Llama 3.1 to 3.3 Henry Tu 2024-12-17 13:54:21 -08:00
  • 486c0bc9c8 refine Botao Chen 2024-12-17 13:41:36 -08:00
  • 48482ff9c3 refine Botao Chen 2024-12-17 13:38:19 -08:00
  • fbca51d6da Fix to conda env build script Ashwin Bharambe 2024-12-17 12:19:34 -08:00
  • 0452c6a0c7 add missing init file Ashwin Bharambe 2024-12-17 11:48:28 -08:00
  • 8de8eb03c8
    Update the "InterleavedTextMedia" type (#635) Ashwin Bharambe 2024-12-17 11:18:31 -08:00
  • 10eb31badf
    docs: Update getting_started.ipynb link to correct jupyter notebook path in README.md (#636) Arun Brahma 2024-12-18 00:41:13 +05:30
  • 99f331f5c8
    [bugfix] no shield_call when there's no shields configured (#642) Xi Yan 2024-12-17 11:10:19 -08:00
  • cf9fce6b6e Update to the ImageContentItem datatype so url + data is not in a confusing union Ashwin Bharambe 2024-12-17 10:49:47 -08:00
  • 4936794de1 Move InterleavedContent to api/common/content_types.py Ashwin Bharambe 2024-12-17 10:22:14 -08:00
  • a30aaaa2e5 fix meta-reference, test vllm Ashwin Bharambe 2024-12-16 23:45:15 -08:00
  • b75e4eb6b9 refactor and fix safety test Ashwin Bharambe 2024-12-16 22:57:16 -08:00
  • 415b8f2dbd temp commit Botao Chen 2024-12-16 22:39:08 -08:00
  • 720ff235a0 pre-commit? Ashwin Bharambe 2024-12-16 22:34:54 -08:00
  • 7a1b60fccf Minor Ashwin Bharambe 2024-12-16 22:30:10 -08:00
  • 1bcc26ccd1 Fix LibraryClient completely correctly; also make tests pass Ashwin Bharambe 2024-12-16 22:16:21 -08:00
  • 81e1957446 temp commit Botao Chen 2024-12-16 21:43:30 -08:00
  • 093ffe4273 precommit fix Xi Yan 2024-12-16 21:17:20 -08:00
  • aab4084ee3 fix shields step Xi Yan 2024-12-16 21:12:18 -08:00
  • bf961f8aa5 fix shields step Xi Yan 2024-12-16 20:59:10 -08:00
  • 30f6eb282f temp commit Botao Chen 2024-12-16 19:04:47 -08:00
  • b2dbb5e3fe merge Botao Chen 2024-12-16 16:49:02 -08:00
  • 6a51e2268d Merge branch 'main' into inference_refactor Botao Chen 2024-12-16 16:47:57 -08:00
  • 35b1a6f2dc temp commit Botao Chen 2024-12-16 16:44:15 -08:00
  • d4935ca439 Update OpenAPI Ashwin Bharambe 2024-12-16 14:41:11 -08:00
  • 3b844afd00 Ollama Setup Aidan Do 2024-12-17 09:49:34 +11:00
  • f6caf6ef74 README update with email agent links Jeff Tang 2024-12-16 14:48:44 -08:00
  • e0731ba353 Fix agent executor Ashwin Bharambe 2024-12-16 11:21:51 -08:00
  • 59ce047aea Memory tests pass now Ashwin Bharambe 2024-12-15 20:55:06 -08:00
  • e51154964f Tests pass with Ollama now Ashwin Bharambe 2024-12-15 17:31:21 -08:00
  • a9a041a1de Rework InterleavedContentMedia datatype so URL downloading is in llama-stack Ashwin Bharambe 2024-12-15 13:23:30 -08:00
  • c2f7905fa4 Fix bedrock inference impl Ashwin Bharambe 2024-12-16 14:22:34 -08:00
  • eb37fba9da Small fix to library client Ashwin Bharambe 2024-12-16 14:08:30 -08:00
  • 5e08812bcb Add Dinesh to be a code owner Ashwin Bharambe 2024-12-16 13:00:50 -08:00
  • 2e5bfcd42a
    Update Telemetry API so OpenAPI generation can work (#640) Ashwin Bharambe 2024-12-16 13:00:14 -08:00
  • 787f5461bc Fix the openapi generator Ashwin Bharambe 2024-12-16 12:51:39 -08:00
  • 78e2bfbe7a
    [tests] add client-sdk pytests & delete client.py (#638) Xi Yan 2024-12-16 12:04:56 -08:00
  • 6abaf4574d test custom tool Xi Yan 2024-12-16 12:04:05 -08:00
  • b1f311982f delete client.py Xi Yan 2024-12-16 11:52:58 -08:00
  • 04ccb2db3e safety Xi Yan 2024-12-16 11:47:01 -08:00
  • 5bbdcf70f6 Use a Dict instead of tree for spans Ashwin Bharambe 2024-12-16 11:39:33 -08:00
  • 546fd84177 mv sdk tests Xi Yan 2024-12-16 11:26:00 -08:00
  • 3cf4fdb740 mv sdk tests Xi Yan 2024-12-16 11:17:57 -08:00
  • 22875a5ce0 some improvements for none cases Vladislav 2024-12-16 19:02:51 +01:00
  • 89aadb793a add build params and co update Vladislav 2024-12-16 18:23:11 +01:00
  • 12858d6d61 add build and dt types for platfrom Vladislav 2024-12-13 18:15:17 +01:00
  • 54e48d555d
    Merge branch 'meta-llama:main' into main Shrinit Goyal 2024-12-16 18:14:20 +05:30
  • e7a349e969
    Update README to correct Jupyter notebook link for Llama Stack usage Arun Brahma 2024-12-16 10:21:55 +05:30
  • 0dd56fe7ec Approach #2 Aidan Do 2024-12-16 11:55:50 +11:00