Commit graph

  • 6a863f9b78 Bump version to 0.0.14 Xi Yan 2024-09-12 21:24:07 -07:00
  • 16635508bd Bump version to 0.0.14 Xi Yan 2024-09-12 15:11:15 -07:00
  • 5712566061
    Remove request wrapper migration (#64) Xi Yan 2024-09-12 15:03:49 -07:00
  • 1d0e91d802
    Support data: in URL for memory. Add ootb support for pdfs (#67) Hardik Shah 2024-09-12 13:00:21 -07:00
  • 487e16dc3f moved utility to common and updated data_url parsing logic Hardik Shah 2024-09-12 11:58:04 -07:00
  • 5f49dce839 support data: in URL for memory. Add ootb support for pdfs Hardik Shah 2024-09-12 10:54:55 -07:00
  • 736092f6bc
    [Inference] Use huggingface_hub inference client for TGI adapter (#53) Celina Hanouti 2024-09-12 18:11:35 +02:00
  • c8808b4700 Move helper into impl file + fix merging conflicts Celina Hanouti 2024-09-12 15:55:42 +02:00
  • 04f0b8fe11 Merge branch 'main' into tgi-integration Celina Hanouti 2024-09-12 15:31:07 +02:00
  • 7d6ebf4b72 update inference adapters Xi Yan 2024-09-11 19:58:20 -07:00
  • 29d1ef3fdc together adapter inference Xi Yan 2024-09-11 18:41:00 -07:00
  • f55ffa8b53 fix agentic calling inference Xi Yan 2024-09-11 18:30:09 -07:00
  • 2501b3d7de
    Merge branch 'main' into migrate_request_wrapper Xi Yan 2024-09-11 16:06:38 -07:00
  • 2aa76e4d81 fix api to work with openapi generator Xi Yan 2024-09-11 16:05:35 -07:00
  • cd493b8228 Simplified Telemetry API and tying it to logger (#57) Ashwin Bharambe 2024-09-11 14:25:37 -07:00
  • 83ffdcc1ef openapi generator rerun Xi Yan 2024-09-11 15:30:51 -07:00
  • 8385a45aca fix inference Xi Yan 2024-09-11 15:15:16 -07:00
  • 2d0163b47b fix inference Xi Yan 2024-09-11 14:51:06 -07:00
  • 191cd28831
    Simplified Telemetry API and tying it to logger (#57) Ashwin Bharambe 2024-09-11 14:25:37 -07:00
  • e8c2f068a3 move span events one level down into structured log events Ashwin Bharambe 2024-09-11 14:24:54 -07:00
  • 96f3058145 remove hack from openapi generator Xi Yan 2024-09-11 14:20:52 -07:00
  • 75ac0b2db1 re-generate openapi spec Xi Yan 2024-09-11 14:17:57 -07:00
  • a3081f28fc migrate apis without implementations Xi Yan 2024-09-11 14:15:13 -07:00
  • 6049aada71 migrate agentic system Xi Yan 2024-09-11 13:57:39 -07:00
  • 4b34f741d0 safety api Xi Yan 2024-09-11 13:41:15 -07:00
  • 959c499cac inference regenerate openapi spec Xi Yan 2024-09-11 12:36:23 -07:00
  • 8b558336b4 inference/completion Xi Yan 2024-09-11 12:32:12 -07:00
  • a7be58e4e1 migrate inference/completion Xi Yan 2024-09-11 12:29:22 -07:00
  • 0c7c6b7e02 [1/n] migrate inference/chat_completion Xi Yan 2024-09-11 12:21:19 -07:00
  • 99af14b18c Merge remote-tracking branch 'origin/main' into telemetry Ashwin Bharambe 2024-09-11 12:18:12 -07:00
  • f294875396 small update which adds a METRIC type Ashwin Bharambe 2024-09-11 12:17:04 -07:00
  • 1433aaf9f7 add CODEOWNERS file Xi Yan 2024-09-11 11:40:37 -07:00
  • 89300df5dc
    Add config file based CLI (#60) Xi Yan 2024-09-11 11:39:46 -07:00
  • ad33c41eb0 update readme Xi Yan 2024-09-11 11:22:17 -07:00
  • dd3d6525fe update readme Xi Yan 2024-09-11 11:17:59 -07:00
  • 6aa44805c2 update configure to only consume config Xi Yan 2024-09-11 10:59:41 -07:00
  • 987e1cafc4 only consume config as argument Xi Yan 2024-09-11 10:43:36 -07:00
  • aebec57ed7 move import to inline Xi Yan 2024-09-10 22:05:40 -07:00
  • 58def874a9
    add safety to openapi spec (#62) Xi Yan 2024-09-10 17:47:13 -07:00
  • d602c8314e add safety to openapi spec Xi Yan 2024-09-10 16:42:28 -07:00
  • 0df4d9c9bd API Keys passed from Client instead of distro configuration Hardik Shah 2024-09-10 12:36:30 -07:00
  • a11d92601b
    Enable Bing search (#59) Hardik Shah 2024-09-10 12:34:29 -07:00
  • 6c97e84372 fix run-config/config-file to config Xi Yan 2024-09-10 12:21:51 -07:00
  • ace3953926 distribution_type -> distribution Xi Yan 2024-09-10 12:06:46 -07:00
  • 03123f718b dropped commented code Hardik Shah 2024-09-10 11:44:08 -07:00
  • be9e488e56 update readme Xi Yan 2024-09-10 11:35:27 -07:00
  • 1e978e16b1 update build.sh Xi Yan 2024-09-10 11:23:30 -07:00
  • f05347be8f fix configure script to work with config file Xi Yan 2024-09-10 11:15:38 -07:00
  • 9d5582245d configure script with config Xi Yan 2024-09-10 11:13:01 -07:00
  • 5bf2fe452d fix build command Xi Yan 2024-09-10 11:06:32 -07:00
  • 0981193d78 config file for build Xi Yan 2024-09-10 11:02:46 -07:00
  • 0964b0a74a Improve TGI adapter initialization condition Celina Hanouti 2024-09-10 18:22:09 +02:00
  • 2b63074676 add /inference/chat_completion to SSE special case Dalton Flanagan 2024-09-10 01:14:11 -04:00
  • bdede6d14e simplify search tool and enable configuration for search engine Hardik Shah 2024-09-09 18:41:11 -07:00
  • 4f021de10f
    API spec update, client demo with Stainless SDK (#58) Xi Yan 2024-09-09 13:09:47 -07:00
  • 6bfcbc678e remove client sdk examples Xi Yan 2024-09-09 12:21:08 -07:00
  • 26209a9d99 add comment todos Xi Yan 2024-09-09 11:50:35 -07:00
  • 8c378fadcc agentic system client sdk Xi Yan 2024-09-09 11:46:08 -07:00
  • b7b8f5c2c3 update script Xi Yan 2024-09-09 11:19:57 -07:00
  • 6ccb0a4c1f Simplified Telemetry API and tying it to logger Ashwin Bharambe 2024-09-07 15:25:35 -07:00
  • 84b8a53a34 update wrapper request Xi Yan 2024-09-09 11:15:22 -07:00
  • 838ab91ebf update generator & yaml spec Xi Yan 2024-09-09 10:39:29 -07:00
  • 2ac8e7b901 Remove unecessary method argument Celina Hanouti 2024-09-09 19:04:21 +02:00
  • fff1b6d6bf Use HfApi to get the namespace when not provide in the hf endpoint name Celina Hanouti 2024-09-09 18:59:10 +02:00
  • 3d660ad938 Rename TGI Adapter class Celina Hanouti 2024-09-09 18:30:34 +02:00
  • eee6c69f46 Update CLI reference and add typing Celina Hanouti 2024-09-09 17:49:07 +02:00
  • b96e705680 Fixes post-review and split TGI adapter into local and Inference Endpoints ones Celina Hanouti 2024-09-09 17:47:49 +02:00
  • ee32de4c3f [wip] client w/ stainless sdk Xi Yan 2024-09-08 18:31:49 -07:00
  • 640c5f8ab9 add tool for bing search Hardik Shah 2024-09-08 17:25:52 -07:00
  • 741310f78e rename observability -> Telemetry; regen Spec Ashwin Bharambe 2024-09-07 15:23:53 -07:00
  • 70e682fbdf Update distribution_id -> distribution_type, provider_id -> provider_type Ashwin Bharambe 2024-09-07 08:42:28 -07:00
  • 3f090d1975
    Add Chroma and PGVector adapters (#56) Ashwin Bharambe 2024-09-06 18:53:17 -07:00
  • c02d8aa3d3 Add Chroma and PGVector adapters Ashwin Bharambe 2024-09-05 23:49:14 -07:00
  • 5de6ed946e
    Query generators for RAG query (#54) Hardik Shah 2024-09-06 13:10:39 -07:00
  • 95a5982524 drop classes for functions Hardik Shah 2024-09-06 12:58:13 -07:00
  • c2b7b462e9 use agent.inference_api instead of passing host/port again Hardik Shah 2024-09-06 12:48:08 -07:00
  • 406c3b24d4
    upgrade llama_models (#55) Yufei (Benny) Chen 2024-09-06 12:03:13 -07:00
  • f1e23075d1 upgrade llama_models benjibc 2024-09-06 18:55:19 +00:00
  • 5ab4fd31f7 Merge branch 'tgi-integration' of github.com:hanouticelina/llama-stack into tgi-integration Celina Hanouti 2024-09-06 17:58:22 +02:00
  • 031dbc0e45 Use InferenceClient.text_generation for TGI inference Celina Hanouti 2024-09-06 17:56:27 +02:00
  • 3858d94edf
    Merge branch 'meta-llama:main' into tgi-integration Celina Hanouti 2024-09-06 15:37:12 +02:00
  • 4a70f3d2ba Query generators for rag query Hardik Shah 2024-09-04 17:58:42 -07:00
  • dd1e1ceb13 Add bubblewrap to the container Ashwin Bharambe 2024-09-05 16:45:58 -07:00
  • f6b5e394ab Remove dependence on os.environ["USER"] Ashwin Bharambe 2024-09-05 15:37:12 -07:00
  • 7aa50934bf Update the default value for TGI URL Celina Hanouti 2024-09-05 19:05:07 +02:00
  • e5bcfdac21 Use huggingface_hub inference client for TGI inference Celina Hanouti 2024-09-05 18:29:04 +02:00
  • 6c69e09c6a Bump version to 0.0.13 Ashwin Bharambe 2024-09-04 23:10:38 -07:00
  • 21bedc1596
    [inference] Add a TGI adapter (#52) Ashwin Bharambe 2024-09-04 22:49:33 -07:00
  • 046afcb945 Use the lower-level generate_stream() method for correct tool calling Ashwin Bharambe 2024-09-04 17:36:45 -07:00
  • f355b9b844 TGI adapter and some refactoring of other inference adapters Ashwin Bharambe 2024-09-04 10:51:27 -07:00
  • 6ad7365676 A little clean up for the Fireworks and Together adapters Ashwin Bharambe 2024-09-04 22:34:15 -07:00
  • 225cd75074
    Update cli_reference.md raghotham 2024-09-04 18:50:10 -07:00
  • bfee50aa83 A few more fixes to the OpenAPI generator Ashwin Bharambe 2024-09-04 10:29:20 -07:00
  • 0167953d2d Update OpenAPI generator for POST requests Ashwin Bharambe 2024-09-04 09:27:00 -07:00
  • 01d971bda6 Bump version to 0.0.12 Ashwin Bharambe 2024-09-03 23:24:02 -07:00
  • 1380d78c19 Fixes to the llama stack configure script + inference adapters Ashwin Bharambe 2024-09-03 23:22:21 -07:00
  • 4869f2b983 Update fireworks and together entries as adapters Ashwin Bharambe 2024-09-03 22:56:52 -07:00
  • f802d481d9 Bump version to 0.0.11 Ashwin Bharambe 2024-09-03 22:41:29 -07:00
  • 7bc7785b0d
    API Updates: fleshing out RAG APIs, introduce "llama stack" CLI command (#51) Ashwin Bharambe 2024-09-03 22:39:39 -07:00
  • 86059af5af Added a "--raw" option for model template printing Ashwin Bharambe 2024-09-03 22:04:43 -07:00