Commit graph

  • 73b71d9689 Handle Annotated types more correctly Ashwin Bharambe 2024-09-14 12:22:13 -07:00
  • 085f9fcce3
    Merge branch 'main' into cli Xi Yan 2024-09-14 14:10:34 -07:00
  • e69e1b8309 [tmp fix] hardware requirement tmp fix Xi Yan 2024-09-14 14:06:36 -07:00
  • bafef7ab96 clean up configure Xi Yan 2024-09-14 14:00:10 -07:00
  • 20157487d4 configure update Xi Yan 2024-09-14 13:59:09 -07:00
  • 4ed1f38134 update docker build script Xi Yan 2024-09-14 13:46:17 -07:00
  • d1f0d17644 update build cli Xi Yan 2024-09-14 13:35:09 -07:00
  • 53ab18d6bb Bump version to 0.0.16 Ashwin Bharambe 2024-09-14 08:09:45 -07:00
  • 49ce36426f Make llama model download error message a bit better Ashwin Bharambe 2024-09-14 08:06:34 -07:00
  • 7a283ea076 Bump version to 0.0.15 Ashwin Bharambe 2024-09-13 17:23:12 -07:00
  • 498cf03617 add pypdf Ashwin Bharambe 2024-09-13 17:04:43 -07:00
  • 19a14cd273 Nuke hardware_requirements from SKUs Ashwin Bharambe 2024-09-13 16:39:02 -07:00
  • 768ed09dec rename getting started Xi Yan 2024-09-13 14:48:27 -07:00
  • 27107fbbbd update doc Xi Yan 2024-09-13 14:31:07 -07:00
  • 245cc88081 add reedme Xi Yan 2024-09-13 14:24:42 -07:00
  • dfeb84be5c udpate build script Xi Yan 2024-09-13 14:09:47 -07:00
  • 9315eae645 remove comments Xi Yan 2024-09-13 14:08:55 -07:00
  • 62865ce0d2 update memory providers Xi Yan 2024-09-13 13:58:16 -07:00
  • 73a6589446 configure to regenerate file Xi Yan 2024-09-13 13:39:39 -07:00
  • 84047928ce remove config from build Xi Yan 2024-09-13 13:18:42 -07:00
  • 33772da8e0 remove configure from build Xi Yan 2024-09-13 13:17:17 -07:00
  • d8b3fdbd54
    Update README.md raghotham 2024-09-13 08:56:47 -07:00
  • 6a863f9b78 Bump version to 0.0.14 Xi Yan 2024-09-12 21:24:07 -07:00
  • 16635508bd Bump version to 0.0.14 Xi Yan 2024-09-12 15:11:15 -07:00
  • 5712566061
    Remove request wrapper migration (#64) Xi Yan 2024-09-12 15:03:49 -07:00
  • 1d0e91d802
    Support data: in URL for memory. Add ootb support for pdfs (#67) Hardik Shah 2024-09-12 13:00:21 -07:00
  • 487e16dc3f moved utility to common and updated data_url parsing logic Hardik Shah 2024-09-12 11:58:04 -07:00
  • 5f49dce839 support data: in URL for memory. Add ootb support for pdfs Hardik Shah 2024-09-12 10:54:55 -07:00
  • 736092f6bc
    [Inference] Use huggingface_hub inference client for TGI adapter (#53) Celina Hanouti 2024-09-12 18:11:35 +02:00
  • c8808b4700 Move helper into impl file + fix merging conflicts Celina Hanouti 2024-09-12 15:55:42 +02:00
  • 04f0b8fe11 Merge branch 'main' into tgi-integration Celina Hanouti 2024-09-12 15:31:07 +02:00
  • 7d6ebf4b72 update inference adapters Xi Yan 2024-09-11 19:58:20 -07:00
  • 29d1ef3fdc together adapter inference Xi Yan 2024-09-11 18:41:00 -07:00
  • f55ffa8b53 fix agentic calling inference Xi Yan 2024-09-11 18:30:09 -07:00
  • 2501b3d7de
    Merge branch 'main' into migrate_request_wrapper Xi Yan 2024-09-11 16:06:38 -07:00
  • 2aa76e4d81 fix api to work with openapi generator Xi Yan 2024-09-11 16:05:35 -07:00
  • cd493b8228 Simplified Telemetry API and tying it to logger (#57) Ashwin Bharambe 2024-09-11 14:25:37 -07:00
  • 83ffdcc1ef openapi generator rerun Xi Yan 2024-09-11 15:30:51 -07:00
  • 8385a45aca fix inference Xi Yan 2024-09-11 15:15:16 -07:00
  • 2d0163b47b fix inference Xi Yan 2024-09-11 14:51:06 -07:00
  • 191cd28831
    Simplified Telemetry API and tying it to logger (#57) Ashwin Bharambe 2024-09-11 14:25:37 -07:00
  • e8c2f068a3 move span events one level down into structured log events Ashwin Bharambe 2024-09-11 14:24:54 -07:00
  • 96f3058145 remove hack from openapi generator Xi Yan 2024-09-11 14:20:52 -07:00
  • 75ac0b2db1 re-generate openapi spec Xi Yan 2024-09-11 14:17:57 -07:00
  • a3081f28fc migrate apis without implementations Xi Yan 2024-09-11 14:15:13 -07:00
  • 6049aada71 migrate agentic system Xi Yan 2024-09-11 13:57:39 -07:00
  • 4b34f741d0 safety api Xi Yan 2024-09-11 13:41:15 -07:00
  • 959c499cac inference regenerate openapi spec Xi Yan 2024-09-11 12:36:23 -07:00
  • 8b558336b4 inference/completion Xi Yan 2024-09-11 12:32:12 -07:00
  • a7be58e4e1 migrate inference/completion Xi Yan 2024-09-11 12:29:22 -07:00
  • 0c7c6b7e02 [1/n] migrate inference/chat_completion Xi Yan 2024-09-11 12:21:19 -07:00
  • 99af14b18c Merge remote-tracking branch 'origin/main' into telemetry Ashwin Bharambe 2024-09-11 12:18:12 -07:00
  • f294875396 small update which adds a METRIC type Ashwin Bharambe 2024-09-11 12:17:04 -07:00
  • 1433aaf9f7 add CODEOWNERS file Xi Yan 2024-09-11 11:40:37 -07:00
  • 89300df5dc
    Add config file based CLI (#60) Xi Yan 2024-09-11 11:39:46 -07:00
  • ad33c41eb0 update readme Xi Yan 2024-09-11 11:22:17 -07:00
  • dd3d6525fe update readme Xi Yan 2024-09-11 11:17:59 -07:00
  • 6aa44805c2 update configure to only consume config Xi Yan 2024-09-11 10:59:41 -07:00
  • 987e1cafc4 only consume config as argument Xi Yan 2024-09-11 10:43:36 -07:00
  • aebec57ed7 move import to inline Xi Yan 2024-09-10 22:05:40 -07:00
  • 58def874a9
    add safety to openapi spec (#62) Xi Yan 2024-09-10 17:47:13 -07:00
  • d602c8314e add safety to openapi spec Xi Yan 2024-09-10 16:42:28 -07:00
  • 0df4d9c9bd API Keys passed from Client instead of distro configuration Hardik Shah 2024-09-10 12:36:30 -07:00
  • a11d92601b
    Enable Bing search (#59) Hardik Shah 2024-09-10 12:34:29 -07:00
  • 6c97e84372 fix run-config/config-file to config Xi Yan 2024-09-10 12:21:51 -07:00
  • ace3953926 distribution_type -> distribution Xi Yan 2024-09-10 12:06:46 -07:00
  • 03123f718b dropped commented code Hardik Shah 2024-09-10 11:44:08 -07:00
  • be9e488e56 update readme Xi Yan 2024-09-10 11:35:27 -07:00
  • 1e978e16b1 update build.sh Xi Yan 2024-09-10 11:23:30 -07:00
  • f05347be8f fix configure script to work with config file Xi Yan 2024-09-10 11:15:38 -07:00
  • 9d5582245d configure script with config Xi Yan 2024-09-10 11:13:01 -07:00
  • 5bf2fe452d fix build command Xi Yan 2024-09-10 11:06:32 -07:00
  • 0981193d78 config file for build Xi Yan 2024-09-10 11:02:46 -07:00
  • 0964b0a74a Improve TGI adapter initialization condition Celina Hanouti 2024-09-10 18:22:09 +02:00
  • 2b63074676 add /inference/chat_completion to SSE special case Dalton Flanagan 2024-09-10 01:14:11 -04:00
  • bdede6d14e simplify search tool and enable configuration for search engine Hardik Shah 2024-09-09 18:41:11 -07:00
  • 4f021de10f
    API spec update, client demo with Stainless SDK (#58) Xi Yan 2024-09-09 13:09:47 -07:00
  • 6bfcbc678e remove client sdk examples Xi Yan 2024-09-09 12:21:08 -07:00
  • 26209a9d99 add comment todos Xi Yan 2024-09-09 11:50:35 -07:00
  • 8c378fadcc agentic system client sdk Xi Yan 2024-09-09 11:46:08 -07:00
  • b7b8f5c2c3 update script Xi Yan 2024-09-09 11:19:57 -07:00
  • 6ccb0a4c1f Simplified Telemetry API and tying it to logger Ashwin Bharambe 2024-09-07 15:25:35 -07:00
  • 84b8a53a34 update wrapper request Xi Yan 2024-09-09 11:15:22 -07:00
  • 838ab91ebf update generator & yaml spec Xi Yan 2024-09-09 10:39:29 -07:00
  • 2ac8e7b901 Remove unecessary method argument Celina Hanouti 2024-09-09 19:04:21 +02:00
  • fff1b6d6bf Use HfApi to get the namespace when not provide in the hf endpoint name Celina Hanouti 2024-09-09 18:59:10 +02:00
  • 3d660ad938 Rename TGI Adapter class Celina Hanouti 2024-09-09 18:30:34 +02:00
  • eee6c69f46 Update CLI reference and add typing Celina Hanouti 2024-09-09 17:49:07 +02:00
  • b96e705680 Fixes post-review and split TGI adapter into local and Inference Endpoints ones Celina Hanouti 2024-09-09 17:47:49 +02:00
  • ee32de4c3f [wip] client w/ stainless sdk Xi Yan 2024-09-08 18:31:49 -07:00
  • 640c5f8ab9 add tool for bing search Hardik Shah 2024-09-08 17:25:52 -07:00
  • 741310f78e rename observability -> Telemetry; regen Spec Ashwin Bharambe 2024-09-07 15:23:53 -07:00
  • 70e682fbdf Update distribution_id -> distribution_type, provider_id -> provider_type Ashwin Bharambe 2024-09-07 08:42:28 -07:00
  • 3f090d1975
    Add Chroma and PGVector adapters (#56) Ashwin Bharambe 2024-09-06 18:53:17 -07:00
  • c02d8aa3d3 Add Chroma and PGVector adapters Ashwin Bharambe 2024-09-05 23:49:14 -07:00
  • 5de6ed946e
    Query generators for RAG query (#54) Hardik Shah 2024-09-06 13:10:39 -07:00
  • 95a5982524 drop classes for functions Hardik Shah 2024-09-06 12:58:13 -07:00
  • c2b7b462e9 use agent.inference_api instead of passing host/port again Hardik Shah 2024-09-06 12:48:08 -07:00
  • 406c3b24d4
    upgrade llama_models (#55) Yufei (Benny) Chen 2024-09-06 12:03:13 -07:00
  • f1e23075d1 upgrade llama_models benjibc 2024-09-06 18:55:19 +00:00