Commit graph

  • a0a00f1345 Update telemetry to have TEXT be the default log format Ashwin Bharambe 2024-11-21 15:17:37 -08:00
  • 945db5dac2 fix logging Xi Yan 2024-11-21 15:02:57 -08:00
  • d790be28b3 Don't skip meta-reference for the tests Ashwin Bharambe 2024-11-21 13:29:12 -08:00
  • 55c55b9f51 Update Quick Start significantly Ashwin Bharambe 2024-11-21 13:20:37 -08:00
  • 654722da7d fix model id for llm_as_judge_405b Xi Yan 2024-11-19 19:05:06 -08:00
  • 6395dadc2b
    use logging instead of prints (#499) Dinesh Yeduguru 2024-11-21 11:32:53 -08:00
  • 4e1105e563
    Fix fp8 quantization script. (#500) liyunlu0618 2024-11-21 09:15:28 -08:00
  • cf079a22a0 Plurals Ashwin Bharambe 2024-11-20 23:24:59 -08:00
  • cd6ccb664c Integrate distro docs into the restructured docs Ashwin Bharambe 2024-11-20 23:20:05 -08:00
  • 2411a44833 Update more distribution docs to be simpler and partially codegen'ed Ashwin Bharambe 2024-11-20 14:44:04 -08:00
  • e84d4436b5
    Since we are pushing for HF repos, we should accept them in inference configs (#497) Ashwin Bharambe 2024-11-20 16:14:37 -08:00
  • b3f9e8b2f2
    Restructure docs (#494) Dinesh Yeduguru 2024-11-20 15:54:47 -08:00
  • 068ac00a3b
    Don't depend on templates.py when print llama stack build messages (#496) Ashwin Bharambe 2024-11-20 15:44:49 -08:00
  • 00816cc8ef make sure codegen doesn't cause spurious diffs for no reason v0.0.53 Ashwin Bharambe 2024-11-20 13:55:43 -08:00
  • 681322731b
    Make run yaml optional so dockers can start with just --env (#492) Ashwin Bharambe 2024-11-20 13:11:40 -08:00
  • 1d8d0593af
    register with provider even if present in stack (#491) Dinesh Yeduguru 2024-11-20 11:05:50 -08:00
  • 91e7efbc91
    fall to back to read from chroma/pgvector when not in cache (#489) Dinesh Yeduguru 2024-11-20 10:30:23 -08:00
  • ae49a4cb97
    Reorganizing Zero to Hero Folder structure (#447) Justin Lee 2024-11-20 10:27:29 -08:00
  • 89f5093dfc Fix tgi doc Ashwin Bharambe 2024-11-19 21:05:59 -08:00
  • 1086b500f9
    Support Tavily as built-in search tool. (#485) Mengtao Yuan 2024-11-19 20:59:02 -08:00
  • 08be023290
    Added optional md5 validate command once download is completed (#486) varunfb 2024-11-19 17:42:43 -08:00
  • e670f99ef7
    add changelog (#487) Dinesh Yeduguru 2024-11-19 17:36:08 -08:00
  • dd5466e17d Bump version to 0.0.53 Ashwin Bharambe 2024-11-19 16:44:15 -08:00
  • b0fdf7552a docs Xi Yan 2024-11-19 16:41:45 -08:00
  • c49acc5226 docs Xi Yan 2024-11-19 16:39:40 -08:00
  • f78200b189 docs Xi Yan 2024-11-19 16:37:30 -08:00
  • e605d57fb7 use API version in "remote" stack client Ashwin Bharambe 2024-11-19 15:59:47 -08:00
  • 7bfcfe80b5 Add logs (prints :/) to dump out what URL vllm / tgi is connecting to Ashwin Bharambe 2024-11-19 15:50:26 -08:00
  • 887ccc2143 Ensure llama-stack-client is installed in the container with TEST_PYPI Ashwin Bharambe 2024-11-19 15:20:51 -08:00
  • 2da93c8835 fix 3.2-1b fireworks Xi Yan 2024-11-19 14:20:07 -08:00
  • 189df6358a codegen docs Xi Yan 2024-11-19 14:16:00 -08:00
  • 185df4b568 fix fireworks registration Xi Yan 2024-11-19 14:09:00 -08:00
  • 38ba3b9f0c Fix fireworks stream completion Ashwin Bharambe 2024-11-19 13:36:14 -08:00
  • 05d1ead02f Update condition in tests to handle llama-3.1 vs llama3.1 (HF names) Ashwin Bharambe 2024-11-19 13:25:36 -08:00
  • 394519d68a Add llama-stack-client as a legitimate dependency for llama-stack Ashwin Bharambe 2024-11-19 11:44:35 -08:00
  • c46b462c22 Updates to docker build script Ashwin Bharambe 2024-11-19 11:36:53 -08:00
  • 39e99b39fe
    update quick start to have the working instruction (#467) Henry Tai 2024-11-20 02:32:19 +08:00
  • 1b0f5fff5a fix curl endpoint Xi Yan 2024-11-19 10:26:05 -08:00
  • 1619d37cc6 codegen per-distro dependencies; not hooked into setup.py yet Ashwin Bharambe 2024-11-19 09:54:30 -08:00
  • 5e4ac1b7c1 Make sure server code uses version prefixed routes Ashwin Bharambe 2024-11-19 09:15:05 -08:00
  • 84d5f35a48 Update the model alias for llama guard models in ollama Ashwin Bharambe 2024-11-19 00:22:24 -08:00
  • e8d3eee095 Fix docs yet again Ashwin Bharambe 2024-11-18 23:51:25 -08:00
  • 02f1c47416
    support adding alias for models without hf repo/sku entry (#481) Dinesh Yeduguru 2024-11-18 23:50:18 -08:00
  • 8ed79ad0f3 Fix the pyopenapi generator avoid potential circular imports Ashwin Bharambe 2024-11-18 23:37:52 -08:00
  • d463d68e1e Update docs Ashwin Bharambe 2024-11-18 23:21:25 -08:00
  • 93abb8e208 Include all yamls Ashwin Bharambe 2024-11-18 22:46:07 -08:00
  • 0dc7f5fa89
    Add version to REST API url (#478) Ashwin Bharambe 2024-11-18 22:44:14 -08:00
  • 05e93bd2f7 together default Xi Yan 2024-11-18 22:39:45 -08:00
  • 7693786322 Use HF names for registering fireworks and together models Ashwin Bharambe 2024-11-18 22:34:26 -08:00
  • 6765fd76ff
    fix llama stack build for together & llama stack build from templates (#479) Xi Yan 2024-11-18 22:29:16 -08:00
  • ea52a3ee1c minor enhancement for test fixtures Ashwin Bharambe 2024-11-18 22:20:59 -08:00
  • fcc2132e6f
    remove pydantic namespace warnings using model_config (#470) Matthew Farrellee 2024-11-18 22:24:14 -05:00
  • 2108a779f2
    Update kotlin client docs (#476) Riandy 2024-11-18 19:13:20 -08:00
  • d2b7c5aeae
    add quantized model ollama support (#471) Kai Wu 2024-11-18 18:55:23 -08:00
  • 14c75c3f21 Update CONTRIBUTING to include info about pre-commit Ashwin Bharambe 2024-11-18 18:17:41 -08:00
  • fe19076838
    get stack run config based on template name (#477) Dinesh Yeduguru 2024-11-18 18:05:05 -08:00
  • 50d539e6d7 update tests --inference-model to hf id Xi Yan 2024-11-18 17:36:58 -08:00
  • 939056e265 More documentation fixes Ashwin Bharambe 2024-11-18 17:06:13 -08:00
  • e40404625b Update to docs Ashwin Bharambe 2024-11-18 16:52:48 -08:00
  • 91f3009c67 No more built_at Ashwin Bharambe 2024-11-18 16:38:51 -08:00
  • afa4f0b19f Update remote vllm docs Ashwin Bharambe 2024-11-18 16:34:33 -08:00
  • fb15ff4a97 Move to use argparse, fix issues with multiple --env cmdline options Ashwin Bharambe 2024-11-18 16:31:59 -08:00
  • b87f3ac499 Allow server to accept --env key pairs Ashwin Bharambe 2024-11-18 16:17:59 -08:00
  • 1fb61137ad Add conda_env Ashwin Bharambe 2024-11-18 16:08:03 -08:00
  • b822149098 Update start conda Ashwin Bharambe 2024-11-18 16:07:27 -08:00
  • 47c37fd831 Fixes Ashwin Bharambe 2024-11-18 16:03:20 -08:00
  • 3aedde2ab4 Add a pre-commit for distro_codegen but it does not work yet Ashwin Bharambe 2024-11-18 15:20:49 -08:00
  • 57a9b4d57f
    Allow models to be registered as long as llama model is provided (#472) Dinesh Yeduguru 2024-11-18 15:05:29 -08:00
  • 2a31163178
    Auto-generate distro yamls + docs (#468) Ashwin Bharambe 2024-11-18 14:57:06 -08:00
  • 0784284ab5
    [Agentic Eval] add ability to run agents generation (#469) Xi Yan 2024-11-18 11:43:03 -08:00
  • f1b9578f8d
    Extend shorthand support for the llama stack run command (#465) Vladimir Ivić 2024-11-15 23:16:42 -08:00
  • 57bafd0f8c
    fix faiss serialize and serialize of index (#464) Dinesh Yeduguru 2024-11-15 18:02:48 -08:00
  • ff99025875
    await initialize in faiss (#463) Dinesh Yeduguru 2024-11-15 14:21:31 -08:00
  • 20bf2f50c2 No more model_id warnings Ashwin Bharambe 2024-11-15 12:20:18 -08:00
  • e8112b31ab
    move hf addapter->remote (#459) Xi Yan 2024-11-14 22:41:19 -05:00
  • 788411b680 categorical score for llm as judge Xi Yan 2024-11-14 22:33:20 -05:00
  • 0850ad656a
    unregister for memory banks and remove update API (#458) Dinesh Yeduguru 2024-11-14 17:12:11 -08:00
  • 2eab3b7ed9 skip aggregation for llm_as_judge Xi Yan 2024-11-14 17:50:46 -05:00
  • bba6edd06b Fix OpenAPI generation to have text/event-stream for streamable methods Ashwin Bharambe 2024-11-14 12:51:38 -08:00
  • acbecbf8b3
    Add a verify-download command to llama CLI (#457) Ashwin Bharambe 2024-11-14 11:47:51 -08:00
  • 0713607b68
    Support parallel downloads for llama model download (#448) Ashwin Bharambe 2024-11-14 09:56:22 -08:00
  • 0c750102c6
    Fix build configure deprecation message (#456) Martin Hickey 2024-11-14 17:56:03 +00:00
  • 58381dbe78
    local persistence for eval tasks (#453) Xi Yan 2024-11-14 10:36:23 -05:00
  • 46f0b6606a
    init registry once (#450) Dinesh Yeduguru 2024-11-13 22:20:57 -08:00
  • efe791bab7
    Support model resource updates and deletes (#452) Dinesh Yeduguru 2024-11-13 21:55:41 -08:00
  • 4253cfcd7f
    local persistent for hf dataset provider (#451) Xi Yan 2024-11-14 00:08:37 -05:00
  • e90ea1ab1e
    make distribution registry thread safe and other fixes (#449) Dinesh Yeduguru 2024-11-13 15:12:34 -08:00
  • 15dee2b8b8
    Added link to the Colab notebook of the Llama Stack lesson on the Llama 3.2 course on DLAI (#445) Jeff Tang 2024-11-13 13:59:41 -08:00
  • 787e2034b7
    model registration in ollama and vllm check against the available models in the provider (#446) Dinesh Yeduguru 2024-11-13 13:04:06 -08:00
  • 7f6ac2fbd7 allow seeing warnings with traces optionally Ashwin Bharambe 2024-11-13 12:27:19 -08:00
  • 96e7ef646f
    add support for ${env.FOO_BAR} placeholders in run.yaml files (#439) Ashwin Bharambe 2024-11-13 11:25:58 -08:00
  • 838b8d4fb5
    PR-437-Fixed bug to allow system instructions after first turn (#440) Sarthak Deshpande 2024-11-14 00:04:04 +05:30
  • 94a6f57812
    change schema -> dataset_schema for register_dataset api (#443) Xi Yan 2024-11-13 11:17:46 -05:00
  • d5b1202c83
    change schema -> dataset_schema (#442) Xi Yan 2024-11-13 10:58:12 -05:00
  • c29fa56dde
    add inline:: prefix for localfs provider (#441) Xi Yan 2024-11-13 10:44:39 -05:00
  • 36b052ab10 slightly update README.md Ashwin Bharambe 2024-11-12 22:11:46 -08:00
  • 12947ac19e
    Kill "remote" providers and fix testing with a remote stack properly (#435) Ashwin Bharambe 2024-11-12 21:51:29 -08:00
  • 59a65e34d3
    Update new_api_provider.md Xi Yan 2024-11-13 00:02:13 -05:00
  • fdff24e77a
    Inference to use provider resource id to register and validate (#428) Dinesh Yeduguru 2024-11-12 20:02:00 -08:00
  • e51107e019 Fix compose.yaml Ashwin Bharambe 2024-11-12 15:43:30 -08:00