Commit graph

  • 5d54c2ee70 Use Llama Stack template when streaming Fred Reiss 2025-02-04 13:09:49 -08:00
  • ade413f1e3 Update logging and route Meta Llama requests differently Fred Reiss 2025-01-31 14:12:35 -08:00
  • 24cc7a777c Change line width and remove __del__ Fred Reiss 2025-01-29 18:42:18 -08:00
  • 74c8504f50 Implement unregister_model() and shutdown() Fred Reiss 2025-01-29 18:07:36 -08:00
  • 4302200396 Clean up logging Fred Reiss 2025-01-25 19:00:52 -08:00
  • 59211067d1 Remove unneeded config parameter Fred Reiss 2025-01-25 19:00:08 -08:00
  • 29ae2552fd Use shared code where possible Fred Reiss 2025-01-24 22:30:53 -08:00
  • 25c780802f Update for latest APIs Fred Reiss 2025-01-24 16:55:56 -08:00
  • fcb87faa36 Add completion API Fred Reiss 2025-01-24 10:56:47 -08:00
  • 80c357f434 Import updated vLLM code and get tests working Fred Reiss 2025-01-22 17:15:09 -08:00
  • 441e741ac9 chore: escape tool output for logging Eric Huang 2025-03-07 13:28:06 -08:00
  • b4a22ed1b4 fix scoring Xi Yan 2025-03-07 13:12:10 -08:00
  • e6ce9bbb83
    build(deps): bump actions/upload-artifact from 3 to 4 dependabot[bot] 2025-03-07 20:55:37 +00:00
  • e0bacf917c
    build(deps): bump thollander/actions-comment-pull-request from 2 to 3 dependabot[bot] 2025-03-07 20:55:35 +00:00
  • 3a6febdc0f fix params Xi Yan 2025-03-07 12:43:00 -08:00
  • da373471e4 fix params Xi Yan 2025-03-07 12:39:44 -08:00
  • 2de1b92477 build: add 'tiktoken' to deps Eric Huang 2025-03-07 12:27:49 -08:00
  • ac7e2caa1f build: include .md Eric Huang 2025-03-07 12:07:33 -08:00
  • 823afa7413 feat: add support for LLAMA_STACK_LOG_FILE Charlie Doern 2025-03-06 13:22:42 -05:00
  • d27d959a18 fix(cli): llama model prompt-format Eric Huang 2025-03-07 11:43:59 -08:00
  • 2cbaa1c75a recolorize a bit Ashwin Bharambe 2025-03-07 11:33:38 -08:00
  • e9642379e7 feat(agent): plain function as client tool Eric Huang 2025-03-07 10:46:59 -08:00
  • 11fffe7b95 feat(logging): implement category-based logging Sébastien Han 2025-03-03 13:59:48 +01:00
  • efe1772727 Revert "feat: add a configurable category-based logger (#1352)" Sébastien Han 2025-03-03 11:58:40 +01:00
  • 8da515fe55 refactor: display defaults in help text Charlie Doern 2025-03-07 13:48:57 -05:00
  • 459793578e remove multi-turn inputs Xi Yan 2025-03-07 10:38:36 -08:00
  • 99ebb445ba add cursor rules directory Ashwin Bharambe 2025-03-07 10:08:49 -08:00
  • 5b2f5affb3 Revert "feat: record token usage for inference API (#1300)" Dinesh Yeduguru 2025-03-07 10:08:55 -08:00
  • b34170387e test: first unit test for resolver Ashwin Bharambe 2025-03-07 10:03:14 -08:00
  • 6dd8b7b4c6
    build: bump llama-stack-client version Sébastien Han 2025-03-07 17:25:20 +01:00
  • 6f02bbc952 chore: add pytest-report.xml to gitignore Ihar Hrachyshka 2025-03-07 11:13:56 -05:00
  • 7893ae121a
    raise exception instead Yuan Tang 2025-03-07 11:09:36 -05:00
  • 1eeba2cc8a Changes after merge. ilya-kolchinsky 2025-03-07 16:29:04 +01:00
  • 6b9f673fdb Merge branch 'refs/heads/main' into preprocessors ilya-kolchinsky 2025-03-07 16:20:30 +01:00
  • 3f15349c9d Updated the configuration templates to include the builtin preprocessors. ilya-kolchinsky 2025-03-07 16:08:14 +01:00
  • e895bb111c Added lazy initialization to the docling provider. ilya-kolchinsky 2025-03-07 15:38:34 +01:00
  • 785a281dbd chore: remove the incorrect output reidliu 2025-03-07 21:06:40 +08:00
  • e2cc93c017 Add Clarifai as Inference Provider Srikanth Bachala 2025-03-07 17:01:32 +05:30
  • 2a24eb7f53 readme sanjaychelliah 2024-10-14 14:21:58 +05:30
  • 61f5f6d252 add clarifai inference provider sanjaychelliah 2024-10-09 03:35:31 +05:30
  • a98358dc63
    ci: enable Dependabot for GitHub Actions Sébastien Han 2025-03-07 09:57:20 +01:00
  • aa6ca1ed5f
    test: add inspect unit test Sébastien Han 2025-03-05 16:58:57 +01:00
  • 5cab79646d pre-commit Ashwin Bharambe 2025-03-06 20:55:42 -08:00
  • 2313b1bab5 fix config to use the correct sigil for env var replacement Ashwin Bharambe 2025-03-06 20:49:34 -08:00
  • 2f45b9f2f7 fix: rebase and solve conflict Cheney Zhang 2025-03-05 17:30:01 +08:00
  • fa67d79bfe fix: fixed Milvus integration code Cheney Zhang 2025-02-25 20:30:54 +08:00
  • 5d0d4c3467 feat: add Milvus vectorDB Cheney Zhang 2025-02-20 15:02:05 +08:00
  • 567a086ce6 refine Botao Chen 2025-03-06 19:24:02 -08:00
  • ddaf929f76 refine Botao Chen 2025-03-06 19:21:33 -08:00
  • 9a6385833c
    ci: Add script to generate changelog Yuan Tang 2025-03-06 21:14:51 -05:00
  • 78076e04e1 fix Xi Yan 2025-03-06 18:05:19 -08:00
  • 2949e6a9b4 fix Xi Yan 2025-03-06 18:01:14 -08:00
  • 6a4ce68223
    fix(security): Bump jinja2 to >=3.1.6 Yuan Tang 2025-03-06 20:50:56 -05:00
  • aeed4232cf
    chore: Delete unused .gitmodules Yuan Tang 2025-03-06 20:38:34 -05:00
  • 01ef34d40b fix: Swap to AsyncOpenAI client in remote vllm provider Ben Browning 2025-03-06 18:27:36 -05:00
  • ebc8258038 Merge remote-tracking branch 'origin/main' into benchmark_eval Botao Chen 2025-03-06 15:59:03 -08:00
  • 81ff6ecff4 rag eval notebook Xi Yan 2025-03-06 15:50:43 -08:00
  • 2dc2101b73 eval concept Xi Yan 2025-03-06 15:43:24 -08:00
  • 39f0636c6c eval concept Xi Yan 2025-03-06 15:41:13 -08:00
  • 9e2aeae3ea refine Botao Chen 2025-03-06 15:25:37 -08:00
  • 198158a7fb docs: update test_agents to use new Agent SDK API Eric Huang 2025-03-06 15:12:52 -08:00
  • d342a53ae0 use library client throughout Xi Yan 2025-03-06 12:58:30 -08:00
  • b464575a1e more fix Xi Yan 2025-03-06 12:54:03 -08:00
  • 000569b003 benchmark Xi Yan 2025-03-06 12:43:25 -08:00
  • 275fdbc23f Fixed a few issues in the docling provider. ilya-kolchinsky 2025-03-06 20:51:37 +01:00
  • 47fea967a7 update doc Xi Yan 2025-03-06 11:48:40 -08:00
  • e53bdc929a test: use json only Eric Huang 2025-03-06 11:43:26 -08:00
  • 0db524cc26 add test cases for customizer Ubuntu 2025-03-06 19:34:02 +00:00
  • da2971005a chore: log exception Eric Huang 2025-03-06 11:25:00 -08:00
  • 103a3b1a4f add nvidia distribution Ubuntu 2025-03-06 18:26:53 +00:00
  • b5c6a80b2e linting fix for templates Ubuntu 2025-03-06 16:42:57 +00:00
  • a799d96a2c add latest code Ubuntu 2025-03-06 16:33:51 +00:00
  • f10a412898 Fixed multiple bugs. ilya-kolchinsky 2025-03-06 16:46:59 +01:00
  • fcb52fa3a4 fix: Import chardet and pypdf only when actually needed Ihar Hrachyshka 2025-03-06 10:25:24 -05:00
  • 5540c1a956 chore: update the config file name reidliu 2025-03-06 21:41:59 +08:00
  • 6cbc298edb Added the preprocessing chain parameter to the RAG tool insert API. ilya-kolchinsky 2025-03-06 14:22:19 +01:00
  • 4c81a72214 Added output type to PreprocessorResponse. ilya-kolchinsky 2025-03-06 14:05:05 +01:00
  • 5524210a2d docs: add information on how to set log level before running Charlie Doern 2025-03-05 16:41:01 -05:00
  • 3fc37b7be3
    fix: resolve pydantic warning on .dict() usage Sébastien Han 2025-03-06 11:41:03 +01:00
  • e622799f38
    fix: solve ruff B008 warnings Sébastien Han 2025-03-06 11:28:13 +01:00
  • 4599ee68cd
    fix: remove ruff N999 Sébastien Han 2025-03-05 09:56:50 +01:00
  • cde74939a9 ci: add Github workflow which runs unittests in PR Ashwin Bharambe 2025-03-05 18:17:23 -08:00
  • 72dee96300 merge Xi Yan 2025-03-05 17:40:32 -08:00
  • 9066b2ac12 fix eval Xi Yan 2025-03-05 17:37:19 -08:00
  • 20fc6d4267
    docs: Add CHANGELOG.md Yuan Tang 2025-03-05 20:36:52 -05:00
  • 62a844c614 fix eval Xi Yan 2025-03-05 17:36:37 -08:00
  • 2541dcc162 fix eval Xi Yan 2025-03-05 17:36:02 -08:00
  • 6e65b9282d work eval Xi Yan 2025-03-05 17:12:47 -08:00
  • fd68b0dc9a tmp eval Xi Yan 2025-03-05 16:41:37 -08:00
  • 54abeeebce default text model Xi Yan 2025-03-05 16:24:43 -08:00
  • 5d43b9157e fix scoring Xi Yan 2025-03-05 16:20:11 -08:00
  • 546a417b09 fix scoring Xi Yan 2025-03-05 16:05:39 -08:00
  • f1e4588b0a fix report to at least not barf Ashwin Bharambe 2025-03-05 15:55:25 -08:00
  • f2464050c7 add registeration test Xi Yan 2025-03-05 15:47:20 -08:00
  • 4f82d361a8 update README.md Ashwin Bharambe 2025-03-05 15:22:53 -08:00
  • c19350f4ed support multiple model ids for testing Ashwin Bharambe 2025-03-05 11:21:53 -08:00
  • 113b17679d kill safety conftest Ashwin Bharambe 2025-03-04 16:54:29 -08:00
  • 8d49a10c8e remove some code from report.py which has been disabled for now Ashwin Bharambe 2025-03-04 16:53:41 -08:00
  • cd9d278d12 refactor(test): introduce --stack-config and simplify options Ashwin Bharambe 2025-03-04 16:35:53 -08:00
  • 5b0ec561dc fix: don't import from llama_models Ihar Hrachyshka 2025-03-05 18:23:19 -05:00