Commit graph

  • 8332ea23ad
    Add run win command for stack (#890) Vladislav Bronzov 2025-01-28 17:04:28 +01:00
  • 09299e908e
    Add windows support for build execution (#889) Vladislav Bronzov 2025-01-28 16:41:41 +01:00
  • d123e9d3d7 Update docs for RAG and improve CONTRIBUTING.md Ashwin Bharambe 2025-01-28 06:08:14 -08:00
  • 229f0d5f7c
    Agent response format (#660) Zhonglin Han 2025-01-28 05:05:38 -08:00
  • e4865c3510
    adding readme to docs folder for easier discoverability of notebooks … (#857) Justin Lee 2025-01-28 04:58:46 -08:00
  • ba453c3487
    Report generation minor fixes (#884) Sixian Yi 2025-01-28 04:58:12 -08:00
  • 5b0d778871
    Update index.md (#888) Chris Khanoyan 2025-01-28 07:55:41 -05:00
  • aa65610e75
    Sambanova - LlamaGuard (#886) snova-edwardm 2025-01-27 15:46:30 -08:00
  • 724978339d ask user to type if not in colab allow_more_providers_guide Kai Wu 2025-01-27 14:46:00 -08:00
  • 0e69d71eb9 add instructions and code to support more providers in guide Kai Wu 2025-01-27 13:58:54 -08:00
  • 3c1a2c3d66
    Fix telemetry init (#885) Dinesh Yeduguru 2025-01-27 11:20:28 -08:00
  • e5936a8df8
    Update discriminator to have the correct mapping (#881) Ashwin Bharambe 2025-01-27 09:18:13 -08:00
  • a6d20e0f53
    Update documentation (#865) Ashwin Bharambe 2025-01-27 09:17:51 -08:00
  • 891bf704eb
    Ensure llama stack build --config <> --image-type <> works (#879) Ashwin Bharambe 2025-01-25 11:13:36 -08:00
  • 7de46e40f9
    Fixed multiple typos (#878) Bakunga Bronson 2025-01-25 06:45:43 +08:00
  • 33113139e8
    Fixed typo (#877) Bakunga Bronson 2025-01-25 05:16:00 +08:00
  • 632e60439a
    Fix report generation for url endpoints (#876) Hardik Shah 2025-01-24 13:15:44 -08:00
  • 087a83f673 Bump key for faiss Ashwin Bharambe 2025-01-24 12:08:36 -08:00
  • d111bad2f2
    Update GH action so it correctly queries for test.pypi, etc. (#875) Ashwin Bharambe 2025-01-24 11:56:29 -08:00
  • 2cebb24d3a
    Update doc templates for running safety on self-hosted templates (#874) Hardik Shah 2025-01-24 11:28:20 -08:00
  • eaba6a550a Point to 0.1.0 release notes in docs Ashwin Bharambe 2025-01-24 10:00:09 -08:00
  • 05d73dd4fd Bump version to 0.1.0 Ashwin Bharambe 2025-01-24 09:50:07 -08:00
  • 19521cb22e More doc updates v0.1.0 Ashwin Bharambe 2025-01-24 09:22:15 -08:00
  • 2118f37350 Doc updates Ashwin Bharambe 2025-01-23 20:43:10 -08:00
  • 9351a4b2d7 Update documentation Ashwin Bharambe 2025-01-23 15:33:04 -08:00
  • 2fefe8dacd
    Update 'first RAG agent' in gettingstarted doc (#867) ehhuang 2025-01-23 17:02:04 -08:00
  • cb11336886
    remove logger handler only in notebook (#868) Dinesh Yeduguru 2025-01-23 16:58:17 -08:00
  • ebffa15f40
    update python sdk reference (#866) Dinesh Yeduguru 2025-01-23 16:04:06 -08:00
  • c570a708bf
    update the client reference (#864) Dinesh Yeduguru 2025-01-23 15:32:16 -08:00
  • a78f1fc70d
    make default tool prompt format none in agent config (#863) Dinesh Yeduguru 2025-01-23 14:44:59 -08:00
  • 94ffaf468c
    More updates to ReadTheDocs (#861) Hardik Shah 2025-01-23 12:50:38 -08:00
  • 7df40da5fa
    sync readme.md to index.md (#860) Dinesh Yeduguru 2025-01-23 12:43:09 -08:00
  • a6a4270eef
    Updates to ReadTheDocs (#859) Hardik Shah 2025-01-23 12:42:15 -08:00
  • d78027f3b5 Move runpod provider to the correct directory Ashwin Bharambe 2025-01-23 12:25:12 -08:00
  • 22dc684da6
    Sambanova inference provider (#555) snova-edwardm 2025-01-23 12:20:28 -08:00
  • e2b5456e48
    Add Runpod Provider + Distribution (#362) Marut Pandya 2025-01-23 12:19:02 -08:00
  • 86466b71a9
    update docs for adding new API providers (#855) Dinesh Yeduguru 2025-01-23 12:05:57 -08:00
  • d0be9288a3
    Llama_Stack_Building_AI_Applications.ipynb -> getting_started.ipynb (#854) Dinesh Yeduguru 2025-01-23 12:04:06 -08:00
  • a10cdc7cdb
    Update README.md Hardik Shah 2025-01-23 12:00:01 -08:00
  • 74e933cbfd
    More Updates to Read the Docs (#856) Hardik Shah 2025-01-23 11:39:33 -08:00
  • 8a686270e9
    remove getting started notebook (#853) Dinesh Yeduguru 2025-01-23 10:09:09 -08:00
  • 25a70ca4dc
    Fixed distro documentation (#852) Hardik Shah 2025-01-23 08:19:51 -08:00
  • e44a1a68f1
    Delete docs/to_situate directory (#851) raghotham 2025-01-23 07:15:47 -08:00
  • bfbd773b54 remove test report Sixian Yi 2025-01-23 01:06:39 -08:00
  • 82a28f3a24
    update doc for client-sdk testing (#849) Sixian Yi 2025-01-23 00:17:16 -08:00
  • 3d14a3d46f Kill colons Ashwin Bharambe 2025-01-22 22:54:13 -08:00
  • 910717c1fd
    Add vLLM raw completions API (#823) Aidan Do 2025-01-23 17:58:27 +11:00
  • 4d7c8c797f Kill colons Ashwin Bharambe 2025-01-22 22:54:13 -08:00
  • 28012c51bb
    update docs for tools and telemetry (#846) Dinesh Yeduguru 2025-01-22 22:50:29 -08:00
  • 35c71d5bbe
    Update OpenAPI generator to output discriminator (#848) Ashwin Bharambe 2025-01-22 22:15:23 -08:00
  • 65f07c3d63
    Update Documentation (#838) Hardik Shah 2025-01-22 20:38:52 -08:00
  • 6c205e1d5a Fix tool tests Ashwin Bharambe 2025-01-22 20:31:18 -08:00
  • 0bff6e1658 Move tool_runtime.memory -> tool_runtime.rag Ashwin Bharambe 2025-01-22 20:25:02 -08:00
  • f3d8864c36 Rename builtin::memory -> builtin::rag Ashwin Bharambe 2025-01-22 20:22:51 -08:00
  • 597869a2aa
    add distro report (#847) Sixian Yi 2025-01-22 19:20:49 -08:00
  • 23f1980f9c Fix meta-reference GPU implementation for inference Ashwin Bharambe 2025-01-22 18:31:59 -08:00
  • f4b0f2af8b If initialization fails for library client, error the test Ashwin Bharambe 2025-01-22 18:11:42 -08:00
  • 72a1b27d01 nitpick Ashwin Bharambe 2025-01-22 18:09:46 -08:00
  • 09d2e04c5c Address feedback: v2 tools-doc Dinesh Yeduguru 2025-01-22 17:38:56 -08:00
  • 19dee34ebd Address feedback Dinesh Yeduguru 2025-01-22 17:12:28 -08:00
  • a5be81dac5 update link in telemetry Dinesh Yeduguru 2025-01-22 17:04:04 -08:00
  • 151174a9db update link Dinesh Yeduguru 2025-01-22 16:56:30 -08:00
  • a8345f5f76 Fix llama stack build docker creation to have correct entrypoint Ashwin Bharambe 2025-01-22 16:53:54 -08:00
  • 73243f1348 update index.md Dinesh Yeduguru 2025-01-22 16:21:30 -08:00
  • 08dcb9e31e Accept "query_config" params for the RAG tool Ashwin Bharambe 2025-01-22 16:42:36 -08:00
  • cb27cbd4b5 update docs for tools and telemetry Dinesh Yeduguru 2025-01-22 16:17:50 -08:00
  • f4f47970e5
    [client sdk test] add options for inference_model, safety_shield, embedding_model (#843) Sixian Yi 2025-01-22 15:35:19 -08:00
  • 4dd4f09fc5 Rename a test and add some comments Ashwin Bharambe 2025-01-22 15:27:29 -08:00
  • 494e969f8d add a bunch of NBVAL SKIPs to unblock ugh Ashwin Bharambe 2025-01-22 14:22:10 -08:00
  • deab4f57dd
    Improved report generation for providers (#844) Hardik Shah 2025-01-22 15:27:09 -08:00
  • 8738c3e5a7
    fix experimental-post-training template (#842) Botao Chen 2025-01-22 15:04:05 -08:00
  • 82d942b501 Foo v0.1.0rc12 v0.1.0rc11 Ashwin Bharambe 2025-01-22 13:58:17 -08:00
  • 55d01339c2 Update notebook Ashwin Bharambe 2025-01-22 13:31:11 -08:00
  • 07b87365ab
    [inference api] modify content types so they follow a more standard structure (#841) Ashwin Bharambe 2025-01-22 12:16:18 -08:00
  • caa8387dd2
    Fix fireworks client sdk chat completion with images (#840) Hardik Shah 2025-01-22 11:25:10 -08:00
  • a63a43c646
    [memory refactor][6/n] Update naming and routes (#839) Ashwin Bharambe 2025-01-22 10:39:13 -08:00
  • c9e5578151
    [memory refactor][5/n] Migrate all vector_io providers (#835) Ashwin Bharambe 2025-01-22 10:17:59 -08:00
  • 63f37f9b7c
    [memory refactor][4/n] Update the client-sdk test for RAG (#834) Ashwin Bharambe 2025-01-22 10:15:19 -08:00
  • 1a7490470a
    [memory refactor][3/n] Introduce RAGToolRuntime as a specialized sub-protocol (#832) Ashwin Bharambe 2025-01-22 10:04:16 -08:00
  • 78a481bb22
    [memory refactor][2/n] Update faiss and make it pass tests (#830) Ashwin Bharambe 2025-01-22 10:02:15 -08:00
  • 3ae8585b65
    [memory refactor][1/n] Rename Memory -> VectorIO, MemoryBanks -> VectorDBs (#828) Ashwin Bharambe 2025-01-22 09:59:30 -08:00
  • 35a00d004a
    bug fix for distro report generation (#836) Sixian Yi 2025-01-21 21:44:06 -08:00
  • edf56884a7
    add pytest option to generate a functional report for distribution (#833) Sixian Yi 2025-01-21 21:18:23 -08:00
  • e41873f268
    [ez] structured output for /completion ollama & enable tests (#822) Sixian Yi 2025-01-21 21:10:24 -08:00
  • 7a4b382ae9
    add section for mcp tool usage in notebook (#831) Dinesh Yeduguru 2025-01-21 13:10:42 -08:00
  • 75a2694daa Refactor the API enum to an independent file into llama_stack/apis/ Ashwin Bharambe 2025-01-19 12:22:40 -08:00
  • 74f6af8bbe
    [CICD] add simple test step for docker build workflow, fix prefix bug (#821) Xi Yan 2025-01-18 15:16:05 -08:00
  • 55067fa81d
    test report for v0.1 (#814) Sixian Yi 2025-01-18 07:50:45 -08:00
  • 5379eca9fd
    Fix incorrect image type in publish-to-docker workflow (#819) Yuan Tang 2025-01-18 00:33:03 -05:00
  • 5a63d0ff1d
    Fix incorrect RunConfigSettings due to the removal of conda_env (#801) Yuan Tang 2025-01-18 00:30:57 -05:00
  • 3a9468ce9b
    fix again vllm for non base64 (#818) Xi Yan 2025-01-17 18:33:40 -08:00
  • 3e7496e835
    fix vllm base64 image inference (#815) Xi Yan 2025-01-17 17:07:28 -08:00
  • 3d4c53dfec
    add mcp runtime as default to all providers (#816) Dinesh Yeduguru 2025-01-17 16:40:58 -08:00
  • 6da3053c0e
    More generic image type for OCI-compliant container technologies (#802) Yuan Tang 2025-01-17 19:37:42 -05:00
  • 9d005154d7
    fix vllm template (#813) Xi Yan 2025-01-17 15:34:29 -08:00
  • eb60f04f86
    optional api dependencies (#793) Ashwin Bharambe 2025-01-17 15:26:53 -08:00
  • 1f60c0286d
    cannot import name 'GreedySamplingStrategy' (#806) Aidan Do 2025-01-18 09:34:29 +11:00
  • e1decaec9d
    Fixing small typo in quick start guide (#807) Paul McCarthy 2025-01-17 19:15:55 +00:00
  • 53b5f6b24a
    add json_schema_type to ParamType deps (#808) Dinesh Yeduguru 2025-01-17 11:02:25 -08:00
  • c2a072911d
    fix eval notebook & add test to workflow (#803) Xi Yan 2025-01-16 23:11:21 -08:00