Commit graph

  • e5936a8df8
    Update discriminator to have the correct mapping (#881) Ashwin Bharambe 2025-01-27 09:18:13 -08:00
  • a6d20e0f53
    Update documentation (#865) Ashwin Bharambe 2025-01-27 09:17:51 -08:00
  • 891bf704eb
    Ensure llama stack build --config <> --image-type <> works (#879) Ashwin Bharambe 2025-01-25 11:13:36 -08:00
  • 7de46e40f9
    Fixed multiple typos (#878) Bakunga Bronson 2025-01-25 06:45:43 +08:00
  • 33113139e8
    Fixed typo (#877) Bakunga Bronson 2025-01-25 05:16:00 +08:00
  • 632e60439a
    Fix report generation for url endpoints (#876) Hardik Shah 2025-01-24 13:15:44 -08:00
  • 087a83f673 Bump key for faiss Ashwin Bharambe 2025-01-24 12:08:36 -08:00
  • d111bad2f2
    Update GH action so it correctly queries for test.pypi, etc. (#875) Ashwin Bharambe 2025-01-24 11:56:29 -08:00
  • 2cebb24d3a
    Update doc templates for running safety on self-hosted templates (#874) Hardik Shah 2025-01-24 11:28:20 -08:00
  • eaba6a550a Point to 0.1.0 release notes in docs Ashwin Bharambe 2025-01-24 10:00:09 -08:00
  • 05d73dd4fd Bump version to 0.1.0 Ashwin Bharambe 2025-01-24 09:50:07 -08:00
  • 19521cb22e More doc updates v0.1.0 Ashwin Bharambe 2025-01-24 09:22:15 -08:00
  • 2118f37350 Doc updates Ashwin Bharambe 2025-01-23 20:43:10 -08:00
  • 9351a4b2d7 Update documentation Ashwin Bharambe 2025-01-23 15:33:04 -08:00
  • 2fefe8dacd
    Update 'first RAG agent' in gettingstarted doc (#867) ehhuang 2025-01-23 17:02:04 -08:00
  • cb11336886
    remove logger handler only in notebook (#868) Dinesh Yeduguru 2025-01-23 16:58:17 -08:00
  • ebffa15f40
    update python sdk reference (#866) Dinesh Yeduguru 2025-01-23 16:04:06 -08:00
  • c570a708bf
    update the client reference (#864) Dinesh Yeduguru 2025-01-23 15:32:16 -08:00
  • a78f1fc70d
    make default tool prompt format none in agent config (#863) Dinesh Yeduguru 2025-01-23 14:44:59 -08:00
  • 94ffaf468c
    More updates to ReadTheDocs (#861) Hardik Shah 2025-01-23 12:50:38 -08:00
  • 7df40da5fa
    sync readme.md to index.md (#860) Dinesh Yeduguru 2025-01-23 12:43:09 -08:00
  • a6a4270eef
    Updates to ReadTheDocs (#859) Hardik Shah 2025-01-23 12:42:15 -08:00
  • d78027f3b5 Move runpod provider to the correct directory Ashwin Bharambe 2025-01-23 12:25:12 -08:00
  • 22dc684da6
    Sambanova inference provider (#555) snova-edwardm 2025-01-23 12:20:28 -08:00
  • e2b5456e48
    Add Runpod Provider + Distribution (#362) Marut Pandya 2025-01-23 12:19:02 -08:00
  • 86466b71a9
    update docs for adding new API providers (#855) Dinesh Yeduguru 2025-01-23 12:05:57 -08:00
  • d0be9288a3
    Llama_Stack_Building_AI_Applications.ipynb -> getting_started.ipynb (#854) Dinesh Yeduguru 2025-01-23 12:04:06 -08:00
  • a10cdc7cdb
    Update README.md Hardik Shah 2025-01-23 12:00:01 -08:00
  • 74e933cbfd
    More Updates to Read the Docs (#856) Hardik Shah 2025-01-23 11:39:33 -08:00
  • 8a686270e9
    remove getting started notebook (#853) Dinesh Yeduguru 2025-01-23 10:09:09 -08:00
  • 25a70ca4dc
    Fixed distro documentation (#852) Hardik Shah 2025-01-23 08:19:51 -08:00
  • e44a1a68f1
    Delete docs/to_situate directory (#851) raghotham 2025-01-23 07:15:47 -08:00
  • bfbd773b54 remove test report Sixian Yi 2025-01-23 01:06:39 -08:00
  • 82a28f3a24
    update doc for client-sdk testing (#849) Sixian Yi 2025-01-23 00:17:16 -08:00
  • 3d14a3d46f Kill colons Ashwin Bharambe 2025-01-22 22:54:13 -08:00
  • 910717c1fd
    Add vLLM raw completions API (#823) Aidan Do 2025-01-23 17:58:27 +11:00
  • 4d7c8c797f Kill colons Ashwin Bharambe 2025-01-22 22:54:13 -08:00
  • 28012c51bb
    update docs for tools and telemetry (#846) Dinesh Yeduguru 2025-01-22 22:50:29 -08:00
  • 35c71d5bbe
    Update OpenAPI generator to output discriminator (#848) Ashwin Bharambe 2025-01-22 22:15:23 -08:00
  • 65f07c3d63
    Update Documentation (#838) Hardik Shah 2025-01-22 20:38:52 -08:00
  • 6c205e1d5a Fix tool tests Ashwin Bharambe 2025-01-22 20:31:18 -08:00
  • 0bff6e1658 Move tool_runtime.memory -> tool_runtime.rag Ashwin Bharambe 2025-01-22 20:25:02 -08:00
  • f3d8864c36 Rename builtin::memory -> builtin::rag Ashwin Bharambe 2025-01-22 20:22:51 -08:00
  • 597869a2aa
    add distro report (#847) Sixian Yi 2025-01-22 19:20:49 -08:00
  • 23f1980f9c Fix meta-reference GPU implementation for inference Ashwin Bharambe 2025-01-22 18:31:59 -08:00
  • f4b0f2af8b If initialization fails for library client, error the test Ashwin Bharambe 2025-01-22 18:11:42 -08:00
  • 72a1b27d01 nitpick Ashwin Bharambe 2025-01-22 18:09:46 -08:00
  • a8345f5f76 Fix llama stack build docker creation to have correct entrypoint Ashwin Bharambe 2025-01-22 16:53:54 -08:00
  • 08dcb9e31e Accept "query_config" params for the RAG tool Ashwin Bharambe 2025-01-22 16:42:36 -08:00
  • f4f47970e5
    [client sdk test] add options for inference_model, safety_shield, embedding_model (#843) Sixian Yi 2025-01-22 15:35:19 -08:00
  • 4dd4f09fc5 Rename a test and add some comments Ashwin Bharambe 2025-01-22 15:27:29 -08:00
  • 494e969f8d add a bunch of NBVAL SKIPs to unblock ugh Ashwin Bharambe 2025-01-22 14:22:10 -08:00
  • deab4f57dd
    Improved report generation for providers (#844) Hardik Shah 2025-01-22 15:27:09 -08:00
  • 8738c3e5a7
    fix experimental-post-training template (#842) Botao Chen 2025-01-22 15:04:05 -08:00
  • 82d942b501 Foo v0.1.0rc12 v0.1.0rc11 Ashwin Bharambe 2025-01-22 13:58:17 -08:00
  • 55d01339c2 Update notebook Ashwin Bharambe 2025-01-22 13:31:11 -08:00
  • 07b87365ab
    [inference api] modify content types so they follow a more standard structure (#841) Ashwin Bharambe 2025-01-22 12:16:18 -08:00
  • caa8387dd2
    Fix fireworks client sdk chat completion with images (#840) Hardik Shah 2025-01-22 11:25:10 -08:00
  • a63a43c646
    [memory refactor][6/n] Update naming and routes (#839) Ashwin Bharambe 2025-01-22 10:39:13 -08:00
  • c9e5578151
    [memory refactor][5/n] Migrate all vector_io providers (#835) Ashwin Bharambe 2025-01-22 10:17:59 -08:00
  • 63f37f9b7c
    [memory refactor][4/n] Update the client-sdk test for RAG (#834) Ashwin Bharambe 2025-01-22 10:15:19 -08:00
  • 1a7490470a
    [memory refactor][3/n] Introduce RAGToolRuntime as a specialized sub-protocol (#832) Ashwin Bharambe 2025-01-22 10:04:16 -08:00
  • 78a481bb22
    [memory refactor][2/n] Update faiss and make it pass tests (#830) Ashwin Bharambe 2025-01-22 10:02:15 -08:00
  • 3ae8585b65
    [memory refactor][1/n] Rename Memory -> VectorIO, MemoryBanks -> VectorDBs (#828) Ashwin Bharambe 2025-01-22 09:59:30 -08:00
  • 35a00d004a
    bug fix for distro report generation (#836) Sixian Yi 2025-01-21 21:44:06 -08:00
  • edf56884a7
    add pytest option to generate a functional report for distribution (#833) Sixian Yi 2025-01-21 21:18:23 -08:00
  • e41873f268
    [ez] structured output for /completion ollama & enable tests (#822) Sixian Yi 2025-01-21 21:10:24 -08:00
  • 7a4b382ae9
    add section for mcp tool usage in notebook (#831) Dinesh Yeduguru 2025-01-21 13:10:42 -08:00
  • 75a2694daa Refactor the API enum to an independent file into llama_stack/apis/ Ashwin Bharambe 2025-01-19 12:22:40 -08:00
  • 74f6af8bbe
    [CICD] add simple test step for docker build workflow, fix prefix bug (#821) Xi Yan 2025-01-18 15:16:05 -08:00
  • 55067fa81d
    test report for v0.1 (#814) Sixian Yi 2025-01-18 07:50:45 -08:00
  • 5379eca9fd
    Fix incorrect image type in publish-to-docker workflow (#819) Yuan Tang 2025-01-18 00:33:03 -05:00
  • 5a63d0ff1d
    Fix incorrect RunConfigSettings due to the removal of conda_env (#801) Yuan Tang 2025-01-18 00:30:57 -05:00
  • 3a9468ce9b
    fix again vllm for non base64 (#818) Xi Yan 2025-01-17 18:33:40 -08:00
  • 3e7496e835
    fix vllm base64 image inference (#815) Xi Yan 2025-01-17 17:07:28 -08:00
  • 3d4c53dfec
    add mcp runtime as default to all providers (#816) Dinesh Yeduguru 2025-01-17 16:40:58 -08:00
  • 6da3053c0e
    More generic image type for OCI-compliant container technologies (#802) Yuan Tang 2025-01-17 19:37:42 -05:00
  • 9d005154d7
    fix vllm template (#813) Xi Yan 2025-01-17 15:34:29 -08:00
  • eb60f04f86
    optional api dependencies (#793) Ashwin Bharambe 2025-01-17 15:26:53 -08:00
  • 1f60c0286d
    cannot import name 'GreedySamplingStrategy' (#806) Aidan Do 2025-01-18 09:34:29 +11:00
  • e1decaec9d
    Fixing small typo in quick start guide (#807) Paul McCarthy 2025-01-17 19:15:55 +00:00
  • 53b5f6b24a
    add json_schema_type to ParamType deps (#808) Dinesh Yeduguru 2025-01-17 11:02:25 -08:00
  • c2a072911d
    fix eval notebook & add test to workflow (#803) Xi Yan 2025-01-16 23:11:21 -08:00
  • 9d574f4aee
    fix playground for v1 (#799) Xi Yan 2025-01-16 19:32:07 -08:00
  • b2ac29b9da
    fix provider model list test (#800) Hardik Shah 2025-01-16 19:27:29 -08:00
  • 9f14382d82
    meta reference inference fixes (#797) Ashwin Bharambe 2025-01-16 18:17:46 -08:00
  • cb41848a2a disable version check optionally Ashwin Bharambe 2025-01-16 18:14:26 -08:00
  • 38009631bc
    Remove llama-guard in Cerebras template & improve agent test (#798) Xi Yan 2025-01-16 18:11:35 -08:00
  • 0fefd4390a
    Fix tgi adapter (#796) Xi Yan 2025-01-16 17:44:12 -08:00
  • 73215460ba
    add default toolgroups to all providers (#795) Dinesh Yeduguru 2025-01-16 16:54:59 -08:00
  • e88faa91e2
    fix the code execution test in sdk tests (#794) Dinesh Yeduguru 2025-01-16 16:42:25 -08:00
  • 35bf6ea75a
    Pin torchtune pkg version (#791) Botao Chen 2025-01-16 16:31:13 -08:00
  • d1f3b032c9
    cerebras template update for memory (#792) Xi Yan 2025-01-16 16:07:53 -08:00
  • 48b12b9777
    [Test automation] generate custom test report (#739) Sixian Yi 2025-01-16 15:33:50 -08:00
  • 03ac84a829 Update default port from 5000 -> 8321 Ashwin Bharambe 2025-01-16 15:26:48 -08:00
  • f1faa9c924 pop fix Hardik Shah 2025-01-16 14:09:59 -08:00
  • fcd1a57429 update notebook Dinesh Yeduguru 2025-01-16 14:00:48 -08:00
  • a6b9f2cec7
    fix cerebras template (#790) Xi Yan 2025-01-16 13:53:06 -08:00
  • 12c994b5b2
    REST API fixes (#789) Dinesh Yeduguru 2025-01-16 13:47:08 -08:00
  • cee3816609
    Make llama stack build not create a new conda by default (#788) Ashwin Bharambe 2025-01-16 13:44:53 -08:00