Commit graph

  • 86820d13cd Approach #2 Aidan Do 2024-12-16 11:47:45 +11:00
  • cb8a28c128
    Doc: Ollama command references non-existent file (#632) Aidan Do 2024-12-16 01:52:28 +11:00
  • a5d5850c37 Ollama command references non-existent file Aidan Do 2024-12-15 22:07:20 +11:00
  • d0854a48b2 . Aidan Do 2024-12-15 17:53:22 +11:00
  • d9db9a01bf Formatting Aidan Do 2024-12-15 13:48:55 +11:00
  • 7076e661b5 Remove print statements in unit tests Aidan Do 2024-12-15 12:05:52 +11:00
  • 3faa79d76c feat: Add tags field for models with dynamic and user-defined population Habeb Nawatha 2024-12-14 21:40:18 +02:00
  • cf87262e9c Add tool calls to groq inference adapter Aidan Do 2024-12-14 22:20:54 +11:00
  • 78912e663b
    Update llama_stack/providers/tests/inference/groq/test_groq_utils.py Aidan Do 2024-12-14 20:48:38 +11:00
  • 45c6453a8b Add Redis KVStore integration test Vishwanath Martur 2024-12-14 15:15:38 +05:30
  • 09ab5c8eab
    Update llama_stack/providers/remote/inference/groq/groq_utils.py Aidan Do 2024-12-14 20:44:30 +11:00
  • 687bc52b4b
    Update llama_stack/providers/tests/inference/groq/test_groq_utils.py Aidan Do 2024-12-14 20:43:35 +11:00
  • ee53eb8691
    Update llama_stack/providers/remote/inference/groq/groq_utils.py Aidan Do 2024-12-14 16:49:30 +11:00
  • b78b9ed4a1
    Update llama_stack/providers/remote/inference/groq/groq.py Aidan Do 2024-12-14 16:36:16 +11:00
  • aa9a9f18be
    Move model_id above so warning actually works Aidan Do 2024-12-14 16:35:24 +11:00
  • 815f4af6cf
    add colab notebook & update docs (#619) Xi Yan 2024-12-13 19:15:15 -08:00
  • 74bed24a68 fix typo Justin Lee 2024-12-14 08:55:38 +08:00
  • 3587f0894d Removing formatting changes Aidan Do 2024-12-14 11:41:25 +11:00
  • ab51a508b6 made changes to readme and pinning to v0.0.61 Justin Lee 2024-12-14 08:40:46 +08:00
  • 20383bfea5
    [3/n][torchtune integration] add validation logic (#600) Botao Chen 2024-12-13 16:35:06 -08:00
  • 378150e23c Add Groq provider - chat completions Aidan Do 2024-12-12 21:15:09 +11:00
  • 1bfef60590 refine Botao Chen 2024-12-13 16:12:56 -08:00
  • 018dce89ca Merge branch 'main' into post_training_v4 Botao Chen 2024-12-13 16:06:25 -08:00
  • a4a2e3ce32 colab tag Xi Yan 2024-12-13 15:34:27 -08:00
  • 25fa24a16e comments Xi Yan 2024-12-13 15:32:11 -08:00
  • c294a01c4b
    [2/n][torchtune integration] implement job management and return training artifacts (#593) Botao Chen 2024-12-13 15:00:04 -08:00
  • d0a72cc288 fix misc Botao Chen 2024-12-13 14:55:01 -08:00
  • f161440524 add notebooks Xi Yan 2024-12-13 14:48:33 -08:00
  • 5764a95912
    Add missing environments field for vLLM provider (#623) Yuan Tang 2024-12-13 17:06:27 -05:00
  • cd3cb8a1b7
    Add missing environments field for vLLM provider Yuan Tang 2024-12-13 16:59:59 -05:00
  • 932c09b35c restructure Xi Yan 2024-12-13 13:54:31 -08:00
  • d55a8343ea merge Botao Chen 2024-12-13 12:55:21 -08:00
  • 516e1a3e59
    add embedding model by default to distribution templates (#617) Dinesh Yeduguru 2024-12-13 12:48:00 -08:00
  • d73575dd9d add info to upgrade run.yaml when the default embedding model is not found Dinesh Yeduguru 2024-12-13 12:47:07 -08:00
  • ee5066806e return type fix Dinesh Yeduguru 2024-12-13 12:10:39 -08:00
  • e2a0dce8ad Merge branch 'main' into post_training_v3 Botao Chen 2024-12-13 12:09:01 -08:00
  • e893b22868 export LibraryClient Ashwin Bharambe 2024-12-13 12:07:42 -08:00
  • 6de92a6c33
    Reformat distributions table (#608) Yuan Tang 2024-12-13 14:45:17 -05:00
  • 4800247b5c minor Ashwin Bharambe 2024-12-13 11:44:08 -08:00
  • aeb76390fc
    [1/n] torchtune <> llama-stack integration skeleton (#540) Botao Chen 2024-12-13 11:05:35 -08:00
  • 29d0896ec8 address commit Botao Chen 2024-12-13 10:38:53 -08:00
  • 40d70864e7 do not mention sentence transformer provider in docs Dinesh Yeduguru 2024-12-13 09:32:42 -08:00
  • de44af1501 temp commit Botao Chen 2024-12-12 21:44:03 -08:00
  • b7b1670aba merge cookbooks w/ guides Xi Yan 2024-12-12 21:07:05 -08:00
  • d2e607a92d merge cookbooks w/ guides Xi Yan 2024-12-12 20:56:47 -08:00
  • 8aba8c1f08 delete readme Xi Yan 2024-12-12 18:00:58 -08:00
  • f09b8c607f add to docs Xi Yan 2024-12-12 17:57:12 -08:00
  • 73b6dd7af1 add link to colab Xi Yan 2024-12-12 17:54:41 -08:00
  • d145bb629d notebooks Xi Yan 2024-12-12 17:49:32 -08:00
  • c0f8452310
    Add PyPI URL Yuan Tang 2024-12-12 20:17:49 -05:00
  • 8efe33646d temp_commit Botao Chen 2024-12-12 17:15:05 -08:00
  • c24456a41a
    Add automatic PyPI release GitHub workflow Yuan Tang 2024-12-12 20:14:01 -05:00
  • fe2eb39da7 fix provider ids for together and fireworks Dinesh Yeduguru 2024-12-12 15:43:30 -08:00
  • 317e80dc2c refine Botao Chen 2024-12-12 15:41:39 -08:00
  • 8132b4e177 refine Botao Chen 2024-12-12 15:33:59 -08:00
  • 2f88006bd0 add embedding model by default Dinesh Yeduguru 2024-12-12 14:46:51 -08:00
  • 0f78a5fb2d misc Botao Chen 2024-12-12 14:17:26 -08:00
  • aad0dedc85 address comments Botao Chen 2024-12-12 14:13:17 -08:00
  • 3378c100f6 address comments Botao Chen 2024-12-12 14:05:40 -08:00
  • 53b3a1e345
    Update kotlin docs to 0.0.58 (#614) Riandy 2024-12-13 05:09:13 +08:00
  • 2f266c361b Update kotlin docs to 0.0.58 Riandy Riandy 2024-12-13 04:59:25 +08:00
  • 2a9b13dd52
    add test for completion logprobs (#532) Matthew Farrellee 2024-12-12 15:19:48 -05:00
  • 96e158eaac
    Make embedding generation go through inference (#606) Dinesh Yeduguru 2024-12-12 11:47:50 -08:00
  • ee3f0c6b55 fix failing memory tests Dinesh Yeduguru 2024-12-12 11:43:06 -08:00
  • c38d377eb7 precommit fixes Dinesh Yeduguru 2024-12-12 11:30:44 -08:00
  • d362d2d740 implement embedding generation in supported inference providers (#589) Dinesh Yeduguru 2024-12-12 11:17:39 -08:00
  • 6a23f24ee0 Revert "Revert "add model type to APIs" (#605)" Dinesh Yeduguru 2024-12-11 10:18:00 -08:00
  • 4f8b73b9e1
    Vector store inference api (#598) Dinesh Yeduguru 2024-12-12 11:16:54 -08:00
  • db7b26a8c9 remove unused check_model Dinesh Yeduguru 2024-12-12 11:15:38 -08:00
  • a14785af46
    [docs] add playground ui docs (#592) Xi Yan 2024-12-12 10:40:38 -08:00
  • 75066663f1 address comments Xi Yan 2024-12-12 10:38:29 -08:00
  • 8b45d147df
    [/datasetio] drop columns not specified by dataset schema for huggingface provider (#611) Xi Yan 2024-12-12 10:23:09 -08:00
  • 278e14ed85 typo Xi Yan 2024-12-12 09:35:42 -08:00
  • fced5ec6dd
    Merge branch 'meta-llama:main' into main Shrinit Goyal 2024-12-12 12:42:45 +05:30
  • f3073d9fb1 README cleanup Jeff Tang 2024-12-11 18:18:07 -08:00
  • cc75a8ce1b added missing file Jeff Tang 2024-12-11 18:15:45 -08:00
  • a7d29952b0 draft bug fix Jeff Tang 2024-12-11 18:14:44 -08:00
  • 61e837380c rename folder; code readme update Jeff Tang 2024-12-11 18:04:32 -08:00
  • e14493885b hf drop rows not specified by schema Xi Yan 2024-12-11 17:17:17 -08:00
  • 7a7f7d8118 upgrade faiss to new prefix Dinesh Yeduguru 2024-12-11 16:54:46 -08:00
  • b509d59dcd weaviate fixes Dinesh Yeduguru 2024-12-10 16:45:33 -08:00
  • 0e451525e5 remove mixin and test fixes Dinesh Yeduguru 2024-12-09 15:00:12 -08:00
  • 5bbeb985ca user inference api to generate embeddings in vector store Dinesh Yeduguru 2024-12-09 12:49:35 -08:00
  • 96accc1216 fix meta reference fixture Dinesh Yeduguru 2024-12-11 16:36:06 -08:00
  • 5821ec9ef3 address feedback Dinesh Yeduguru 2024-12-11 16:24:37 -08:00
  • b7cb06f004
    Allow using an "inline" version of Chroma using PersistentClient (#567) Ashwin Bharambe 2024-12-11 16:02:04 -08:00
  • 210e291c57 Undo the unnecessary revert for the client code path Ashwin Bharambe 2024-12-11 15:44:55 -08:00
  • f0e045d1c8 Create a separate inline::chromadb provider Ashwin Bharambe 2024-12-11 14:11:08 -08:00
  • b259ec9847
    center Yuan Tang 2024-12-11 15:24:03 -05:00
  • c0a83c32fb
    Reformat distributions table Yuan Tang 2024-12-11 15:21:52 -05:00
  • 41487e6ed1
    refactor scoring/eval pytests (#607) Xi Yan 2024-12-11 10:47:37 -08:00
  • 00658e02f8 fix eval tests model registration Xi Yan 2024-12-11 10:36:39 -08:00
  • e167e9eb93 implement embedding generation in supported inference providers Dinesh Yeduguru 2024-12-09 12:48:56 -08:00
  • b896be2311 add model type Dinesh Yeduguru 2024-12-09 12:45:11 -08:00
  • 3b5a33d921 parameterize judge_model Xi Yan 2024-12-11 10:31:05 -08:00
  • 310c15bada
    Revert "Revert "add model type to APIs" (#605)" Dinesh Yeduguru 2024-12-11 10:18:00 -08:00
  • 47b2dc8ae3
    Revert "add model type to APIs" (#605) Dinesh Yeduguru 2024-12-11 10:17:54 -08:00
  • 2e12c68a07
    Revert "add model type to APIs (#588)" Dinesh Yeduguru 2024-12-11 10:17:40 -08:00
  • 8e33db6015
    add model type to APIs (#588) Dinesh Yeduguru 2024-12-11 10:16:53 -08:00
  • 7e1d628864
    Fix some typos in distributions/providers docs (#603) Yuan Tang 2024-12-11 13:10:52 -05:00