llama-stack

phoenix-oss/llama-stack

Fork 0

forked from phoenix-oss/llama-stack-mirror

Commit graph

5d7b611336

Add JSON structured outputs to Ollama Provider (#680) Aidan Do 2025-01-03 04:05:51 +11:00
8146dce11e

Add missing newlines before printing the Dockerfile content (#700) Yuan Tang 2025-01-02 11:04:29 -06:00
c1987d6143

Fix failing flake8 E226 check (#701) Yuan Tang 2025-01-02 11:04:07 -06:00
eee25db11d

Add missing "inline::" prefix for providers in building_distro.md (#702) Yuan Tang 2025-01-02 11:03:30 -06:00
a6c206ea66

[bugfix] fix prompt_adapter interleaved_content_convert_to_raw (#696) Xi Yan 2024-12-30 16:40:36 -08:00
7c1e3daa75

[bugfix] fix meta-reference agents w/ safety multiple model loading pytest (#694) Xi Yan 2024-12-30 16:25:46 -08:00
8ba29b19f2

Minor Quick Start documentation updates. (#692) Derek Slager 2024-12-30 14:19:05 -08:00
694adb1501

[bugfix] fix broken vision inference, change serialization for bytes (#693) Xi Yan 2024-12-30 13:57:41 -08:00
79f8bc8416

Update index.md raghotham 2024-12-30 11:32:28 -08:00
54f8aab61e copy getting_started Xi Yan 2024-12-30 10:42:28 -08:00
0e098c483b link getting started Xi Yan 2024-12-30 09:47:10 -08:00
3c72c034e6

[remove import *] clean up import *'s (#689) Xi Yan 2024-12-27 15:45:44 -08:00
70db039ff4 fix client-sdk memory/safety test Xi Yan 2024-12-26 15:48:28 -08:00
b6aca4c8bb fix client-sdk agents/inference test Xi Yan 2024-12-26 15:44:34 -08:00
4e1d0a2fc5 update playground doc video Xi Yan 2024-12-26 14:50:19 -08:00
28ce511986 fix --endpoint docs Xi Yan 2024-12-26 14:32:07 -08:00
7ba95a8e74

docs: update evals_reference/index.md (#675) Ikko Eltociear Ashimine 2024-12-27 04:32:37 +09:00
21fb92d7cf

Add 3.3 70B to Ollama inference provider (#681) Aidan Do 2024-12-26 17:15:58 +11:00
fa371fdc9e

Removed unnecessary CONDA_PREFIX env var in installation guide (#683) Yuan Tang 2024-12-23 16:17:30 -05:00
987e651755

Add missing venv option in --image-type (#677) Yuan Tang 2024-12-22 00:10:13 -05:00
bae197c37e

Fix post training apis broken by torchtune release (#674) Botao Chen 2024-12-20 16:12:02 -08:00
06cb0c837e

[torchtune integration] post training + eval (#670) Botao Chen 2024-12-20 13:43:13 -08:00
c8be0bf1c9

Tools API with brave and MCP providers (#639) Dinesh Yeduguru 2024-12-19 21:25:17 -08:00
17fdb47e5e

Add Llama 70B 3.3 to fireworks (#654) Aidan Do 2024-12-20 12:32:49 +11:00
8b8d1c1ef4

fix trace starting in library client (#655) Dinesh Yeduguru 2024-12-19 16:13:52 -08:00
ddf37ea467

Fixed imports for inference (#661) cdgamarose-nv 2024-12-19 14:19:36 -08:00
540fc4d717

Fix Meta reference GPU implementation (#663) Ashwin Bharambe 2024-12-19 14:09:45 -08:00
f19eb8eee3 Update types in parallel_utils for meta-refernece-gpu impl Ashwin Bharambe 2024-12-19 13:58:20 -08:00
b33086d632 Adding @vladimirivic to the owners file Vladimir Ivic 2024-12-19 11:32:05 -08:00
5be2ea37b1 fix context_retriever model->model_id Xi Yan 2024-12-19 12:52:00 -08:00
03607a68c7

remove unused telemetry related code for console (#659) Dinesh Yeduguru 2024-12-19 11:21:11 -08:00
36b4fe02cc

[4/n][torchtune integration] support lazy load model during inference (#620) Botao Chen 2024-12-18 16:30:53 -08:00
3b4b2ea30c fix replace_env_vars bug Ashwin Bharambe 2024-12-18 13:48:30 -08:00
12cbed1617 Register Message and ResponseFormat Ashwin Bharambe 2024-12-18 10:32:25 -08:00
ceadaf1840 Dont include 3B / 1B models for bedrock since they arent ondemand Ashwin Bharambe 2024-12-18 06:30:02 -08:00
c39a3777b5 Make bedrock "just" work Ashwin Bharambe 2024-12-18 06:22:14 -08:00
d6fcdefec7 Bump version to 0.0.63 v0.0.63 Ashwin Bharambe 2024-12-17 23:15:27 -08:00
f1d6cb22d7 Update URL type to avoid string-ifying and creating complexity Ashwin Bharambe 2024-12-17 22:48:47 -08:00
75e72cf2fc model_type=llm for filering available models for playground Xi Yan 2024-12-17 19:42:38 -08:00
2f9fdb0ea7 Update notebook Ashwin Bharambe 2024-12-17 18:51:51 -08:00
0fb4b7de6f Add more debugging logs to when llama guard fails Ashwin Bharambe 2024-12-17 17:11:21 -08:00
eea478618d Bump version to 0.0.62 v0.0.62 Ashwin Bharambe 2024-12-17 18:19:47 -08:00
af8f1b3531 model selection playground fix Xi Yan 2024-12-17 18:12:59 -08:00
3700022d6f

store attributes values in builtin types to avoid otel warnings (#649) Dinesh Yeduguru 2024-12-17 17:10:43 -08:00
0e2a99e223

Update Cerebras from Llama 3.1 to 3.3 (#645) Henry Tu 2024-12-17 19:28:24 -05:00
b7a7caa9a8 Fix conversion to RawMessage everywhere Ashwin Bharambe 2024-12-17 13:38:01 -08:00
fbca51d6da Fix to conda env build script Ashwin Bharambe 2024-12-17 12:19:34 -08:00
0452c6a0c7 add missing init file Ashwin Bharambe 2024-12-17 11:48:28 -08:00
8de8eb03c8

Update the "InterleavedTextMedia" type (#635) Ashwin Bharambe 2024-12-17 11:18:31 -08:00
10eb31badf

docs: Update getting_started.ipynb link to correct jupyter notebook path in README.md (#636) Arun Brahma 2024-12-18 00:41:13 +05:30
99f331f5c8

[bugfix] no shield_call when there's no shields configured (#642) Xi Yan 2024-12-17 11:10:19 -08:00
c2f7905fa4 Fix bedrock inference impl Ashwin Bharambe 2024-12-16 14:22:34 -08:00
eb37fba9da Small fix to library client Ashwin Bharambe 2024-12-16 14:08:30 -08:00
5e08812bcb Add Dinesh to be a code owner Ashwin Bharambe 2024-12-16 13:00:50 -08:00
2e5bfcd42a

Update Telemetry API so OpenAPI generation can work (#640) Ashwin Bharambe 2024-12-16 13:00:14 -08:00
78e2bfbe7a

[tests] add client-sdk pytests & delete client.py (#638) Xi Yan 2024-12-16 12:04:56 -08:00
cb8a28c128

Doc: Ollama command references non-existent file (#632) Aidan Do 2024-12-16 01:52:28 +11:00
815f4af6cf

add colab notebook & update docs (#619) Xi Yan 2024-12-13 19:15:15 -08:00
20383bfea5

[3/n][torchtune integration] add validation logic (#600) Botao Chen 2024-12-13 16:35:06 -08:00
c294a01c4b

[2/n][torchtune integration] implement job management and return training artifacts (#593) Botao Chen 2024-12-13 15:00:04 -08:00
5764a95912

Add missing environments field for vLLM provider (#623) Yuan Tang 2024-12-13 17:06:27 -05:00
516e1a3e59

add embedding model by default to distribution templates (#617) Dinesh Yeduguru 2024-12-13 12:48:00 -08:00
e893b22868 export LibraryClient Ashwin Bharambe 2024-12-13 12:07:42 -08:00
6de92a6c33

Reformat distributions table (#608) Yuan Tang 2024-12-13 14:45:17 -05:00
4800247b5c minor Ashwin Bharambe 2024-12-13 11:44:08 -08:00
aeb76390fc

[1/n] torchtune <> llama-stack integration skeleton (#540) Botao Chen 2024-12-13 11:05:35 -08:00
53b3a1e345

Update kotlin docs to 0.0.58 (#614) Riandy 2024-12-13 05:09:13 +08:00
2a9b13dd52

add test for completion logprobs (#532) Matthew Farrellee 2024-12-12 15:19:48 -05:00
96e158eaac

Make embedding generation go through inference (#606) Dinesh Yeduguru 2024-12-12 11:47:50 -08:00
a14785af46

[docs] add playground ui docs (#592) Xi Yan 2024-12-12 10:40:38 -08:00
8b45d147df

[/datasetio] drop columns not specified by dataset schema for huggingface provider (#611) Xi Yan 2024-12-12 10:23:09 -08:00
b7cb06f004

Allow using an "inline" version of Chroma using PersistentClient (#567) Ashwin Bharambe 2024-12-11 16:02:04 -08:00
41487e6ed1

refactor scoring/eval pytests (#607) Xi Yan 2024-12-11 10:47:37 -08:00
47b2dc8ae3

Revert "add model type to APIs" (#605) Dinesh Yeduguru 2024-12-11 10:17:54 -08:00
8e33db6015

add model type to APIs (#588) Dinesh Yeduguru 2024-12-11 10:16:53 -08:00
7e1d628864

Fix some typos in distributions/providers docs (#603) Yuan Tang 2024-12-11 13:10:52 -05:00
b52df5fe5b

add completion api support to nvidia inference provider (#533) Matthew Farrellee 2024-12-11 13:08:38 -05:00
07c72c4256

Add vLLM to API providers and distributions tables (#604) Yuan Tang 2024-12-11 13:05:47 -05:00
a4bcfb8bba

[/scoring] add ability to define aggregation functions for scoring functions & refactors (#597) Xi Yan 2024-12-11 10:03:42 -08:00
e128f2547a

add tracing back to the lib cli (#595) Dinesh Yeduguru 2024-12-11 08:44:20 -08:00
1c03ba239e

[#342] RAG - fix PDF format in vector database (#551) Aidan Do 2024-12-11 16:33:27 +11:00
f5c36c47ed

Added support for llama 3.3 model (#601) varunfb 2024-12-10 20:03:31 -08:00
76eb558bde

doc: llama-stack build --config help text references old directory (#596) Aidan Do 2024-12-11 12:42:02 +11:00
e0d5be41fe

add nvidia nim inference provider to docs (#534) Matthew Farrellee 2024-12-10 16:23:56 -05:00
e2054d53e4

Fix issue 586 (#594) v0.0.61 Xi Yan 2024-12-10 10:22:04 -08:00
02b43be9d7 Bump version to 0.0.61 Ashwin Bharambe 2024-12-10 10:18:44 -08:00
fa68ded07c Remove the unnecessary message after llama stack build Ashwin Bharambe 2024-12-10 09:46:37 -08:00
885bb0900b memory retrival to print only the bytes injected Dinesh Yeduguru 2024-12-10 09:32:18 -08:00
2e3d3a62a5 Revert "add tracing to library client (#591)" Dinesh Yeduguru 2024-12-10 08:50:20 -08:00
16d103842a Revert "await end_trace in libcli" Dinesh Yeduguru 2024-12-10 08:47:32 -08:00
f969b561ea Revert "Disable telemetry in library client for now" Dinesh Yeduguru 2024-12-10 08:47:18 -08:00
686f8d5b8d remove info logging in agent instance Dinesh Yeduguru 2024-12-10 08:40:42 -08:00
1ad691bb04 Bump version to 0.0.60 Ashwin Bharambe 2024-12-09 22:19:51 -08:00
176ebddf47 Disable telemetry in library client for now Ashwin Bharambe 2024-12-09 22:17:25 -08:00
baae4f7b51 Bump version to 0.0.59 Ashwin Bharambe 2024-12-09 21:22:20 -08:00
a4d8a6009a

Fixes for library client (#587) Ashwin Bharambe 2024-12-09 17:14:37 -08:00
7615da78b8 await end_trace in libcli Dinesh Yeduguru 2024-12-09 15:54:42 -08:00
bc1fddf1df

add tracing to library client (#591) Dinesh Yeduguru 2024-12-09 15:46:26 -08:00
ab7145a04f minor refactor Xi Yan 2024-12-09 15:43:12 -08:00
cd40a5fdbf

update template run.yaml to include openai api key for braintrust (#590) Xi Yan 2024-12-09 15:40:59 -08:00