Xi Yan
29138a5167
Update getting_started.md
2024-10-05 12:28:02 -07:00
Xi Yan
6d4013ac99
Update getting_started.md
2024-10-05 12:14:59 -07:00
Mindaugas
9d16129603
Add 'url' property to Redis KV config ( #192 )
2024-10-05 11:26:26 -07:00
Ashwin Bharambe
bfb0e92034
Bump version to 0.0.40
2024-10-04 09:33:43 -07:00
Ashwin Bharambe
dc75aab547
Add setuptools dependency
2024-10-04 09:30:54 -07:00
Dalton Flanagan
441052b0fd
avoid jq since non-standard on macOS
2024-10-04 10:11:43 -04:00
Dalton Flanagan
9bf2e354ae
CLI now requires jq
2024-10-04 10:05:59 -04:00
raghotham
00ed9a410b
Update getting_started.md
...
update discord invite link
2024-10-03 23:28:43 -07:00
AshleyT3
734f59d3b8
Check that the model is found before use. ( #182 )
2024-10-03 23:24:47 -07:00
Ashwin Bharambe
f913b57397
fix fp8 imports
2024-10-03 14:40:21 -07:00
Ashwin Bharambe
8d41e6caa9
Bump version to 0.0.39
2024-10-03 11:31:03 -07:00
Ashwin Bharambe
7f49315822
Kill a derpy import
2024-10-03 11:25:58 -07:00
Xi Yan
62d266f018
[CLI] avoid configure twice ( #171 )
...
* avoid configure twice
* cleanup tmp config
* update output msg
* address comment
* update msg
* script update
2024-10-03 11:20:54 -07:00
Russell Bryant
06db9213b1
inference: Add model option to client ( #170 )
...
I was running this client for testing purposes and being able to
specify which model to use is a convenient addition. This change makes
that possible.
2024-10-03 11:18:57 -07:00
Ashwin Bharambe
210b71b0ba
fix prompt guard ( #177 )
...
Several other fixes to configure. Add support for 1b/3b models in ollama.
2024-10-03 11:07:53 -07:00
Xi Yan
b9b1e8b08b
[bugfix] conda path lookup ( #179 )
...
* fix conda lookup
* comments
2024-10-03 10:45:16 -07:00
raghotham
d74501f75c
Update README.md
...
Added pypi package version
2024-10-03 10:21:16 -07:00
Ashwin Bharambe
c02a90e4c8
Bump version to 0.0.38
2024-10-03 05:42:47 -07:00
Ashwin Bharambe
e9f6150588
A bit cleanup to avoid breakages
2024-10-02 21:31:09 -07:00
Ashwin Bharambe
988a9cada3
Don't ask for Api.inspect in stack build
2024-10-02 21:10:56 -07:00
Ashwin Bharambe
19ce6bf009
Don't validate prompt-guard anymore
2024-10-02 20:43:57 -07:00
Xi Yan
703ab9385f
fix routing table key list
2024-10-02 18:23:31 -07:00
Ashwin Bharambe
8d049000e3
Add an introspection "Api.inspect" API
2024-10-02 15:41:14 -07:00
Adrian Cole
01d93be948
Adds markdown-link-check and fixes a broken link ( #165 )
...
Signed-off-by: Adrian Cole <adrian.cole@elastic.co>
Co-authored-by: Ashwin Bharambe <ashwin.bharambe@gmail.com>
2024-10-02 14:26:20 -07:00
Ashwin Bharambe
fe4aabd690
provider_id => provider_type, adapter_id => adapter_type
2024-10-02 14:05:59 -07:00
Ashwin Bharambe
df68db644b
Refactoring distribution/distribution.py
...
This file was becoming too large and unclear what it housed. Split it
into pieces.
2024-10-02 14:03:02 -07:00
Ashwin Bharambe
546f05bd3f
No automatic pager
2024-10-02 12:26:09 -07:00
Russell Bryant
204eb6d810
docker: Check for selinux before using --security-opt
( #167 )
...
Before using `--security-opt label=disable`, check that SELinux is
enabled. Otherwise, the option is not relevant.
This fixes errors on Mac.
Closes #166
Signed-off-by: Russell Bryant <rbryant@redhat.com>
2024-10-02 10:37:41 -07:00
Ashwin Bharambe
9b93ee2c2b
Bump version to 0.0.37
2024-10-02 10:15:08 -07:00
Ashwin Bharambe
227b69e6e6
Fix sample memory impl
2024-10-02 10:13:09 -07:00
Ashwin Bharambe
335dea849a
fix sample impls
2024-10-02 10:10:31 -07:00
Ashwin Bharambe
bf0d111c53
Fix build script
2024-10-02 10:04:23 -07:00
Ashwin Bharambe
4a75d922a9
Make Llama Guard 1B the default
2024-10-02 09:48:26 -07:00
Ashwin Bharambe
cc5029a716
Add special case for prompt guard
2024-10-02 08:43:12 -07:00
Ashwin Bharambe
a80b707ff8
Ensure we always ask for pydantic>=2
2024-10-02 06:29:06 -07:00
Ashwin Bharambe
eb2d8a31a5
Add a RoutableProvider protocol, support for multiple routing keys ( #163 )
...
* Update configure.py to use multiple routing keys for safety
* Refactor distribution/datatypes into a providers/datatypes
* Cleanup
2024-09-30 17:30:21 -07:00
Xi Yan
73decb3781
re-build from name
2024-09-30 16:22:52 -07:00
Xi Yan
4897bf2f85
allow --name to re-build from config
2024-09-30 16:18:12 -07:00
Xi Yan
d28c3dfe0f
[CLI] simplify docker run ( #159 )
...
* bake run.yaml inside docker, simplify run
* add docker template examples
* delete generated Dockerfile
* unique deps
* clean up debug
* default entrypoint
* address comments, update output msg
* update msg
* build output msg
* configure msg
* unique special_deps
* remove quotes in configure
2024-09-30 15:04:04 -07:00
Russell Bryant
8db49de961
docker: Install in editable mode for dev purposes ( #160 )
...
While rebuilding a stack using the `docker` image type and having
`LLAMA_STACK_DIR` set so it installs `llama_stack` from my local
source, I noticed that once built, it just used the image build cache
and didn't pull in changes to my source.
1. Install in editable mode (`pip install -e`) for dev purposes.
2. Mount the source into the container for `configure` and `run` so
that the editable install works.
Signed-off-by: Russell Bryant <rbryant@redhat.com>
2024-09-30 11:56:31 -07:00
Russell Bryant
cb36be320f
Fix podman+selinux compatibility ( #132 )
...
When I ran `llama stack configure` for my `docker` based stack on my
system using podman + SELinux (CentOS Stream 9), The `podman run`
command failed due to SELinux blocking access to the volume mount.
As a simple fix, disable SELinux label checking.
Signed-off-by: Russell Bryant <rbryant@redhat.com>
2024-09-29 20:19:44 -07:00
moritalous
2bd785354d
fix broken bedrock inference provider ( #151 )
2024-09-29 20:17:58 -07:00
Byung Chun Kim
2f096ca509
accepts not model itself. ( #153 )
2024-09-29 20:16:50 -07:00
Ashwin Bharambe
5bf679cab6
Pull (extract) provider data from the provider instead of pushing from the top ( #148 )
2024-09-29 20:00:51 -07:00
Xi Yan
f6a6598d1a
[bugfix] fix #146 ( #147 )
...
* more robust image type
* lint
2024-09-28 17:47:00 -07:00
Xi Yan
b646167d94
Update README.md
2024-09-28 16:55:22 -07:00
Xi Yan
5ce759adc4
Update README.md
2024-09-28 16:55:08 -07:00
Xi Yan
6a8c2ae1df
[CLI] remove dependency on CONDA_PREFIX in CLI ( #144 )
...
* remove dependency on CONDA_PREFIX in CLI
* lint
* typo
* more robust
2024-09-28 16:46:47 -07:00
Ashwin Bharambe
fe460ba103
Avoid importing a lot of stuff
2024-09-28 16:06:10 -07:00
Xi Yan
4ae8c63a2b
pre-commit lint
2024-09-28 16:04:41 -07:00