Ashwin Bharambe
939056e265
More documentation fixes
2024-11-18 17:06:13 -08:00
Ashwin Bharambe
e40404625b
Update to docs
2024-11-18 16:52:48 -08:00
Ashwin Bharambe
afa4f0b19f
Update remote vllm docs
2024-11-18 16:34:33 -08:00
Ashwin Bharambe
47c37fd831
Fixes
2024-11-18 16:03:53 -08:00
Ashwin Bharambe
3aedde2ab4
Add a pre-commit for distro_codegen but it does not work yet
2024-11-18 15:21:13 -08:00
Ashwin Bharambe
2a31163178
Auto-generate distro yamls + docs ( #468 )
...
# What does this PR do?
Automatically generates
- build.yaml
- run.yaml
- run-with-safety.yaml
- parts of markdown docs
for the distributions.
## Test Plan
At this point, this only updates the YAMLs and the docs. Some testing
(especially with ollama and vllm) has been performed but needs to be
much more tested.
2024-11-18 14:57:06 -08:00
Dinesh Yeduguru
fdff24e77a
Inference to use provider resource id to register and validate ( #428 )
...
This PR changes the way model id gets translated to the final model name
that gets passed through the provider.
Major changes include:
1) Providers are responsible for registering an object and as part of
the registration returning the object with the correct provider specific
name of the model provider_resource_id
2) To help with the common look ups different names a new ModelLookup
class is created.
Tested all inference providers including together, fireworks, vllm,
ollama, meta reference and bedrock
2024-11-12 20:02:00 -08:00
Ashwin Bharambe
3d7561e55c
Rename all inline providers with an inline:: prefix ( #423 )
2024-11-11 22:19:16 -08:00
Xi Yan
b0b9c905b3
docs
2024-11-09 10:22:41 -08:00
Xi Yan
cc61fd8083
docs
2024-11-09 09:00:18 -08:00
Xi Yan
0c14761453
docs
2024-11-09 08:57:51 -08:00
Ashwin Bharambe
4986e46188
Distributions updates (slight updates to ollama, add inline-vllm and remote-vllm) ( #408 )
...
* remote vllm distro
* add inline-vllm details, fix things
* Write some docs
2024-11-08 18:09:39 -08:00
Xi Yan
bd0622ef10
update docs
2024-11-08 12:47:05 -08:00
Xi Yan
7ee9f8d8ac
rename
2024-11-08 10:34:48 -08:00
Xi Yan
b1d7376730
kill tgi/cpu
2024-11-08 10:33:45 -08:00
Xi Yan
8350f2df4c
[docs] refactor remote-hosted distro ( #402 )
...
* move docs
* docs
2024-11-07 19:16:38 -08:00
Ashwin Bharambe
994732e2e0
impls
-> inline
, adapters
-> remote
(#381 )
2024-11-06 14:54:05 -08:00
Dinesh Yeduguru
093c9f1987
add bedrock distribution code ( #358 )
...
* add bedrock distribution code
* fix linter error
* add bedrock shields support
* linter fixes
* working bedrock safety
* change to return only one violation
* remove env var reading
* refereshable boto credentials
* remove env vars
* address raghu's feedback
* fix session_ttl passing
---------
Co-authored-by: Dinesh Yeduguru <dineshyv@fb.com>
2024-11-06 14:39:11 -08:00
Xi Yan
db30809141
precommit
2024-11-05 15:26:13 -08:00
Xi Yan
0706f6c82f
add Llama3.2-3B-Instruct:int4-qlora-eo8
2024-11-05 15:22:26 -08:00
Xi Yan
16b7fa4614
quantized model docs
2024-11-05 15:21:13 -08:00
Xi Yan
c810a4184d
[docs] update documentations ( #356 )
...
* move docs -> source
* Add files via upload
* mv image
* Add files via upload
* colocate iOS setup doc
* delete image
* Add files via upload
* fix
* delete image
* Add files via upload
* Update developer_cookbook.md
* toctree
* wip subfolder
* docs update
* subfolder
* updates
* name
* updates
* index
* updates
* refactor structure
* depth
* docs
* content
* docs
* getting started
* distributions
* fireworks
* fireworks
* update
* theme
* theme
* theme
* pdj theme
* pytorch theme
* css
* theme
* agents example
* format
* index
* headers
* copy button
* test tabs
* test tabs
* fix
* tabs
* tab
* tabs
* sphinx_design
* quick start commands
* size
* width
* css
* css
* download models
* asthetic fix
* tab format
* update
* css
* width
* css
* docs
* tab based
* tab
* tabs
* docs
* style
* image
* css
* color
* typo
* update docs
* missing links
* list templates
* links
* links update
* troubleshooting
* fix
* distributions
* docs
* fix table
* kill llamastack-local-gpu/cpu
* Update index.md
* Update index.md
* mv ios_setup.md
* Update ios_setup.md
* Add remote_or_local.gif
* Update ios_setup.md
* release notes
* typos
* Add ios_setup to index
* nav bar
* hide torctree
* ios image
* links update
* rename
* rename
* docs
* rename
* links
* distributions
* distributions
* distributions
* distributions
* remove release
* remote
---------
Co-authored-by: dltn <6599399+dltn@users.noreply.github.com>
Co-authored-by: Ashwin Bharambe <ashwin.bharambe@gmail.com>
2024-11-04 16:52:38 -08:00