Ashwin Bharambe 
								
							 
						 
						
							
							
							
							
								
							
							
								e8d3eee095 
								
							 
						 
						
							
							
								
								Fix docs yet again  
							
							
							
						 
						
							2024-11-18 23:51:35 -08:00 
							
								 
							
							
								 
							
						 
					 
				
					
						
							
								
								
									Ashwin Bharambe 
								
							 
						 
						
							
							
							
							
								
							
							
								d463d68e1e 
								
							 
						 
						
							
							
								
								Update docs  
							
							
							
						 
						
							2024-11-18 23:21:25 -08:00 
							
								 
							
							
								 
							
						 
					 
				
					
						
							
								
								
									Ashwin Bharambe 
								
							 
						 
						
							
							
							
							
								
							
							
								7693786322 
								
							 
						 
						
							
							
								
								Use HF names for registering fireworks and together models  
							
							
							
						 
						
							2024-11-18 22:34:47 -08:00 
							
								 
							
							
								 
							
						 
					 
				
					
						
							
								
								
									Ashwin Bharambe 
								
							 
						 
						
							
							
							
							
								
							
							
								939056e265 
								
							 
						 
						
							
							
								
								More documentation fixes  
							
							
							
						 
						
							2024-11-18 17:06:13 -08:00 
							
								 
							
							
								 
							
						 
					 
				
					
						
							
								
								
									Ashwin Bharambe 
								
							 
						 
						
							
							
							
							
								
							
							
								e40404625b 
								
							 
						 
						
							
							
								
								Update to docs  
							
							
							
						 
						
							2024-11-18 16:52:48 -08:00 
							
								 
							
							
								 
							
						 
					 
				
					
						
							
								
								
									Ashwin Bharambe 
								
							 
						 
						
							
							
							
							
								
							
							
								afa4f0b19f 
								
							 
						 
						
							
							
								
								Update remote vllm docs  
							
							
							
						 
						
							2024-11-18 16:34:33 -08:00 
							
								 
							
							
								 
							
						 
					 
				
					
						
							
								
								
									Ashwin Bharambe 
								
							 
						 
						
							
							
							
							
								
							
							
								47c37fd831 
								
							 
						 
						
							
							
								
								Fixes  
							
							
							
						 
						
							2024-11-18 16:03:53 -08:00 
							
								 
							
							
								 
							
						 
					 
				
					
						
							
								
								
									Ashwin Bharambe 
								
							 
						 
						
							
							
							
							
								
							
							
								3aedde2ab4 
								
							 
						 
						
							
							
								
								Add a pre-commit for distro_codegen but it does not work yet  
							
							
							
						 
						
							2024-11-18 15:21:13 -08:00 
							
								 
							
							
								 
							
						 
					 
				
					
						
							
								
								
									Ashwin Bharambe 
								
							 
						 
						
							
							
								
								
							
							
							
								
							
							
								2a31163178 
								
							 
						 
						
							
							
								
								Auto-generate distro yamls + docs  ( #468 )  
							
							... 
							
							
							
							# What does this PR do?
Automatically generates
- build.yaml
- run.yaml
- run-with-safety.yaml
- parts of markdown docs
for the distributions.
## Test Plan
At this point, this only updates the YAMLs and the docs. Some testing
(especially with ollama and vllm) has been performed but needs to be
much more tested. 
							
						 
						
							2024-11-18 14:57:06 -08:00 
							
								 
							
							
								 
							
						 
					 
				
					
						
							
								
								
									Dinesh Yeduguru 
								
							 
						 
						
							
							
								
								
							
							
							
								
							
							
								fdff24e77a 
								
							 
						 
						
							
							
								
								Inference to use provider resource id to register and validate ( #428 )  
							
							... 
							
							
							
							This PR changes the way model id gets translated to the final model name
that gets passed through the provider.
Major changes include:
1) Providers are responsible for registering an object and as part of
the registration returning the object with the correct provider specific
name of the model provider_resource_id
2) To help with the common look ups different names a new ModelLookup
class is created.
Tested all inference providers including together, fireworks, vllm,
ollama, meta reference and bedrock 
							
						 
						
							2024-11-12 20:02:00 -08:00 
							
								 
							
							
								 
							
						 
					 
				
					
						
							
								
								
									Ashwin Bharambe 
								
							 
						 
						
							
							
								
								
							
							
							
								
							
							
								3d7561e55c 
								
							 
						 
						
							
							
								
								Rename all inline providers with an inline:: prefix ( #423 )  
							
							
							
						 
						
							2024-11-11 22:19:16 -08:00 
							
								 
							
							
								 
							
						 
					 
				
					
						
							
								
								
									Xi Yan 
								
							 
						 
						
							
							
							
							
								
							
							
								b0b9c905b3 
								
							 
						 
						
							
							
								
								docs  
							
							
							
						 
						
							2024-11-09 10:22:41 -08:00 
							
								 
							
							
								 
							
						 
					 
				
					
						
							
								
								
									Xi Yan 
								
							 
						 
						
							
							
							
							
								
							
							
								cc61fd8083 
								
							 
						 
						
							
							
								
								docs  
							
							
							
						 
						
							2024-11-09 09:00:18 -08:00 
							
								 
							
							
								 
							
						 
					 
				
					
						
							
								
								
									Xi Yan 
								
							 
						 
						
							
							
							
							
								
							
							
								0c14761453 
								
							 
						 
						
							
							
								
								docs  
							
							
							
						 
						
							2024-11-09 08:57:51 -08:00 
							
								 
							
							
								 
							
						 
					 
				
					
						
							
								
								
									Ashwin Bharambe 
								
							 
						 
						
							
							
								
								
							
							
							
								
							
							
								4986e46188 
								
							 
						 
						
							
							
								
								Distributions updates (slight updates to ollama, add inline-vllm and remote-vllm) ( #408 )  
							
							... 
							
							
							
							* remote vllm distro
* add inline-vllm details, fix things
* Write some docs 
							
						 
						
							2024-11-08 18:09:39 -08:00 
							
								 
							
							
								 
							
						 
					 
				
					
						
							
								
								
									Xi Yan 
								
							 
						 
						
							
							
							
							
								
							
							
								bd0622ef10 
								
							 
						 
						
							
							
								
								update docs  
							
							
							
						 
						
							2024-11-08 12:47:05 -08:00 
							
								 
							
							
								 
							
						 
					 
				
					
						
							
								
								
									Xi Yan 
								
							 
						 
						
							
							
							
							
								
							
							
								7ee9f8d8ac 
								
							 
						 
						
							
							
								
								rename  
							
							
							
						 
						
							2024-11-08 10:34:48 -08:00 
							
								 
							
							
								 
							
						 
					 
				
					
						
							
								
								
									Xi Yan 
								
							 
						 
						
							
							
							
							
								
							
							
								b1d7376730 
								
							 
						 
						
							
							
								
								kill tgi/cpu  
							
							
							
						 
						
							2024-11-08 10:33:45 -08:00 
							
								 
							
							
								 
							
						 
					 
				
					
						
							
								
								
									Xi Yan 
								
							 
						 
						
							
							
								
								
							
							
							
								
							
							
								8350f2df4c 
								
							 
						 
						
							
							
								
								[docs] refactor remote-hosted distro ( #402 )  
							
							... 
							
							
							
							* move docs
* docs 
							
						 
						
							2024-11-07 19:16:38 -08:00 
							
								 
							
							
								 
							
						 
					 
				
					
						
							
								
								
									Ashwin Bharambe 
								
							 
						 
						
							
							
								
								
							
							
							
								
							
							
								994732e2e0 
								
							 
						 
						
							
							
								
								impls -> inline, adapters -> remote (#381 )  
							
							
							
						 
						
							2024-11-06 14:54:05 -08:00 
							
								 
							
							
								 
							
						 
					 
				
					
						
							
								
								
									Dinesh Yeduguru 
								
							 
						 
						
							
							
								
								
							
							
							
								
							
							
								093c9f1987 
								
							 
						 
						
							
							
								
								add bedrock distribution code ( #358 )  
							
							... 
							
							
							
							* add bedrock distribution code
* fix linter error
* add bedrock shields support
* linter fixes
* working bedrock safety
* change to return only one violation
* remove env var reading
* refereshable boto credentials
* remove env vars
* address raghu's feedback
* fix session_ttl passing
---------
Co-authored-by: Dinesh Yeduguru <dineshyv@fb.com> 
							
						 
						
							2024-11-06 14:39:11 -08:00 
							
								 
							
							
								 
							
						 
					 
				
					
						
							
								
								
									Xi Yan 
								
							 
						 
						
							
							
							
							
								
							
							
								db30809141 
								
							 
						 
						
							
							
								
								precommit  
							
							
							
						 
						
							2024-11-05 15:26:13 -08:00 
							
								 
							
							
								 
							
						 
					 
				
					
						
							
								
								
									Xi Yan 
								
							 
						 
						
							
							
							
							
								
							
							
								0706f6c82f 
								
							 
						 
						
							
							
								
								add Llama3.2-3B-Instruct:int4-qlora-eo8  
							
							
							
						 
						
							2024-11-05 15:22:26 -08:00 
							
								 
							
							
								 
							
						 
					 
				
					
						
							
								
								
									Xi Yan 
								
							 
						 
						
							
							
							
							
								
							
							
								16b7fa4614 
								
							 
						 
						
							
							
								
								quantized model docs  
							
							
							
						 
						
							2024-11-05 15:21:13 -08:00 
							
								 
							
							
								 
							
						 
					 
				
					
						
							
								
								
									Xi Yan 
								
							 
						 
						
							
							
								
								
							
							
							
								
							
							
								c810a4184d 
								
							 
						 
						
							
							
								
								[docs] update documentations ( #356 )  
							
							... 
							
							
							
							* move docs -> source
* Add files via upload
* mv image
* Add files via upload
* colocate iOS setup doc
* delete image
* Add files via upload
* fix
* delete image
* Add files via upload
* Update developer_cookbook.md
* toctree
* wip subfolder
* docs update
* subfolder
* updates
* name
* updates
* index
* updates
* refactor structure
* depth
* docs
* content
* docs
* getting started
* distributions
* fireworks
* fireworks
* update
* theme
* theme
* theme
* pdj theme
* pytorch theme
* css
* theme
* agents example
* format
* index
* headers
* copy button
* test tabs
* test tabs
* fix
* tabs
* tab
* tabs
* sphinx_design
* quick start commands
* size
* width
* css
* css
* download models
* asthetic fix
* tab format
* update
* css
* width
* css
* docs
* tab based
* tab
* tabs
* docs
* style
* image
* css
* color
* typo
* update docs
* missing links
* list templates
* links
* links update
* troubleshooting
* fix
* distributions
* docs
* fix table
* kill llamastack-local-gpu/cpu
* Update index.md
* Update index.md
* mv ios_setup.md
* Update ios_setup.md
* Add remote_or_local.gif
* Update ios_setup.md
* release notes
* typos
* Add ios_setup to index
* nav bar
* hide torctree
* ios image
* links update
* rename
* rename
* docs
* rename
* links
* distributions
* distributions
* distributions
* distributions
* remove release
* remote
---------
Co-authored-by: dltn <6599399+dltn@users.noreply.github.com>
Co-authored-by: Ashwin Bharambe <ashwin.bharambe@gmail.com> 
							
						 
						
							2024-11-04 16:52:38 -08:00