Kai Wu
9ed1c50990
add more to safety
2024-11-05 15:06:27 -08:00
Justin Lee
5f5220b8fc
git commit created a dir for zero-hero
2024-11-05 11:11:27 -08:00
Kai Wu
36d3520bf2
refactor notebooks
2024-11-05 11:10:08 -08:00
Sanyam Bhutani
6d3769aedd
Update tool_Calling101.ipynb
2024-11-04 19:35:31 -08:00
Sanyam Bhutani
800a9dc134
Create tool_Calling101.ipynb
2024-11-04 19:34:51 -08:00
Sanyam Bhutani
579cb110b4
add image
2024-11-04 19:28:37 -08:00
Sanyam Bhutani
d37853074b
Merge branch 'docs_improvement' of https://github.com/meta-llama/llama-stack into docs_improvement
2024-11-04 19:11:53 -08:00
Sanyam Bhutani
19fa3bde97
push
2024-11-04 19:11:37 -08:00
Kai Wu
08ccf47ff1
Merge branch 'main' into docs_improvement
2024-11-04 17:25:33 -08:00
Kai Wu
556ec85d63
change safety101 agent101 to notebook
2024-11-04 17:22:23 -08:00
Kai Wu
ddf9a2b62c
change safety101 agent101 to notebook
2024-11-04 17:19:21 -08:00
Sanyam Bhutani
af8453436e
Update Memory101.ipynb
2024-11-04 17:17:26 -08:00
Sanyam Bhutani
09cd99fc2d
Create Memory101.ipynb
2024-11-04 17:16:23 -08:00
Xi Yan
c810a4184d
[docs] update documentations ( #356 )
...
* move docs -> source
* Add files via upload
* mv image
* Add files via upload
* colocate iOS setup doc
* delete image
* Add files via upload
* fix
* delete image
* Add files via upload
* Update developer_cookbook.md
* toctree
* wip subfolder
* docs update
* subfolder
* updates
* name
* updates
* index
* updates
* refactor structure
* depth
* docs
* content
* docs
* getting started
* distributions
* fireworks
* fireworks
* update
* theme
* theme
* theme
* pdj theme
* pytorch theme
* css
* theme
* agents example
* format
* index
* headers
* copy button
* test tabs
* test tabs
* fix
* tabs
* tab
* tabs
* sphinx_design
* quick start commands
* size
* width
* css
* css
* download models
* asthetic fix
* tab format
* update
* css
* width
* css
* docs
* tab based
* tab
* tabs
* docs
* style
* image
* css
* color
* typo
* update docs
* missing links
* list templates
* links
* links update
* troubleshooting
* fix
* distributions
* docs
* fix table
* kill llamastack-local-gpu/cpu
* Update index.md
* Update index.md
* mv ios_setup.md
* Update ios_setup.md
* Add remote_or_local.gif
* Update ios_setup.md
* release notes
* typos
* Add ios_setup to index
* nav bar
* hide torctree
* ios image
* links update
* rename
* rename
* docs
* rename
* links
* distributions
* distributions
* distributions
* distributions
* remove release
* remote
---------
Co-authored-by: dltn <6599399+dltn@users.noreply.github.com>
Co-authored-by: Ashwin Bharambe <ashwin.bharambe@gmail.com>
2024-11-04 16:52:38 -08:00
Kai Wu
87904d329f
add more about safety and agent docs
2024-11-04 16:23:46 -08:00
Kai Wu
d61f328ffb
Merge branch 'docs_improvement' of github.com:meta-llama/llama-stack into docs_improvement
2024-11-04 16:22:44 -08:00
Sanyam Bhutani
f8da92900e
Create zero_to_getting_started.ipynb
2024-11-04 16:08:55 -08:00
Justin Lee
d200a6b002
beef up quickstart
2024-11-04 14:56:03 -08:00
Kai Wu
1422d631a8
Merge branch 'main' into docs_improvement
2024-11-04 12:40:18 -08:00
Kai Wu
2898a9bc9e
prompt guide added
2024-11-04 12:38:44 -08:00
Justin Lee
4f31f1b4cc
added few-shot-guide
2024-11-01 14:14:35 -07:00
Justin Lee
78083c4e0a
removed unnecessary files
2024-11-01 13:56:54 -07:00
Justin Lee
43289c36e1
added todo
2024-11-01 13:55:30 -07:00
Justin Lee
b41abff4fb
minor enhancement md
2024-11-01 13:50:24 -07:00
Justin Lee
1794ebc627
added cloud-local-inference-guide
2024-11-01 13:39:11 -07:00
Justin Lee
46763bc001
quick fix on title
2024-11-01 11:46:35 -07:00
Justin Lee
ed70e140eb
added streaming guide
2024-11-01 11:41:03 -07:00
Justin Lee
bf16d7729f
wrote guide for chat completion
2024-11-01 11:33:15 -07:00
Justin Lee
b514f1ec3a
quickstart guide (might be dated)
2024-10-31 15:49:05 -07:00
Justin Lee
703d7ebb6e
adding a few more inference examples
2024-10-31 15:46:47 -07:00
Justin Lee
626dffa0d9
added simple inferences
2024-10-31 15:45:47 -07:00
Ashwin Bharambe
37b330b4ef
add dynamic clients for all APIs ( #348 )
...
* add dynamic clients for all APIs
* fix openapi generator
* inference + memory + agents tests now pass with "remote" providers
* Add docstring which fixes openapi generator :/
2024-10-31 14:46:25 -07:00
Kai Wu
e4560a5e74
second draft
2024-10-31 13:37:55 -07:00
Xi Yan
3b1917d5ea
run openapi generator
2024-10-30 16:17:35 -07:00
Kai Wu
050b1ae718
agents101 draft
2024-10-30 14:01:28 -07:00
Kai Wu
384b31c4c2
first draft
2024-10-29 17:42:41 -07:00
Ashwin Bharambe
4aa1bf6a60
Kill --name from llama stack build ( #340 )
2024-10-28 23:07:32 -07:00
Xi Yan
8a74e400d6
Update getting_started.md
2024-10-25 13:30:33 -07:00
Xi Yan
f168752bba
Update getting_started.md
2024-10-25 13:27:43 -07:00
Xi Yan
abdf7cddf3
[Evals API][4/n] evals with generation meta-reference impl ( #303 )
...
* wip
* dataset validation
* test_scoring
* cleanup
* clean up test
* comments
* error checking
* dataset client
* test client:
* datasetio client
* clean up
* basic scoring function works
* scorer wip
* equality scorer
* score batch impl
* score batch
* update scoring test
* refactor
* validate scorer input
* address comments
* evals with generation
* add all rows scores to ScoringResult
* minor typing
* bugfix
* scoring function def rename
* rebase name
* refactor
* address comments
* Update iOS inference instructions for new quantization
* Small updates to quantization config
* Fix score threshold in faiss
* Bump version to 0.0.45
* Handle both ipv6 and ipv4 interfaces together
* update manifest for build templates
* Update getting_started.md
* chatcompletion & completion input type validation
* inclusion->subsetof
* error checking
* scoring_function -> scoring_fn rename, scorer -> scoring_fn rename
* address comments
* [Evals API][5/n] fixes to generate openapi spec (#323 )
* generate openapi
* typing comment, dataset -> dataset_id
* remove custom type
* sample eval run.yaml
---------
Co-authored-by: Dalton Flanagan <6599399+dltn@users.noreply.github.com>
Co-authored-by: Ashwin Bharambe <ashwin.bharambe@gmail.com>
2024-10-25 13:12:39 -07:00
Xi Yan
07f9bf723f
fix broken --list-templates with adding build.yaml files for packaging ( #327 )
...
* add build files to templates
* fix templates
* manifest
* symlink
* symlink
* precommit
* change everything to docker build.yaml
* remove image_type in templates
* fix build from templates CLI
* fix readmes
2024-10-25 12:51:22 -07:00
Justin Lee
b6d8246b82
added templates and enhanced readme ( #307 )
...
Co-authored-by: Justin Lee <justinai@fb.com>
2024-10-24 17:07:06 -07:00
Xi Yan
e70420a06e
Update getting_started.md
2024-10-24 14:19:35 -07:00
Suraj Subramanian
668a495aba
Add REST api example for chat_completion ( #286 )
2024-10-22 10:35:20 -07:00
raghotham
e2a5a2e10d
first version of readthedocs ( #278 )
2024-10-22 10:15:58 +05:30
Ashwin Bharambe
1944405dca
Update new_api_provider.md
2024-10-21 14:02:51 -07:00
Ashwin Bharambe
606c48309e
Small updates to encourage integration testing
2024-10-21 13:52:33 -07:00
Xi Yan
3a7884345a
Update new_api_provider.md
2024-10-21 13:41:56 -07:00
Xi Yan
25b37c9ff7
Update new_api_provider.md
2024-10-21 13:41:46 -07:00
Xi Yan
23210e8679
llama stack distributions / templates / docker refactor ( #266 )
...
* docker compose ollama
* comment
* update compose file
* readme for distributions
* readme
* move distribution folders
* move distribution/templates to distributions/
* rename
* kill distribution/templates
* readme
* readme
* build/developer cookbook/new api provider
* developer cookbook
* readme
* readme
* [bugfix] fix case for agent when memory bank registered without specifying provider_id (#264 )
* fix case where memory bank is registered without provider_id
* memory test
* agents unit test
* Add an option to not use elastic agents for meta-reference inference (#269 )
* Allow overridding checkpoint_dir via config
* Small rename
* Make all methods `async def` again; add completion() for meta-reference (#270 )
PR #201 had made several changes while trying to fix issues with getting the stream=False branches of inference and agents API working. As part of this, it made a change which was slightly gratuitous. Namely, making chat_completion() and brethren "def" instead of "async def".
The rationale was that this allowed the user (within llama-stack) of this to use it as:
```
async for chunk in api.chat_completion(params)
```
However, it causes unnecessary confusion for several folks. Given that clients (e.g., llama-stack-apps) anyway use the SDK methods (which are completely isolated) this choice was not ideal. Let's revert back so the call now looks like:
```
async for chunk in await api.chat_completion(params)
```
Bonus: Added a completion() implementation for the meta-reference provider. Technically should have been another PR :)
* Improve an important error message
* update ollama for llama-guard3
* Add vLLM inference provider for OpenAI compatible vLLM server (#178 )
This PR adds vLLM inference provider for OpenAI compatible vLLM server.
* Create .readthedocs.yaml
Trying out readthedocs
* Update event_logger.py (#275 )
spelling error
* vllm
* build templates
* delete templates
* tmp add back build to avoid merge conflicts
* vllm
* vllm
---------
Co-authored-by: Ashwin Bharambe <ashwin.bharambe@gmail.com>
Co-authored-by: Ashwin Bharambe <ashwin@meta.com>
Co-authored-by: Yuan Tang <terrytangyuan@gmail.com>
Co-authored-by: raghotham <rsm@meta.com>
Co-authored-by: nehal-a2z <nehal@coderabbit.ai>
2024-10-21 11:17:53 -07:00