Commit graph

112 commits

Author SHA1 Message Date
Justin Lee
ecad16b904 doc enhancements, converted md into jupyter, reorganize files 2024-11-05 13:12:30 -08:00
Justin Lee
5f5220b8fc git commit created a dir for zero-hero 2024-11-05 11:11:27 -08:00
Kai Wu
36d3520bf2 refactor notebooks 2024-11-05 11:10:08 -08:00
Sanyam Bhutani
6d3769aedd Update tool_Calling101.ipynb 2024-11-04 19:35:31 -08:00
Sanyam Bhutani
800a9dc134 Create tool_Calling101.ipynb 2024-11-04 19:34:51 -08:00
Sanyam Bhutani
579cb110b4 add image 2024-11-04 19:28:37 -08:00
Sanyam Bhutani
d37853074b Merge branch 'docs_improvement' of https://github.com/meta-llama/llama-stack into docs_improvement 2024-11-04 19:11:53 -08:00
Sanyam Bhutani
19fa3bde97 push 2024-11-04 19:11:37 -08:00
Kai Wu
08ccf47ff1 Merge branch 'main' into docs_improvement 2024-11-04 17:25:33 -08:00
Kai Wu
556ec85d63 change safety101 agent101 to notebook 2024-11-04 17:22:23 -08:00
Kai Wu
ddf9a2b62c change safety101 agent101 to notebook 2024-11-04 17:19:21 -08:00
Sanyam Bhutani
af8453436e Update Memory101.ipynb 2024-11-04 17:17:26 -08:00
Sanyam Bhutani
09cd99fc2d Create Memory101.ipynb 2024-11-04 17:16:23 -08:00
Xi Yan
c810a4184d
[docs] update documentations (#356)
* move docs -> source

* Add files via upload

* mv image

* Add files via upload

* colocate iOS setup doc

* delete image

* Add files via upload

* fix

* delete image

* Add files via upload

* Update developer_cookbook.md

* toctree

* wip subfolder

* docs update

* subfolder

* updates

* name

* updates

* index

* updates

* refactor structure

* depth

* docs

* content

* docs

* getting started

* distributions

* fireworks

* fireworks

* update

* theme

* theme

* theme

* pdj theme

* pytorch theme

* css

* theme

* agents example

* format

* index

* headers

* copy button

* test tabs

* test tabs

* fix

* tabs

* tab

* tabs

* sphinx_design

* quick start commands

* size

* width

* css

* css

* download models

* asthetic fix

* tab format

* update

* css

* width

* css

* docs

* tab based

* tab

* tabs

* docs

* style

* image

* css

* color

* typo

* update docs

* missing links

* list templates

* links

* links update

* troubleshooting

* fix

* distributions

* docs

* fix table

* kill llamastack-local-gpu/cpu

* Update index.md

* Update index.md

* mv ios_setup.md

* Update ios_setup.md

* Add remote_or_local.gif

* Update ios_setup.md

* release notes

* typos

* Add ios_setup to index

* nav bar

* hide torctree

* ios image

* links update

* rename

* rename

* docs

* rename

* links

* distributions

* distributions

* distributions

* distributions

* remove release

* remote

---------

Co-authored-by: dltn <6599399+dltn@users.noreply.github.com>
Co-authored-by: Ashwin Bharambe <ashwin.bharambe@gmail.com>
2024-11-04 16:52:38 -08:00
Kai Wu
87904d329f add more about safety and agent docs 2024-11-04 16:23:46 -08:00
Kai Wu
d61f328ffb Merge branch 'docs_improvement' of github.com:meta-llama/llama-stack into docs_improvement 2024-11-04 16:22:44 -08:00
Sanyam Bhutani
f8da92900e Create zero_to_getting_started.ipynb 2024-11-04 16:08:55 -08:00
Justin Lee
d200a6b002 beef up quickstart 2024-11-04 14:56:03 -08:00
Kai Wu
1422d631a8 Merge branch 'main' into docs_improvement 2024-11-04 12:40:18 -08:00
Kai Wu
2898a9bc9e prompt guide added 2024-11-04 12:38:44 -08:00
Justin Lee
4f31f1b4cc added few-shot-guide 2024-11-01 14:14:35 -07:00
Justin Lee
78083c4e0a removed unnecessary files 2024-11-01 13:56:54 -07:00
Justin Lee
43289c36e1 added todo 2024-11-01 13:55:30 -07:00
Justin Lee
b41abff4fb minor enhancement md 2024-11-01 13:50:24 -07:00
Justin Lee
1794ebc627 added cloud-local-inference-guide 2024-11-01 13:39:11 -07:00
Justin Lee
46763bc001 quick fix on title 2024-11-01 11:46:35 -07:00
Justin Lee
ed70e140eb added streaming guide 2024-11-01 11:41:03 -07:00
Justin Lee
bf16d7729f wrote guide for chat completion 2024-11-01 11:33:15 -07:00
Justin Lee
b514f1ec3a quickstart guide (might be dated) 2024-10-31 15:49:05 -07:00
Justin Lee
703d7ebb6e adding a few more inference examples 2024-10-31 15:46:47 -07:00
Justin Lee
626dffa0d9 added simple inferences 2024-10-31 15:45:47 -07:00
Ashwin Bharambe
37b330b4ef
add dynamic clients for all APIs (#348)
* add dynamic clients for all APIs

* fix openapi generator

* inference + memory + agents tests now pass with "remote" providers

* Add docstring which fixes openapi generator :/
2024-10-31 14:46:25 -07:00
Kai Wu
e4560a5e74 second draft 2024-10-31 13:37:55 -07:00
Xi Yan
3b1917d5ea run openapi generator 2024-10-30 16:17:35 -07:00
Kai Wu
050b1ae718 agents101 draft 2024-10-30 14:01:28 -07:00
Kai Wu
384b31c4c2 first draft 2024-10-29 17:42:41 -07:00
Ashwin Bharambe
4aa1bf6a60
Kill --name from llama stack build (#340) 2024-10-28 23:07:32 -07:00
Xi Yan
8a74e400d6
Update getting_started.md 2024-10-25 13:30:33 -07:00
Xi Yan
f168752bba
Update getting_started.md 2024-10-25 13:27:43 -07:00
Xi Yan
abdf7cddf3
[Evals API][4/n] evals with generation meta-reference impl (#303)
* wip

* dataset validation

* test_scoring

* cleanup

* clean up test

* comments

* error checking

* dataset client

* test client:

* datasetio client

* clean up

* basic scoring function works

* scorer wip

* equality scorer

* score batch impl

* score batch

* update scoring test

* refactor

* validate scorer input

* address comments

* evals with generation

* add all rows scores to ScoringResult

* minor typing

* bugfix

* scoring function def rename

* rebase name

* refactor

* address comments

* Update iOS inference instructions for new quantization

* Small updates to quantization config

* Fix score threshold in faiss

* Bump version to 0.0.45

* Handle both ipv6 and ipv4 interfaces together

* update manifest for build templates

* Update getting_started.md

* chatcompletion & completion input type validation

* inclusion->subsetof

* error checking

* scoring_function -> scoring_fn rename, scorer -> scoring_fn rename

* address comments

* [Evals API][5/n] fixes to generate openapi spec (#323)

* generate openapi

* typing comment, dataset -> dataset_id

* remove custom type

* sample eval run.yaml

---------

Co-authored-by: Dalton Flanagan <6599399+dltn@users.noreply.github.com>
Co-authored-by: Ashwin Bharambe <ashwin.bharambe@gmail.com>
2024-10-25 13:12:39 -07:00
Xi Yan
07f9bf723f
fix broken --list-templates with adding build.yaml files for packaging (#327)
* add build files to templates

* fix templates

* manifest

* symlink

* symlink

* precommit

* change everything to docker build.yaml

* remove image_type in templates

* fix build from templates CLI

* fix readmes
2024-10-25 12:51:22 -07:00
Justin Lee
b6d8246b82
added templates and enhanced readme (#307)
Co-authored-by: Justin Lee <justinai@fb.com>
2024-10-24 17:07:06 -07:00
Xi Yan
e70420a06e
Update getting_started.md 2024-10-24 14:19:35 -07:00
Suraj Subramanian
668a495aba
Add REST api example for chat_completion (#286) 2024-10-22 10:35:20 -07:00
raghotham
e2a5a2e10d
first version of readthedocs (#278) 2024-10-22 10:15:58 +05:30
Ashwin Bharambe
1944405dca
Update new_api_provider.md 2024-10-21 14:02:51 -07:00
Ashwin Bharambe
606c48309e Small updates to encourage integration testing 2024-10-21 13:52:33 -07:00
Xi Yan
3a7884345a
Update new_api_provider.md 2024-10-21 13:41:56 -07:00
Xi Yan
25b37c9ff7
Update new_api_provider.md 2024-10-21 13:41:46 -07:00
Xi Yan
23210e8679
llama stack distributions / templates / docker refactor (#266)
* docker compose ollama

* comment

* update compose file

* readme for distributions

* readme

* move distribution folders

* move distribution/templates to distributions/

* rename

* kill distribution/templates

* readme

* readme

* build/developer cookbook/new api provider

* developer cookbook

* readme

* readme

* [bugfix] fix case for agent when memory bank registered without specifying provider_id (#264)

* fix case where memory bank is registered without provider_id

* memory test

* agents unit test

* Add an option to not use elastic agents for meta-reference inference (#269)

* Allow overridding checkpoint_dir via config

* Small rename

* Make all methods `async def` again; add completion() for meta-reference (#270)

PR #201 had made several changes while trying to fix issues with getting the stream=False branches of inference and agents API working. As part of this, it made a change which was slightly gratuitous. Namely, making chat_completion() and brethren "def" instead of "async def".

The rationale was that this allowed the user (within llama-stack) of this to use it as:

```
async for chunk in api.chat_completion(params)
```

However, it causes unnecessary confusion for several folks. Given that clients (e.g., llama-stack-apps) anyway use the SDK methods (which are completely isolated) this choice was not ideal. Let's revert back so the call now looks like:

```
async for chunk in await api.chat_completion(params)
```

Bonus: Added a completion() implementation for the meta-reference provider. Technically should have been another PR :)

* Improve an important error message

* update ollama for llama-guard3

* Add vLLM inference provider for OpenAI compatible vLLM server (#178)

This PR adds vLLM inference provider for OpenAI compatible vLLM server.

* Create .readthedocs.yaml

Trying out readthedocs

* Update event_logger.py (#275)

spelling error

* vllm

* build templates

* delete templates

* tmp add back build to avoid merge conflicts

* vllm

* vllm

---------

Co-authored-by: Ashwin Bharambe <ashwin.bharambe@gmail.com>
Co-authored-by: Ashwin Bharambe <ashwin@meta.com>
Co-authored-by: Yuan Tang <terrytangyuan@gmail.com>
Co-authored-by: raghotham <rsm@meta.com>
Co-authored-by: nehal-a2z <nehal@coderabbit.ai>
2024-10-21 11:17:53 -07:00