Sanyam Bhutani
|
09cd99fc2d
|
Create Memory101.ipynb
|
2024-11-04 17:16:23 -08:00 |
|
Kai Wu
|
87904d329f
|
add more about safety and agent docs
|
2024-11-04 16:23:46 -08:00 |
|
Kai Wu
|
d61f328ffb
|
Merge branch 'docs_improvement' of github.com:meta-llama/llama-stack into docs_improvement
|
2024-11-04 16:22:44 -08:00 |
|
Sanyam Bhutani
|
f8da92900e
|
Create zero_to_getting_started.ipynb
|
2024-11-04 16:08:55 -08:00 |
|
Justin Lee
|
d200a6b002
|
beef up quickstart
|
2024-11-04 14:56:03 -08:00 |
|
Kai Wu
|
1422d631a8
|
Merge branch 'main' into docs_improvement
|
2024-11-04 12:40:18 -08:00 |
|
Kai Wu
|
2898a9bc9e
|
prompt guide added
|
2024-11-04 12:38:44 -08:00 |
|
Dinesh Yeduguru
|
ac93dd89cf
|
fix bedrock impl (#359)
* fix bedrock impl
* fix linter errors
* fix return type and remove debug print
|
2024-11-03 07:32:30 -08:00 |
|
Justin Lee
|
4f31f1b4cc
|
added few-shot-guide
|
2024-11-01 14:14:35 -07:00 |
|
Justin Lee
|
78083c4e0a
|
removed unnecessary files
|
2024-11-01 13:56:54 -07:00 |
|
Justin Lee
|
43289c36e1
|
added todo
|
2024-11-01 13:55:30 -07:00 |
|
Justin Lee
|
b41abff4fb
|
minor enhancement md
|
2024-11-01 13:50:24 -07:00 |
|
Justin Lee
|
1794ebc627
|
added cloud-local-inference-guide
|
2024-11-01 13:39:11 -07:00 |
|
Ashwin Bharambe
|
bf4f97a2e1
|
Fix vLLM adapter chat_completion signature
|
2024-11-01 13:09:03 -07:00 |
|
Justin Lee
|
46763bc001
|
quick fix on title
|
2024-11-01 11:46:35 -07:00 |
|
Justin Lee
|
ed70e140eb
|
added streaming guide
|
2024-11-01 11:41:03 -07:00 |
|
Dalton Flanagan
|
adecb2a2d3
|
update for message parsing on ios
|
2024-11-01 14:37:19 -04:00 |
|
Justin Lee
|
bf16d7729f
|
wrote guide for chat completion
|
2024-11-01 11:33:15 -07:00 |
|
Justin Lee
|
b514f1ec3a
|
quickstart guide (might be dated)
|
2024-10-31 15:49:05 -07:00 |
|
Justin Lee
|
703d7ebb6e
|
adding a few more inference examples
|
2024-10-31 15:46:47 -07:00 |
|
Justin Lee
|
626dffa0d9
|
added simple inferences
|
2024-10-31 15:45:47 -07:00 |
|
Ashwin Bharambe
|
37b330b4ef
|
add dynamic clients for all APIs (#348)
* add dynamic clients for all APIs
* fix openapi generator
* inference + memory + agents tests now pass with "remote" providers
* Add docstring which fixes openapi generator :/
|
2024-10-31 14:46:25 -07:00 |
|
Kai Wu
|
e4560a5e74
|
second draft
|
2024-10-31 13:37:55 -07:00 |
|
Steve Grubb
|
f04b566c5c
|
Do not cache pip (#349)
Pip has a 3.3GB cache of torch and friends. Do not keep this in the image.
|
2024-10-31 09:52:40 -07:00 |
|
Xi Yan
|
3b1917d5ea
|
run openapi generator
|
2024-10-30 16:17:35 -07:00 |
|
Kai Wu
|
050b1ae718
|
agents101 draft
|
2024-10-30 14:01:28 -07:00 |
|
Kai Wu
|
384b31c4c2
|
first draft
|
2024-10-29 17:42:41 -07:00 |
|
Ashwin Bharambe
|
4aa1bf6a60
|
Kill --name from llama stack build (#340)
|
2024-10-28 23:07:32 -07:00 |
|
Ashwin Bharambe
|
26d1668f7d
|
Revert "remove Field for return_type"
This reverts commit ffb3965ade .
|
2024-10-28 21:39:48 -07:00 |
|
Ashwin Bharambe
|
eccd7dc4a9
|
Avoid warnings from pydantic for overriding schema
Also fix structured output in completions
|
2024-10-28 21:39:48 -07:00 |
|
Xi Yan
|
ed833bb758
|
[Evals API][7/n] braintrust scoring provider (#333)
* wip scoring refactor
* llm as judge, move folders
* test full generation + eval
* extract score regex to llm context
* remove prints, cleanup braintrust in this branch
* braintrust skeleton
* datasetio test fix
* braintrust provider
* remove prints
* dependencies
* change json -> class
* json -> class
* remove initialize
* address nits
* check identifier prefix
* braintrust scoring identifier check, rebase
* udpate MANIFEST
* manifest
* remove braintrust scoring_fn
* remove comments
* tests
* imports fix
|
2024-10-28 18:59:35 -07:00 |
|
Xi Yan
|
ae671eaf7a
|
distro readmes with model serving instructions (#339)
* readme updates
* quantied compose
* dell tgi
* config update
* readme
* update model serving readmes
* update
* update
* config
|
2024-10-28 17:47:14 -07:00 |
|
Xi Yan
|
a70a4706fc
|
update distributions compose/readme (#338)
* readme updates
* quantied compose
* dell tgi
* config update
|
2024-10-28 16:34:43 -07:00 |
|
Xi Yan
|
985ff4d6ce
|
update distributions/readmes
|
2024-10-28 15:10:40 -07:00 |
|
Xi Yan
|
7b8748c53e
|
[Evals API][6/n] meta-reference llm as judge, registration for ScoringFnDefs (#330)
* wip scoring refactor
* llm as judge, move folders
* test full generation + eval
* extract score regex to llm context
* remove prints, cleanup braintrust in this branch
* change json -> class
* remove initialize
* address nits
* check identifier prefix
* udpate MANIFEST
|
2024-10-28 14:08:42 -07:00 |
|
Xi Yan
|
04a4784287
|
Update README.md
|
2024-10-28 13:25:44 -07:00 |
|
Xi Yan
|
3fa1eaf37d
|
Update README.md
|
2024-10-28 13:18:55 -07:00 |
|
Xi Yan
|
0d4215e125
|
Update README.md
|
2024-10-28 13:18:34 -07:00 |
|
Xi Yan
|
8f5a850de9
|
Update README.md
|
2024-10-28 13:16:23 -07:00 |
|
Xi Yan
|
ffb3965ade
|
remove Field for return_type
|
2024-10-28 13:04:41 -07:00 |
|
Ashwin Bharambe
|
b7d2b83d55
|
Allow passing provider_registry to resolve_impls()
|
2024-10-28 11:58:16 -07:00 |
|
Ashwin Bharambe
|
8a3b64d1be
|
Bump version to 0.0.47
|
2024-10-27 22:30:38 -07:00 |
|
Xi Yan
|
46bb8884a7
|
distributions readme typos
|
2024-10-27 11:57:21 -07:00 |
|
Dalton Flanagan
|
44c05c6e7d
|
add vision instruct models for fireworks
|
2024-10-27 17:54:54 +00:00 |
|
Dinesh Yeduguru
|
9b85d9a841
|
completion() for fireworks (#329)
|
2024-10-25 16:12:10 -07:00 |
|
Dinesh Yeduguru
|
7ec79f3b9d
|
completion() for together (#324)
* completion() for together
* test fixes
* fix client building
|
2024-10-25 14:21:12 -07:00 |
|
Xi Yan
|
8a74e400d6
|
Update getting_started.md
|
2024-10-25 13:30:33 -07:00 |
|
Xi Yan
|
f168752bba
|
Update getting_started.md
|
2024-10-25 13:27:43 -07:00 |
|
Xi Yan
|
abdf7cddf3
|
[Evals API][4/n] evals with generation meta-reference impl (#303)
* wip
* dataset validation
* test_scoring
* cleanup
* clean up test
* comments
* error checking
* dataset client
* test client:
* datasetio client
* clean up
* basic scoring function works
* scorer wip
* equality scorer
* score batch impl
* score batch
* update scoring test
* refactor
* validate scorer input
* address comments
* evals with generation
* add all rows scores to ScoringResult
* minor typing
* bugfix
* scoring function def rename
* rebase name
* refactor
* address comments
* Update iOS inference instructions for new quantization
* Small updates to quantization config
* Fix score threshold in faiss
* Bump version to 0.0.45
* Handle both ipv6 and ipv4 interfaces together
* update manifest for build templates
* Update getting_started.md
* chatcompletion & completion input type validation
* inclusion->subsetof
* error checking
* scoring_function -> scoring_fn rename, scorer -> scoring_fn rename
* address comments
* [Evals API][5/n] fixes to generate openapi spec (#323)
* generate openapi
* typing comment, dataset -> dataset_id
* remove custom type
* sample eval run.yaml
---------
Co-authored-by: Dalton Flanagan <6599399+dltn@users.noreply.github.com>
Co-authored-by: Ashwin Bharambe <ashwin.bharambe@gmail.com>
|
2024-10-25 13:12:39 -07:00 |
|
Ashwin Bharambe
|
426d821e7f
|
Bump version to 0.0.46
|
2024-10-25 13:10:55 -07:00 |
|