Commit graph

449 commits

Author SHA1 Message Date
Xi Yan
c5cf9c30be score batch 2024-11-06 17:30:46 -08:00
Xi Yan
0bce74402f scoring test pass 2024-11-06 17:27:55 -08:00
Xi Yan
0351072531 fix scoring register 2024-11-06 17:18:16 -08:00
Xi Yan
def6d5d8ad scoring resolve 2024-11-06 17:04:25 -08:00
Xi Yan
c53733d1a3 fixture 2024-11-06 16:41:17 -08:00
Xi Yan
00869799a1 Merge branch 'main' into eval_task_register 2024-11-06 16:34:22 -08:00
Xi Yan
8fc2d212a2
fix safety signature mismatch (#388)
* fix safety sig

* shield_type->identifier
2024-11-06 16:30:47 -08:00
Ashwin Bharambe
7c340f0236 rename test_inference -> test_text_inference 2024-11-06 16:12:50 -08:00
Xi Yan
10eda0af59 delete unused 2024-11-06 16:08:04 -08:00
Ashwin Bharambe
3b54ce3499 remote::vllm now works with vision models 2024-11-06 16:07:17 -08:00
Xi Yan
1fe4099bd0 datasetio test 2024-11-06 16:00:38 -08:00
Xi Yan
1b7e19d5d0 Merge branch 'main' into eval_task_register 2024-11-06 15:05:46 -08:00
Ashwin Bharambe
994732e2e0
impls -> inline, adapters -> remote (#381) 2024-11-06 14:54:05 -08:00
Ashwin Bharambe
b10e9f46bb
Enable remote::vllm (#384)
* Enable remote::vllm

* Kill the giant list of hard coded models
2024-11-06 14:42:44 -08:00
Dinesh Yeduguru
093c9f1987
add bedrock distribution code (#358)
* add bedrock distribution code

* fix linter error

* add bedrock shields support

* linter fixes

* working bedrock safety

* change to return only one violation

* remove env var reading

* refereshable boto credentials

* remove env vars

* address raghu's feedback

* fix session_ttl passing

---------

Co-authored-by: Dinesh Yeduguru <dineshyv@fb.com>
2024-11-06 14:39:11 -08:00
Dinesh Yeduguru
6ebd553da5
fix routing tables look up key for memory bank (#383)
Co-authored-by: Dinesh Yeduguru <dineshyv@fb.com>
2024-11-06 13:32:46 -08:00
Xi Yan
748606195b
Kill llama stack configure (#371)
* remove configure

* build msg

* wip

* build->run

* delete prints

* docs

* fix docs, kill configure

* precommit

* update fireworks build

* docs

* clean up build

* comments

* fix

* test

* remove baking build.yaml into docker

* fix msg, urls

* configure msg
2024-11-06 13:32:10 -08:00
Xi Yan
f778b907e4 wip 2024-11-06 12:52:02 -08:00
Ashwin Bharambe
d289afdbde Fix exception in server when client SSE connection closes 2024-11-06 11:00:34 -08:00
Xi Yan
683a370d23 wip tests 2024-11-06 10:03:49 -08:00
Xi Yan
be7b76ceac rename 2024-11-05 17:08:32 -08:00
Xi Yan
e5b4e4d569 api name 2024-11-05 17:01:05 -08:00
Xi Yan
4a64f98c82 separate benchmark / app eval 2024-11-05 16:54:31 -08:00
Ashwin Bharambe
cde9bc1388
Enable vision models for (Together, Fireworks, Meta-Reference, Ollama) (#376)
* Enable vision models for Together and Fireworks

* Works with ollama 0.4.0 pre-release with the vision model

* localize media for meta_reference inference

* Fix
2024-11-05 16:22:33 -08:00
Xi Yan
979cd4cd44 naming fix 2024-11-05 16:20:16 -08:00
Xi Yan
9759911e47 typo 2024-11-05 16:06:40 -08:00
Xi Yan
f3955d04d7 config field doc 2024-11-05 16:06:02 -08:00
Xi Yan
be0649d79d unwrap context -> config 2024-11-05 16:02:47 -08:00
Xi Yan
db30809141 precommit 2024-11-05 15:26:13 -08:00
Xi Yan
0706f6c82f add Llama3.2-3B-Instruct:int4-qlora-eo8 2024-11-05 15:22:26 -08:00
Xi Yan
16b7fa4614 quantized model docs 2024-11-05 15:21:13 -08:00
Dinesh Yeduguru
4dd01eeaa1
fix postgres config validation (#380)
* fix postgres config validation

* dont remove types

---------

Co-authored-by: Dinesh Yeduguru <dineshyv@fb.com>
2024-11-05 15:09:04 -08:00
Xi Yan
04eebd8a36 pre commit 2024-11-05 15:06:44 -08:00
Xi Yan
60fc191308 evaluate api update 2024-11-05 15:03:12 -08:00
Xi Yan
1b62188c30 eval task 2024-11-05 14:59:50 -08:00
Xi Yan
bca96b5b35 eval api 2024-11-05 14:55:59 -08:00
Dinesh Yeduguru
a2351bf2e9
add ability to persist memory banks created for faiss (#375)
* init

* add tests

* fix tests'

* more fixes

* add tests

* make the default path more faiss specific

* fix linter

---------

Co-authored-by: Dinesh Yeduguru <dineshyv@fb.com>
2024-11-05 14:50:23 -08:00
Xi Yan
fe91608321 scoring fn api 2024-11-05 14:34:56 -08:00
Dinesh Yeduguru
dcd8cfe0f3
add postgres kvstoreimpl (#374)
* add postgres kvstoreimpl

* make table name configurable

* add validator for table name

* linter fix

---------

Co-authored-by: Dinesh Yeduguru <dineshyv@fb.com>
2024-11-05 11:42:21 -08:00
Xi Yan
4fc92e52d7 wip 2024-11-05 11:23:24 -08:00
Ashwin Bharambe
8de845a96d Kill everything from tests/ 2024-11-04 22:10:16 -08:00
Ashwin Bharambe
f08efc23a6 Kill non-integration older tests 2024-11-04 22:06:34 -08:00
Steve Grubb
122793ab92
Correct a traceback in vllm (#366)
File "/usr/local/lib/python3.10/site-packages/llama_stack/providers/adapters/inference/vllm/vllm.py", line 136, in _stream_chat_completion
async for chunk in process_chat_completion_stream_response(
TypeError: process_chat_completion_stream_response() takes 2 positional arguments but 3 were given

This corrects the error by deleting the request variable
2024-11-04 20:49:35 -08:00
Ashwin Bharambe
3ca294c359 Bump version to 0.0.49 2024-11-04 20:38:00 -08:00
Ashwin Bharambe
a81178f1f5 The server now depends on SQLite by default 2024-11-04 20:35:53 -08:00
Ashwin Bharambe
9a57a009ee Need to await for get_object_from_identifier() now 2024-11-04 20:33:12 -08:00
Ashwin Bharambe
7cf4c905f3 add support for remote providers in tests 2024-11-04 20:30:46 -08:00
Ashwin Bharambe
0763a0b85f Fix for the fix! 2024-11-04 20:06:01 -08:00
Ashwin Bharambe
fb2678b134 Fix shield_type and routing table breakage 2024-11-04 19:57:15 -08:00
Xi Yan
657de08f04 precommit 2024-11-04 19:01:56 -08:00