Commit graph

  • 406c3b24d4
    upgrade llama_models (#55) Yufei (Benny) Chen 2024-09-06 12:03:13 -07:00
  • dd1e1ceb13 Add bubblewrap to the container Ashwin Bharambe 2024-09-05 16:45:58 -07:00
  • f6b5e394ab Remove dependence on os.environ["USER"] Ashwin Bharambe 2024-09-05 15:37:12 -07:00
  • 6c69e09c6a Bump version to 0.0.13 Ashwin Bharambe 2024-09-04 23:10:38 -07:00
  • 21bedc1596
    [inference] Add a TGI adapter (#52) Ashwin Bharambe 2024-09-04 22:49:33 -07:00
  • 6ad7365676 A little clean up for the Fireworks and Together adapters Ashwin Bharambe 2024-09-04 22:34:15 -07:00
  • 225cd75074
    Update cli_reference.md raghotham 2024-09-04 18:50:10 -07:00
  • bfee50aa83 A few more fixes to the OpenAPI generator Ashwin Bharambe 2024-09-04 10:29:20 -07:00
  • 0167953d2d Update OpenAPI generator for POST requests Ashwin Bharambe 2024-09-04 09:27:00 -07:00
  • 01d971bda6 Bump version to 0.0.12 Ashwin Bharambe 2024-09-03 23:24:02 -07:00
  • 1380d78c19 Fixes to the llama stack configure script + inference adapters Ashwin Bharambe 2024-09-03 23:22:21 -07:00
  • 4869f2b983 Update fireworks and together entries as adapters Ashwin Bharambe 2024-09-03 22:56:52 -07:00
  • f802d481d9 Bump version to 0.0.11 Ashwin Bharambe 2024-09-03 22:41:29 -07:00
  • 7bc7785b0d
    API Updates: fleshing out RAG APIs, introduce "llama stack" CLI command (#51) Ashwin Bharambe 2024-09-03 22:39:39 -07:00
  • 35093c0b6f
    Add patch for SSE event endpoint responses (#50) Dalton Flanagan 2024-09-03 23:40:31 -04:00
  • 0af81776c7 fix for incomplete SSE type generation Dalton Flanagan 2024-09-03 13:11:40 -04:00
  • 70d557f793
    Update LICENSE (#47) raghotham 2024-08-29 07:39:50 -07:00
  • f2e18826b6
    Together AI basic integration (#43) Hassan El Mghari 2024-08-28 16:07:13 -07:00
  • a8b9541f19 Bump version to 0.0.10 Ashwin Bharambe 2024-08-27 04:19:27 -07:00
  • 117b95b38c
    Update RFC-0001-llama-stack.md raghotham 2024-08-26 20:56:09 -07:00
  • 870cd7bb8b Add blobfile for tiktoken Ashwin Bharambe 2024-08-26 14:50:20 -07:00
  • 40ca8e21bd
    Fireworks basic integration (#39) Yufei (Benny) Chen 2024-08-25 08:05:52 -07:00
  • f812648aca Bump version to 0.0.9 Ashwin Bharambe 2024-08-24 09:45:01 -07:00
  • c1a82ea8cd Add a script for install a pip wheel from a presigned url Ashwin Bharambe 2024-08-23 12:06:50 -07:00
  • 9777639a1c
    Updated URLs and addressed feedback (#37) varunfb 2024-08-22 13:34:46 -07:00
  • 4930616ec7
    Updated cli instructions with additonal details for each subcommands (#36) varunfb 2024-08-22 12:20:47 -07:00
  • 49f2bbbaeb
    fixed bug in download not enough disk space condition (#35) sisminnmaw 2024-08-23 00:10:47 +09:00
  • b4af8c0e00
    update cli ref doc: llama model template names related; separation of copy-and-pastable commands with their outputs (#34) Jeff Tang 2024-08-21 20:41:30 -07:00
  • 863bb915e1 Remove quantization_config from the APIs for now Ashwin Bharambe 2024-08-21 14:17:05 -07:00
  • ab0a24f333
    Add API keys to AgenticSystemConfig instead of relying on dotenv (#33) Ashwin Bharambe 2024-08-21 12:35:59 -07:00
  • face3ceff1 suppress warning in CLI Ashwin Bharambe 2024-08-21 12:25:13 -07:00
  • 270b5502d7 broaden URL match in download for older model families Dalton Flanagan 2024-08-21 12:11:11 -04:00
  • 2232bfa8b5
    RFC-0001-The-Llama-Stack (#8) raghotham 2024-08-20 19:01:18 -07:00
  • 57881c08c1 Bump version to 0.0.8 Ashwin Bharambe 2024-08-19 20:12:01 -07:00
  • e08e963f86 Add --manifest-file option to argparser Ashwin Bharambe 2024-08-19 18:26:30 -07:00
  • b3da6b8afb Bump version to 0.0.7 Ashwin Bharambe 2024-08-19 16:27:36 -07:00
  • 23de941424 Bump version to 0.0.6 Ashwin Bharambe 2024-08-19 14:12:18 -07:00
  • 38244c3161 llama_models.llama3_1 -> llama_models.llama3 Ashwin Bharambe 2024-08-19 10:55:37 -07:00
  • f502716cf7 Fix ShieldType Union equality bug dltn 2024-08-18 19:13:15 -07:00
  • 5e072d0780 Add a --manifest-file option to llama download Ashwin Bharambe 2024-08-17 10:08:00 -07:00
  • b8fc4d4dee
    Updates to prompt for tool calls (#29) Hardik Shah 2024-08-15 13:23:51 -07:00
  • 0d933ac4c5 No need for unnecessary $(conda run ...) to get python interpreter Ashwin Bharambe 2024-08-14 20:48:35 -07:00
  • 00f0e6d92b
    Avoid using nearly double the memory needed (#30) Ashwin Bharambe 2024-08-14 17:44:36 -07:00
  • b311dcd143 formatting Dalton Flanagan 2024-08-14 17:03:43 -04:00
  • 069d877210 Typo bugfix (rename variable x -> prompt) Ashwin Bharambe 2024-08-14 13:47:27 -07:00
  • b6ccaf1778 formatting Dalton Flanagan 2024-08-14 14:22:25 -04:00
  • 94dfa293a6 Bump version to 0.0.5 Hardik Shah 2024-08-13 15:23:57 -07:00
  • 432957d6b6 fix typo dltn 2024-08-13 11:39:57 -07:00
  • 7f13853e5e
    Update README.md Hardik Shah 2024-08-12 17:10:02 -07:00
  • 37da47ef8e upgrade pydantic to latest Hardik Shah 2024-08-12 15:14:21 -07:00
  • 2cd8b2ff5b Add simple validation for RemoteProviderConfig Ashwin Bharambe 2024-08-09 15:15:20 -07:00
  • 898cd5b352 Bump version to 0.0.4 dltn 2024-08-08 15:24:45 -07:00
  • 416097a9ea
    Rename inline -> local (#24) Dalton Flanagan 2024-08-08 17:39:03 -04:00
  • dd15671f7f Bump version to 0.0.3 Ashwin Bharambe 2024-08-08 13:40:03 -07:00
  • e830814399
    Introduce Llama stack distributions (#22) Ashwin Bharambe 2024-08-08 13:38:41 -07:00
  • da4645a27a
    hide non-featured (older) models from model list command without show-all flag (#23) Dalton Flanagan 2024-08-07 23:31:30 -04:00
  • 7664d5701d update tests and formatting Hardik Shah 2024-08-05 12:34:16 -07:00
  • d7a4cdd70d added options to ollama inference Hardik Shah 2024-08-02 14:44:22 -07:00
  • 09cf3fe78b Use new definitions of Model / SKU Ashwin Bharambe 2024-07-31 11:36:16 -07:00
  • 156bfa0e15
    Added Ollama as an inference impl (#20) Hardik Shah 2024-07-31 22:08:37 -07:00
  • c253c1c9ad Begin adding a /safety/run_shield API Ashwin Bharambe 2024-07-31 21:57:10 -07:00
  • 1bc81eae7b update toolchain to work with updated imports from llama_models Ashwin Bharambe 2024-07-30 17:52:57 -07:00
  • 23014ea4d1 Add hacks because Cloudfront config limits on the 405b model files Ashwin Bharambe 2024-07-30 13:46:20 -07:00
  • 404af06e02 Bump version to 0.0.2 Ashwin Bharambe 2024-07-29 23:56:41 -07:00
  • 7306e6b167 show sampling params in model describe Ashwin Bharambe 2024-07-29 23:44:07 -07:00
  • 040c30ee54 added resumable downloader for downloading models Ashwin Bharambe 2024-07-29 07:41:07 -07:00
  • 59574924de model template --template -> model template --name Ashwin Bharambe 2024-07-29 18:21:05 -07:00
  • 45b8a7ffcd Add model describe subcommand Ashwin Bharambe 2024-07-29 18:19:53 -07:00
  • 9d7f283722 Add model list subcommand Ashwin Bharambe 2024-07-29 16:39:53 -07:00
  • a789c47ec9
    Update cli_reference.md Dalton Flanagan 2024-07-29 16:31:56 -04:00
  • dd6c1f1e64
    Add links to shields Dalton Flanagan 2024-07-27 11:28:46 -04:00
  • b5d7cec11e
    Add shields to README Dalton Flanagan 2024-07-27 11:02:50 -04:00
  • 3583cf2d51 update model template output to be prettier, more consumable Ashwin Bharambe 2024-07-26 15:39:46 -07:00
  • 51f8049c7a Update fp8_requirements, we don't need nightly torch anymore Ashwin Bharambe 2024-07-26 08:25:44 -07:00
  • ec433448f2
    Add CLI reference docs (#14) Dalton Flanagan 2024-07-25 16:56:29 -04:00
  • b8aa99b034
    Update fbgemm version (#12) Jianyu Huang 2024-07-24 23:48:44 -07:00
  • 378a2077dd
    Update download command (#9) Lucain 2024-07-25 01:50:40 +02:00
  • 17bd1d876c Canonical package name for the dependency Ashwin Bharambe 2024-07-23 13:30:33 -07:00
  • f7e053e3ba Updates to setup and requirements for PyPI Ashwin Bharambe 2024-07-23 13:25:40 -07:00
  • d802d0f051 add requirements to MANIFEST.in Ashwin Bharambe 2024-07-23 12:59:28 -07:00
  • 5d5acc8ed5 Initial commit Ashwin Bharambe 2024-06-25 15:47:57 -07:00