llama-stack/llama_stack/providers/inline
Ashwin Bharambe 540fc4d717
Fix Meta reference GPU implementation (#663)
By performing in-place mutations, we lost. Never in life do that.
2024-12-19 14:09:45 -08:00
..
agents fix context_retriever model->model_id 2024-12-19 12:52:00 -08:00
datasetio Telemetry API redesign (#525) 2024-12-04 11:22:45 -08:00
eval Add ability to query and export spans to dataset (#574) 2024-12-05 21:07:30 -08:00
inference Fix Meta reference GPU implementation (#663) 2024-12-19 14:09:45 -08:00
ios/inference impls -> inline, adapters -> remote (#381) 2024-11-06 14:54:05 -08:00
memory Update the "InterleavedTextMedia" type (#635) 2024-12-17 11:18:31 -08:00
post_training/torchtune [3/n][torchtune integration] add validation logic (#600) 2024-12-13 16:35:06 -08:00
safety Add more debugging logs to when llama guard fails 2024-12-17 18:52:02 -08:00
scoring [/scoring] add ability to define aggregation functions for scoring functions & refactors (#597) 2024-12-11 10:03:42 -08:00
telemetry Update Telemetry API so OpenAPI generation can work (#640) 2024-12-16 13:00:14 -08:00
__init__.py impls -> inline, adapters -> remote (#381) 2024-11-06 14:54:05 -08:00