llama-stack-mirror/llama_stack/providers/inline/inference
2024-12-11 10:35:00 -08:00
..
meta_reference implement embedding generation in supported inference providers 2024-12-11 10:35:00 -08:00
sentence_transformers implement embedding generation in supported inference providers 2024-12-11 10:35:00 -08:00
vllm Update more distribution docs to be simpler and partially codegen'ed 2024-11-20 22:03:44 -08:00
__init__.py precommit 2024-11-08 17:58:58 -08:00