llama-stack-mirror/llama_stack/providers/inline/inference
2024-12-19 11:30:42 -08:00
..
meta_reference [4/n][torchtune integration] support lazy load model during inference (#620) 2024-12-18 16:30:53 -08:00
sentence_transformers add embedding model by default to distribution templates (#617) 2024-12-13 12:48:00 -08:00
vllm Merge branch 'vllm' into vllm-merge-1 2024-12-19 11:30:42 -08:00
__init__.py precommit 2024-11-08 17:58:58 -08:00