llama-stack-mirror

mirror of https://github.com/meta-llama/llama-stack.git synced 2025-12-17 23:17:15 +00:00

History

Fred Reiss 8e358ec6a8 Merge branch 'vllm' into vllm-merge-1		2024-12-19 11:30:42 -08:00
..
__init__.py	Remove "routing_table" and "routing_key" concepts for the user (#201 )	2024-10-10 10:24:13 -07:00
conftest.py	Make embedding generation go through inference (#606 )	2024-12-12 11:47:50 -08:00
fixtures.py	Merge branch 'vllm' into vllm-merge-1	2024-12-19 11:30:42 -08:00
pasta.jpeg	Enable vision models for (Together, Fireworks, Meta-Reference, Ollama) (#376 )	2024-11-05 16:22:33 -08:00
test_embeddings.py	add embedding model by default to distribution templates (#617 )	2024-12-13 12:48:00 -08:00
test_model_registration.py	[4/n][torchtune integration] support lazy load model during inference (#620 )	2024-12-18 16:30:53 -08:00
test_prompt_adapter.py	Added tests for persistence (#274 )	2024-10-22 19:41:46 -07:00
test_text_inference.py	Add inline vLLM provider to regression tests	2024-12-19 11:27:59 -08:00
test_vision_inference.py	Update the "InterleavedTextMedia" type (#635 )	2024-12-17 11:18:31 -08:00
utils.py	Enable vision models for (Together, Fireworks, Meta-Reference, Ollama) (#376 )	2024-11-05 16:22:33 -08:00