llama-stack-mirror/llama_stack/providers/tests/inference
2024-12-19 11:30:42 -08:00
..
__init__.py Remove "routing_table" and "routing_key" concepts for the user (#201) 2024-10-10 10:24:13 -07:00
conftest.py Make embedding generation go through inference (#606) 2024-12-12 11:47:50 -08:00
fixtures.py Merge branch 'vllm' into vllm-merge-1 2024-12-19 11:30:42 -08:00
pasta.jpeg Enable vision models for (Together, Fireworks, Meta-Reference, Ollama) (#376) 2024-11-05 16:22:33 -08:00
test_embeddings.py add embedding model by default to distribution templates (#617) 2024-12-13 12:48:00 -08:00
test_model_registration.py [4/n][torchtune integration] support lazy load model during inference (#620) 2024-12-18 16:30:53 -08:00
test_prompt_adapter.py Added tests for persistence (#274) 2024-10-22 19:41:46 -07:00
test_text_inference.py Add inline vLLM provider to regression tests 2024-12-19 11:27:59 -08:00
test_vision_inference.py Update the "InterleavedTextMedia" type (#635) 2024-12-17 11:18:31 -08:00
utils.py Enable vision models for (Together, Fireworks, Meta-Reference, Ollama) (#376) 2024-11-05 16:22:33 -08:00