llama-stack/llama_stack/providers
2024-10-17 10:03:27 -07:00
..
adapters Remove request arg from chat completion response processing (#240) 2024-10-15 13:03:17 -07:00
impls Fix fp8 implementation which had bit-rotten a bit 2024-10-15 13:57:01 -07:00
registry Split off meta-reference-quantized provider 2024-10-10 16:03:19 -07:00
tests Allow overriding MODEL_IDS for inference test 2024-10-17 10:03:27 -07:00
utils Remove request arg from chat completion response processing (#240) 2024-10-15 13:03:17 -07:00
__init__.py API Updates (#73) 2024-09-17 19:51:35 -07:00
datatypes.py Remove "routing_table" and "routing_key" concepts for the user (#201) 2024-10-10 10:24:13 -07:00