llama-stack-mirror

mirror of https://github.com/meta-llama/llama-stack.git synced 2025-10-04 12:07:34 +00:00

History

Ashwin Bharambe e84d4436b5 Since we are pushing for HF repos, we should accept them in inference configs (#497 ) # What does this PR do? As the title says. ## Test Plan This needs `8752149f58` to also land. So the next package (0.0.54) will make this work properly. The test is: ```bash pytest -v -s -m "llama_3b and meta_reference" test_model_registration.py ```		2024-11-20 16:14:37 -08:00
..
quantization	precommit	2024-11-08 17:58:58 -08:00
__init__.py	Add provider deprecation support; change directory structure (#397 )	2024-11-07 13:04:53 -08:00
config.py	Since we are pushing for HF repos, we should accept them in inference configs (#497 )	2024-11-20 16:14:37 -08:00
generation.py	Auto-generate distro yamls + docs (#468 )	2024-11-18 14:57:06 -08:00
inference.py	unregister for memory banks and remove update API (#458 )	2024-11-14 17:12:11 -08:00
model_parallel.py	Add provider deprecation support; change directory structure (#397 )	2024-11-07 13:04:53 -08:00
parallel_utils.py	precommit	2024-11-08 17:58:58 -08:00