|
api
|
update import for quantization format from models
|
2024-07-22 00:04:03 -07:00 |
|
quantization
|
rename toolchain/ --> llama_toolchain/
|
2024-07-21 23:48:38 -07:00 |
|
__init__.py
|
rename toolchain/ --> llama_toolchain/
|
2024-07-21 23:48:38 -07:00 |
|
api_instance.py
|
rename toolchain/ --> llama_toolchain/
|
2024-07-21 23:48:38 -07:00 |
|
client.py
|
rename toolchain/ --> llama_toolchain/
|
2024-07-21 23:48:38 -07:00 |
|
generation.py
|
rename toolchain/ --> llama_toolchain/
|
2024-07-21 23:48:38 -07:00 |
|
inference.py
|
rename toolchain/ --> llama_toolchain/
|
2024-07-21 23:48:38 -07:00 |
|
server.py
|
rename toolchain/ --> llama_toolchain/
|
2024-07-21 23:48:38 -07:00 |