Improve VLLM model discovery error handling

• Add comprehensive error handling in check_model_availability method
• Provide helpful error messages with actionable solutions for 404 errors
• Warn when API token is set but model discovery is disabled
This commit is contained in:
Akram Ben Aissi 2025-10-06 12:56:05 +02:00
parent e9214f9004
commit e28bc93635
15 changed files with 69 additions and 50 deletions

View file

@ -52,7 +52,7 @@ class InferenceImpl(Impl):
async def should_refresh_models(self):
return False
async def allow_listing_models(self):
async def enable_model_discovery(self):
return True
async def list_models(self):