mirror of
https://github.com/meta-llama/llama-stack.git
synced 2025-07-27 06:28:50 +00:00
- Add new Vertex AI remote inference provider with litellm integration - Support for Gemini models through Google Cloud Vertex AI platform - Uses Google Cloud Application Default Credentials (ADC) for authentication - Added VertexAI models: gemini-2.5-flash, gemini-2.5-pro, gemini-2.0-flash. - Updated provider registry to include vertexai provider - Updated starter template to support Vertex AI configuration - Added comprehensive documentation and sample configuration Signed-off-by: Eran Cohen <eranco@redhat.com> |
||
---|---|---|
.. | ||
__init__.py | ||
agents.py | ||
datasetio.py | ||
eval.py | ||
files.py | ||
inference.py | ||
post_training.py | ||
safety.py | ||
scoring.py | ||
telemetry.py | ||
tool_runtime.py | ||
vector_io.py |