forked from phoenix-oss/llama-stack-mirror
## What does this PR do? In this PR, we implement a passthrough inference provider that works for any endpoints that respect llama stack inference API definition. ## Test Plan config some endpoint that respect llama stack inference API definition and got the inference results successfully <img width="1268" alt="Screenshot 2025-02-19 at 8 52 51 PM" src="https://github.com/user-attachments/assets/447816e4-ea7a-4365-b90c-386dc7dcf4a1" /> |
||
---|---|---|
.. | ||
__init__.py | ||
agents.py | ||
datasetio.py | ||
eval.py | ||
inference.py | ||
post_training.py | ||
safety.py | ||
scoring.py | ||
telemetry.py | ||
tool_runtime.py | ||
vector_io.py |