forked from phoenix/litellm-mirror
add arch diagram
This commit is contained in:
parent
009a1f7f86
commit
54db564529
3 changed files with 37 additions and 1 deletions
31
docs/my-website/docs/proxy/architecture.md
Normal file
31
docs/my-website/docs/proxy/architecture.md
Normal file
|
@ -0,0 +1,31 @@
|
|||
import Image from '@theme/IdealImage';
|
||||
import Tabs from '@theme/Tabs';
|
||||
import TabItem from '@theme/TabItem';
|
||||
|
||||
# Life of a Request
|
||||
|
||||
## High Level architecture
|
||||
|
||||
<Image img={require('../../img/litellm_gateway.png')} />
|
||||
|
||||
|
||||
### Request Flow
|
||||
|
||||
1. **User Sends Request**: The process begins when a user sends a request to the LiteLLM Proxy Server (Gateway).
|
||||
|
||||
2. [**Virtual Keys**](../virtual_keys): The request first passes through the Virtual Keys component
|
||||
|
||||
3. **Rate Limiting**: The MaxParallelRequestsHandler applies rate limiting to manage the flow of requests.
|
||||
|
||||
4. **Proxy Server Processing**: The request is then processed by the LiteLLM proxy_server.py, which handles the core logic of the proxy.
|
||||
|
||||
5. [**LiteLLM Router**](../routing): LiteLLM Router**: The LiteLLM Router determines where to send the request based on the configuration and request parameters.
|
||||
|
||||
6. **Model Interaction**: The request is sent to the appropriate model API (litellm.completion() or litellm.embedding()) for processing.
|
||||
|
||||
7. **Response**: The model's response is sent back through the same components to the user.
|
||||
|
||||
8. **Post-Request Processing**: After the response is sent, several asynchronous operations occur:
|
||||
- The _PROXY_track_cost_callback updates spend in the database.
|
||||
- Logging to LangFuse for analytics and monitoring.
|
||||
- The MaxParallelRequestsHandler updates virtual key usage and performs post-request cleanup.
|
BIN
docs/my-website/img/litellm_gateway.png
Normal file
BIN
docs/my-website/img/litellm_gateway.png
Normal file
Binary file not shown.
After Width: | Height: | Size: 96 KiB |
|
@ -31,7 +31,12 @@ const sidebars = {
|
|||
"proxy/quick_start",
|
||||
"proxy/docker_quick_start",
|
||||
"proxy/deploy",
|
||||
"proxy/prod",
|
||||
"proxy/prod",
|
||||
{
|
||||
type: "category",
|
||||
label: "Architecture",
|
||||
items: ["proxy/architecture"],
|
||||
},
|
||||
{
|
||||
type: "link",
|
||||
label: "📖 All Endpoints (Swagger)",
|
||||
|
|
Loading…
Add table
Add a link
Reference in a new issue