Ishaan Jaff
5de101ab7b
[Feat] Add GET, DELETE Responses endpoints on LiteLLM Proxy ( #10297 )
...
* add GET responses endpoints on router
* add GET responses endpoints on router
* add GET responses endpoints on router
* add DELETE responses endpoints on proxy
* fixes for testing GET, DELETE endpoints
* test_basic_responses api e2e
2025-04-24 17:34:26 -07:00
Ishaan Jaff
868cdd0226
[Feat] Add Support for DELETE /v1/responses/{response_id} on OpenAI, Azure OpenAI ( #10205 )
...
* add transform_delete_response_api_request to base responses config
* add transform_delete_response_api_request
* add delete_response_api_handler
* fixes for deleting responses, response API
* add adelete_responses
* add async test_basic_openai_responses_delete_endpoint
* test_basic_openai_responses_delete_endpoint
* working delete for streaming on responses API
* fixes azure transformation
* TestAnthropicResponsesAPITest
* fix code check
* fix linting
* fixes for get_complete_url
* test_basic_openai_responses_streaming_delete_endpoint
* streaming fixes
2025-04-22 18:27:03 -07:00
Ishaan Jaff
0c2f705417
[Feat] Add Responses API - Routing Affinity logic for sessions ( #10193 )
...
* test for test_responses_api_routing_with_previous_response_id
* test_responses_api_routing_with_previous_response_id
* add ResponsesApiDeploymentCheck
* ResponsesApiDeploymentCheck
* ResponsesApiDeploymentCheck
* fix ResponsesApiDeploymentCheck
* test_responses_api_routing_with_previous_response_id
* ResponsesApiDeploymentCheck
* test_responses_api_deployment_check.py
* docs routing affinity
* simplify ResponsesApiDeploymentCheck
* test response id
* fix code quality check
2025-04-21 20:00:27 -07:00
Ishaan Jaff
d3e04eac7f
[Feat] Unified Responses API - Add Azure Responses API support ( #10116 )
...
* initial commit for azure responses api support
* update get complete url
* fixes for responses API
* working azure responses API
* working responses API
* test suite for responses API
* azure responses API test suite
* fix test with complete url
* fix test refactor
* test fix metadata checks
* fix code quality check
2025-04-17 16:47:59 -07:00
Krish Dholakia
5ac61a7572
Add bedrock latency optimized inference support ( #9623 )
...
* fix(converse_transformation.py): add performanceConfig param support on bedrock
Closes https://github.com/BerriAI/litellm/issues/7606
* fix(converse_transformation.py): refactor to use more flexible single getter for params which are separate config blocks
* test(test_main.py): add e2e mock test for bedrock performance config
* build(model_prices_and_context_window.json): add versioned multimodal embedding
* refactor(multimodal_embeddings/): migrate to config pattern
* feat(vertex_ai/multimodalembeddings): calculate usage for multimodal embedding calls
Enables cost calculation for multimodal embeddings
* feat(vertex_ai/multimodalembeddings): get usage object for embedding calls
ensures accurate cost tracking for vertexai multimodal embedding calls
* fix(embedding_handler.py): remove unused imports
* fix: fix linting errors
* fix: handle response api usage calculation
* test(test_vertex_ai_multimodal_embedding_transformation.py): update tests
* test: mark flaky test
* feat(vertex_ai/multimodal_embeddings/transformation.py): support text+image+video input
* docs(vertex.md): document sending text + image to vertex multimodal embeddings
* test: remove incorrect file
* fix(multimodal_embeddings/transformation.py): fix linting error
* style: remove unused import
2025-03-29 00:23:09 -07:00
Ishaan Jaff
de473bee4b
fix mypy linting errors
2025-03-12 12:13:19 -07:00
Ishaan Jaff
ffa4978f8a
ResponsesAPIRequestUtils
2025-03-12 09:36:08 -07:00
Ishaan Jaff
24cb83b0e4
Response API cost tracking
2025-03-11 22:02:14 -07:00
Ishaan Jaff
f32968409e
working basic openai response api request
2025-03-11 17:37:19 -07:00
Ishaan Jaff
2c6774e3ee
get_optional_params_responses_api
2025-03-11 16:00:49 -07:00