Krish Dholakia
|
5ac61a7572
|
Add bedrock latency optimized inference support (#9623)
* fix(converse_transformation.py): add performanceConfig param support on bedrock
Closes https://github.com/BerriAI/litellm/issues/7606
* fix(converse_transformation.py): refactor to use more flexible single getter for params which are separate config blocks
* test(test_main.py): add e2e mock test for bedrock performance config
* build(model_prices_and_context_window.json): add versioned multimodal embedding
* refactor(multimodal_embeddings/): migrate to config pattern
* feat(vertex_ai/multimodalembeddings): calculate usage for multimodal embedding calls
Enables cost calculation for multimodal embeddings
* feat(vertex_ai/multimodalembeddings): get usage object for embedding calls
ensures accurate cost tracking for vertexai multimodal embedding calls
* fix(embedding_handler.py): remove unused imports
* fix: fix linting errors
* fix: handle response api usage calculation
* test(test_vertex_ai_multimodal_embedding_transformation.py): update tests
* test: mark flaky test
* feat(vertex_ai/multimodal_embeddings/transformation.py): support text+image+video input
* docs(vertex.md): document sending text + image to vertex multimodal embeddings
* test: remove incorrect file
* fix(multimodal_embeddings/transformation.py): fix linting error
* style: remove unused import
|
2025-03-29 00:23:09 -07:00 |
|
Ishaan Jaff
|
de473bee4b
|
fix mypy linting errors
|
2025-03-12 12:13:19 -07:00 |
|
Ishaan Jaff
|
ffa4978f8a
|
ResponsesAPIRequestUtils
|
2025-03-12 09:36:08 -07:00 |
|
Ishaan Jaff
|
24cb83b0e4
|
Response API cost tracking
|
2025-03-11 22:02:14 -07:00 |
|
Ishaan Jaff
|
f32968409e
|
working basic openai response api request
|
2025-03-11 17:37:19 -07:00 |
|
Ishaan Jaff
|
2c6774e3ee
|
get_optional_params_responses_api
|
2025-03-11 16:00:49 -07:00 |
|