Krish Dholakia
9b7ebb6a7d
build(pyproject.toml): add new dev dependencies - for type checking ( #9631 )
...
* build(pyproject.toml): add new dev dependencies - for type checking
* build: reformat files to fit black
* ci: reformat to fit black
* ci(test-litellm.yml): make tests run clear
* build(pyproject.toml): add ruff
* fix: fix ruff checks
* build(mypy/): fix mypy linting errors
* fix(hashicorp_secret_manager.py): fix passing cert for tls auth
* build(mypy/): resolve all mypy errors
* test: update test
* fix: fix black formatting
* build(pre-commit-config.yaml): use poetry run black
* fix(proxy_server.py): fix linting error
* fix: fix ruff safe representation error
2025-03-29 11:02:13 -07:00
Krish Dholakia
5ac61a7572
Add bedrock latency optimized inference support ( #9623 )
...
* fix(converse_transformation.py): add performanceConfig param support on bedrock
Closes https://github.com/BerriAI/litellm/issues/7606
* fix(converse_transformation.py): refactor to use more flexible single getter for params which are separate config blocks
* test(test_main.py): add e2e mock test for bedrock performance config
* build(model_prices_and_context_window.json): add versioned multimodal embedding
* refactor(multimodal_embeddings/): migrate to config pattern
* feat(vertex_ai/multimodalembeddings): calculate usage for multimodal embedding calls
Enables cost calculation for multimodal embeddings
* feat(vertex_ai/multimodalembeddings): get usage object for embedding calls
ensures accurate cost tracking for vertexai multimodal embedding calls
* fix(embedding_handler.py): remove unused imports
* fix: fix linting errors
* fix: handle response api usage calculation
* test(test_vertex_ai_multimodal_embedding_transformation.py): update tests
* test: mark flaky test
* feat(vertex_ai/multimodal_embeddings/transformation.py): support text+image+video input
* docs(vertex.md): document sending text + image to vertex multimodal embeddings
* test: remove incorrect file
* fix(multimodal_embeddings/transformation.py): fix linting error
* style: remove unused import
2025-03-29 00:23:09 -07:00
Ishaan Jaff
af8e917eb0
MockResponsesAPIStreamingIterator
2025-03-20 12:25:58 -07:00
Ishaan Jaff
c64f834272
add AsyncMockResponsesAPIStreamingIterator
2025-03-20 12:06:36 -07:00
Ishaan Jaff
0cd671785d
add fake_stream to llm http handler
2025-03-20 09:55:59 -07:00
Ishaan Jaff
15d618f5b1
Add exception mapping for responses API
2025-03-13 15:57:58 -07:00
Ishaan Jaff
de473bee4b
fix mypy linting errors
2025-03-12 12:13:19 -07:00
Ishaan Jaff
1f7c21fd1b
remove infinit loop for streaming
2025-03-12 11:55:17 -07:00
Ishaan Jaff
181072e15d
fix code quality checks
2025-03-12 11:47:10 -07:00
Ishaan Jaff
d6351c3433
test_basic_openai_responses_api
2025-03-12 10:07:03 -07:00
Ishaan Jaff
35e9bba154
_handle_logging_completed_response
2025-03-12 09:42:48 -07:00
Ishaan Jaff
d6a49f6b66
explictly pass params to partial func
2025-03-12 09:37:38 -07:00
Ishaan Jaff
ffa4978f8a
ResponsesAPIRequestUtils
2025-03-12 09:36:08 -07:00
Ishaan Jaff
58acf23c3e
STREAM_SSE_DONE_STRING
2025-03-12 09:33:28 -07:00
Ishaan Jaff
aa250088b2
re-use base_llm_http_handler
2025-03-12 09:31:34 -07:00
Ishaan Jaff
047879c004
add aresponses
2025-03-12 09:22:44 -07:00
Ishaan Jaff
e4cda0a1b7
add SyncResponsesAPIStreamingIterator
2025-03-12 09:18:25 -07:00
Ishaan Jaff
fde75a068a
working streaming logging
2025-03-12 00:02:39 -07:00
Ishaan Jaff
b790f0a5c6
log input of response API
2025-03-11 22:34:18 -07:00
Ishaan Jaff
51dc24a405
_transform_response_api_usage_to_chat_usage
2025-03-11 22:26:44 -07:00
Ishaan Jaff
20e3398476
fix typing for aresponses
2025-03-11 22:03:48 -07:00
Ishaan Jaff
24cb83b0e4
Response API cost tracking
2025-03-11 22:02:14 -07:00
Ishaan Jaff
8fa313ab07
add async streaming support
2025-03-11 20:00:42 -07:00
Ishaan Jaff
aa40cb5b26
working ResponsesAPIStreamingIterator
2025-03-11 19:47:43 -07:00
Ishaan Jaff
f32968409e
working basic openai response api request
2025-03-11 17:37:19 -07:00
Ishaan Jaff
2c6774e3ee
get_optional_params_responses_api
2025-03-11 16:00:49 -07:00
Ishaan Jaff
8c4331638e
add aysnc aresponses
2025-03-11 14:33:50 -07:00
Ishaan Jaff
4b1b87eb67
openai reasoning initial types
2025-03-11 14:28:47 -07:00