Ishaan Jaff
968ef4a299
fix response typing
2025-03-26 16:56:56 -07:00
Ishaan Jaff
957b7eb82c
define types for response form AnthropicMessagesResponse
2025-03-26 16:54:45 -07:00
Ishaan Jaff
e93f84b347
add type hints for AnthropicMessagesResponse
2025-03-26 16:53:33 -07:00
Krish Dholakia
37265c8b1c
Add Daily User Spend Aggregate view - allows UI Usage tab to work > 1m rows ( #9538 )
...
* ci: update github action
* build(schema.prisma): enable daily user spend table
allows storing aggregate view of user's daily spend
* build(schema.prisma): add new daily user spend table
* feat: working daily user spend tracking
maintains an aggregate view for easier querying in high traffic
* setup_google_dns
* ci: update ci yaml
---------
Co-authored-by: Ishaan Jaff <ishaanjaffer0324@gmail.com>
2025-03-26 16:36:36 -07:00
Ishaan Jaff
b7f4abd13a
Merge pull request #9542 from BerriAI/litellm_fix_vertex_ai_ft_models
...
[Feature]: Support for Fine-Tuned Vertex AI LLMs
2025-03-26 16:19:41 -07:00
Ishaan Jaff
23aa7f81b5
fix ProxyException
2025-03-26 15:53:33 -07:00
Ishaan Jaff
ff33ed020c
fix auth checks
2025-03-26 15:45:58 -07:00
Ishaan Jaff
8bd2081dec
fix get_key_object
2025-03-26 15:41:40 -07:00
Ishaan Jaff
f8caebc54b
is_database_connection_error
2025-03-26 14:55:40 -07:00
Ishaan Jaff
044c56c439
Merge branch 'main' into litellm_stability_fixes
2025-03-26 14:41:43 -07:00
Ishaan Jaff
0aae9aa24a
rename _is_model_gemini_spec_model
2025-03-26 14:28:26 -07:00
Ishaan Jaff
8eaf4c55c0
test_gemini_fine_tuned_model_request_consistency
2025-03-26 14:18:11 -07:00
Ishaan Jaff
93daf5cbac
_get_model_name_from_gemini_spec_model
2025-03-26 12:16:18 -07:00
Krish Dholakia
801ecb6517
Nova Canvas complete image generation tasks ( #9177 ) ( #9525 )
...
Read Version from pyproject.toml / read-version (push) Successful in 17s
Helm unit test / unit-test (push) Successful in 22s
* Nova Canvas complete image generation tasks (#9177 )
* add initial support for Amazon Nova Canvas model
Signed-off-by: omrishiv <327609+omrishiv@users.noreply.github.com>
* adjust name to AmazonNovaCanvas and map function variables to config
Signed-off-by: omrishiv <327609+omrishiv@users.noreply.github.com>
* tighten model name check
Signed-off-by: omrishiv <327609+omrishiv@users.noreply.github.com>
* fix quality mapping
Signed-off-by: omrishiv <327609+omrishiv@users.noreply.github.com>
* add premium quality in config
Signed-off-by: omrishiv <327609+omrishiv@users.noreply.github.com>
* support all Amazon Nova Canvas tasks
* remove unused import
Signed-off-by: omrishiv <327609+omrishiv@users.noreply.github.com>
* add tests for image generation tasks and fix payload
Signed-off-by: omrishiv <327609+omrishiv@users.noreply.github.com>
* add missing util file
Signed-off-by: omrishiv <327609+omrishiv@users.noreply.github.com>
* update model prices backup file
Signed-off-by: omrishiv <327609+omrishiv@users.noreply.github.com>
* remove image tasks other than text->image
Signed-off-by: omrishiv <327609+omrishiv@users.noreply.github.com>
* add color guided generation task for Nova Canvas
Signed-off-by: omrishiv <327609+omrishiv@users.noreply.github.com>
* fix merge
Signed-off-by: omrishiv <327609+omrishiv@users.noreply.github.com>
* add nova canvas image generation documentation
Signed-off-by: omrishiv <327609+omrishiv@users.noreply.github.com>
* add nova canvas unit tests
Signed-off-by: omrishiv <327609+omrishiv@users.noreply.github.com>
---------
Signed-off-by: omrishiv <327609+omrishiv@users.noreply.github.com>
Co-authored-by: Krish Dholakia <krrishdholakia@gmail.com>
* ci(config.yml): bump ci config
* test: fix test
---------
Signed-off-by: omrishiv <327609+omrishiv@users.noreply.github.com>
Co-authored-by: omrishiv <327609+omrishiv@users.noreply.github.com>
2025-03-26 11:28:20 -07:00
Ishaan Jaff
793a920caa
rename _is_model_gemini_spec_model
2025-03-26 11:14:51 -07:00
Ishaan Jaff
baa9b34950
Merge branch 'main' into litellm_fix_vertex_ai_ft_models
2025-03-26 11:11:54 -07:00
Ishaan Jaff
a2ae9be6a2
undo changes to utils
2025-03-26 11:09:29 -07:00
Ishaan Jaff
fee20250a6
pick_cheapest_chat_models_from_llm_provider
2025-03-26 10:59:02 -07:00
Ishaan Jaff
8a72b67b18
undo code changes
2025-03-26 10:57:08 -07:00
Ishaan Jaff
3ee7962f9c
fix llm request utils
2025-03-26 10:55:56 -07:00
Ishaan Jaff
bbe69a47a9
_is_model_gemini_gemini_spec_model
2025-03-26 10:53:23 -07:00
Krrish Dholakia
a97d0563b4
build(model_prices_and_context_window.json): add commercial rate limits for gemini 2.0 flash lite
2025-03-26 08:58:35 -07:00
Ishaan Jaff
22dfc4dea9
fix user_api_key_auth example config
2025-03-26 08:36:11 -07:00
Ishaan Jaff
fafbaf6302
Merge branch 'main' into litellm_stability_fixes
2025-03-26 07:16:31 -07:00
Ishaan Jaff
fb31006cd8
select_model_for_request_transformation
2025-03-26 07:12:25 -07:00
Ishaan Jaff
27c085cc56
fix util vertex
2025-03-26 00:08:16 -07:00
Ishaan Jaff
2bef0481af
_transform_request_body
2025-03-26 00:05:45 -07:00
Ishaan Jaff
830ecbdb8c
fix get_optional_params
2025-03-25 23:54:57 -07:00
Ishaan Jaff
aa8ba9b8f2
fix base_model in param mapping
2025-03-25 23:54:29 -07:00
Krish Dholakia
6fd18651d1
Support litellm.api_base
for vertex_ai + gemini/ across completion, embedding, image_generation ( #9516 )
...
Read Version from pyproject.toml / read-version (push) Successful in 19s
Helm unit test / unit-test (push) Successful in 20s
* test(tests): add unit testing for litellm_proxy integration
* fix(cost_calculator.py): fix tracking cost in sdk when calling proxy
* fix(main.py): respect litellm.api_base on `vertex_ai/` and `gemini/` routes
* fix(main.py): consistently support custom api base across gemini + vertexai on embedding + completion
* feat(vertex_ai/): test
* fix: fix linting error
* test: set api base as None before starting loadtest
2025-03-25 23:46:20 -07:00
Ishaan Jaff
8657816477
fix gemini/gemini-2.0-flash-lite on model cost map
2025-03-25 23:20:43 -07:00
Ishaan Jaff
c61214dcf1
Merge pull request #9523 from BerriAI/litellm_add_gemini_flash_lite
...
[Feat - New Model] Add VertexAI `gemini-2.0-flash-lite` and Google AI Studio `gemini-2.0-flash-lite`
2025-03-25 23:11:44 -07:00
Nicholas Grabar
f68cc26f15
8864 Add support for anyOf union type while handling null fields
2025-03-25 22:37:28 -07:00
Ishaan Jaff
79ef184345
run ci/cd again
2025-03-25 21:57:45 -07:00
Ishaan Jaff
b4e745323a
add test config
2025-03-25 19:21:51 -07:00
Ishaan Jaff
ce49e27217
fixes for auth checks
2025-03-25 15:44:13 -07:00
Ishaan Jaff
59040167ac
fix ProxyErrorTypes
2025-03-25 14:40:11 -07:00
Ishaan Jaff
4c87084ff7
UserAPIKeyAuthExceptionHandler
2025-03-25 14:07:14 -07:00
Ishaan Jaff
0af9a5e8d0
add gemini/gemini-2.0-flash-lite
2025-03-25 07:51:42 -07:00
Ishaan Jaff
62bb7d6605
add vertex gemini-2.0-flash-lite
2025-03-25 07:48:33 -07:00
Krish Dholakia
92883560f0
fix vertex ai multimodal embedding translation ( #9471 )
...
Read Version from pyproject.toml / read-version (push) Successful in 20s
Helm unit test / unit-test (push) Successful in 24s
* remove data:image/jpeg;base64, prefix from base64 image input
vertex_ai's multimodal embeddings endpoint expects a raw base64 string without `data:image/jpeg;base64,` prefix.
* Add Vertex Multimodal Embedding Test
* fix(test_vertex.py): add e2e tests on multimodal embeddings
* test: unit testing
* test: remove sklearn dep
* test: update test with fixed route
* test: fix test
---------
Co-authored-by: Jonarod <jonrodd@gmail.com>
Co-authored-by: Emerson Gomes <emerson.gomes@thalesgroup.com>
2025-03-24 23:23:28 -07:00
Krrish Dholakia
75994d0bf0
test: improve flaky test
2025-03-24 23:15:04 -07:00
Krish Dholakia
a619580bf8
Add vertexai topLogprobs support ( #9518 )
...
* Added support for top_logprobs in vertex gemini models
* Testing for top_logprobs feature in vertexai
* Update litellm/llms/vertex_ai/gemini/vertex_and_google_ai_studio_gemini.py
Co-authored-by: Tom Matthews <tomukmatthews@gmail.com>
* refactor(tests/): refactor testing to be in correct repo
---------
Co-authored-by: Aditya Thaker <adityathaker28@gmail.com>
Co-authored-by: Tom Matthews <tomukmatthews@gmail.com>
2025-03-24 22:42:38 -07:00
Ishaan Jaff
12639b7ccf
fix sagemaker streaming error
2025-03-24 21:29:29 -07:00
Krish Dholakia
bd309a28c5
Merge pull request #9512 from BerriAI/litellm_dev_03_24_2025_p3
...
fix(invoke_handler.py): remove hard coded chunk on streaming usage
2025-03-24 21:21:36 -07:00
Ishaan Jaff
863fe3a4d2
fix import mcp router
2025-03-24 21:08:24 -07:00
Ishaan Jaff
fb3a67ba8d
fix code quality
2025-03-24 21:06:17 -07:00
Ishaan Jaff
dcc2edbd4d
fix mcp import
2025-03-24 21:02:54 -07:00
Ishaan Jaff
08a4ba1b7e
Merge branch 'main' into litellm_exp_mcp_server
2025-03-24 19:03:56 -07:00
Ishaan Jaff
002837f962
fix linting errors
2025-03-24 18:48:43 -07:00