Commit graph

4527 commits

Author SHA1 Message Date
Ishaan Jaff
3d4fc22eaf bug fix - allow pods to startup when DB is unavailable 2025-03-26 19:49:51 -07:00
Ishaan Jaff
6648e85c97 refactor PrismaDBExceptionHandler 2025-03-26 19:22:24 -07:00
Ishaan Jaff
cb44f8d64d Merge pull request #9533 from BerriAI/litellm_stability_fixes
[Reliability Fixes] - Gracefully handle exceptions when DB is having an outage
2025-03-26 18:57:38 -07:00
Krish Dholakia
eeab48ce5a Add Daily User Spend Aggregate view - allows UI Usage tab to work > 1m rows (#9538)
* ci: update github action

* build(schema.prisma): enable daily user spend table

allows storing aggregate view of user's daily spend

* build(schema.prisma): add new daily user spend table

* feat: working daily user spend tracking

maintains an aggregate view for easier querying in high traffic

* setup_google_dns

* ci: update ci yaml

---------

Co-authored-by: Ishaan Jaff <ishaanjaffer0324@gmail.com>
2025-03-26 16:36:36 -07:00
Ishaan Jaff
1274d27534 fix ProxyException 2025-03-26 15:53:33 -07:00
Ishaan Jaff
ba0d8a6d22 fix auth checks 2025-03-26 15:45:58 -07:00
Ishaan Jaff
427b549975 fix get_key_object 2025-03-26 15:41:40 -07:00
Ishaan Jaff
43c402b8b2 is_database_connection_error 2025-03-26 14:55:40 -07:00
Ishaan Jaff
f85c0dac4f Merge branch 'main' into litellm_stability_fixes 2025-03-26 14:41:43 -07:00
Ishaan Jaff
ad5d26f664 fix user_api_key_auth example config 2025-03-26 08:36:11 -07:00
Krish Dholakia
1ab5d49132 Support litellm.api_base for vertex_ai + gemini/ across completion, embedding, image_generation (#9516)
* test(tests): add unit testing for litellm_proxy integration

* fix(cost_calculator.py): fix tracking cost in sdk when calling proxy

* fix(main.py): respect litellm.api_base on `vertex_ai/` and `gemini/` routes

* fix(main.py): consistently support custom api base across gemini + vertexai on embedding + completion

* feat(vertex_ai/): test

* fix: fix linting error

* test: set api base as None before starting loadtest
2025-03-25 23:46:20 -07:00
Ishaan Jaff
fea5213517 add test config 2025-03-25 19:21:51 -07:00
Ishaan Jaff
6493a6422d fixes for auth checks 2025-03-25 15:44:13 -07:00
Ishaan Jaff
ea209661d5 fix ProxyErrorTypes 2025-03-25 14:40:11 -07:00
Ishaan Jaff
74e3600367 UserAPIKeyAuthExceptionHandler 2025-03-25 14:07:14 -07:00
Krish Dholakia
f09f4a6396 Merge pull request #9512 from BerriAI/litellm_dev_03_24_2025_p3
fix(invoke_handler.py): remove hard coded chunk on streaming usage
2025-03-24 21:21:36 -07:00
Ishaan Jaff
a61925448a fix import mcp router 2025-03-24 21:08:24 -07:00
Ishaan Jaff
dc899570ce fix code quality 2025-03-24 21:06:17 -07:00
Ishaan Jaff
a616375b3c fix mcp import 2025-03-24 21:02:54 -07:00
Ishaan Jaff
c6424d6246 Merge branch 'main' into litellm_exp_mcp_server 2025-03-24 19:03:56 -07:00
Ishaan Jaff
f600899ff7 fix linting errors 2025-03-24 18:48:43 -07:00
Krrish Dholakia
017a9b882a fix(invoke_handler.py): remove hard code 2025-03-24 17:58:26 -07:00
Krrish Dholakia
2f01caa919 fix(litellm_logging.py): always log the api base
Fixes issue where api base missing from spend logs due to refactor
2025-03-24 13:45:39 -07:00
Ishaan Jaff
d4b53d3fa9 fix mcp type imports 2025-03-24 11:01:29 -07:00
Ishaan Jaff
f133bb07d1 fix pydantic import error 2025-03-24 07:11:48 -07:00
Tyler Hutcherson
785f9714ca update redisvl dependency 2025-03-24 08:42:11 -04:00
Ishaan Jaff
d932206bfb Merge branch 'main' into litellm_exp_mcp_server 2025-03-22 18:51:25 -07:00
Krrish Dholakia
2e5146855e fix: fix linting error 2025-03-22 17:56:39 -07:00
Krrish Dholakia
c568952490 fix(router.py): fix get_model_list to return all wildcard models
enables viewing all wildcard models on `/model/info`
2025-03-22 15:39:23 -07:00
Krrish Dholakia
c45a61940d fix(llm_passthrough_endpoints.py): fix raising helpful debug error message 2025-03-22 15:06:16 -07:00
Krrish Dholakia
7bd9c5cd9e build: cleanup unused files 2025-03-22 12:54:11 -07:00
Krrish Dholakia
2d5c7f809d fix(llm_passthrough_endpoints.py): raise verbose error if credentials not found on proxy 2025-03-22 11:49:51 -07:00
Krrish Dholakia
851c993063 fix(vertex_ai/common_utils.py): fix handling constructed url with default vertex config 2025-03-22 11:32:01 -07:00
Krrish Dholakia
0145ebd343 feat(llm_passthrough_endpoints.py): base case passing for refactored vertex passthrough route 2025-03-22 11:06:52 -07:00
Krrish Dholakia
e99a22b9ac refactor(llm_passthrough_endpoints.py): refactor vertex passthrough to use common llm passthrough handler.py 2025-03-22 10:42:46 -07:00
Krrish Dholakia
12046a1a2e fix: fix linting error 2025-03-21 18:23:58 -07:00
Krrish Dholakia
4a88ba1bc8 fix: fix linting error 2025-03-21 18:08:00 -07:00
Krrish Dholakia
02b7299337 fix(proxy/_types.py): handle user_email=None 2025-03-21 17:46:18 -07:00
Krrish Dholakia
1dd9a433e3 fix: remove unused import 2025-03-21 17:25:57 -07:00
Krrish Dholakia
92d6e2fecb fix: remove unused import 2025-03-21 17:22:58 -07:00
Krrish Dholakia
c7f42747bf refactor(user_api_key_auth.py): move is_route_allowed to inside common_checks
ensures consistent behaviour inside api key + jwt routes
2025-03-21 17:21:07 -07:00
Krrish Dholakia
a23a7e1486 test: initial e2e testing to ensure non admin jwt token cannot create new teams 2025-03-21 16:40:18 -07:00
Ishaan Jaff
9760b030e3 fix naming 2025-03-20 21:57:28 -07:00
Ishaan Jaff
616c4db12d add litellm mcp endpoints 2025-03-20 21:12:56 -07:00
Krrish Dholakia
edf7eb867d fix(team_endpoints.py): consistently return 404 if team not found in DB
Fixes response on /team/delete
2025-03-20 20:28:03 -07:00
Krrish Dholakia
d415738860 feat(pass_through_endpoints.py): return api base on pass-through exception
enables easy debugging on backend api errors
2025-03-20 20:19:52 -07:00
Krrish Dholakia
45a65d71f3 feat(pass_through_endpoints.py): support returning api-base on pass-through endpoints
Make it easier to debug what the api base sent to provider was
2025-03-20 20:11:49 -07:00
Ishaan Jaff
d3decfc0f9 fix mcp router 2025-03-20 18:05:14 -07:00
Ishaan Jaff
7a93ec2b94 cleanup 2025-03-20 18:02:02 -07:00
Ishaan Jaff
d9c77da113 remove stale file 2025-03-20 17:59:19 -07:00