litellm-mirror

mirror of https://github.com/BerriAI/litellm.git synced 2025-04-24 18:24:20 +00:00

Author	SHA1	Message	Date
Ishaan Jaff	0edd4aa8a7	fix PodLockManager	2025-03-27 23:29:48 -07:00
Ishaan Jaff	1ba73f491a	working cron job updates	2025-03-27 23:13:01 -07:00
Ishaan Jaff	666cd8d4e3	handle commit updates to the DB	2025-03-27 22:55:36 -07:00
Ishaan Jaff	0cb6cb6c66	fix schema to work with lock acquisition	2025-03-27 22:54:46 -07:00
Ishaan Jaff	758182fc7f	fix typo on codebase	2025-03-27 22:36:00 -07:00
Ishaan Jaff	fc46f6b861	fix db spend update buffer	2025-03-27 22:34:15 -07:00
Ishaan Jaff	1bfffadd05	fix typo	2025-03-27 22:31:38 -07:00
Ishaan Jaff	9139230531	update the correct set of txs	2025-03-27 21:54:12 -07:00
Ishaan Jaff	aa6c34cb97	use lock manager for update spend job	2025-03-27 21:43:02 -07:00
Ishaan Jaff	dd00429ade	update schema	2025-03-27 21:36:22 -07:00
Ishaan Jaff	ea93a09b7a	add model CronJob	2025-03-27 21:10:26 -07:00
Ishaan Jaff	e65b708fa2	get_all_update_transactions_from_redis	2025-03-27 19:15:30 -07:00
Ishaan Jaff	963791bbb5	use redis update buffer class	2025-03-27 19:12:51 -07:00
Ishaan Jaff	ad72078167	basic structure for commit update txs to redis	2025-03-27 18:21:33 -07:00
Ishaan Jaff	894306141e	refactor, use commit_update_transactions_to_db	2025-03-27 18:07:23 -07:00
Ishaan Jaff	b721b2b4ac	use DBSpendUpdateWriter common function for	2025-03-27 17:04:50 -07:00
Ishaan Jaff	403f2ef68d	use simple static methods for updating spend	2025-03-27 16:57:12 -07:00
Ishaan Jaff	7995fd7c98	fix DBSpendUpdateWriter	2025-03-27 16:44:46 -07:00
Ishaan Jaff	a0fd508de4	DBSpendUpdateWriter	2025-03-27 16:43:18 -07:00
Ishaan Jaff	072be44a54	fix get_credentials job	2025-03-27 16:34:06 -07:00
Ishaan Jaff	21e3b764f5	use DBSpendUpdateWriter class for managing DB spend updates	2025-03-27 16:31:23 -07:00
Krrish Dholakia	f1f40eba3f	docs(index.md): document new team model flow All checks were successful Helm unit test / unit-test (push) Successful in 18s Details Read Version from pyproject.toml / read-version (push) Successful in 22s Details	2025-03-27 12:14:29 -07:00
Krrish Dholakia	f1fa0044cb	bump: version 1.64.1 → 1.65.0	2025-03-27 12:07:17 -07:00
Krish Dholakia	63c9f59373	Allow team admins to add/update/delete models on UI + show api base and model id on request logs (#9572 ) * feat(view_logs.tsx): show model id + api base in request logs easier debugging * fix(index.tsx): fix length of api base easier viewing * refactor(leftnav.tsx): show models tab to team admin * feat(model_dashboard.tsx): add explainer for what the 'models' page is for team admin helps them understand how they can use it * feat(model_management_endpoints.py): restrict model add by team to just team admin allow team admin to add models via non-team keys (e.g. ui token) * test(test_add_update_models.py): update unit testing for new behaviour * fix(model_dashboard.tsx): show user the models * feat(proxy_server.py): add new query param 'user_models_only' to `/v2/model/info` Allows user to retrieve just the models they've added Used in UI to show internal users just the models they've added * feat(model_dashboard.tsx): allow team admins to view their own models * fix: allow ui user to fetch model cost map * feat(add_model_tab.tsx): require team admins to specify team when onboarding models * fix(_types.py): add `/v1/model/info` to info route `/model/info` was already there * fix(model_info_view.tsx): allow user to edit a model they created * fix(model_management_endpoints.py): allow team admin to update team model * feat(model_managament_endpoints.py): allow team admin to delete team models * fix(model_management_endpoints.py): don't require team id to be set when adding a model * fix(proxy_server.py): fix linting error * fix: fix ui linting error * fix(model_management_endpoints.py): ensure consistent auth checks on all model calls * test: remove old test - function no longer exists in same form * test: add updated mock testing	2025-03-27 12:06:31 -07:00
Krrish Dholakia	3ca34a181c	docs(openai.md): add gpt-4o-transcribe to docs All checks were successful Read Version from pyproject.toml / read-version (push) Successful in 17s Details Helm unit test / unit-test (push) Successful in 19s Details	2025-03-26 23:23:35 -07:00
Krish Dholakia	6ba31e1d84	Allow viewing keyinfo on request logs (#9568 ) * feat(view_logs.tsx): show model id + api base in request logs easier debugging * fix(index.tsx): fix length of api base easier viewing * build(ui/): initial commit allowing user to click into key from request logs allows easier debugging of 'what key is this?	2025-03-26 23:11:15 -07:00
Krish Dholakia	c0845fec1f	Add OpenAI gpt-4o-transcribe support (#9517 ) * refactor: introduce new transformation config for gpt-4o-transcribe models * refactor: expose new transformation configs for audio transcription * ci: fix config yml * feat(openai/transcriptions): support provider config transformation on openai audio transcriptions allows gpt-4o and whisper audio transformation to work as expected * refactor: migrate fireworks ai + deepgram to new transform request pattern * feat(openai/): working support for gpt-4o-audio-transcribe * build(model_prices_and_context_window.json): add gpt-4o-transcribe to model cost map * build(model_prices_and_context_window.json): specify what endpoints are supported for `/audio/transcriptions` * fix(get_supported_openai_params.py): fix return * refactor(deepgram/): migrate unit test to deepgram handler * refactor: cleanup unused imports * fix(get_supported_openai_params.py): fix linting error * test: update test	2025-03-26 23:10:25 -07:00
Krrish Dholakia	109add7946	build(model_prices_and_context_window.json): add gemini multimodal embedding cost	2025-03-26 23:04:24 -07:00
Ishaan Jaff	7097ce544e	Merge pull request #9569 from BerriAI/litellm_fix_db_unavailable Some checks failed Read Version from pyproject.toml / read-version (push) Successful in 19s Details Helm unit test / unit-test (push) Successful in 21s Details Publish Prisma Migrations / publish-migrations (push) Failing after 1m18s Details [Reliability Fix] - Allow Pods to startup + passing /health/readiness when `allow_requests_on_db_unavailable: True` and DB is down	2025-03-26 21:18:52 -07:00
Ishaan Jaff	05c38049fe	docs prod.md	2025-03-26 21:04:36 -07:00
Ishaan Jaff	b6506f7bda	test_db_health_readiness_check_with_prisma_error	2025-03-26 20:56:39 -07:00
Ishaan Jaff	87f0201f84	test_handle_db_exception_with_connection_error	2025-03-26 20:44:25 -07:00
Ishaan Jaff	34e58be36d	fix order of _setup_prisma_client	2025-03-26 20:42:01 -07:00
Ishaan Jaff	427580eff5	fix _setup_prisma_client	2025-03-26 20:35:27 -07:00
Ishaan Jaff	15c04da735	refactor tests	2025-03-26 20:00:10 -07:00
Ishaan Jaff	88ef97b9d1	allow proxy to startup on DB unavailable	2025-03-26 19:50:57 -07:00
Ishaan Jaff	497570b2a6	bug fix - allow pods to startup when DB is unavailable	2025-03-26 19:49:51 -07:00
Ishaan Jaff	7142b0b610	refactor PrismaDBExceptionHandler	2025-03-26 19:22:24 -07:00
Ishaan Jaff	0155b0eba2	Merge pull request #9533 from BerriAI/litellm_stability_fixes [Reliability Fixes] - Gracefully handle exceptions when DB is having an outage	2025-03-26 18:57:38 -07:00
Ishaan Jaff	485aa87e65	allow_requests_on_db_unavailable	2025-03-26 18:48:18 -07:00
Ishaan Jaff	763f853a9f	docs fix	2025-03-26 18:32:58 -07:00
Ishaan Jaff	5242c5fbab	test - auth exception handler	2025-03-26 18:28:28 -07:00
Ishaan Jaff	15b1a8afb0	test_is_database_connection_error_prisma_errors	2025-03-26 18:27:39 -07:00
Krish Dholakia	4351c77253	Support Gemini audio token cost tracking + fix openai audio input token cost tracking (#9535 ) * fix(vertex_and_google_ai_studio_gemini.py): log gemini audio tokens in usage object enables accurate cost tracking * refactor(vertex_ai/cost_calculator.py): refactor 128k+ token cost calculation to only run if model info has it Google has moved away from this for gemini-2.0 models * refactor(vertex_ai/cost_calculator.py): migrate to usage object for more flexible data passthrough * fix(llm_cost_calc/utils.py): support audio token cost tracking in generic cost per token enables vertex ai cost tracking to work with audio tokens * fix(llm_cost_calc/utils.py): default to total prompt tokens if text tokens field not set * refactor(llm_cost_calc/utils.py): move openai cost tracking to generic cost per token more consistent behaviour across providers * test: add unit test for gemini audio token cost calculation * ci: bump ci config * test: fix test	2025-03-26 17:26:25 -07:00
Krrish Dholakia	04490c99d7	test: fix test	2025-03-26 17:12:09 -07:00
Krrish Dholakia	72c0ad419f	build(migration.sql): add migration file for new dailyusertable documents prisma db changes	2025-03-26 17:11:25 -07:00
Krrish Dholakia	d4adc9764b	test(test_db_schema_migration.py): ci/cd test to enforce schema migrations are documented in .sql files	2025-03-26 16:59:50 -07:00
Krish Dholakia	37265c8b1c	Add Daily User Spend Aggregate view - allows UI Usage tab to work > 1m rows (#9538 ) * ci: update github action * build(schema.prisma): enable daily user spend table allows storing aggregate view of user's daily spend * build(schema.prisma): add new daily user spend table * feat: working daily user spend tracking maintains an aggregate view for easier querying in high traffic * setup_google_dns * ci: update ci yaml --------- Co-authored-by: Ishaan Jaff <ishaanjaffer0324@gmail.com>	2025-03-26 16:36:36 -07:00
Krrish Dholakia	fb9abd40f3	ci(config.yml): add pytest-postgres to ci/cd	2025-03-26 16:25:08 -07:00
Krish Dholakia	132d3f7baa	feat(prisma-migrations): add baseline db migration file (#9565 ) adds initial baseline db migration file enables future schema changes to be documented via .sql files	2025-03-26 16:22:56 -07:00

1 2 3 4 5 ...

20959 commits