litellm-mirror

mirror of https://github.com/BerriAI/litellm.git synced 2025-04-26 03:04:13 +00:00

Author	SHA1	Message	Date
Ishaan Jaff	3a086cee06	(Feat) - Show Error Logs on LiteLLM UI (#8904 ) * fix test_moderations_bad_model * use async_post_call_failure_hook * basic logging errors in DB * show status on ui * show status on ui * ui show request / response side by side * stash fixes * working, track raw request * track error info in metadata * fix showing error / request / response logs * show traceback on error viewer * ui with traceback of error * fix async_post_call_failure_hook * fix(http_parsing_utils.py): orjson can throw errors on some emoji's in text, default to json.loads * test_get_error_information * fix code quality * rename proxy track cost callback test * _should_store_errors_in_spend_logs * feature flag error logs * Revert "_should_store_errors_in_spend_logs" This reverts commit `7f345df477`. * Revert "feature flag error logs" This reverts commit `0e90c022bb`. * test_spend_logs_payload * fix OTEL log_db_metrics * fix import json * fix ui linting error * test_async_post_call_failure_hook * test_chat_completion_bad_model_with_spend_logs --------- Co-authored-by: Krrish Dholakia <krrishdholakia@gmail.com>	2025-02-28 20:10:09 -08:00
Ishaan Jaff	55b938dd6e	(Infra/DB) - Allow running older litellm version when out of sync with current state of DB (#8695 ) * fix check migration * clean up should_update_prisma_schema * update test * db_migration_disable_update_check * Check container logs for expected message * db_migration_disable_update_check * test_check_migration_out_of_sync * test_should_update_prisma_schema * db_migration_disable_update_check * pip install aiohttp	2025-02-20 18:30:23 -08:00
Krish Dholakia	91ed05df29	Litellm dev contributor prs 01 31 2025 (#8168 ) * Add O3-Mini for Azure and Remove Vision Support (#8161) * Azure Released O3-mini at the same time as OAI, so i've added support here. Confirmed to work with Sweden Central. * [FIX] replace cgi for python 3.13 with email.Message as suggested in PEP 594 (#8160) * Update model_prices_and_context_window.json (#8120) codestral2501 pricing on vertex_ai * Fix/db view names (#8119) * Fix to case sensitive DB Views name * Fix to case sensitive DB View names * Added quotes to check query as well * Added quotes to create view query * test: handle server error for flaky test vertex ai has unstable endpoints --------- Co-authored-by: Wanis Elabbar <70503629+elabbarw@users.noreply.github.com> Co-authored-by: Honghua Dong <dhh1995@163.com> Co-authored-by: superpoussin22 <vincent.nadal@orange.fr> Co-authored-by: Miguel Armenta <37154380+ma-armenta@users.noreply.github.com>	2025-02-01 09:05:20 -08:00
Ishaan Jaff	51b17be3ab	use asyncio tasks for logging db metrics (#7663 )	2025-01-09 19:59:32 -08:00
Ishaan Jaff	c7f14e936a	(code quality) run ruff rule to ban unused imports (#7313 ) * remove unused imports * fix AmazonConverseConfig * fix test * fix import * ruff check fixes * test fixes * fix testing * fix imports	2024-12-19 12:33:42 -08:00
Ishaan Jaff	eb47117800	(feat) log error class, function_name on prometheus service failure hook + only log DB related failures on DB service hook (#6650 ) * log error on prometheus service failure hook * use a more accurate function name for wrapper that handles logging db metrics * fix log_db_metrics * test_log_db_metrics_failure_error_types * fix linting * fix auth checks	2024-11-07 17:01:18 -08:00
Ishaan Jaff	610974b4fc	(code quality) add ruff check PLR0915 for `too-many-statements` (#6309 ) * ruff add PLR0915 * add noqa for PLR0915 * fix noqa * add # noqa: PLR0915 * # noqa: PLR0915 * # noqa: PLR0915 * # noqa: PLR0915 * add # noqa: PLR0915 * # noqa: PLR0915 * # noqa: PLR0915 * # noqa: PLR0915 * # noqa: PLR0915	2024-10-18 15:36:49 +05:30
Krish Dholakia	9695c1af10	LiteLLM Minor Fixes & Improvements (10/08/2024) (#6119 ) * refactor(cost_calculator.py): move error line to debug - https://github.com/BerriAI/litellm/issues/5683#issuecomment-2398599498 * fix(migrate-hidden-params-to-read-from-standard-logging-payload): Fixes https://github.com/BerriAI/litellm/issues/5546#issuecomment-2399994026 * fix(types/utils.py): mark weight as a litellm param Fixes https://github.com/BerriAI/litellm/issues/5781 * feat(internal_user_endpoints.py): fix /user/info + show user max budget as default max budget Fixes https://github.com/BerriAI/litellm/issues/6117 * feat: support returning team member budget in `/user/info` Sets user max budget in team as max budget on ui Closes https://github.com/BerriAI/litellm/issues/6117 * bug fix for optional parameter passing to replicate (#6067) Signed-off-by: Mandana Vaziri <mvaziri@us.ibm.com> * fix(o1_transformation.py): handle o1 temperature=0 o1 doesn't support temp=0, allow admin to drop this param * test: fix test --------- Signed-off-by: Mandana Vaziri <mvaziri@us.ibm.com> Co-authored-by: Mandana Vaziri <mvaziri@us.ibm.com>	2024-10-08 21:57:03 -07:00
Krish Dholakia	04e5963b65	Litellm expose disable schema update flag (#6085 ) * fix: enable new 'disable_prisma_schema_update' flag * build(config.yml): remove setup remote docker step * ci(config.yml): give container time to start up * ci(config.yml): update test * build(config.yml): actually start docker * build(config.yml): simplify grep check * fix(prisma_client.py): support reading disable_schema_update via env vars * ci(config.yml): add test to check if all general settings are documented * build(test_General_settings.py): check available dir * ci: check ../ repo path * build: check ./ * build: fix test	2024-10-05 21:26:51 -04:00
Krish Dholakia	d57be47b0f	Litellm ruff linting enforcement (#5992 ) * ci(config.yml): add a 'check_code_quality' step Addresses https://github.com/BerriAI/litellm/issues/5991 * ci(config.yml): check why circle ci doesn't pick up this test * ci(config.yml): fix to run 'check_code_quality' tests * fix(__init__.py): fix unprotected import * fix(__init__.py): don't remove unused imports * build(ruff.toml): update ruff.toml to ignore unused imports * fix: fix: ruff + pyright - fix linting + type-checking errors * fix: fix linting errors * fix(lago.py): fix module init error * fix: fix linting errors * ci(config.yml): cd into correct dir for checks * fix(proxy_server.py): fix linting error * fix(utils.py): fix bare except causes ruff linting errors * fix: ruff - fix remaining linting errors * fix(clickhouse.py): use standard logging object * fix(__init__.py): fix unprotected import * fix: ruff - fix linting errors * fix: fix linting errors * ci(config.yml): cleanup code qa step (formatting handled in local_testing) * fix(_health_endpoints.py): fix ruff linting errors * ci(config.yml): just use ruff in check_code_quality pipeline for now * build(custom_guardrail.py): include missing file * style(embedding_handler.py): fix ruff check	2024-10-01 19:44:20 -04:00
Krish Dholakia	bd17424c4b	LiteLLM Minor Fixes & Improvements (09/26/2024) (#5925 ) (#5937 ) * LiteLLM Minor Fixes & Improvements (09/26/2024) (#5925) * fix(litellm_logging.py): don't initialize prometheus_logger if non premium user Prevents bad error messages in logs Fixes https://github.com/BerriAI/litellm/issues/5897 * Add Support for Custom Providers in Vision and Function Call Utils (#5688) * Add Support for Custom Providers in Vision and Function Call Utils Lookup * Remove parallel function call due to missing model info param * Add Unit Tests for Vision and Function Call Changes * fix-#5920: set header value to string to fix "'int' object has no att… (#5922) * LiteLLM Minor Fixes & Improvements (09/24/2024) (#5880) * LiteLLM Minor Fixes & Improvements (09/23/2024) (#5842) * feat(auth_utils.py): enable admin to allow client-side credentials to be passed Makes it easier for devs to experiment with finetuned fireworks ai models * feat(router.py): allow setting configurable_clientside_auth_params for a model Closes https://github.com/BerriAI/litellm/issues/5843 * build(model_prices_and_context_window.json): fix anthropic claude-3-5-sonnet max output token limit Fixes https://github.com/BerriAI/litellm/issues/5850 * fix(azure_ai/): support content list for azure ai Fixes https://github.com/BerriAI/litellm/issues/4237 * fix(litellm_logging.py): always set saved_cache_cost Set to 0 by default * fix(fireworks_ai/cost_calculator.py): add fireworks ai default pricing handles calling 405b+ size models * fix(slack_alerting.py): fix error alerting for failed spend tracking Fixes regression with slack alerting error monitoring * fix(vertex_and_google_ai_studio_gemini.py): handle gemini no candidates in streaming chunk error * docs(bedrock.md): add llama3-1 models * test: fix tests * fix(azure_ai/chat): fix transformation for azure ai calls * feat(azure_ai/embed): Add azure ai embeddings support Closes https://github.com/BerriAI/litellm/issues/5861 * fix(azure_ai/embed): enable async embedding * feat(azure_ai/embed): support azure ai multimodal embeddings * fix(azure_ai/embed): support async multi modal embeddings * feat(together_ai/embed): support together ai embedding calls * feat(rerank/main.py): log source documents for rerank endpoints to langfuse improves rerank endpoint logging * fix(langfuse.py): support logging `/audio/speech` input to langfuse * test(test_embedding.py): fix test * test(test_completion_cost.py): fix helper util * fix-#5920: set header value to string to fix "'int' object has no attribute 'encode'" --------- Co-authored-by: Krish Dholakia <krrishdholakia@gmail.com> * Revert "fix-#5920: set header value to string to fix "'int' object has no att…" (#5926) This reverts commit `a554ae2695`. * build(model_prices_and_context_window.json): add azure ai cohere rerank model pricing Enables cost tracking for azure ai cohere rerank models * fix(litellm_logging.py): fix debug log to be clearer Closes https://github.com/BerriAI/litellm/issues/5909 * test(test_utils.py): fix test name * fix(azure_ai/cost_calculator.py): support cost tracking for azure ai rerank models * fix(azure_ai): fix azure ai base model cost tracking for rerank endpoints * fix(converse_handler.py): support new llama 3-2 models Fixes https://github.com/BerriAI/litellm/issues/5901 * fix(litellm_logging.py): ensure response is redacted for standard message logging Fixes https://github.com/BerriAI/litellm/issues/5890#issuecomment-2378242360 * fix(cost_calculator.py): use 'get_model_info' for cohere rerank cost calculation allows user to set custom cost for model * fix(config.yml): fix docker hub auht * build(config.yml): add docker auth to all tests * fix(db/create_views.py): fix linting error * fix(main.py): fix circular import * fix(azure_ai/__init__.py): fix circular import * fix(main.py): fix import * fix: fix linting errors * test: fix test * fix(proxy_server.py): pass premium user value on startup used for prometheus init --------- Co-authored-by: Cole Murray <colemurray.cs@gmail.com> Co-authored-by: bravomark <62681807+bravomark@users.noreply.github.com> * handle streaming for azure ai studio error * [Perf Proxy] parallel request limiter - use one cache update call (#5932) * fix parallel request limiter - use one cache update call * ci/cd run again * run ci/cd again * use docker username password * fix config.yml * fix config * fix config * fix config.yml * ci/cd run again * use correct typing for batch set cache * fix async_set_cache_pipeline * fix only check user id tpm / rpm limits when limits set * fix test_openai_azure_embedding_with_oidc_and_cf * test: fix test * test(test_rerank.py): fix test --------- Co-authored-by: Cole Murray <colemurray.cs@gmail.com> Co-authored-by: bravomark <62681807+bravomark@users.noreply.github.com> Co-authored-by: Ishaan Jaff <ishaanjaffer0324@gmail.com>	2024-09-27 17:54:13 -07:00
Krish Dholakia	a1d9e96b31	LiteLLM Minor Fixes & Improvements (09/25/2024) (#5893 ) * fix(langfuse.py): support new langfuse prompt_chat class init params * fix(langfuse.py): handle new init values on prompt chat + prompt text templates fixes error caused during langfuse logging * docs(openai_compatible.md): clarify `openai/` handles correct routing for `/v1/completions` route Fixes https://github.com/BerriAI/litellm/issues/5876 * fix(utils.py): handle unmapped gemini model optional param translation Fixes https://github.com/BerriAI/litellm/issues/5888 * fix(o1_transformation.py): fix o-1 validation, to not raise error if temperature=1 Fixes https://github.com/BerriAI/litellm/issues/5884 * fix(prisma_client.py): refresh iam token Fixes https://github.com/BerriAI/litellm/issues/5896 * fix: pass drop params where required * fix(utils.py): pass drop_params correctly * fix(types/vertex_ai.py): fix generation config * test(test_max_completion_tokens.py): fix test * fix(vertex_and_google_ai_studio_gemini.py): fix map openai params	2024-09-26 16:41:44 -07:00
Krish Dholakia	3933fba41f	LiteLLM Minor Fixes & Improvements (09/19/2024) (#5793 ) * fix(model_prices_and_context_window.json): add cost tracking for more vertex llama3.1 model 8b and 70b models * fix(proxy/utils.py): handle data being none on pre-call hooks * fix(proxy/): create views on initial proxy startup fixes base case, where user starts proxy for first time Fixes https://github.com/BerriAI/litellm/issues/5756 * build(config.yml): fix vertex version for test * feat(ui/): support enabling/disabling slack alerting Allows admin to turn on/off slack alerting through ui * feat(rerank/main.py): support langfuse logging * fix(proxy/utils.py): fix linting errors * fix(langfuse.py): log clean metadata * test(tests): replace deprecated openai model	2024-09-20 08:19:52 -07:00
Krish Dholakia	c76d2c6ade	Refactor 'check_view_exists' logic (#5659 ) * fix(proxy/utils.py): comment out auto-upsert logic in check_view_exists Prevents proxy from failing on startup due to faulty logic * fix(db/migration_scripts/create_views.py): fix 'DailyTagSpend' quotation on check * fix(create_views.py): mongly global spend time period should be 30d not 20d * fix(schema.prisma): index on startTime and endUser for efficient UI querying	2024-09-12 13:39:50 -07:00
Ishaan Jaff	7ff7028885	fix create script for pre-creating views	2024-09-09 11:03:27 -07:00
Ishaan Jaff	b0178a85cf	refactor get_secret	2024-09-03 10:42:12 -07:00
Ishaan Jaff	dad4bd58bc	(feat) stop eagerly evaluating fstring	2024-03-25 09:01:42 -07:00
ishaan-jaff	44202aa25b	(fix) dynamo db test - new model_spend params	2024-02-17 16:00:08 -08:00
Krrish Dholakia	34fce00960	fix: dynamo_db.py handle permissions row	2024-02-15 20:36:59 -08:00
Ishaan Jaff	003feda33f	Merge branch 'main' into litellm_dynamo_use_arn	2024-02-13 21:27:38 -08:00
ishaan-jaff	c4038e38e5	(fix) dyanmo fixes	2024-02-12 19:54:39 -08:00
Krrish Dholakia	551a4c60a7	fix(dynamo_db.py): eliminating unneccessary params	2024-02-07 22:52:15 -08:00
ishaan-jaff	9c13c65ce2	(dynamo) use assume_role_with_web_identity	2024-02-02 10:57:06 -08:00
ishaan-jaff	96c9fd3f02	(fix) patch -For DynamoDB Backwards Compatibility	2024-01-29 16:52:31 -08:00
ishaan-jaff	2e64ca9057	(fix) dynamoDB + hashed tokens bug	2024-01-29 11:17:27 -08:00
ishaan-jaff	2c79585e18	(fix) dynamo - use hashed_api keys in table	2024-01-26 14:32:00 -08:00
Krrish Dholakia	0752048b81	fix(dynamo_db.py): fix update bug	2024-01-24 21:29:56 -08:00
ishaan-jaff	9b39893980	v0 set arn number on dynamoDB /key/gen	2024-01-24 10:29:28 -08:00
ishaan-jaff	5b54bcc712	(feat) spendLogs table DynamoDB	2024-01-18 12:39:11 -08:00
Krrish Dholakia	cff9f7fee6	fix(proxy_server.py): handle empty insert_data response	2024-01-17 17:28:23 -08:00
Krrish Dholakia	98b83fa780	feat(proxy_server.py): support model access groups	2024-01-17 15:45:31 -08:00
ishaan-jaff	d59ac23e82	(fix) proxy, dynamo - allow users to set ssl_verify False	2024-01-16 16:13:55 -08:00
ishaan-jaff	340cca5ac3	Revert "Revert "(fix) dynamoDB import issues"" This reverts commit `6181b95ab3`.	2024-01-15 16:48:10 -08:00
ishaan-jaff	6181b95ab3	Revert "(fix) dynamoDB import issues" This reverts commit `b931841e60`.	2024-01-15 12:04:13 -08:00
ishaan-jaff	1f62637197	(fix) proxy + dynamo - add some debugging	2024-01-15 12:03:10 -08:00
ishaan-jaff	b931841e60	(fix) dynamoDB import issues	2024-01-13 11:38:42 -08:00
Krrish Dholakia	be1ebe36a1	fix(dynamo_db.py): allow user to specify table names	2024-01-13 18:35:19 +05:30
Krrish Dholakia	352f943dcf	fix(dynamo_db.py): don't auto-create tables, allow database_type == 'dynamodb'	2024-01-12 11:33:40 +05:30
Krrish Dholakia	f94a37a836	fix(dynamo_db.py): add cost tracking support for key + user	2024-01-11 23:56:41 +05:30
Krrish Dholakia	9b3d78c4f3	fix(dynamo_db.py): if table create fails, tell user what the table + hash key needs to be	2024-01-11 23:01:28 +05:30
Krrish Dholakia	befbcbb7ac	fix: fix linting issues	2024-01-09 22:55:18 +05:30
Krrish Dholakia	ba7026639d	fix: fix linting errors	2024-01-09 22:16:16 +05:30
Krrish Dholakia	35f9666dc2	feat(proxy_server.py): adds working dynamo db support for key gen	2024-01-09 18:23:34 +05:30

43 commits