Commit graph

4466 commits

Author SHA1 Message Date
Rena Lu
6833f37986 remove prints 2024-02-09 16:25:29 -05:00
Rena Lu
ae0ede4190
Merge branch 'BerriAI:main' into main 2024-02-09 16:20:14 -05:00
Rena Lu
0e8a0aefd5 add vertex ai private endpoint support 2024-02-09 16:19:26 -05:00
Krrish Dholakia
2a7e346144 fix(main.py): trigger new build 2024-02-08 23:03:03 -08:00
Krish Dholakia
51c07e294a
Merge pull request #1902 from BerriAI/litellm_mistral_message_list_fix
fix(factory.py): mistral message input fix
2024-02-08 23:01:39 -08:00
Krish Dholakia
6084e0b25a
Merge pull request #1901 from BerriAI/litellm_ui_usage_tiers
fix(proxy_server.py): enable aggregate queries via /spend/keys
2024-02-08 22:45:52 -08:00
Krrish Dholakia
b426fa55f4 test(test_completion.py): fix test 2024-02-08 22:04:22 -08:00
Krrish Dholakia
3a4ac8be79 fix: fixes 2024-02-08 21:54:48 -08:00
Krrish Dholakia
2756ba591c test(test_parallel_request_limiter.py): fix test 2024-02-08 21:49:58 -08:00
Krrish Dholakia
b9393fb769 fix(test_parallel_request_limiter.py): use mock responses for streaming 2024-02-08 21:45:38 -08:00
Ishaan Jaff
c54f21f9ec
Merge pull request #1904 from BerriAI/litellm_show_delete_confirmation
Admin UI - show delete confirmation when deleting keys
2024-02-08 21:39:17 -08:00
ishaan-jaff
ab3fe95810 (feat) update ui build 2024-02-08 21:38:04 -08:00
Krrish Dholakia
841639333b fix(bedrock.py): raise exception for amazon titan null response 2024-02-08 21:12:25 -08:00
Krrish Dholakia
c9e5c796ad fix(factory.py): mistral message input fix 2024-02-08 20:54:26 -08:00
Krrish Dholakia
e98437104d fix(proxy_server.py): enable aggregate queries via /spend/keys 2024-02-08 20:29:08 -08:00
Krish Dholakia
95bf684a8c
Merge pull request #1898 from BerriAI/litellm_langfuse_error_logging
Litellm langfuse error logging - log input
2024-02-08 17:38:46 -08:00
Krrish Dholakia
ff93609453 build(schema.prisma): support direct url on prisma schema 2024-02-08 17:37:37 -08:00
Krrish Dholakia
64fd1f7d21 fix(langfuse.py): langfuse success logging fix 2024-02-08 16:46:04 -08:00
Krrish Dholakia
bc23a9266e fix(langfuse.py): support passing input params for langfuse errors 2024-02-08 16:37:33 -08:00
David Leen
140d915adf Add support for AWS credentials from profile file
https://boto3.amazonaws.com/v1/documentation/api/latest/guide/credentials.html#aws-config-file
2024-02-08 15:10:50 -08:00
Ishaan Jaff
59465bd612
Merge pull request #1892 from BerriAI/litellm_speed_up_s3_logging
[FEAT] 76 % Faster s3 logging Proxy / litellm.acompletion / router.acompletion 🚀
2024-02-08 11:48:36 -08:00
ishaan-jaff
6eb17cd916 (test) s3 logging 2024-02-08 11:11:19 -08:00
ishaan-jaff
c2b948e6a9 (test) s3 logging time 2024-02-08 11:01:11 -08:00
ishaan-jaff
ac4d9a7542 (feat) speed up s3 logging 2024-02-08 10:59:54 -08:00
ishaan-jaff
4cee080e7c (fix) ui /login without sso 2024-02-08 10:47:45 -08:00
ishaan-jaff
99b9a7bfec (test) load test s3 cache 2024-02-08 10:07:03 -08:00
ishaan-jaff
8a615cd125 (test) async s3 cache 2024-02-08 10:04:10 -08:00
ishaan-jaff
9a23317461 (feat) fix s3 cache 2024-02-08 10:01:56 -08:00
ishaan-jaff
c59021d090 (cookbook) load test litellm router 2024-02-08 07:24:28 -08:00
Krrish Dholakia
0d803e1379 fix(proxy_cli.py): fix max connection limit issue on db 2024-02-07 22:57:44 -08:00
Krrish Dholakia
551a4c60a7 fix(dynamo_db.py): eliminating unneccessary params 2024-02-07 22:52:15 -08:00
ishaan-jaff
3837c77df9 (feat) slack alerting when track callback fails 2024-02-07 20:09:28 -08:00
ishaan-jaff
e8e6fc6123 (fix) remove extra statement 2024-02-07 19:26:31 -08:00
ishaan-jaff
8a328b4c6d (fix) track cost for semantic_caching, place on langfuse trace 2024-02-07 19:20:15 -08:00
ishaan-jaff
5dc26b11bf Merge remote-tracking branch 'origin/main' into litellm_aert_when_budget_tracking_fails 2024-02-07 18:50:00 -08:00
ishaan-jaff
8462e85792 (feat) alert for failing cost tracking 2024-02-07 18:49:45 -08:00
Ishaan Jaff
98b0ace2e9
Merge pull request #1874 from BerriAI/litellm_azure_base_model_pricing
[FEAT] Azure Pricing - based on base_model in model_info
2024-02-07 18:37:55 -08:00
ishaan-jaff
e143eac6b5 (feat) add azure/gpt-4-0125-preview 2024-02-07 18:22:31 -08:00
ishaan-jaff
cc7a690c9b (fix) azure_base_model cost calc 2024-02-07 18:18:15 -08:00
Krrish Dholakia
d2dceb3537 fix(proxy_server.py): check if prisma client is set before scheduling reset budget 2024-02-07 18:14:37 -08:00
ishaan-jaff
6969b25946 (fix) azure cost calc 2024-02-07 17:33:10 -08:00
ishaan-jaff
e914dfa940 (ci/cd) runn again 2024-02-07 17:13:13 -08:00
ishaan-jaff
bb469278c6 (fix) cost tracking 2024-02-07 17:06:05 -08:00
ishaan-jaff
bbbd37f0cb (ci/cd) run again 2024-02-07 16:55:38 -08:00
ishaan-jaff
9c597cbe0b (feat) use base_model for azure cost 2024-02-07 16:33:35 -08:00
ishaan-jaff
0764af4392 (feat) use base_model for azure response_cost 2024-02-07 16:33:07 -08:00
ishaan-jaff
705396240e (test) using base_model for cost_calc on router 2024-02-07 16:30:58 -08:00
ishaan-jaff
920d684da4 (feat) log model_info in router metadata 2024-02-07 15:44:28 -08:00
ishaan-jaff
68926c6524 (fix) model_prices_and_context_window.json error 2024-02-07 15:42:37 -08:00
Krrish Dholakia
655fcd4d79 fix(utils.py): fix ollama stop sequence mapping 2024-02-07 13:14:03 -08:00