Commit graph

7112 commits

Author SHA1 Message Date
ishaan-jaff
6eb17cd916 (test) s3 logging 2024-02-08 11:11:19 -08:00
ishaan-jaff
c2b948e6a9 (test) s3 logging time 2024-02-08 11:01:11 -08:00
ishaan-jaff
ac4d9a7542 (feat) speed up s3 logging 2024-02-08 10:59:54 -08:00
Krish Dholakia
7c83ba061b
Update README.md 2024-02-08 10:06:27 -08:00
Krrish Dholakia
9be5e2f7e3 docs(enterprise.md): adding enterprise support to docs 2024-02-08 10:02:40 -08:00
ishaan-jaff
c59021d090 (cookbook) load test litellm router 2024-02-08 07:24:28 -08:00
Krrish Dholakia
0d803e1379 fix(proxy_cli.py): fix max connection limit issue on db 2024-02-07 22:57:44 -08:00
ishaan-jaff
3c54d8d1a6 bump: version 1.23.1 → 1.23.2 2024-02-07 20:13:50 -08:00
Ishaan Jaff
ed4751e2de
Merge pull request #1877 from BerriAI/litellm_aert_when_budget_tracking_fails
[Feat] Slack Alert when budget tracking fails
2024-02-07 20:12:00 -08:00
ishaan-jaff
3837c77df9 (feat) slack alerting when track callback fails 2024-02-07 20:09:28 -08:00
ishaan-jaff
e8e6fc6123 (fix) remove extra statement 2024-02-07 19:26:31 -08:00
Ishaan Jaff
717dc78d53
Merge pull request #1878 from BerriAI/litellm_improve_semantic_cache_tracing
[Feat] Semantic Caching - Track Cost of using embedding, Use Langfuse Trace ID
2024-02-07 19:25:23 -08:00
ishaan-jaff
8197b3de0a (fix) remove extra statement 2024-02-07 19:24:27 -08:00
ishaan-jaff
8425a8ba22 (fix) track cost for semantic_caching, place on langfuse trace 2024-02-07 19:21:50 -08:00
ishaan-jaff
8a328b4c6d (fix) track cost for semantic_caching, place on langfuse trace 2024-02-07 19:20:15 -08:00
ishaan-jaff
5dc26b11bf Merge remote-tracking branch 'origin/main' into litellm_aert_when_budget_tracking_fails 2024-02-07 18:50:00 -08:00
ishaan-jaff
8462e85792 (feat) alert for failing cost tracking 2024-02-07 18:49:45 -08:00
Ishaan Jaff
98b0ace2e9
Merge pull request #1874 from BerriAI/litellm_azure_base_model_pricing
[FEAT] Azure Pricing - based on base_model in model_info
2024-02-07 18:37:55 -08:00
Ishaan Jaff
e17e78396e
Merge pull request #1876 from BerriAI/litellm_add_azure_gpt_4_turbo
[Feat] add azure/gpt-4-0125-preview
2024-02-07 18:23:29 -08:00
ishaan-jaff
c98e247b0f (docs) azure/gpt-4-0125-preview 2024-02-07 18:22:38 -08:00
ishaan-jaff
e143eac6b5 (feat) add azure/gpt-4-0125-preview 2024-02-07 18:22:31 -08:00
ishaan-jaff
cc7a690c9b (fix) azure_base_model cost calc 2024-02-07 18:18:15 -08:00
Krrish Dholakia
4cb7759fcd bump: version 1.23.0 → 1.23.1 2024-02-07 18:14:46 -08:00
Krrish Dholakia
d2dceb3537 fix(proxy_server.py): check if prisma client is set before scheduling reset budget 2024-02-07 18:14:37 -08:00
ishaan-jaff
6969b25946 (fix) azure cost calc 2024-02-07 17:33:10 -08:00
ishaan-jaff
e914dfa940 (ci/cd) runn again 2024-02-07 17:13:13 -08:00
ishaan-jaff
bb469278c6 (fix) cost tracking 2024-02-07 17:06:05 -08:00
ishaan-jaff
bbbd37f0cb (ci/cd) run again 2024-02-07 16:55:38 -08:00
ishaan-jaff
206a6845ac (docs) pricing base_model 2024-02-07 16:54:24 -08:00
ishaan-jaff
9c597cbe0b (feat) use base_model for azure cost 2024-02-07 16:33:35 -08:00
ishaan-jaff
0764af4392 (feat) use base_model for azure response_cost 2024-02-07 16:33:07 -08:00
ishaan-jaff
705396240e (test) using base_model for cost_calc on router 2024-02-07 16:30:58 -08:00
ishaan-jaff
920d684da4 (feat) log model_info in router metadata 2024-02-07 15:44:28 -08:00
ishaan-jaff
68926c6524 (fix) model_prices_and_context_window.json error 2024-02-07 15:42:37 -08:00
Krish Dholakia
0300512743
Update model_prices_and_context_window.json 2024-02-07 13:25:02 -08:00
Krish Dholakia
bb3bf122cc
Update model_prices_and_context_window.json 2024-02-07 13:23:38 -08:00
Krrish Dholakia
655fcd4d79 fix(utils.py): fix ollama stop sequence mapping 2024-02-07 13:14:03 -08:00
Krrish Dholakia
66913222a1 docs(langfuse_integration.md): docs for showing how to log errors to langfuse 2024-02-07 12:07:11 -08:00
Krrish Dholakia
048acc8e68 docs(ui.md): add more help to docs 2024-02-07 11:46:57 -08:00
Krrish Dholakia
2fcbe06af8 docs(ui.md): add proxy admin view to docs 2024-02-07 11:38:27 -08:00
Krrish Dholakia
e165ec40d4 docs(ui.md): ui doc updates 2024-02-07 11:38:27 -08:00
ishaan-jaff
258fe63e7d (fix) ui - when request body is None 2024-02-07 11:33:43 -08:00
Krrish Dholakia
8939593826 fix(proxy_server.py): fix merge errors 2024-02-07 00:04:52 -08:00
Krrish Dholakia
184e78772b refactor(proxy_server.py): fix merge error 2024-02-06 23:44:23 -08:00
Krrish Dholakia
9e138b9e4e bump: version 1.22.11 → 1.23.0 2024-02-06 23:39:28 -08:00
Krrish Dholakia
46dd08c207 refactor(main.py): trigger rebuild 2024-02-06 23:39:28 -08:00
Krish Dholakia
f785aee0df
Merge pull request #1860 from BerriAI/litellm_spend_logging_high_traffic
fix(proxy_server.py): prisma client fixes for high traffic
2024-02-06 23:37:43 -08:00
Krish Dholakia
df60edfa07
Merge branch 'main' into litellm_spend_logging_high_traffic 2024-02-06 23:36:58 -08:00
Krrish Dholakia
fd9c7a90af fix(proxy_server.py): update user cache to with new spend 2024-02-06 23:06:05 -08:00
Krrish Dholakia
73d8e3e640 fix(ollama_chat.py): fix token counting 2024-02-06 22:18:46 -08:00