Commit graph

7089 commits

Author SHA1 Message Date
ishaan-jaff
cc7a690c9b (fix) azure_base_model cost calc 2024-02-07 18:18:15 -08:00
ishaan-jaff
6969b25946 (fix) azure cost calc 2024-02-07 17:33:10 -08:00
ishaan-jaff
e914dfa940 (ci/cd) runn again 2024-02-07 17:13:13 -08:00
ishaan-jaff
bb469278c6 (fix) cost tracking 2024-02-07 17:06:05 -08:00
ishaan-jaff
bbbd37f0cb (ci/cd) run again 2024-02-07 16:55:38 -08:00
ishaan-jaff
206a6845ac (docs) pricing base_model 2024-02-07 16:54:24 -08:00
ishaan-jaff
9c597cbe0b (feat) use base_model for azure cost 2024-02-07 16:33:35 -08:00
ishaan-jaff
0764af4392 (feat) use base_model for azure response_cost 2024-02-07 16:33:07 -08:00
ishaan-jaff
705396240e (test) using base_model for cost_calc on router 2024-02-07 16:30:58 -08:00
ishaan-jaff
920d684da4 (feat) log model_info in router metadata 2024-02-07 15:44:28 -08:00
ishaan-jaff
68926c6524 (fix) model_prices_and_context_window.json error 2024-02-07 15:42:37 -08:00
Krish Dholakia
0300512743
Update model_prices_and_context_window.json 2024-02-07 13:25:02 -08:00
Krish Dholakia
bb3bf122cc
Update model_prices_and_context_window.json 2024-02-07 13:23:38 -08:00
Krrish Dholakia
655fcd4d79 fix(utils.py): fix ollama stop sequence mapping 2024-02-07 13:14:03 -08:00
Krrish Dholakia
66913222a1 docs(langfuse_integration.md): docs for showing how to log errors to langfuse 2024-02-07 12:07:11 -08:00
Krrish Dholakia
048acc8e68 docs(ui.md): add more help to docs 2024-02-07 11:46:57 -08:00
Krrish Dholakia
2fcbe06af8 docs(ui.md): add proxy admin view to docs 2024-02-07 11:38:27 -08:00
Krrish Dholakia
e165ec40d4 docs(ui.md): ui doc updates 2024-02-07 11:38:27 -08:00
ishaan-jaff
258fe63e7d (fix) ui - when request body is None 2024-02-07 11:33:43 -08:00
Krrish Dholakia
8939593826 fix(proxy_server.py): fix merge errors 2024-02-07 00:04:52 -08:00
Krrish Dholakia
184e78772b refactor(proxy_server.py): fix merge error 2024-02-06 23:44:23 -08:00
Krrish Dholakia
9e138b9e4e bump: version 1.22.11 → 1.23.0 2024-02-06 23:39:28 -08:00
Krrish Dholakia
46dd08c207 refactor(main.py): trigger rebuild 2024-02-06 23:39:28 -08:00
Krish Dholakia
f785aee0df
Merge pull request #1860 from BerriAI/litellm_spend_logging_high_traffic
fix(proxy_server.py): prisma client fixes for high traffic
2024-02-06 23:37:43 -08:00
Krish Dholakia
df60edfa07
Merge branch 'main' into litellm_spend_logging_high_traffic 2024-02-06 23:36:58 -08:00
Krrish Dholakia
fd9c7a90af fix(proxy_server.py): update user cache to with new spend 2024-02-06 23:06:05 -08:00
Krrish Dholakia
73d8e3e640 fix(ollama_chat.py): fix token counting 2024-02-06 22:18:46 -08:00
Krrish Dholakia
4174471dac fix(proxy_server.py): fix endpoint 2024-02-06 22:09:30 -08:00
Krish Dholakia
2bc710d8e9
Merge pull request #1843 from BerriAI/litellm_admin_ui_view_all_keys
feat(ui): enable admin to view all valid keys created on the proxy
2024-02-06 22:06:46 -08:00
Krrish Dholakia
0874c17a31 fix: export npm build into proxy 2024-02-06 20:12:50 -08:00
Krrish Dholakia
4a0df3cb4f fix(proxy_cli.py-&&-proxy_server.py): bump reset budget intervals and fix pool limits for prisma connections 2024-02-06 19:39:49 -08:00
ishaan-jaff
5f4b06fb19 (docs) caching 2024-02-06 19:39:32 -08:00
ishaan-jaff
5a29f362ee (fix) allow litellm_settings to be None 2024-02-06 19:29:39 -08:00
ishaan-jaff
e9bf16bbda bump: version 1.22.10 → 1.22.11 2024-02-06 19:23:57 -08:00
ishaan-jaff
c69eaebfd8 (fix) dockerfile for semantic caching 2024-02-06 19:23:27 -08:00
ishaan-jaff
7b26b3b789 (ci/cd) run again 2024-02-06 18:25:15 -08:00
Krrish Dholakia
b6adeec347 fix(proxy_server.py): prisma client fixes for high traffic 2024-02-06 17:30:36 -08:00
ishaan-jaff
83628938ab bump: version 1.22.9 → 1.22.10 2024-02-06 17:12:46 -08:00
Ishaan Jaff
73c6ce890b
Merge pull request #1859 from BerriAI/litellm_allow_using_budgets_without_keys
[Feat] Budgets for 'user' param passed to /chat/completions, /embeddings etc
2024-02-06 16:32:25 -08:00
ishaan-jaff
6369424629 (ci/cd) run again 2024-02-06 16:08:25 -08:00
Krish Dholakia
0fd64bc906
Merge pull request #1839 from BerriAI/litellm_slack_langfuse_alerting
fix(proxy/utils.py): if langfuse trace id passed in, include in slack alert
2024-02-06 15:49:00 -08:00
Krish Dholakia
9e9fb747ce
Merge branch 'main' into litellm_slack_langfuse_alerting 2024-02-06 15:48:52 -08:00
ishaan-jaff
8208ebd9db (docs) budget per end_user 2024-02-06 15:39:45 -08:00
ishaan-jaff
196787359f (test) track_cost_ for end users 2024-02-06 15:25:51 -08:00
ishaan-jaff
52b864976b (feat) support max_user_budget 2024-02-06 15:19:36 -08:00
Krrish Dholakia
be81183782 refactor(main.py): trigger deploy
n
2024-02-06 15:17:40 -08:00
ishaan-jaff
78f75647da (fix) redisvl requirements.txt issue 2024-02-06 15:17:40 -08:00
ishaan-jaff
8ba2c8dbf7 (fix) langfuse show semantic-similarity in tags 2024-02-06 15:17:40 -08:00
ishaan-jaff
eb3b68a2f0 (fix) dockerfile requirements.txt 2024-02-06 15:17:40 -08:00
ishaan-jaff
325ca43946 (feat) show semantic-cache on health/readiness 2024-02-06 15:17:40 -08:00