ishaan-jaff
|
e8e6fc6123
|
(fix) remove extra statement
|
2024-02-07 19:26:31 -08:00 |
|
ishaan-jaff
|
8a328b4c6d
|
(fix) track cost for semantic_caching, place on langfuse trace
|
2024-02-07 19:20:15 -08:00 |
|
ishaan-jaff
|
5dc26b11bf
|
Merge remote-tracking branch 'origin/main' into litellm_aert_when_budget_tracking_fails
|
2024-02-07 18:50:00 -08:00 |
|
ishaan-jaff
|
8462e85792
|
(feat) alert for failing cost tracking
|
2024-02-07 18:49:45 -08:00 |
|
Ishaan Jaff
|
98b0ace2e9
|
Merge pull request #1874 from BerriAI/litellm_azure_base_model_pricing
[FEAT] Azure Pricing - based on base_model in model_info
|
2024-02-07 18:37:55 -08:00 |
|
ishaan-jaff
|
e143eac6b5
|
(feat) add azure/gpt-4-0125-preview
|
2024-02-07 18:22:31 -08:00 |
|
ishaan-jaff
|
cc7a690c9b
|
(fix) azure_base_model cost calc
|
2024-02-07 18:18:15 -08:00 |
|
Krrish Dholakia
|
d2dceb3537
|
fix(proxy_server.py): check if prisma client is set before scheduling reset budget
|
2024-02-07 18:14:37 -08:00 |
|
ishaan-jaff
|
6969b25946
|
(fix) azure cost calc
|
2024-02-07 17:33:10 -08:00 |
|
ishaan-jaff
|
e914dfa940
|
(ci/cd) runn again
|
2024-02-07 17:13:13 -08:00 |
|
ishaan-jaff
|
bb469278c6
|
(fix) cost tracking
|
2024-02-07 17:06:05 -08:00 |
|
ishaan-jaff
|
bbbd37f0cb
|
(ci/cd) run again
|
2024-02-07 16:55:38 -08:00 |
|
ishaan-jaff
|
9c597cbe0b
|
(feat) use base_model for azure cost
|
2024-02-07 16:33:35 -08:00 |
|
ishaan-jaff
|
0764af4392
|
(feat) use base_model for azure response_cost
|
2024-02-07 16:33:07 -08:00 |
|
ishaan-jaff
|
705396240e
|
(test) using base_model for cost_calc on router
|
2024-02-07 16:30:58 -08:00 |
|
ishaan-jaff
|
920d684da4
|
(feat) log model_info in router metadata
|
2024-02-07 15:44:28 -08:00 |
|
ishaan-jaff
|
68926c6524
|
(fix) model_prices_and_context_window.json error
|
2024-02-07 15:42:37 -08:00 |
|
Krrish Dholakia
|
655fcd4d79
|
fix(utils.py): fix ollama stop sequence mapping
|
2024-02-07 13:14:03 -08:00 |
|
ishaan-jaff
|
258fe63e7d
|
(fix) ui - when request body is None
|
2024-02-07 11:33:43 -08:00 |
|
Krrish Dholakia
|
8939593826
|
fix(proxy_server.py): fix merge errors
|
2024-02-07 00:04:52 -08:00 |
|
Krrish Dholakia
|
184e78772b
|
refactor(proxy_server.py): fix merge error
|
2024-02-06 23:44:23 -08:00 |
|
Krrish Dholakia
|
46dd08c207
|
refactor(main.py): trigger rebuild
|
2024-02-06 23:39:28 -08:00 |
|
Krish Dholakia
|
df60edfa07
|
Merge branch 'main' into litellm_spend_logging_high_traffic
|
2024-02-06 23:36:58 -08:00 |
|
Krrish Dholakia
|
fd9c7a90af
|
fix(proxy_server.py): update user cache to with new spend
|
2024-02-06 23:06:05 -08:00 |
|
Krrish Dholakia
|
73d8e3e640
|
fix(ollama_chat.py): fix token counting
|
2024-02-06 22:18:46 -08:00 |
|
Krrish Dholakia
|
4174471dac
|
fix(proxy_server.py): fix endpoint
|
2024-02-06 22:09:30 -08:00 |
|
Krish Dholakia
|
2bc710d8e9
|
Merge pull request #1843 from BerriAI/litellm_admin_ui_view_all_keys
feat(ui): enable admin to view all valid keys created on the proxy
|
2024-02-06 22:06:46 -08:00 |
|
Krrish Dholakia
|
0874c17a31
|
fix: export npm build into proxy
|
2024-02-06 20:12:50 -08:00 |
|
Krrish Dholakia
|
4a0df3cb4f
|
fix(proxy_cli.py-&&-proxy_server.py): bump reset budget intervals and fix pool limits for prisma connections
|
2024-02-06 19:39:49 -08:00 |
|
ishaan-jaff
|
5a29f362ee
|
(fix) allow litellm_settings to be None
|
2024-02-06 19:29:39 -08:00 |
|
ishaan-jaff
|
7b26b3b789
|
(ci/cd) run again
|
2024-02-06 18:25:15 -08:00 |
|
Krrish Dholakia
|
b6adeec347
|
fix(proxy_server.py): prisma client fixes for high traffic
|
2024-02-06 17:30:36 -08:00 |
|
Ishaan Jaff
|
73c6ce890b
|
Merge pull request #1859 from BerriAI/litellm_allow_using_budgets_without_keys
[Feat] Budgets for 'user' param passed to /chat/completions, /embeddings etc
|
2024-02-06 16:32:25 -08:00 |
|
ishaan-jaff
|
6369424629
|
(ci/cd) run again
|
2024-02-06 16:08:25 -08:00 |
|
Krish Dholakia
|
9e9fb747ce
|
Merge branch 'main' into litellm_slack_langfuse_alerting
|
2024-02-06 15:48:52 -08:00 |
|
ishaan-jaff
|
196787359f
|
(test) track_cost_ for end users
|
2024-02-06 15:25:51 -08:00 |
|
ishaan-jaff
|
52b864976b
|
(feat) support max_user_budget
|
2024-02-06 15:19:36 -08:00 |
|
Krrish Dholakia
|
be81183782
|
refactor(main.py): trigger deploy
n
|
2024-02-06 15:17:40 -08:00 |
|
ishaan-jaff
|
8ba2c8dbf7
|
(fix) langfuse show semantic-similarity in tags
|
2024-02-06 15:17:40 -08:00 |
|
ishaan-jaff
|
325ca43946
|
(feat) show semantic-cache on health/readiness
|
2024-02-06 15:17:40 -08:00 |
|
Krrish Dholakia
|
0d03b28a3b
|
test(test_completion.py): fix test
|
2024-02-06 15:17:40 -08:00 |
|
ishaan-jaff
|
b5db630dba
|
(ci/cd) run again
|
2024-02-06 15:17:40 -08:00 |
|
ishaan-jaff
|
43061d612d
|
(fix) mark semantic caching as beta test
|
2024-02-06 15:17:40 -08:00 |
|
ishaan-jaff
|
e32c2beddd
|
(fix) semantic caching
|
2024-02-06 15:17:40 -08:00 |
|
ishaan-jaff
|
f3de05cc54
|
(fix) test-semantic caching
|
2024-02-06 15:17:40 -08:00 |
|
ishaan-jaff
|
f8248b2c79
|
(feat) redis-semantic cache on proxy
|
2024-02-06 15:17:40 -08:00 |
|
ishaan-jaff
|
58f47c9e29
|
(fix) use semantic cache on proxy
|
2024-02-06 15:17:40 -08:00 |
|
ishaan-jaff
|
91424b66d7
|
allow setting redis_semantic cache_embedding model
|
2024-02-06 15:17:40 -08:00 |
|
ishaan-jaff
|
e2c88ce154
|
(feat) log semantic_sim to langfuse
|
2024-02-06 15:17:40 -08:00 |
|
ishaan-jaff
|
e0d5c953d6
|
(feat) working semantic cache on proxy
|
2024-02-06 15:17:40 -08:00 |
|