ishaan-jaff
|
6eb17cd916
|
(test) s3 logging
|
2024-02-08 11:11:19 -08:00 |
|
ishaan-jaff
|
c2b948e6a9
|
(test) s3 logging time
|
2024-02-08 11:01:11 -08:00 |
|
ishaan-jaff
|
ac4d9a7542
|
(feat) speed up s3 logging
|
2024-02-08 10:59:54 -08:00 |
|
Krish Dholakia
|
7c83ba061b
|
Update README.md
|
2024-02-08 10:06:27 -08:00 |
|
Krrish Dholakia
|
9be5e2f7e3
|
docs(enterprise.md): adding enterprise support to docs
|
2024-02-08 10:02:40 -08:00 |
|
ishaan-jaff
|
c59021d090
|
(cookbook) load test litellm router
|
2024-02-08 07:24:28 -08:00 |
|
Krrish Dholakia
|
0d803e1379
|
fix(proxy_cli.py): fix max connection limit issue on db
|
2024-02-07 22:57:44 -08:00 |
|
ishaan-jaff
|
3c54d8d1a6
|
bump: version 1.23.1 → 1.23.2
|
2024-02-07 20:13:50 -08:00 |
|
Ishaan Jaff
|
ed4751e2de
|
Merge pull request #1877 from BerriAI/litellm_aert_when_budget_tracking_fails
[Feat] Slack Alert when budget tracking fails
|
2024-02-07 20:12:00 -08:00 |
|
ishaan-jaff
|
3837c77df9
|
(feat) slack alerting when track callback fails
|
2024-02-07 20:09:28 -08:00 |
|
ishaan-jaff
|
e8e6fc6123
|
(fix) remove extra statement
|
2024-02-07 19:26:31 -08:00 |
|
Ishaan Jaff
|
717dc78d53
|
Merge pull request #1878 from BerriAI/litellm_improve_semantic_cache_tracing
[Feat] Semantic Caching - Track Cost of using embedding, Use Langfuse Trace ID
|
2024-02-07 19:25:23 -08:00 |
|
ishaan-jaff
|
8197b3de0a
|
(fix) remove extra statement
|
2024-02-07 19:24:27 -08:00 |
|
ishaan-jaff
|
8425a8ba22
|
(fix) track cost for semantic_caching, place on langfuse trace
|
2024-02-07 19:21:50 -08:00 |
|
ishaan-jaff
|
8a328b4c6d
|
(fix) track cost for semantic_caching, place on langfuse trace
|
2024-02-07 19:20:15 -08:00 |
|
ishaan-jaff
|
5dc26b11bf
|
Merge remote-tracking branch 'origin/main' into litellm_aert_when_budget_tracking_fails
|
2024-02-07 18:50:00 -08:00 |
|
ishaan-jaff
|
8462e85792
|
(feat) alert for failing cost tracking
|
2024-02-07 18:49:45 -08:00 |
|
Ishaan Jaff
|
98b0ace2e9
|
Merge pull request #1874 from BerriAI/litellm_azure_base_model_pricing
[FEAT] Azure Pricing - based on base_model in model_info
|
2024-02-07 18:37:55 -08:00 |
|
Ishaan Jaff
|
e17e78396e
|
Merge pull request #1876 from BerriAI/litellm_add_azure_gpt_4_turbo
[Feat] add azure/gpt-4-0125-preview
|
2024-02-07 18:23:29 -08:00 |
|
ishaan-jaff
|
c98e247b0f
|
(docs) azure/gpt-4-0125-preview
|
2024-02-07 18:22:38 -08:00 |
|
ishaan-jaff
|
e143eac6b5
|
(feat) add azure/gpt-4-0125-preview
|
2024-02-07 18:22:31 -08:00 |
|
ishaan-jaff
|
cc7a690c9b
|
(fix) azure_base_model cost calc
|
2024-02-07 18:18:15 -08:00 |
|
Krrish Dholakia
|
4cb7759fcd
|
bump: version 1.23.0 → 1.23.1
|
2024-02-07 18:14:46 -08:00 |
|
Krrish Dholakia
|
d2dceb3537
|
fix(proxy_server.py): check if prisma client is set before scheduling reset budget
|
2024-02-07 18:14:37 -08:00 |
|
ishaan-jaff
|
6969b25946
|
(fix) azure cost calc
|
2024-02-07 17:33:10 -08:00 |
|
ishaan-jaff
|
e914dfa940
|
(ci/cd) runn again
|
2024-02-07 17:13:13 -08:00 |
|
ishaan-jaff
|
bb469278c6
|
(fix) cost tracking
|
2024-02-07 17:06:05 -08:00 |
|
ishaan-jaff
|
bbbd37f0cb
|
(ci/cd) run again
|
2024-02-07 16:55:38 -08:00 |
|
ishaan-jaff
|
206a6845ac
|
(docs) pricing base_model
|
2024-02-07 16:54:24 -08:00 |
|
ishaan-jaff
|
9c597cbe0b
|
(feat) use base_model for azure cost
|
2024-02-07 16:33:35 -08:00 |
|
ishaan-jaff
|
0764af4392
|
(feat) use base_model for azure response_cost
|
2024-02-07 16:33:07 -08:00 |
|
ishaan-jaff
|
705396240e
|
(test) using base_model for cost_calc on router
|
2024-02-07 16:30:58 -08:00 |
|
ishaan-jaff
|
920d684da4
|
(feat) log model_info in router metadata
|
2024-02-07 15:44:28 -08:00 |
|
ishaan-jaff
|
68926c6524
|
(fix) model_prices_and_context_window.json error
|
2024-02-07 15:42:37 -08:00 |
|
Krish Dholakia
|
0300512743
|
Update model_prices_and_context_window.json
|
2024-02-07 13:25:02 -08:00 |
|
Krish Dholakia
|
bb3bf122cc
|
Update model_prices_and_context_window.json
|
2024-02-07 13:23:38 -08:00 |
|
Krrish Dholakia
|
655fcd4d79
|
fix(utils.py): fix ollama stop sequence mapping
|
2024-02-07 13:14:03 -08:00 |
|
Krrish Dholakia
|
66913222a1
|
docs(langfuse_integration.md): docs for showing how to log errors to langfuse
|
2024-02-07 12:07:11 -08:00 |
|
Krrish Dholakia
|
048acc8e68
|
docs(ui.md): add more help to docs
|
2024-02-07 11:46:57 -08:00 |
|
Krrish Dholakia
|
2fcbe06af8
|
docs(ui.md): add proxy admin view to docs
|
2024-02-07 11:38:27 -08:00 |
|
Krrish Dholakia
|
e165ec40d4
|
docs(ui.md): ui doc updates
|
2024-02-07 11:38:27 -08:00 |
|
ishaan-jaff
|
258fe63e7d
|
(fix) ui - when request body is None
|
2024-02-07 11:33:43 -08:00 |
|
Krrish Dholakia
|
8939593826
|
fix(proxy_server.py): fix merge errors
|
2024-02-07 00:04:52 -08:00 |
|
Krrish Dholakia
|
184e78772b
|
refactor(proxy_server.py): fix merge error
|
2024-02-06 23:44:23 -08:00 |
|
Krrish Dholakia
|
9e138b9e4e
|
bump: version 1.22.11 → 1.23.0
|
2024-02-06 23:39:28 -08:00 |
|
Krrish Dholakia
|
46dd08c207
|
refactor(main.py): trigger rebuild
|
2024-02-06 23:39:28 -08:00 |
|
Krish Dholakia
|
f785aee0df
|
Merge pull request #1860 from BerriAI/litellm_spend_logging_high_traffic
fix(proxy_server.py): prisma client fixes for high traffic
|
2024-02-06 23:37:43 -08:00 |
|
Krish Dholakia
|
df60edfa07
|
Merge branch 'main' into litellm_spend_logging_high_traffic
|
2024-02-06 23:36:58 -08:00 |
|
Krrish Dholakia
|
fd9c7a90af
|
fix(proxy_server.py): update user cache to with new spend
|
2024-02-06 23:06:05 -08:00 |
|
Krrish Dholakia
|
73d8e3e640
|
fix(ollama_chat.py): fix token counting
|
2024-02-06 22:18:46 -08:00 |
|