litellm-mirror

mirror of https://github.com/BerriAI/litellm.git synced 2025-04-26 19:24:27 +00:00

Author	SHA1	Message	Date
Krish Dholakia	4c8b620352	Merge pull request #3685 from BerriAI/litellm_lago_integration feat(lago.py): Enable Usage-based billing with lago	2024-05-16 13:09:48 -07:00
Krrish Dholakia	d167a9ea99	feat(lago.py): adding support for usage-based billing with lago Closes https://github.com/BerriAI/litellm/issues/3639	2024-05-16 10:54:18 -07:00
Krish Dholakia	07a1cf39e8	Merge branch 'main' into litellm_end_user_cost_tracking	2024-05-15 22:27:04 -07:00
Ishaan Jaff	d257518e47	fix deep copy logic in during_call_hook	2024-05-15 17:34:54 -07:00
Ishaan Jaff	38a539baf4	show debug info on slack alerts	2024-05-15 13:23:44 -07:00
Krrish Dholakia	9945387d7a	feat(proxy_server.py): new `/end_user/info` endpoint get spend for a specific end-user	2024-05-15 09:50:52 -07:00
Krish Dholakia	64ca2fde53	Merge branch 'main' into litellm_region_based_routing	2024-05-08 22:19:51 -07:00
Krrish Dholakia	7c99808529	feat(proxy_server.py): add CRUD endpoints for 'end_user' management allow admin to specify region + default models for end users	2024-05-08 18:50:36 -07:00
Krrish Dholakia	5f93cae3ff	feat(proxy_server.py): return litellm version in response headers	2024-05-08 16:00:08 -07:00
Krrish Dholakia	e5c57cdddb	feat(slack_alerting.py): reintegrate langfuse trace url for slack alerts this ensures langfuse trace url returned in llm api exception err	2024-05-07 12:58:49 -07:00
Krrish Dholakia	2170e1723d	feat(proxy_server.py): schedule slack daily report if enabled if user enabled daily_reports, send them a slack report every 12 hours	2024-05-06 18:25:48 -07:00
Krrish Dholakia	0c1fc34e9f	feat(slack_alerting.py): support sending daily reports on deployments allow admin to easily know slow + failing deployments Closes https://github.com/BerriAI/litellm/issues/3483	2024-05-06 17:18:42 -07:00
Krrish Dholakia	4a22948ce4	fix(proxy/utils.py): fix slack alerting to only raise alerts for llm api exceptions don't spam for bad user requests. Closes https://github.com/BerriAI/litellm/issues/3395	2024-05-02 17:18:21 -07:00
Krrish Dholakia	c1ffa7d409	fix(proxy/utils.py): fix retry logic for generic data request	2024-05-02 14:50:50 -07:00
Krrish Dholakia	e7b3ac8e06	feat(openmeter.py): add support for user billing open-meter supports user based billing. Closes https://github.com/BerriAI/litellm/issues/1268	2024-05-01 17:23:48 -07:00
Krrish Dholakia	0b2c3f78d4	fix(proxy/utils.py): emit number of spend transactions for keys being written to db in a batch	2024-05-01 08:25:04 -07:00
Krrish Dholakia	462724feef	fix(proxy/utils.py): log rejected proxy requests to langfuse	2024-04-25 19:26:27 -07:00
Ishaan Jaff	2190b039fd	pass alert type on alerting handle	2024-04-25 13:05:34 -07:00
Ishaan Jaff	c5a0b3a8d4	fix - refactor slack alerting	2024-04-23 18:34:11 -07:00
Ishaan Jaff	a3109db4e1	fix langfuse project id	2024-04-23 11:37:17 -07:00
Ishaan Jaff	d076aed9d0	fix - dont send alert on fail request	2024-04-22 16:07:58 -07:00
Ishaan Jaff	8fb9c8d833	ui - find all teams	2024-04-22 14:15:09 -07:00
Ishaan Jaff	aa365c5c4a	feat - show langfuse trace in alerts	2024-04-22 08:51:46 -07:00
Ishaan Jaff	d4c9439cc0	fix - slack alerting show input in the api_base	2024-04-20 13:16:47 -07:00
Ishaan Jaff	828645137c	feat - log team_alias to langfuse	2024-04-19 10:29:42 -07:00
Ishaan Jaff	532e252559	fix - show api_base in hanging requests	2024-04-18 21:01:26 -07:00
Ishaan Jaff	ecc770be00	fix - show api base on hanging requests	2024-04-18 20:57:22 -07:00
Ishaan Jaff	977b030dd9	ui - show all alert types when getting all callbacks	2024-04-18 20:08:13 -07:00
Krish Dholakia	741a18a040	Merge pull request #3144 from BerriAI/litellm_prometheus_latency_tracking feat(prometheus_services.py): emit proxy latency for successful llm api requests	2024-04-18 19:10:58 -07:00
Ishaan Jaff	27333d17e2	fix order by spend	2024-04-18 17:33:38 -07:00
Ishaan Jaff	03b4652af1	fix return key aliases on /user/info	2024-04-18 17:16:52 -07:00
Krrish Dholakia	51cc8dd95b	fix(proxy/utils.py): add prometheus failed db request tracking	2024-04-18 16:30:29 -07:00
Krrish Dholakia	cdfd873713	fix(proxy/utils.py): add call type and duration to proxy_logging failure calls this is for tracking failed db requests on prometheus	2024-04-18 16:24:36 -07:00
Ishaan Jaff	bb07c5fdc5	Merge pull request #3112 from BerriAI/litellm_add_alert_types [Feat] Allow user to select slack alert types to Opt In to	2024-04-18 16:21:33 -07:00
Krrish Dholakia	7f5bcf38b7	feat(prometheus_services.py): emit proxy latency for successful llm api requests uses prometheus histogram for this	2024-04-18 16:04:35 -07:00
Ishaan Jaff	d6e3f587fe	fix trim messages to first 100 chars	2024-04-18 15:21:31 -07:00
Ishaan Jaff	d178916048	fix - test alerting	2024-04-18 11:40:40 -07:00
Ishaan Jaff	58eea0f330	feat return alert types on /config/get/callback	2024-04-17 21:02:10 -07:00
Ishaan Jaff	a97f8a40c1	fix - user based alerting	2024-04-17 20:35:29 -07:00
Ishaan Jaff	2e62b0059c	v0 add types of alerts to slack alerting	2024-04-17 18:16:19 -07:00
Ishaan Jaff	39488780e0	litellm_add_proxy_base_url in slack alerts	2024-04-17 17:42:28 -07:00
Krrish Dholakia	d75cfc5e32	fix(utils.py): return vertex api base for request hanging alerts	2024-04-16 17:53:28 -07:00
Krrish Dholakia	aa5da4346a	fix(proxy_server.py): support tracking org spend currently works when org set for jwt auth	2024-04-11 23:01:21 -07:00
Krrish Dholakia	07798af50d	fix(proxy/utils.py): fix error message	2024-04-08 20:47:13 -07:00
Krrish Dholakia	da216c6915	fix(proxy_server.py): allow mapping a user to an org	2024-04-08 20:45:11 -07:00
Krrish Dholakia	0dad78b53c	feat(proxy/utils.py): return api base for request hanging alerts	2024-04-06 15:58:53 -07:00
Krrish Dholakia	ece37a4b7f	feat(ui): add models via ui adds ability to add models via ui to the proxy. also fixes additional bugs around new /model/new endpoint	2024-04-04 18:56:20 -07:00
Krrish Dholakia	129bb52e9d	fix(proxy_server.py): persist models added via `/model/new` to db allows models to be used across instances https://github.com/BerriAI/litellm/issues/2319 , https://github.com/BerriAI/litellm/issues/2329	2024-04-03 20:16:41 -07:00
Krrish Dholakia	029ee15951	perf(proxy_server.py): batch write spend logs reduces prisma client errors, by batch writing spend logs - max 1k logs at a time	2024-04-02 18:46:55 -07:00
Krrish Dholakia	e06d43dc90	fix(tpm_rpm_limiter.py): fix cache init logic	2024-04-01 18:01:38 -07:00

... 3 4 5 6 7 ...

503 commits