Commit graph

946 commits

Author SHA1 Message Date
Ishaan Jaff
1588453532
Merge pull request #2516 from marcklingen/patch-1
Add langfuse `sdk_integration`
2024-05-01 20:01:47 -07:00
Krrish Dholakia
dbf999f56e refactor(langfuse.py): remove print statements 2024-05-01 19:11:59 -07:00
Ishaan Jaff
4e3ca5a26e fix - errant prints on langfuse 2024-05-01 19:07:56 -07:00
Krrish Dholakia
2a9651b3ca feat(openmeter.py): add support for user billing
open-meter supports user based billing. Closes https://github.com/BerriAI/litellm/issues/1268
2024-05-01 17:23:48 -07:00
Krrish Dholakia
0ab6b4bb22 fix(langfuse.py): fix trace param overwriting when existing trace id is given 2024-05-01 08:44:46 -07:00
Krrish Dholakia
abdae87ba2 fix(langfuse.py): don't overwrite trace details if existing trace id passed in 2024-05-01 08:15:17 -07:00
Marc Klingen
adf5e61f2e
Merge branch 'main' into patch-1 2024-05-01 15:19:25 +02:00
Krrish Dholakia
b46db8b891 feat(utils.py): json logs for raw request sent by litellm
make it easier to view verbose logs in datadog
2024-04-29 19:21:19 -07:00
Krrish Dholakia
853b70aba9 fix(langfuse.py): support 'existing_trace_id' param
allow user to call out a trace as pre-existing, this prevents creating a default trace name, and potentially overwriting past traces
2024-04-29 16:39:17 -07:00
Krrish Dholakia
2cf069befb fix(langfuse.py): don't set default trace_name if trace_id given 2024-04-29 16:39:17 -07:00
Ishaan Jaff
d42c3309ec
Merge pull request #3338 from BerriAI/litellm_alerting_sort_latencies
Fix - slack alerting show deployment latencies in sorted order
2024-04-27 18:40:07 -07:00
Ishaan Jaff
cc51db1999 fix slack alerting show deployment latencies 2024-04-27 15:44:30 -07:00
Ishaan Jaff
d6827c3574 fix - link to langfuse traces on slack alerts 2024-04-27 15:23:18 -07:00
Ishaan Jaff
1b586e5042 fix - allow langfuse init with flush interval 2024-04-27 14:53:18 -07:00
Krish Dholakia
40b6b4794b
Merge pull request #3310 from BerriAI/litellm_langfuse_error_logging_2
fix(proxy/utils.py): log rejected proxy requests to langfuse
2024-04-25 19:49:59 -07:00
Ishaan Jaff
a881d16101
Merge pull request #3301 from BerriAI/debug_lowest_latency
[Fix] sending deployment latencies to slack alerting - lowest_latency
2024-04-25 19:43:25 -07:00
Ishaan Jaff
24e918b10f fix clean up prints from slack alerting 2024-04-25 19:36:42 -07:00
Krrish Dholakia
885de2e3c6 fix(proxy/utils.py): log rejected proxy requests to langfuse 2024-04-25 19:26:27 -07:00
Ishaan Jaff
1d5e70f7a0 pass alert type on alerting handle 2024-04-25 13:05:34 -07:00
Ishaan Jaff
984259d420 temp - show better debug logs for lowest latency 2024-04-25 11:22:52 -07:00
Ishaan Jaff
2c7f4695d9
Merge pull request #3283 from BerriAI/litellm_debug_lowest_latency
[Fix] Add better observability for debugging lowest latency routing
2024-04-24 20:42:52 -07:00
Ishaan Jaff
ffc277c2bb temp fix _add_langfuse_trace_id_to_alert 2024-04-24 20:19:21 -07:00
Ishaan Jaff
24b0dfee29 fix - add _get_deployment_latencies_to_alert 2024-04-24 20:09:02 -07:00
Krrish Dholakia
1f7abb8d92 fix(langfuse.py): fix langfuse environ check 2024-04-24 13:21:00 -07:00
Krish Dholakia
263439ee4a
Merge pull request #3098 from greenscale-ai/main
Support for Greenscale AI logging
2024-04-24 13:09:03 -07:00
Krrish Dholakia
52d5e02d8b fix(prometheus.py): fix user-id get for prometheus 2024-04-24 08:08:42 -07:00
Krrish Dholakia
9e3514fd24 fix(langsmith.py): fix streaming langsmith calls 2024-04-23 20:47:19 -07:00
Krish Dholakia
4acdde988f
Merge pull request #3250 from BerriAI/litellm_caching_no_cache_fix
fix(utils.py): fix 'no-cache': true when caching is turned on
2024-04-23 19:57:07 -07:00
Ishaan Jaff
bfc84a7e81 fix - slack alerting use one httpx client for alerts 2024-04-23 18:47:39 -07:00
Ishaan Jaff
37069f45cb fix - use one slack class 2024-04-23 18:31:15 -07:00
Krrish Dholakia
161e836427 fix(utils.py): fix 'no-cache': true when caching is turned on 2024-04-23 12:58:30 -07:00
Ishaan Jaff
5118c96b4d fix langfuse project id 2024-04-23 11:37:17 -07:00
Krrish Dholakia
6ac0dba5c2 fix(prometheus.py): add user tracking to prometheus 2024-04-22 15:14:38 -07:00
Krrish Dholakia
9379e3d047 fix(lowest_tpm_rpm_v2.py): use a combined tpm+rpm query in async get cache, to reduce redis client calls in high traffic 2024-04-20 16:13:11 -07:00
Krish Dholakia
6d9f0f1839
Merge branch 'main' into litellm_ssl_caching_fix 2024-04-19 17:20:27 -07:00
Krish Dholakia
cc7f27252b
Merge pull request #3179 from BerriAI/litellm_dual_caching_fix
fix(caching.py): dual cache async_batch_get_cache fix + testing
2024-04-19 17:02:47 -07:00
Ishaan Jaff
788394b1ec fix - logging proxy base url to langfuse 2024-04-19 16:20:46 -07:00
Krrish Dholakia
0d9c96bebf test(test_prometheus_services.py): fix testing to handle caching ping in init 2024-04-19 16:15:29 -07:00
Ishaan Jaff
c59051895b fix - allow users to opt into langfuse default tags 2024-04-19 16:05:31 -07:00
Ishaan Jaff
c0befe9bbb feat - log base_url to langfuse as a tag 2024-04-19 15:48:00 -07:00
Ishaan Jaff
48e5de4f0e
Merge pull request #3175 from BerriAI/litellm_log_team_alias_langfuse
[FEAT] Log team alias to langfuse
2024-04-19 15:30:59 -07:00
Krrish Dholakia
01a1a8f731 fix(caching.py): dual cache async_batch_get_cache fix + testing
this fixes a bug in usage-based-routing-v2 which was caused b/c of how the result was being returned from dual cache async_batch_get_cache. it also adds unit testing for that function (and it's sync equivalent)
2024-04-19 15:03:25 -07:00
Ishaan Jaff
3167c9da9f fix - use user_api_key_team_alias as the default tag on langfuse 2024-04-19 14:43:41 -07:00
Ishaan Jaff
410d1f2d2c langfuse - log team alias 2024-04-19 14:26:58 -07:00
Krrish Dholakia
08cf77623b fix(caching.py): remove url parsing logic - causing redis ssl connections to fail
this reverts a change that was causing redis url w/ ssl to fail. this also adds unit testing for this sc
enario, to prevent future regressions
2024-04-19 14:01:13 -07:00
Ishaan Jaff
b2bdc99474
Merge pull request #2961 from Manouchehri/langfuse-add-region-2937
(feat) Langfuse - Add location logging, and add cache_hit to metadata.
2024-04-19 10:06:34 -07:00
Krrish Dholakia
81573b2dd9 fix(test_lowest_tpm_rpm_routing_v2.py): unit testing for usage-based-routing-v2 2024-04-18 21:38:00 -07:00
Krrish Dholakia
0f95a824c4 feat(prometheus_services.py): emit proxy latency for successful llm api requests
uses prometheus histogram for this
2024-04-18 16:04:35 -07:00
Krrish Dholakia
d8d78bfb4f fix(prometheus.py): fix metric name to be more accurate
change metric name from litellm_failed_requests_metric to litellm_llm_api_failed_requests_metric
2024-04-18 12:30:44 -07:00
Krrish Dholakia
28edb77350 fix(utils.py): support prometheus failed call metrics 2024-04-18 12:29:15 -07:00