Krish Dholakia
ca4e746545
LiteLLM minor fixes + improvements (31/08/2024) ( #5464 )
...
* fix(vertex_endpoints.py): fix vertex ai pass through endpoints
* test(test_streaming.py): skip model due to end of life
* feat(custom_logger.py): add special callback for model hitting tpm/rpm limits
Closes https://github.com/BerriAI/litellm/issues/4096
2024-09-01 13:31:42 -07:00
Ishaan Jaff
3fae5eb94e
feat prometheus add metric for failure / model
2024-08-31 10:05:23 -07:00
Ishaan Jaff
c60125d7be
add gcs bucket base
2024-08-30 10:41:39 -07:00
Krish Dholakia
fe2a3c02e5
- merge - fix TypeError: 'CompletionUsage' object is not subscriptable #5441 ( #5448 )
...
* fix TypeError: 'CompletionUsage' object is not subscriptable (#5441 )
* test(test_team_logging.py): mark flaky test
---------
Co-authored-by: yafei lee <yafei@dao42.com>
2024-08-30 08:54:42 -07:00
Ishaan Jaff
fddf10eeb8
prometheus - safe update start / end time
2024-08-28 16:13:56 -07:00
Ishaan Jaff
359a003ac8
v0 add rerank on litellm proxy
2024-08-27 17:28:39 -07:00
Ishaan Jaff
a8e192a868
fix use guardrail for pre call hook
2024-08-23 09:34:08 -07:00
Ishaan Jaff
be853d93da
fix prom latency metrics
2024-08-23 06:59:19 -07:00
Ishaan Jaff
9476582fb7
update promtheus metric names
2024-08-22 14:03:00 -07:00
Ishaan Jaff
c719c375f7
track litellm_request_latency_metric
2024-08-22 13:58:10 -07:00
Ishaan Jaff
0ccb1c17f7
fix init correct prometheus metrics
2024-08-22 13:29:35 -07:00
Krish Dholakia
41835d9397
Merge pull request #5323 from MarkRx/feature/langsmith-ids
...
Support LangSmith parent_run_id, trace_id, session_id
2024-08-21 15:38:50 -07:00
MarkRx
58529e2c9c
Support LangSmith parent_run_id, trace_id, session_id
2024-08-21 16:09:30 -04:00
Ishaan Jaff
cdbd245c3d
working lakera ai during call hook
2024-08-20 14:39:04 -07:00
Ishaan Jaff
2f618d08be
fix _get_spend_report_for_time_range
2024-08-19 20:53:39 -07:00
Ishaan Jaff
319690ab5e
feat - guardrails v2
2024-08-19 18:24:20 -07:00
Ishaan Jaff
b4bca8db82
feat - allow accessing data post success call
2024-08-19 11:35:33 -07:00
Ishaan Jaff
db8f789318
Merge pull request #5259 from BerriAI/litellm_return_remaining_tokens_in_header
...
[Feat] return `x-litellm-key-remaining-requests-{model}`: 1, `x-litellm-key-remaining-tokens-{model}: None` in response headers
2024-08-17 12:41:16 -07:00
Ishaan Jaff
a62277a6aa
feat - use commong helper for getting model group
2024-08-17 10:46:04 -07:00
Ishaan Jaff
2dd098f384
show correct metric
2024-08-17 10:12:23 -07:00
Ishaan Jaff
03196742d2
add litellm-key-remaining-tokens on prometheus
2024-08-17 10:02:20 -07:00
Krrish Dholakia
2874b94fb1
refactor: replace .error() with .exception() logging for better debugging on sentry
2024-08-16 09:22:47 -07:00
Krish Dholakia
ca07898fbb
Merge pull request #5235 from BerriAI/litellm_fix_s3_logs
...
fix(s3.py): fix s3 logging payload to have valid json values
2024-08-15 23:00:18 -07:00
Krrish Dholakia
b08492bc29
fix(s3.py): fix s3 logging payload to have valid json values
...
Previously pydantic objects were being stringified, making them unparsable
2024-08-15 17:09:02 -07:00
Ishaan Jaff
17857ecc2b
litellm always log cache_key on hits/misses
2024-08-15 09:59:58 -07:00
Ishaan Jaff
539fcad7e7
fix langfuse log_provider_specific_information_as_span
2024-08-14 17:54:18 -07:00
Ishaan Jaff
794055660d
Merge pull request #5202 from BerriAI/litellm_prom_prefix_litellm
...
[Fix] Prometheus use 'litellm_' prefix for new deployment metrics
2024-08-14 09:50:36 -07:00
Ishaan Jaff
38868a0a45
use litellm_ prefix for new deployment metrics
2024-08-14 09:08:14 -07:00
Ishaan Jaff
e1c70a6954
log failure calls on gcs + testing
2024-08-14 08:55:51 -07:00
Ishaan Jaff
da61511a8e
feat log fail events on gcs
2024-08-14 08:39:16 -07:00
Krrish Dholakia
a181af3d2e
fix(langsmith.py): support langsmith 'extra' field object
...
Closes https://github.com/BerriAI/litellm/issues/5179
2024-08-13 15:20:50 -07:00
Ishaan Jaff
e5ccd3bdaf
allow using langfuse_default_tags
2024-08-13 12:26:37 -07:00
Ishaan Jaff
c5515513a9
feat allow controlling logged tags on langfuse
2024-08-13 12:24:01 -07:00
Ishaan Jaff
f043de98e3
feat log responses in folders
2024-08-12 16:28:12 -07:00
Ishaan Jaff
eaf338aa5f
feat gcs log user api key metadata
2024-08-12 16:06:10 -07:00
Ishaan Jaff
b0629e5947
Merge pull request #5166 from BerriAI/litellm_log_key_created_slack
...
[Feat-Security] Send Slack Alert when CRUD ops done on Virtual Keys, Teams, Internal Users
2024-08-12 12:18:04 -07:00
Ishaan Jaff
0f7d575992
send alert on all key events
2024-08-12 11:39:24 -07:00
Krrish Dholakia
3fa00408f1
fix(proxy_server.py): add info log when spend logs is skipped because disable_spend_logs=True
.
2024-08-12 11:20:30 -07:00
Ishaan Jaff
2e2da04024
v0 log KeyCreatedEvent
2024-08-12 10:56:11 -07:00
Ishaan Jaff
7059615fb1
allow setting PROMETHEUS_SELECTED_INSTANCE
2024-08-10 17:31:05 -07:00
Ishaan Jaff
00443aa0f9
Merge pull request #5154 from BerriAI/litellm_send_prometheus_fallbacks_from_slack
...
[Feat-Proxy] send prometheus fallbacks stats to slack
2024-08-10 17:14:01 -07:00
Ishaan Jaff
ecec37e220
doc new prometheus metrics
2024-08-10 17:13:36 -07:00
Ishaan Jaff
fe26412f03
feat - use api to get prometheus api metrics
2024-08-10 16:36:06 -07:00
Ishaan Jaff
f324ca7b86
add fallback_reports in slack alert types
2024-08-10 16:08:36 -07:00
Ishaan Jaff
328a78a0f8
feat add prometheus api to get data from endpoint
2024-08-10 16:07:08 -07:00
Ishaan Jaff
3ecf4db741
prometheus log_success_fallback_event
2024-08-10 14:05:18 -07:00
Ishaan Jaff
3ca503cc8c
v0 add helper for loging success/fail fallback events
2024-08-10 13:26:39 -07:00
Ishaan Jaff
5765baa5b2
feat - track latency per llm deployment
2024-08-10 12:53:56 -07:00
Ishaan Jaff
e086479fd7
track llm_deployment_success_responses
2024-08-10 10:05:33 -07:00
Ishaan Jaff
c3c570ac7e
feat - refactor prometheus metrics
2024-08-10 09:14:38 -07:00