Krrish Dholakia
01a1a8f731
fix(caching.py): dual cache async_batch_get_cache fix + testing
...
this fixes a bug in usage-based-routing-v2 which was caused b/c of how the result was being returned from dual cache async_batch_get_cache. it also adds unit testing for that function (and it's sync equivalent)
2024-04-19 15:03:25 -07:00
Krrish Dholakia
3c6b6355c7
fix(ollama_chat.py): accept api key as a param for ollama calls
...
allows user to call hosted ollama endpoint using bearer token for auth
2024-04-19 13:02:13 -07:00
Ishaan Jaff
b2bdc99474
Merge pull request #2961 from Manouchehri/langfuse-add-region-2937
...
(feat) Langfuse - Add location logging, and add cache_hit to metadata.
2024-04-19 10:06:34 -07:00
Krrish Dholakia
717abfef98
build(interpret_load_test.py): add hosted proxy info on release notes
2024-04-18 22:40:10 -07:00
Krrish Dholakia
d10743a609
bump: version 1.35.14 → 1.35.15
2024-04-18 22:17:33 -07:00
Krrish Dholakia
9e91541b8a
refactor(main.py): trigger new build
2024-04-18 22:17:19 -07:00
Krish Dholakia
f1340b52dc
Merge pull request #3153 from BerriAI/litellm_usage_based_routing_v2_improvements
...
usage based routing v2 improvements - unit testing + *NEW* async + sync 'pre_call_checks'
2024-04-18 22:16:16 -07:00
Krrish Dholakia
9c42c847a5
fix(router.py): instrument pre-call-checks for all openai endpoints
2024-04-18 21:54:25 -07:00
Krrish Dholakia
3b9e2a58e2
fix(lowest_tpm_rpm_v2.py): ensure backwards compatibility for python 3.8
2024-04-18 21:42:35 -07:00
Krrish Dholakia
81573b2dd9
fix(test_lowest_tpm_rpm_routing_v2.py): unit testing for usage-based-routing-v2
2024-04-18 21:38:00 -07:00
Ishaan Jaff
c903b44849
(ci/cd) vertexai - skip 429 errors
2024-04-18 21:27:22 -07:00
Ishaan Jaff
cb053398eb
(ci/cd) run again
2024-04-18 21:04:07 -07:00
Ishaan Jaff
c720c8ffe9
Merge pull request #3152 from BerriAI/litellm_show_api_base_hanging_req
...
[Fix] show api base hanging request alerts
2024-04-18 21:02:30 -07:00
Ishaan Jaff
6f948cd559
fix - show api_base in hanging requests
2024-04-18 21:01:26 -07:00
Ishaan Jaff
67d356b933
fix - show api base on hanging requests
2024-04-18 20:58:02 -07:00
Ishaan Jaff
f04604910b
fix - show api base on hanging requests
2024-04-18 20:57:22 -07:00
Ishaan Jaff
8c830e031d
(ci/cd) run again
2024-04-18 20:35:21 -07:00
Ishaan Jaff
7d2fb6da04
bump: version 1.35.13 → 1.35.14
2024-04-18 20:23:58 -07:00
Ishaan Jaff
5b55900d8e
fix - view spend per user
2024-04-18 20:23:04 -07:00
Ishaan Jaff
e95e7fde65
ui - new build
2024-04-18 20:18:50 -07:00
Ishaan Jaff
0ed5cfc9de
Merge pull request #3151 from BerriAI/litellm_allow_set_get_alerts
...
[UI] View all alert types
2024-04-18 20:12:54 -07:00
Ishaan Jaff
9073960c00
Merge pull request #3148 from BerriAI/ui_show_keys_in_users_page
...
ui - show key_aliases on `Users` Tab
2024-04-18 20:11:47 -07:00
Ishaan Jaff
01b1136631
fix - GetLLMProvider excepton error raise
2024-04-18 20:10:37 -07:00
Ishaan Jaff
554c83fdaf
ui - show all alert types when getting all callbacks
2024-04-18 20:08:13 -07:00
Ishaan Jaff
48e53984c0
ui - show all alert types
2024-04-18 20:04:34 -07:00
Krrish Dholakia
a05f148c17
fix(tpm_rpm_routing_v2.py): fix tpm rpm routing
2024-04-18 20:01:22 -07:00
Ishaan Jaff
825622d120
Merge pull request #3149 from BerriAI/ui_clean_up_models
...
[UI] - Models Page - Place litellm params in an accordion
2024-04-18 19:49:00 -07:00
Ishaan Jaff
dd92c94788
ui - place litellm params in an accordion
2024-04-18 19:22:34 -07:00
Krish Dholakia
77a353d484
Merge pull request #3144 from BerriAI/litellm_prometheus_latency_tracking
...
feat(prometheus_services.py): emit proxy latency for successful llm api requests
2024-04-18 19:10:58 -07:00
Ishaan Jaff
b5e8557fa4
ui - show keys user owns on ui
2024-04-18 18:27:41 -07:00
Ishaan Jaff
0569974a3a
Merge pull request #3146 from BerriAI/litellm_improve_users_pg
...
[UI]- show user_emails in Users Tab
2024-04-18 17:34:48 -07:00
Ishaan Jaff
d9091dcf97
fix order by spend
2024-04-18 17:33:38 -07:00
Krrish Dholakia
f65005f273
fix(caching.py): fix error logging
2024-04-18 17:21:10 -07:00
Ishaan Jaff
b669e2987b
fix return key aliases on /user/info
2024-04-18 17:16:52 -07:00
Krrish Dholakia
cf1cc71b87
fix(factory.py): add mistral config to known tokenizer configs
2024-04-18 17:11:52 -07:00
Krrish Dholakia
0afb5d6573
fix(caching.py): add call type to service logger object
2024-04-18 16:53:34 -07:00
Ishaan Jaff
8f7bf3ad52
UI - show user_emails
2024-04-18 16:41:06 -07:00
Krrish Dholakia
919a2876f1
fix(proxy/utils.py): add prometheus failed db request tracking
2024-04-18 16:30:29 -07:00
Krrish Dholakia
d61250109e
fix(proxy/utils.py): add call type and duration to proxy_logging failure calls
...
this is for tracking failed db requests on prometheus
2024-04-18 16:24:36 -07:00
Ishaan Jaff
eb04a929e6
Merge pull request #3112 from BerriAI/litellm_add_alert_types
...
[Feat] Allow user to select slack alert types to Opt In to
2024-04-18 16:21:33 -07:00
Ishaan Jaff
92630cd9da
Merge pull request #3136 from Manouchehri/cohere-chat-seed-1
...
(feat) - Add seed to Cohere Chat.
2024-04-18 16:20:48 -07:00
Ishaan Jaff
8958bbeac9
Merge pull request #3142 from BerriAI/litellm_slack_alerting_show_model_passed
...
[Fix] Show `model` passed on `"400: {'error': 'Invalid model name passed in mode` errors 👻
2024-04-18 16:18:17 -07:00
Krrish Dholakia
48d3b563d8
docs(prometheus.md): add litellm service metrics to prometheus docs
2024-04-18 16:06:12 -07:00
Krrish Dholakia
0f95a824c4
feat(prometheus_services.py): emit proxy latency for successful llm api requests
...
uses prometheus histogram for this
2024-04-18 16:04:35 -07:00
Ishaan Jaff
6cfa80c42a
Merge pull request #3140 from BerriAI/litellm_trim_messages
...
[Fix] Slack Alerting - trim messages to first 100 chars
2024-04-18 16:01:39 -07:00
Ishaan Jaff
1031ffdae5
test - invalid model name passed test
2024-04-18 15:46:49 -07:00
Ishaan Jaff
b308f8c079
fix - show model passed in on Invalid model name passed in error
2024-04-18 15:43:30 -07:00
Ishaan Jaff
e20b05d6dd
fix trim messages to first 100 chars
2024-04-18 15:21:31 -07:00
David Manouchehri
f65c02d43a
(feat) - Add seed to Cohere Chat.
2024-04-18 20:57:06 +00:00
Ishaan Jaff
f610061a79
Merge pull request #3130 from BerriAI/litellm_show_vertex_project_exceptions
...
[FIX] - show vertex_project, vertex_location in Vertex AI exceptions
2024-04-18 13:18:20 -07:00