Commit graph

1041 commits

Author SHA1 Message Date
Krrish Dholakia
4b64e506f4 fix(proxy_server.py): fix argon import 2024-03-11 11:33:34 -07:00
Krish Dholakia
c7d0af0a2e
Merge pull request #2426 from BerriAI/litellm_whisper_cost_tracking
feat: add cost tracking + caching for `/audio/transcription` calls
2024-03-09 19:12:06 -08:00
Krish Dholakia
c022568a3a
Merge branch 'main' into litellm_faster_api_key_checking 2024-03-09 18:45:03 -08:00
Krrish Dholakia
7a29fe9525 fix(proxy_server.py): check if master key is str before hashing 2024-03-09 16:51:11 -08:00
Krrish Dholakia
03f0c968f9 fix(proxy_server.py): fix argon exceptions 2024-03-09 16:16:40 -08:00
Krrish Dholakia
fa45c569fd feat: add cost tracking + caching for transcription calls 2024-03-09 15:43:38 -08:00
Krish Dholakia
caa99f43bf
Merge branch 'main' into litellm_load_balancing_transcription_endpoints 2024-03-08 23:08:47 -08:00
Krrish Dholakia
5ffbcf79d3 fix(proxy_server.py): fix argon cfi checking 2024-03-08 22:01:44 -08:00
Ishaan Jaff
8036b48f14
Merge pull request #2408 from BerriAI/litellm_no_store_reqs
[FEAT-liteLLM Proxy] Incognito Requests -  Don't log anything
2024-03-08 21:11:43 -08:00
Krrish Dholakia
0fb7afe820 feat(proxy_server.py): working /audio/transcription endpoint 2024-03-08 18:20:27 -08:00
ishaan-jaff
d6dc28f0ed (fix) proxy setting success callbacks 2024-03-08 16:27:53 -08:00
Krrish Dholakia
cc0294b2f2 fix(proxy_server.py): fix tagging of endpoints 2024-03-08 14:29:31 -08:00
Krrish Dholakia
8c6d5b7f16 feat(proxy_server.py): supports /audio/transcription endpoint on proxy 2024-03-08 14:28:07 -08:00
ishaan-jaff
2aafbe390b (feat) read passed api_version 2024-03-08 13:16:12 -08:00
Krrish Dholakia
0cf056f493 fix(proxy_server.py): use argon2 for faster api key checking
0.04s latency boost on load test
2024-03-07 21:48:18 -08:00
Krrish Dholakia
dd78a1956a fix(proxy_server.py): fix model alias map + add back testing 2024-03-07 07:56:51 -08:00
Krrish Dholakia
a8bc10170a fix(proxy_server.py): support cost tracking if general_settings is none
works if database_url is in env
2024-03-06 21:27:41 -08:00
Krish Dholakia
38612ddd34
Merge pull request #2377 from BerriAI/litellm_team_level_model_groups
feat(proxy_server.py): team based model aliases
2024-03-06 21:03:53 -08:00
Krrish Dholakia
d1d8adfb11 fix(proxy_server.py): fix sql query 2024-03-06 19:41:12 -08:00
Krish Dholakia
cb8b30970b
Merge pull request #2347 from BerriAI/litellm_retry_rate_limited_requests
feat(proxy_server.py): retry if virtual key is rate limited
2024-03-06 19:23:11 -08:00
Krrish Dholakia
ca97ea8acd feat(proxy_server.py): team based model aliases
allow setting model aliases at a team level (e.g. route all 'gpt-3.5-turbo' requests from team-1 to model-deployment-group-2)
2024-03-06 17:42:08 -08:00
ishaan-jaff
6aadfb3472 (fix) remove unuse endpoint 2024-03-06 15:40:22 -08:00
ishaan-jaff
ac5137f442 (fix) admin UI swagger 2024-03-06 14:01:39 -08:00
ishaan-jaff
c4e7c45c3a (fix) update team_id 2024-03-05 19:09:19 -08:00
ishaan-jaff
92753f558c (fix) _update_team_db 2024-03-05 19:03:27 -08:00
Krrish Dholakia
ad55f4dbb5 feat(proxy_server.py): retry if virtual key is rate limited
currently for chat completions
2024-03-05 19:00:03 -08:00
ishaan-jaff
3f7bf5c6b1 (fix) fix batch update user db 2024-03-05 16:46:58 -08:00
ishaan-jaff
3a61df9995 (feat) show /model/metrics on Admin UI 2024-03-04 16:25:35 -08:00
ishaan-jaff
2479712807 (feat) show model metrics on admin panel 2024-03-04 13:44:13 -08:00
Krrish Dholakia
873ddde924 fix(huggingface_restapi.py): fix huggingface streaming error raising 2024-03-04 09:32:41 -08:00
Krrish Dholakia
37ad5efc61 fix(proxy/utils.py): fix resetting budget logic 2024-03-02 20:52:54 -08:00
Krrish Dholakia
f5f12d204e fix(proxy_server.py): fix track cost callback 2024-03-02 19:52:45 -08:00
Krish Dholakia
530b454ff4
Merge branch 'main' into litellm_slack_budget_alerting 2024-03-02 19:13:57 -08:00
Krrish Dholakia
679c47d196 fix(proxy_server.py): fix budget creation 2024-03-02 19:11:37 -08:00
Krish Dholakia
ce84bce8e6
Merge pull request #2300 from BerriAI/litellm_organization_table
feat(proxy_server.py): enable `/organizations/new`, `/organization/info` and `/budget/info` endpoints
2024-03-02 18:37:36 -08:00
Krrish Dholakia
ac085a4643 fix(proxy_server.py): actual implementation of slack soft budget alerting 2024-03-02 18:34:18 -08:00
ishaan-jaff
aa565dc990 (feat) improve test slack alert 2024-03-02 17:33:31 -08:00
ishaan-jaff
7e94fcc9a8 (fix) prisma test 2024-03-02 17:15:27 -08:00
ishaan-jaff
9bc1f5f664 (proxy) test budgets 2024-03-02 16:57:35 -08:00
Ishaan Jaff
9fea8e7152
Merge branch 'main' into litellm_test_slack_alerts 2024-03-02 16:49:12 -08:00
ishaan-jaff
ebaf2eef1f (feat) improve error for testing slack 2024-03-02 16:46:20 -08:00
Krish Dholakia
eaccbf26b7
Merge branch 'main' into litellm_organization_table 2024-03-02 16:09:28 -08:00
Krrish Dholakia
b30cbd0d55 refactor(proxy_server.py): format the message for slack budget alerts 2024-03-02 16:04:36 -08:00
Krrish Dholakia
cbd0851257 fix(proxy_server.py): raise 422 error if no slack connection setup when calling /health/services 2024-03-02 15:56:42 -08:00
Krrish Dholakia
1ef19fbc9c feat: enable user to test slack budget alerting when creating a key 2024-03-02 15:54:46 -08:00
Krrish Dholakia
8bb6897b46 feat(proxy_server.py): exposes /organization/info and /budget/info endpoints 2024-03-02 15:07:33 -08:00
ishaan-jaff
1bb8263c92 (feat) set soft_budget with /key/generate 2024-03-02 14:43:01 -08:00
Krrish Dholakia
8d22ed762e fix(proxy_server.py): enable admin to create new budget if none set for org 2024-03-02 14:38:42 -08:00
ishaan-jaff
fd9f8b7010 (docs) setting soft budgets 2024-03-02 13:05:00 -08:00
ishaan-jaff
eb4f90115d (feat) create soft budget 2024-03-02 12:52:09 -08:00