ishaan-jaff
|
b59e472fcd
|
(fix) patch dynamoDB tea_model_alias bug
|
2024-03-12 11:17:12 -07:00 |
|
Krrish Dholakia
|
60219254a9
|
fix(proxy_server.py): fix /user/info for non-existent user id
|
2024-03-12 11:05:41 -07:00 |
|
Krrish Dholakia
|
5b5d3b10df
|
test(test_key_generate_prisma.py): fix test to only let admin delete a key
|
2024-03-11 14:18:01 -07:00 |
|
Krrish Dholakia
|
366aaf744e
|
fix(proxy_server.py): bug fix
|
2024-03-11 13:54:58 -07:00 |
|
Krrish Dholakia
|
1dcaba83d2
|
fix(proxy_server.py): prevent user from deleting non-user owned keys when they use ui
|
2024-03-11 12:13:30 -07:00 |
|
Krrish Dholakia
|
4b67e2ce10
|
fix(proxy_server.py): fix argon import
|
2024-03-11 11:33:34 -07:00 |
|
Krish Dholakia
|
774ceb741c
|
Merge pull request #2426 from BerriAI/litellm_whisper_cost_tracking
feat: add cost tracking + caching for `/audio/transcription` calls
|
2024-03-09 19:12:06 -08:00 |
|
Krish Dholakia
|
ba9d96e65e
|
Merge branch 'main' into litellm_faster_api_key_checking
|
2024-03-09 18:45:03 -08:00 |
|
Krrish Dholakia
|
ca496a8892
|
fix(proxy_server.py): check if master key is str before hashing
|
2024-03-09 16:51:11 -08:00 |
|
Krrish Dholakia
|
eda9f2ba81
|
fix(proxy_server.py): fix argon exceptions
|
2024-03-09 16:16:40 -08:00 |
|
Krrish Dholakia
|
b2ce963498
|
feat: add cost tracking + caching for transcription calls
|
2024-03-09 15:43:38 -08:00 |
|
Krish Dholakia
|
f461352908
|
Merge branch 'main' into litellm_load_balancing_transcription_endpoints
|
2024-03-08 23:08:47 -08:00 |
|
Krrish Dholakia
|
f9233fa5b9
|
fix(proxy_server.py): fix argon cfi checking
|
2024-03-08 22:01:44 -08:00 |
|
Ishaan Jaff
|
0f19e35f22
|
Merge pull request #2408 from BerriAI/litellm_no_store_reqs
[FEAT-liteLLM Proxy] Incognito Requests - Don't log anything
|
2024-03-08 21:11:43 -08:00 |
|
Krrish Dholakia
|
93615682fe
|
feat(proxy_server.py): working /audio/transcription endpoint
|
2024-03-08 18:20:27 -08:00 |
|
ishaan-jaff
|
8b5d255bee
|
(fix) proxy setting success callbacks
|
2024-03-08 16:27:53 -08:00 |
|
Krrish Dholakia
|
8c6a4df3d4
|
fix(proxy_server.py): fix tagging of endpoints
|
2024-03-08 14:29:31 -08:00 |
|
Krrish Dholakia
|
29d0fcf34a
|
feat(proxy_server.py): supports /audio/transcription endpoint on proxy
|
2024-03-08 14:28:07 -08:00 |
|
ishaan-jaff
|
b316b49c32
|
(feat) read passed api_version
|
2024-03-08 13:16:12 -08:00 |
|
Krrish Dholakia
|
b81c94e94f
|
fix(proxy_server.py): use argon2 for faster api key checking
0.04s latency boost on load test
|
2024-03-07 21:48:18 -08:00 |
|
Krrish Dholakia
|
3c414c6357
|
fix(proxy_server.py): fix model alias map + add back testing
|
2024-03-07 07:56:51 -08:00 |
|
Krish Dholakia
|
ede9647e49
|
Merge pull request #2377 from BerriAI/litellm_team_level_model_groups
feat(proxy_server.py): team based model aliases
|
2024-03-06 21:03:53 -08:00 |
|
Krrish Dholakia
|
df0eb170e6
|
fix(proxy_server.py): fix sql query
|
2024-03-06 19:41:12 -08:00 |
|
Krish Dholakia
|
050a056e09
|
Merge pull request #2347 from BerriAI/litellm_retry_rate_limited_requests
feat(proxy_server.py): retry if virtual key is rate limited
|
2024-03-06 19:23:11 -08:00 |
|
Krrish Dholakia
|
7bfadc258e
|
feat(proxy_server.py): team based model aliases
allow setting model aliases at a team level (e.g. route all 'gpt-3.5-turbo' requests from team-1 to model-deployment-group-2)
|
2024-03-06 17:42:08 -08:00 |
|
ishaan-jaff
|
b6f3eb1434
|
(fix) remove unuse endpoint
|
2024-03-06 15:40:22 -08:00 |
|
ishaan-jaff
|
b0575bdcf0
|
(fix) admin UI swagger
|
2024-03-06 14:01:39 -08:00 |
|
ishaan-jaff
|
086de85625
|
(fix) update team_id
|
2024-03-05 19:09:19 -08:00 |
|
ishaan-jaff
|
3ca856fe39
|
(fix) _update_team_db
|
2024-03-05 19:03:27 -08:00 |
|
Krrish Dholakia
|
c963e2761b
|
feat(proxy_server.py): retry if virtual key is rate limited
currently for chat completions
|
2024-03-05 19:00:03 -08:00 |
|
ishaan-jaff
|
a513a05a7f
|
(fix) fix batch update user db
|
2024-03-05 16:46:58 -08:00 |
|
ishaan-jaff
|
7a72aa8796
|
(feat) show /model/metrics on Admin UI
|
2024-03-04 16:25:35 -08:00 |
|
ishaan-jaff
|
d1a3899882
|
(feat) show model metrics on admin panel
|
2024-03-04 13:44:13 -08:00 |
|
Krrish Dholakia
|
a1ce24c5f0
|
fix(huggingface_restapi.py): fix huggingface streaming error raising
|
2024-03-04 09:32:41 -08:00 |
|
Krrish Dholakia
|
82bced7183
|
fix(proxy/utils.py): fix resetting budget logic
|
2024-03-02 20:52:54 -08:00 |
|
Krrish Dholakia
|
49ed001cdf
|
fix(proxy_server.py): fix track cost callback
|
2024-03-02 19:52:45 -08:00 |
|
Krish Dholakia
|
cbeb65a442
|
Merge branch 'main' into litellm_slack_budget_alerting
|
2024-03-02 19:13:57 -08:00 |
|
Krrish Dholakia
|
6126deb7dd
|
fix(proxy_server.py): fix budget creation
|
2024-03-02 19:11:37 -08:00 |
|
Krish Dholakia
|
46ec94ec3c
|
Merge pull request #2300 from BerriAI/litellm_organization_table
feat(proxy_server.py): enable `/organizations/new`, `/organization/info` and `/budget/info` endpoints
|
2024-03-02 18:37:36 -08:00 |
|
Krrish Dholakia
|
5e37a24a36
|
fix(proxy_server.py): actual implementation of slack soft budget alerting
|
2024-03-02 18:34:18 -08:00 |
|
ishaan-jaff
|
32aa2b0168
|
(feat) improve test slack alert
|
2024-03-02 17:33:31 -08:00 |
|
ishaan-jaff
|
31108d3cde
|
(fix) prisma test
|
2024-03-02 17:15:27 -08:00 |
|
ishaan-jaff
|
c64775302f
|
(proxy) test budgets
|
2024-03-02 16:57:35 -08:00 |
|
Ishaan Jaff
|
76c07a41e7
|
Merge branch 'main' into litellm_test_slack_alerts
|
2024-03-02 16:49:12 -08:00 |
|
ishaan-jaff
|
8cb342ab6a
|
(feat) improve error for testing slack
|
2024-03-02 16:46:20 -08:00 |
|
Krish Dholakia
|
84e45b385d
|
Merge branch 'main' into litellm_organization_table
|
2024-03-02 16:09:28 -08:00 |
|
Krrish Dholakia
|
52efd985ed
|
refactor(proxy_server.py): format the message for slack budget alerts
|
2024-03-02 16:04:36 -08:00 |
|
Krrish Dholakia
|
1bba73bb6c
|
fix(proxy_server.py): raise 422 error if no slack connection setup when calling /health/services
|
2024-03-02 15:56:42 -08:00 |
|
Krrish Dholakia
|
6d87c9bb7d
|
feat: enable user to test slack budget alerting when creating a key
|
2024-03-02 15:54:46 -08:00 |
|
Krrish Dholakia
|
56f1d5454b
|
feat(proxy_server.py): exposes /organization/info and /budget/info endpoints
|
2024-03-02 15:07:33 -08:00 |
|