Krrish Dholakia
|
4b64e506f4
|
fix(proxy_server.py): fix argon import
|
2024-03-11 11:33:34 -07:00 |
|
Krish Dholakia
|
c7d0af0a2e
|
Merge pull request #2426 from BerriAI/litellm_whisper_cost_tracking
feat: add cost tracking + caching for `/audio/transcription` calls
|
2024-03-09 19:12:06 -08:00 |
|
Krish Dholakia
|
c022568a3a
|
Merge branch 'main' into litellm_faster_api_key_checking
|
2024-03-09 18:45:03 -08:00 |
|
Krrish Dholakia
|
7a29fe9525
|
fix(proxy_server.py): check if master key is str before hashing
|
2024-03-09 16:51:11 -08:00 |
|
Ishaan Jaff
|
a1784284bb
|
Merge pull request #2416 from BerriAI/litellm_use_consistent_port
(docs) LiteLLM Proxy - use port 4000 in examples
|
2024-03-09 16:32:08 -08:00 |
|
Krrish Dholakia
|
03f0c968f9
|
fix(proxy_server.py): fix argon exceptions
|
2024-03-09 16:16:40 -08:00 |
|
Krrish Dholakia
|
fa45c569fd
|
feat: add cost tracking + caching for transcription calls
|
2024-03-09 15:43:38 -08:00 |
|
Krrish Dholakia
|
4b60bea975
|
fix(proxy/utils.py): add more logging for prisma client get_data error
|
2024-03-09 13:06:30 -08:00 |
|
Krish Dholakia
|
caa99f43bf
|
Merge branch 'main' into litellm_load_balancing_transcription_endpoints
|
2024-03-08 23:08:47 -08:00 |
|
Krrish Dholakia
|
5ffbcf79d3
|
fix(proxy_server.py): fix argon cfi checking
|
2024-03-08 22:01:44 -08:00 |
|
ishaan-jaff
|
ea6f42216c
|
(docs) use port 4000
|
2024-03-08 21:59:00 -08:00 |
|
Ishaan Jaff
|
8036b48f14
|
Merge pull request #2408 from BerriAI/litellm_no_store_reqs
[FEAT-liteLLM Proxy] Incognito Requests - Don't log anything
|
2024-03-08 21:11:43 -08:00 |
|
Ishaan Jaff
|
0fc7b273e5
|
Merge pull request #2403 from BerriAI/litellm_api_version_client_side
[FEAT] AzureOpenAI - Pass `api_version` to litellm per request
|
2024-03-08 20:33:45 -08:00 |
|
Krrish Dholakia
|
0fb7afe820
|
feat(proxy_server.py): working /audio/transcription endpoint
|
2024-03-08 18:20:27 -08:00 |
|
ishaan-jaff
|
d6dc28f0ed
|
(fix) proxy setting success callbacks
|
2024-03-08 16:27:53 -08:00 |
|
ishaan-jaff
|
9ed51e791b
|
(fix) default num workers
|
2024-03-08 15:24:08 -08:00 |
|
Krrish Dholakia
|
cc0294b2f2
|
fix(proxy_server.py): fix tagging of endpoints
|
2024-03-08 14:29:31 -08:00 |
|
Krrish Dholakia
|
8c6d5b7f16
|
feat(proxy_server.py): supports /audio/transcription endpoint on proxy
|
2024-03-08 14:28:07 -08:00 |
|
ishaan-jaff
|
2aafbe390b
|
(feat) read passed api_version
|
2024-03-08 13:16:12 -08:00 |
|
Krrish Dholakia
|
0cf056f493
|
fix(proxy_server.py): use argon2 for faster api key checking
0.04s latency boost on load test
|
2024-03-07 21:48:18 -08:00 |
|
Ishaan Jaff
|
90d7b61ec2
|
Merge pull request #2393 from BerriAI/litellm_router_debug_get_dep
[Feat] Load Balancing - View Metrics about selected deployments in server logs
|
2024-03-07 20:44:18 -08:00 |
|
ishaan-jaff
|
6f0faca85b
|
(feat) print debug info per deployment
|
2024-03-07 18:33:09 -08:00 |
|
ishaan-jaff
|
8c3ec15856
|
(fix) improve improve prisma alerting/debug
|
2024-03-07 15:44:03 -08:00 |
|
ishaan-jaff
|
5217e800c4
|
(fix) better prisma debug logs
|
2024-03-07 14:04:02 -08:00 |
|
ishaan-jaff
|
b64e4c3e6f
|
(feat) better debugging when reading prisma
|
2024-03-07 13:42:54 -08:00 |
|
Krrish Dholakia
|
dd78a1956a
|
fix(proxy_server.py): fix model alias map + add back testing
|
2024-03-07 07:56:51 -08:00 |
|
Krish Dholakia
|
38612ddd34
|
Merge pull request #2377 from BerriAI/litellm_team_level_model_groups
feat(proxy_server.py): team based model aliases
|
2024-03-06 21:03:53 -08:00 |
|
Krrish Dholakia
|
d1d8adfb11
|
fix(proxy_server.py): fix sql query
|
2024-03-06 19:41:12 -08:00 |
|
Krish Dholakia
|
cb8b30970b
|
Merge pull request #2347 from BerriAI/litellm_retry_rate_limited_requests
feat(proxy_server.py): retry if virtual key is rate limited
|
2024-03-06 19:23:11 -08:00 |
|
Krrish Dholakia
|
fc16b6650e
|
build(schema.prisma): add support for team-based model aliases
|
2024-03-06 18:55:44 -08:00 |
|
Krrish Dholakia
|
ca97ea8acd
|
feat(proxy_server.py): team based model aliases
allow setting model aliases at a team level (e.g. route all 'gpt-3.5-turbo' requests from team-1 to model-deployment-group-2)
|
2024-03-06 17:42:08 -08:00 |
|
ishaan-jaff
|
6aadfb3472
|
(fix) remove unuse endpoint
|
2024-03-06 15:40:22 -08:00 |
|
ishaan-jaff
|
ac5137f442
|
(fix) admin UI swagger
|
2024-03-06 14:01:39 -08:00 |
|
ishaan-jaff
|
ee468a4e05
|
(feat) circular ref error on prisa
|
2024-03-06 12:08:22 -08:00 |
|
ishaan-jaff
|
c17789f170
|
(fix) circular ref error h
|
2024-03-06 12:02:44 -08:00 |
|
ishaan-jaff
|
2313f5f90a
|
(feat) handle litellm circular ref error
|
2024-03-06 10:21:25 -08:00 |
|
Ishaan Jaff
|
2679e2baf3
|
Merge pull request #2345 from BerriAI/litellm_fix_update_team_db
(fix) Update Team DB
|
2024-03-05 20:37:58 -08:00 |
|
Ishaan Jaff
|
a3ece3fd71
|
Merge pull request #2341 from BerriAI/litellm_fixes_proxy_db
[Fix] Better debugging with alerts
|
2024-03-05 20:37:16 -08:00 |
|
ishaan-jaff
|
c4e7c45c3a
|
(fix) update team_id
|
2024-03-05 19:09:19 -08:00 |
|
ishaan-jaff
|
92753f558c
|
(fix) _update_team_db
|
2024-03-05 19:03:27 -08:00 |
|
Krrish Dholakia
|
ad55f4dbb5
|
feat(proxy_server.py): retry if virtual key is rate limited
currently for chat completions
|
2024-03-05 19:00:03 -08:00 |
|
ishaan-jaff
|
52e59e050c
|
(fix) use str(e)
|
2024-03-05 18:57:26 -08:00 |
|
ishaan-jaff
|
00666566c2
|
(fix) chore
|
2024-03-05 18:54:04 -08:00 |
|
Ishaan Jaff
|
03fa322b38
|
Merge branch 'main' into litellm_fixes_proxy_db
|
2024-03-05 18:51:55 -08:00 |
|
ishaan-jaff
|
a7094b0a3c
|
(v0) fix pydantic warnings
|
2024-03-05 18:29:09 -08:00 |
|
ishaan-jaff
|
735728c4dd
|
(feat) add better debugging when DB read/write fails
|
2024-03-05 18:16:59 -08:00 |
|
Ishaan Jaff
|
e8c475b925
|
Merge pull request #2340 from BerriAI/litellm_fix_batch_update_users
[Fix] Fix Batch Updating User DB
|
2024-03-05 18:06:08 -08:00 |
|
ishaan-jaff
|
3df618bf04
|
(fix) cleanup
|
2024-03-05 17:05:01 -08:00 |
|
ishaan-jaff
|
d0953a1349
|
(fix) chore
|
2024-03-05 17:04:25 -08:00 |
|
ishaan-jaff
|
3f7bf5c6b1
|
(fix) fix batch update user db
|
2024-03-05 16:46:58 -08:00 |
|