Krrish Dholakia
|
4b56f08cbe
|
test(test_models.py): fix delete model test
|
2024-04-04 08:46:08 -07:00 |
|
Krrish Dholakia
|
d7601a4844
|
perf(proxy_server.py): batch write spend logs
reduces prisma client errors, by batch writing spend logs - max 1k logs at a time
|
2024-04-02 18:46:55 -07:00 |
|
Krrish Dholakia
|
60f89faf1c
|
test: cleanup
|
2024-03-30 21:40:43 -07:00 |
|
Krrish Dholakia
|
c0204310ee
|
fix(main.py): fix translation to text_completions format for async text completion calls
|
2024-03-30 09:02:51 -07:00 |
|
Krrish Dholakia
|
af2eabba91
|
fix(proxy_server.py): fix /key/update endpoint to update key duration
also adds a test for this to our ci/cd
|
2024-03-29 21:47:10 -07:00 |
|
Ishaan Jaff
|
6b08be4c8e
|
Update README.MD
|
2024-03-29 14:56:41 -07:00 |
|
Ishaan Jaff
|
a951738849
|
Update README.MD
|
2024-03-29 14:56:04 -07:00 |
|
Krrish Dholakia
|
58bdbecbc6
|
fix(proxy_server.py): allow updating tpm/rpm limits via /team/update
|
2024-03-28 15:59:35 -07:00 |
|
Ishaan Jaff
|
75614bc29c
|
(fix) img gen endpoints unstable
|
2024-03-28 12:43:46 -07:00 |
|
Ishaan Jaff
|
2495587e7c
|
(fix) test img gen proxy
|
2024-03-28 12:33:08 -07:00 |
|
Krrish Dholakia
|
49e8cdbff9
|
fix(router.py): check for context window error when handling 400 status code errors
was causing proxy context window fallbacks to not work as expected
|
2024-03-26 08:08:15 -07:00 |
|
Krrish Dholakia
|
c81c9c2583
|
fix(proxy_server.py): fix model info check
|
2024-03-23 15:59:17 -07:00 |
|
Krrish Dholakia
|
066346b8c8
|
test: handle flaky tests locally - causing db timeout issues on circle ci
|
2024-03-19 20:39:53 -07:00 |
|
Krrish Dholakia
|
2dfdc8dd69
|
Revert "Merge pull request #2593 from BerriAI/litellm_reset_budget_fix"
This reverts commit afd363129f , reversing
changes made to c94bc94ad5 .
|
2024-03-19 20:25:41 -07:00 |
|
Krrish Dholakia
|
b9f222f8fb
|
test(test_users.py): skip flaky circle ci test
|
2024-03-19 20:09:38 -07:00 |
|
Krrish Dholakia
|
97e7113b87
|
test(test_users.py): fix test
|
2024-03-19 19:48:44 -07:00 |
|
Krrish Dholakia
|
9140453d0c
|
test: make test call more reliable
|
2024-03-19 19:47:18 -07:00 |
|
Krrish Dholakia
|
0822f6283f
|
test(test_users.py): reposition test
trying to identify why endpoint isn't being called
|
2024-03-19 19:40:41 -07:00 |
|
Krrish Dholakia
|
f588bff69b
|
fix(proxy_server.py): fix spend log update
|
2024-03-18 20:26:28 -07:00 |
|
ishaan-jaff
|
992aba2a77
|
(fix) predict/spend/logs test
|
2024-03-13 14:09:51 -07:00 |
|
ishaan-jaff
|
ba5cc19edf
|
(test) using get_predict_spend_logs
|
2024-03-13 14:00:04 -07:00 |
|
Krrish Dholakia
|
fa45c569fd
|
feat: add cost tracking + caching for transcription calls
|
2024-03-09 15:43:38 -08:00 |
|
Krish Dholakia
|
caa99f43bf
|
Merge branch 'main' into litellm_load_balancing_transcription_endpoints
|
2024-03-08 23:08:47 -08:00 |
|
Krrish Dholakia
|
7ff8fa09d6
|
test(test_whisper.py): hardcode api base
|
2024-03-08 22:51:17 -08:00 |
|
Krrish Dholakia
|
c0c76707a1
|
test(test_whisper.py): cleanup test
|
2024-03-08 22:44:22 -08:00 |
|
Krrish Dholakia
|
0432c85bf7
|
test(test_whisper.py): add debugging for circle ci error
|
2024-03-08 22:43:07 -08:00 |
|
Krrish Dholakia
|
fac01f8481
|
fix(azure.py): add pre call logging for transcription calls
|
2024-03-08 22:23:21 -08:00 |
|
Krrish Dholakia
|
0fb7afe820
|
feat(proxy_server.py): working /audio/transcription endpoint
|
2024-03-08 18:20:27 -08:00 |
|
Krrish Dholakia
|
fe125a5131
|
test(test_whisper.py): add testing for load balancing whisper endpoints on router
|
2024-03-08 14:19:37 -08:00 |
|
Krrish Dholakia
|
9274245a0b
|
test(test_whisper.py): fix getting path for audio file in test
|
2024-03-08 14:10:19 -08:00 |
|
Krrish Dholakia
|
6fa585d001
|
test(test_whisper.py): fix test
|
2024-03-08 14:02:34 -08:00 |
|
Krrish Dholakia
|
6b1049217e
|
feat(azure.py): add support for calling whisper endpoints on azure
|
2024-03-08 13:48:38 -08:00 |
|
Krrish Dholakia
|
696eb54455
|
feat(main.py): support openai transcription endpoints
enable user to load balance between openai + azure transcription endpoints
|
2024-03-08 10:25:19 -08:00 |
|
Krrish Dholakia
|
dd78a1956a
|
fix(proxy_server.py): fix model alias map + add back testing
|
2024-03-07 07:56:51 -08:00 |
|
Krrish Dholakia
|
b9854a99d2
|
test: increase time before checking budget reset - avoid deadlocking
|
2024-03-06 22:16:59 -08:00 |
|
Krrish Dholakia
|
ad55f4dbb5
|
feat(proxy_server.py): retry if virtual key is rate limited
currently for chat completions
|
2024-03-05 19:00:03 -08:00 |
|
Krrish Dholakia
|
2b595bfdc9
|
test(test_keys.py): add more duration for test
|
2024-03-05 15:42:19 -08:00 |
|
Krrish Dholakia
|
4f3d1ee11f
|
test(test_keys.py): extend time for test
|
2024-03-02 21:19:34 -08:00 |
|
Krrish Dholakia
|
37ad5efc61
|
fix(proxy/utils.py): fix resetting budget logic
|
2024-03-02 20:52:54 -08:00 |
|
Krrish Dholakia
|
6fb19c5d42
|
test(test_organizations.py): add testing for /organization/new endpoint
|
2024-03-02 12:13:54 -08:00 |
|
Krrish Dholakia
|
6b8b0f40cf
|
feat(proxy_server.py): add new team_member delete endpoint
|
2024-03-01 09:14:08 -08:00 |
|
Krrish Dholakia
|
f7a2d3faef
|
test: skip sagemaker test - aws account suspended
|
2024-02-28 15:00:53 -08:00 |
|
Krrish Dholakia
|
5265d9042d
|
test: testing fixes
|
2024-02-27 20:35:42 -08:00 |
|
Krrish Dholakia
|
20def3805c
|
test(test_openai_endpoints.py): fix test
|
2024-02-23 22:57:20 -08:00 |
|
Krrish Dholakia
|
a7229c9253
|
fix(proxy_server.py): enable proxy /team/delete endpoint
|
2024-02-21 16:53:12 -08:00 |
|
Krrish Dholakia
|
dca9103b09
|
refactor(test_team.py): trigger new devrelease
|
2024-02-16 09:57:51 -08:00 |
|
Krrish Dholakia
|
faa7892b4c
|
test(test_team.py): trigger new build
|
2024-02-16 08:56:08 -08:00 |
|
Krrish Dholakia
|
3af2fb25be
|
test(test_team.py): adding testing for team endpoints
|
2024-02-16 08:55:21 -08:00 |
|
Krrish Dholakia
|
83d43809a7
|
fix(proxy_server.py): fix proxy server team id bug
|
2024-02-13 22:33:56 -08:00 |
|
Krrish Dholakia
|
4a0df3cb4f
|
fix(proxy_cli.py-&&-proxy_server.py): bump reset budget intervals and fix pool limits for prisma connections
|
2024-02-06 19:39:49 -08:00 |
|