Commit graph

77 commits

Author SHA1 Message Date
Krrish Dholakia
4b56f08cbe test(test_models.py): fix delete model test 2024-04-04 08:46:08 -07:00
Krrish Dholakia
d7601a4844 perf(proxy_server.py): batch write spend logs
reduces prisma client errors, by batch writing spend logs - max 1k logs at a time
2024-04-02 18:46:55 -07:00
Krrish Dholakia
60f89faf1c test: cleanup 2024-03-30 21:40:43 -07:00
Krrish Dholakia
c0204310ee fix(main.py): fix translation to text_completions format for async text completion calls 2024-03-30 09:02:51 -07:00
Krrish Dholakia
af2eabba91 fix(proxy_server.py): fix /key/update endpoint to update key duration
also adds a test for this to our ci/cd
2024-03-29 21:47:10 -07:00
Ishaan Jaff
6b08be4c8e
Update README.MD 2024-03-29 14:56:41 -07:00
Ishaan Jaff
a951738849
Update README.MD 2024-03-29 14:56:04 -07:00
Krrish Dholakia
58bdbecbc6 fix(proxy_server.py): allow updating tpm/rpm limits via /team/update 2024-03-28 15:59:35 -07:00
Ishaan Jaff
75614bc29c (fix) img gen endpoints unstable 2024-03-28 12:43:46 -07:00
Ishaan Jaff
2495587e7c (fix) test img gen proxy 2024-03-28 12:33:08 -07:00
Krrish Dholakia
49e8cdbff9 fix(router.py): check for context window error when handling 400 status code errors
was causing proxy context window fallbacks to not work as expected
2024-03-26 08:08:15 -07:00
Krrish Dholakia
c81c9c2583 fix(proxy_server.py): fix model info check 2024-03-23 15:59:17 -07:00
Krrish Dholakia
066346b8c8 test: handle flaky tests locally - causing db timeout issues on circle ci 2024-03-19 20:39:53 -07:00
Krrish Dholakia
2dfdc8dd69 Revert "Merge pull request #2593 from BerriAI/litellm_reset_budget_fix"
This reverts commit afd363129f, reversing
changes made to c94bc94ad5.
2024-03-19 20:25:41 -07:00
Krrish Dholakia
b9f222f8fb test(test_users.py): skip flaky circle ci test 2024-03-19 20:09:38 -07:00
Krrish Dholakia
97e7113b87 test(test_users.py): fix test 2024-03-19 19:48:44 -07:00
Krrish Dholakia
9140453d0c test: make test call more reliable 2024-03-19 19:47:18 -07:00
Krrish Dholakia
0822f6283f test(test_users.py): reposition test
trying to identify why endpoint isn't being called
2024-03-19 19:40:41 -07:00
Krrish Dholakia
f588bff69b fix(proxy_server.py): fix spend log update 2024-03-18 20:26:28 -07:00
ishaan-jaff
992aba2a77 (fix) predict/spend/logs test 2024-03-13 14:09:51 -07:00
ishaan-jaff
ba5cc19edf (test) using get_predict_spend_logs 2024-03-13 14:00:04 -07:00
Krrish Dholakia
fa45c569fd feat: add cost tracking + caching for transcription calls 2024-03-09 15:43:38 -08:00
Krish Dholakia
caa99f43bf
Merge branch 'main' into litellm_load_balancing_transcription_endpoints 2024-03-08 23:08:47 -08:00
Krrish Dholakia
7ff8fa09d6 test(test_whisper.py): hardcode api base 2024-03-08 22:51:17 -08:00
Krrish Dholakia
c0c76707a1 test(test_whisper.py): cleanup test 2024-03-08 22:44:22 -08:00
Krrish Dholakia
0432c85bf7 test(test_whisper.py): add debugging for circle ci error 2024-03-08 22:43:07 -08:00
Krrish Dholakia
fac01f8481 fix(azure.py): add pre call logging for transcription calls 2024-03-08 22:23:21 -08:00
Krrish Dholakia
0fb7afe820 feat(proxy_server.py): working /audio/transcription endpoint 2024-03-08 18:20:27 -08:00
Krrish Dholakia
fe125a5131 test(test_whisper.py): add testing for load balancing whisper endpoints on router 2024-03-08 14:19:37 -08:00
Krrish Dholakia
9274245a0b test(test_whisper.py): fix getting path for audio file in test 2024-03-08 14:10:19 -08:00
Krrish Dholakia
6fa585d001 test(test_whisper.py): fix test 2024-03-08 14:02:34 -08:00
Krrish Dholakia
6b1049217e feat(azure.py): add support for calling whisper endpoints on azure 2024-03-08 13:48:38 -08:00
Krrish Dholakia
696eb54455 feat(main.py): support openai transcription endpoints
enable user to load balance between openai + azure transcription endpoints
2024-03-08 10:25:19 -08:00
Krrish Dholakia
dd78a1956a fix(proxy_server.py): fix model alias map + add back testing 2024-03-07 07:56:51 -08:00
Krrish Dholakia
b9854a99d2 test: increase time before checking budget reset - avoid deadlocking 2024-03-06 22:16:59 -08:00
Krrish Dholakia
ad55f4dbb5 feat(proxy_server.py): retry if virtual key is rate limited
currently for chat completions
2024-03-05 19:00:03 -08:00
Krrish Dholakia
2b595bfdc9 test(test_keys.py): add more duration for test 2024-03-05 15:42:19 -08:00
Krrish Dholakia
4f3d1ee11f test(test_keys.py): extend time for test 2024-03-02 21:19:34 -08:00
Krrish Dholakia
37ad5efc61 fix(proxy/utils.py): fix resetting budget logic 2024-03-02 20:52:54 -08:00
Krrish Dholakia
6fb19c5d42 test(test_organizations.py): add testing for /organization/new endpoint 2024-03-02 12:13:54 -08:00
Krrish Dholakia
6b8b0f40cf feat(proxy_server.py): add new team_member delete endpoint 2024-03-01 09:14:08 -08:00
Krrish Dholakia
f7a2d3faef test: skip sagemaker test - aws account suspended 2024-02-28 15:00:53 -08:00
Krrish Dholakia
5265d9042d test: testing fixes 2024-02-27 20:35:42 -08:00
Krrish Dholakia
20def3805c test(test_openai_endpoints.py): fix test 2024-02-23 22:57:20 -08:00
Krrish Dholakia
a7229c9253 fix(proxy_server.py): enable proxy /team/delete endpoint 2024-02-21 16:53:12 -08:00
Krrish Dholakia
dca9103b09 refactor(test_team.py): trigger new devrelease 2024-02-16 09:57:51 -08:00
Krrish Dholakia
faa7892b4c test(test_team.py): trigger new build 2024-02-16 08:56:08 -08:00
Krrish Dholakia
3af2fb25be test(test_team.py): adding testing for team endpoints 2024-02-16 08:55:21 -08:00
Krrish Dholakia
83d43809a7 fix(proxy_server.py): fix proxy server team id bug 2024-02-13 22:33:56 -08:00
Krrish Dholakia
4a0df3cb4f fix(proxy_cli.py-&&-proxy_server.py): bump reset budget intervals and fix pool limits for prisma connections 2024-02-06 19:39:49 -08:00