Commit graph

667 commits

Author SHA1 Message Date
ishaan-jaff
13eb40e7bd v0 using custom_key_generate 2024-01-20 08:39:52 -08:00
Krrish Dholakia
7cf0bb475f fix(proxy_server.py): run all endpoints through custom auth 2024-01-19 20:24:33 -08:00
Krrish Dholakia
2c2163e4e5 fix(proxy_server.py): fix key info to handle pydantic v1 2024-01-19 18:52:09 -08:00
Krrish Dholakia
f05aba1f85 fix(utils.py): add metadata to logging obj on setup, if exists 2024-01-19 17:29:47 -08:00
Krrish Dholakia
f5ced089d6 test(tests/): add unit testing for proxy server endpoints 2024-01-19 14:54:29 -08:00
Ishaan Jaff
650036071f
Merge pull request #1519 from BerriAI/litellm_proxy_make_success_handler_non_blocking
[Feat] litellm.acompletion() make Langfuse success handler non blocking
2024-01-19 11:41:00 -08:00
Krrish Dholakia
1a29272b47 fix(parallel_request_limiter.py): handle tpm/rpm limits being null 2024-01-19 10:22:27 -08:00
ishaan-jaff
e6b5152e63 (chore) update load test 2024-01-19 08:52:17 -08:00
Krrish Dholakia
c5e144af23 docs(health.md): add /health/readiness and /health/liveliness to docs 2024-01-19 08:45:23 -08:00
Ishaan Jaff
79e261f533
Merge pull request #1509 from BerriAI/litellm_track_cost_user_id_chat_completions
[Feat] Proxy - Track Cost Per User (Using `user` passed to requests)
2024-01-18 20:44:02 -08:00
Krrish Dholakia
f7694bc193 Merge branch 'main' into litellm_tpm_rpm_rate_limits 2024-01-18 19:10:07 -08:00
ishaan-jaff
5698be0df1 (fix) safe access litellm_params, proxy_server_request 2024-01-18 18:05:51 -08:00
ishaan-jaff
16f3d7e0ed (feat) use user_id passed to request - cost track 2024-01-18 17:51:48 -08:00
ishaan-jaff
ddd9ca86a7 (feat) proxy - track cost for user_ids that do not exist 2024-01-18 17:44:39 -08:00
Krrish Dholakia
1e5efdfa37 fix(proxy_server.py): support setting tpm/rpm limits per user / per key 2024-01-18 17:03:18 -08:00
Ishaan Jaff
a8ba5df90e
Merge pull request #1500 from BerriAI/litellm_create_keys_with_team_id
[Feat] /key/generate - create keys with`team_id`
2024-01-18 16:35:14 -08:00
Krrish Dholakia
5dac2402ef test(test_parallel_request_limiter.py): unit testing for tpm/rpm rate limits 2024-01-18 15:28:28 -08:00
ishaan-jaff
340706565f (fix) add team_id to doc string 2024-01-18 15:23:05 -08:00
ishaan-jaff
2b6972111e (feat) write team_id to User Table 2024-01-18 14:42:46 -08:00
Ishaan Jaff
a26267851f
Merge pull request #1498 from BerriAI/litellm_spend_tracking_logs
[Feat] Proxy - Add Spend tracking logs
2024-01-18 14:21:51 -08:00
ishaan-jaff
90509a159a (fix) write team_id to key table 2024-01-18 13:54:08 -08:00
Krrish Dholakia
aef59c554f feat(parallel_request_limiter.py): add support for tpm/rpm limits 2024-01-18 13:52:15 -08:00
ishaan-jaff
42ad12b2bd (fix) support team_id for /key/generate 2024-01-18 13:48:52 -08:00
ishaan-jaff
4294657b99 (fix) use get_logging_payload 2024-01-18 13:40:48 -08:00
ishaan-jaff
ea32a8757b (feat) set team_id on virtual_keys 2024-01-18 13:34:51 -08:00
ishaan-jaff
73938080f2 (feat) track - api_key in spendLogs 2024-01-18 13:16:25 -08:00
Krrish Dholakia
1ea3833ef7 fix(parallel_request_limiter.py): decrement count for failed llm calls
https://github.com/BerriAI/litellm/issues/1477
2024-01-18 12:42:14 -08:00
ishaan-jaff
5b54bcc712 (feat) spendLogs table DynamoDB 2024-01-18 12:39:11 -08:00
ishaan-jaff
88cdfedf84 (feat) track cost streaming 2024-01-18 12:21:56 -08:00
ishaan-jaff
d14d36af9a (v0 ) working - writing /chat/completion spend tracking 2024-01-18 11:54:15 -08:00
Krrish Dholakia
c8dd36db9e fix(proxy_server.py): show all models user has access to in /models 2024-01-18 10:56:37 -08:00
ishaan-jaff
4a5f987512 (feat) insert_data to spend table 2024-01-18 10:09:02 -08:00
ishaan-jaff
4821fa9201 (v0) add schema.prisma 2024-01-18 10:04:34 -08:00
Ishaan Jaff
143e225194
Merge pull request #1496 from BerriAI/litellm_unit_test_key_endpoints
[Test+Fix] /Key/Info, /Key/Update - Litellm unit test key endpoints
2024-01-18 09:55:30 -08:00
ishaan-jaff
fc1eb36f24 (fix) /key/update overwriting metadata 2024-01-18 09:32:56 -08:00
Krrish Dholakia
96122a4f88 fix(proxy/utils.py): fix isoformat to string logic 2024-01-18 09:32:30 -08:00
Krrish Dholakia
71034099c9 fix(proxy/utils.py): prisma client fix get data to handle list return 2024-01-18 07:49:13 -08:00
ishaan-jaff
85b5395692 (test) use os.environ/ for azure vision enhance 2024-01-17 21:26:47 -08:00
ishaan-jaff
0414e40d4a (docs) also test gpt-4 vision enhancements 2024-01-17 18:46:41 -08:00
Krish Dholakia
e9ac001005
Merge pull request #1483 from BerriAI/litellm_model_access_groups_feature
feat(proxy_server.py): support model access groups
2024-01-17 18:16:53 -08:00
ishaan-jaff
f3a45ea044 (fix) cleanup 2024-01-17 17:54:18 -08:00
ishaan-jaff
8df3a86178 (feat) proxy - set endpoint called in callback 2024-01-17 17:44:28 -08:00
Krrish Dholakia
73daee7e07 fix(proxy_cli.py): ensure proxy always retries if db push fails to connect to db 2024-01-17 17:37:59 -08:00
Krrish Dholakia
cff9f7fee6 fix(proxy_server.py): handle empty insert_data response 2024-01-17 17:28:23 -08:00
ishaan-jaff
00dfb5918c (feat) proxy - log key metadata in calback 2024-01-17 16:42:49 -08:00
Krrish Dholakia
98b83fa780 feat(proxy_server.py): support model access groups 2024-01-17 15:45:31 -08:00
ishaan-jaff
66bcd431f6 (docs) add doc string for /key/delete 2024-01-17 15:27:48 -08:00
ishaan-jaff
0250492d95 (fix) /key/delete 2024-01-17 14:54:29 -08:00
ishaan-jaff
a0eec51ee6 (test) expired key prisma 2024-01-17 13:24:15 -08:00
ishaan-jaff
399f0ba620 (fix) prisma - non expiring keys 2024-01-17 12:56:26 -08:00