Commit graph

929 commits

Author SHA1 Message Date
Krrish Dholakia
074ea17325 fix: support streaming custom cost completion tracking 2024-01-22 15:15:34 -08:00
Ishaan Jaff
5700d60e1a
Merge pull request #1538 from BerriAI/litellm_use_custom_key_gen
[Feat] Proxy Auth - Use custom_key_generate
2024-01-22 14:21:19 -08:00
ishaan-jaff
9d34281025 (fix) spend tracking per key - when no cache hit 2024-01-22 13:18:33 -08:00
ishaan-jaff
6120e1c36d (fix) dynamo max_budget per key 2024-01-22 12:50:14 -08:00
ishaan-jaff
de7663ead3 (fix) make max_budget optional for LiteLLM_VerificationToken 2024-01-22 12:20:38 -08:00
ishaan-jaff
8979b74d49 (feat) working budgets per key 2024-01-22 12:13:19 -08:00
ishaan-jaff
7ed3141c7d (feat) working create budgets per key 2024-01-22 11:58:20 -08:00
ishaan-jaff
8b599d4398 v0 max_budget per key 2024-01-22 11:33:34 -08:00
Krrish Dholakia
e2831e9c80 fix: fix proxy logging 2024-01-20 18:22:45 -08:00
Krrish Dholakia
09b7235b31 fix: support info level logging on pkg + proxy 2024-01-20 17:45:47 -08:00
Krrish Dholakia
3e5b743b89 fix(caching.py): add logging module support for caching 2024-01-20 17:34:29 -08:00
ishaan-jaff
13eb40e7bd v0 using custom_key_generate 2024-01-20 08:39:52 -08:00
Krrish Dholakia
7cf0bb475f fix(proxy_server.py): run all endpoints through custom auth 2024-01-19 20:24:33 -08:00
Krrish Dholakia
2c2163e4e5 fix(proxy_server.py): fix key info to handle pydantic v1 2024-01-19 18:52:09 -08:00
Krrish Dholakia
f05aba1f85 fix(utils.py): add metadata to logging obj on setup, if exists 2024-01-19 17:29:47 -08:00
Krrish Dholakia
f5ced089d6 test(tests/): add unit testing for proxy server endpoints 2024-01-19 14:54:29 -08:00
Ishaan Jaff
650036071f
Merge pull request #1519 from BerriAI/litellm_proxy_make_success_handler_non_blocking
[Feat] litellm.acompletion() make Langfuse success handler non blocking
2024-01-19 11:41:00 -08:00
Krrish Dholakia
1a29272b47 fix(parallel_request_limiter.py): handle tpm/rpm limits being null 2024-01-19 10:22:27 -08:00
ishaan-jaff
e6b5152e63 (chore) update load test 2024-01-19 08:52:17 -08:00
Krrish Dholakia
c5e144af23 docs(health.md): add /health/readiness and /health/liveliness to docs 2024-01-19 08:45:23 -08:00
Ishaan Jaff
79e261f533
Merge pull request #1509 from BerriAI/litellm_track_cost_user_id_chat_completions
[Feat] Proxy - Track Cost Per User (Using `user` passed to requests)
2024-01-18 20:44:02 -08:00
Krrish Dholakia
f7694bc193 Merge branch 'main' into litellm_tpm_rpm_rate_limits 2024-01-18 19:10:07 -08:00
ishaan-jaff
5698be0df1 (fix) safe access litellm_params, proxy_server_request 2024-01-18 18:05:51 -08:00
ishaan-jaff
16f3d7e0ed (feat) use user_id passed to request - cost track 2024-01-18 17:51:48 -08:00
ishaan-jaff
ddd9ca86a7 (feat) proxy - track cost for user_ids that do not exist 2024-01-18 17:44:39 -08:00
Krrish Dholakia
1e5efdfa37 fix(proxy_server.py): support setting tpm/rpm limits per user / per key 2024-01-18 17:03:18 -08:00
Ishaan Jaff
a8ba5df90e
Merge pull request #1500 from BerriAI/litellm_create_keys_with_team_id
[Feat] /key/generate - create keys with`team_id`
2024-01-18 16:35:14 -08:00
Krrish Dholakia
5dac2402ef test(test_parallel_request_limiter.py): unit testing for tpm/rpm rate limits 2024-01-18 15:28:28 -08:00
ishaan-jaff
340706565f (fix) add team_id to doc string 2024-01-18 15:23:05 -08:00
ishaan-jaff
2b6972111e (feat) write team_id to User Table 2024-01-18 14:42:46 -08:00
Ishaan Jaff
a26267851f
Merge pull request #1498 from BerriAI/litellm_spend_tracking_logs
[Feat] Proxy - Add Spend tracking logs
2024-01-18 14:21:51 -08:00
ishaan-jaff
90509a159a (fix) write team_id to key table 2024-01-18 13:54:08 -08:00
Krrish Dholakia
aef59c554f feat(parallel_request_limiter.py): add support for tpm/rpm limits 2024-01-18 13:52:15 -08:00
ishaan-jaff
42ad12b2bd (fix) support team_id for /key/generate 2024-01-18 13:48:52 -08:00
ishaan-jaff
4294657b99 (fix) use get_logging_payload 2024-01-18 13:40:48 -08:00
ishaan-jaff
ea32a8757b (feat) set team_id on virtual_keys 2024-01-18 13:34:51 -08:00
ishaan-jaff
73938080f2 (feat) track - api_key in spendLogs 2024-01-18 13:16:25 -08:00
Krrish Dholakia
1ea3833ef7 fix(parallel_request_limiter.py): decrement count for failed llm calls
https://github.com/BerriAI/litellm/issues/1477
2024-01-18 12:42:14 -08:00
ishaan-jaff
5b54bcc712 (feat) spendLogs table DynamoDB 2024-01-18 12:39:11 -08:00
ishaan-jaff
88cdfedf84 (feat) track cost streaming 2024-01-18 12:21:56 -08:00
ishaan-jaff
d14d36af9a (v0 ) working - writing /chat/completion spend tracking 2024-01-18 11:54:15 -08:00
Krrish Dholakia
c8dd36db9e fix(proxy_server.py): show all models user has access to in /models 2024-01-18 10:56:37 -08:00
ishaan-jaff
4a5f987512 (feat) insert_data to spend table 2024-01-18 10:09:02 -08:00
ishaan-jaff
4821fa9201 (v0) add schema.prisma 2024-01-18 10:04:34 -08:00
Ishaan Jaff
143e225194
Merge pull request #1496 from BerriAI/litellm_unit_test_key_endpoints
[Test+Fix] /Key/Info, /Key/Update - Litellm unit test key endpoints
2024-01-18 09:55:30 -08:00
ishaan-jaff
fc1eb36f24 (fix) /key/update overwriting metadata 2024-01-18 09:32:56 -08:00
Krrish Dholakia
96122a4f88 fix(proxy/utils.py): fix isoformat to string logic 2024-01-18 09:32:30 -08:00
Krrish Dholakia
71034099c9 fix(proxy/utils.py): prisma client fix get data to handle list return 2024-01-18 07:49:13 -08:00
ishaan-jaff
85b5395692 (test) use os.environ/ for azure vision enhance 2024-01-17 21:26:47 -08:00
ishaan-jaff
0414e40d4a (docs) also test gpt-4 vision enhancements 2024-01-17 18:46:41 -08:00