Commit graph

688 commits

Author SHA1 Message Date
ishaan-jaff
f0e17a4322 (fix) raise ProxyException 2024-01-23 13:59:35 -08:00
ishaan-jaff
d2fa9ac4c3 (fix) select_data_generator 2024-01-23 13:47:37 -08:00
ishaan-jaff
00d18cbc86 (fix) select_data_generator - sagemaker 2024-01-23 13:47:37 -08:00
ishaan-jaff
67dddc94d9 (fix) proxy - streaming sagemaker 2024-01-23 13:47:37 -08:00
ishaan-jaff
278cc75603 (docs) use proxy + Sagemaker stream test 2024-01-23 13:47:37 -08:00
Krrish Dholakia
616fe11145 test: fix tests 2024-01-23 13:47:37 -08:00
Krrish Dholakia
d57e57234e fix(openai.py): fix linting issue 2024-01-23 13:47:37 -08:00
Krrish Dholakia
e4fda7c840 feat(utils.py): emit response cost as part of logs 2024-01-23 13:47:37 -08:00
ishaan-jaff
b0e642d3d9 (fix) spend tracking per key - when no cache hit 2024-01-23 13:47:21 -08:00
ishaan-jaff
10060bf051 (fix) dynamo max_budget per key 2024-01-23 13:47:21 -08:00
ishaan-jaff
8d3c30b01f (fix) make max_budget optional for LiteLLM_VerificationToken 2024-01-23 13:47:21 -08:00
ishaan-jaff
2539ed9b0a (feat) working budgets per key 2024-01-23 13:47:21 -08:00
ishaan-jaff
1640a9d58e (feat) working create budgets per key 2024-01-23 13:47:21 -08:00
ishaan-jaff
770aa2e5c6 v0 max_budget per key 2024-01-23 13:47:21 -08:00
ishaan-jaff
16246f3ea2 (chore) docs 2024-01-22 16:59:52 -08:00
ishaan-jaff
b03a751418 (feat) move Proxy Auth Exception to be OpenAI 2024-01-22 16:24:03 -08:00
Krrish Dholakia
074ea17325 fix: support streaming custom cost completion tracking 2024-01-22 15:15:34 -08:00
Ishaan Jaff
5700d60e1a
Merge pull request #1538 from BerriAI/litellm_use_custom_key_gen
[Feat] Proxy Auth - Use custom_key_generate
2024-01-22 14:21:19 -08:00
Krrish Dholakia
e2831e9c80 fix: fix proxy logging 2024-01-20 18:22:45 -08:00
Krrish Dholakia
09b7235b31 fix: support info level logging on pkg + proxy 2024-01-20 17:45:47 -08:00
Krrish Dholakia
3e5b743b89 fix(caching.py): add logging module support for caching 2024-01-20 17:34:29 -08:00
ishaan-jaff
13eb40e7bd v0 using custom_key_generate 2024-01-20 08:39:52 -08:00
Krrish Dholakia
7cf0bb475f fix(proxy_server.py): run all endpoints through custom auth 2024-01-19 20:24:33 -08:00
Krrish Dholakia
2c2163e4e5 fix(proxy_server.py): fix key info to handle pydantic v1 2024-01-19 18:52:09 -08:00
Krrish Dholakia
f05aba1f85 fix(utils.py): add metadata to logging obj on setup, if exists 2024-01-19 17:29:47 -08:00
Krrish Dholakia
f5ced089d6 test(tests/): add unit testing for proxy server endpoints 2024-01-19 14:54:29 -08:00
Ishaan Jaff
650036071f
Merge pull request #1519 from BerriAI/litellm_proxy_make_success_handler_non_blocking
[Feat] litellm.acompletion() make Langfuse success handler non blocking
2024-01-19 11:41:00 -08:00
Krrish Dholakia
1a29272b47 fix(parallel_request_limiter.py): handle tpm/rpm limits being null 2024-01-19 10:22:27 -08:00
ishaan-jaff
e6b5152e63 (chore) update load test 2024-01-19 08:52:17 -08:00
Krrish Dholakia
c5e144af23 docs(health.md): add /health/readiness and /health/liveliness to docs 2024-01-19 08:45:23 -08:00
Ishaan Jaff
79e261f533
Merge pull request #1509 from BerriAI/litellm_track_cost_user_id_chat_completions
[Feat] Proxy - Track Cost Per User (Using `user` passed to requests)
2024-01-18 20:44:02 -08:00
Krrish Dholakia
f7694bc193 Merge branch 'main' into litellm_tpm_rpm_rate_limits 2024-01-18 19:10:07 -08:00
ishaan-jaff
5698be0df1 (fix) safe access litellm_params, proxy_server_request 2024-01-18 18:05:51 -08:00
ishaan-jaff
16f3d7e0ed (feat) use user_id passed to request - cost track 2024-01-18 17:51:48 -08:00
ishaan-jaff
ddd9ca86a7 (feat) proxy - track cost for user_ids that do not exist 2024-01-18 17:44:39 -08:00
Krrish Dholakia
1e5efdfa37 fix(proxy_server.py): support setting tpm/rpm limits per user / per key 2024-01-18 17:03:18 -08:00
Ishaan Jaff
a8ba5df90e
Merge pull request #1500 from BerriAI/litellm_create_keys_with_team_id
[Feat] /key/generate - create keys with`team_id`
2024-01-18 16:35:14 -08:00
Krrish Dholakia
5dac2402ef test(test_parallel_request_limiter.py): unit testing for tpm/rpm rate limits 2024-01-18 15:28:28 -08:00
ishaan-jaff
340706565f (fix) add team_id to doc string 2024-01-18 15:23:05 -08:00
ishaan-jaff
2b6972111e (feat) write team_id to User Table 2024-01-18 14:42:46 -08:00
Ishaan Jaff
a26267851f
Merge pull request #1498 from BerriAI/litellm_spend_tracking_logs
[Feat] Proxy - Add Spend tracking logs
2024-01-18 14:21:51 -08:00
ishaan-jaff
90509a159a (fix) write team_id to key table 2024-01-18 13:54:08 -08:00
Krrish Dholakia
aef59c554f feat(parallel_request_limiter.py): add support for tpm/rpm limits 2024-01-18 13:52:15 -08:00
ishaan-jaff
42ad12b2bd (fix) support team_id for /key/generate 2024-01-18 13:48:52 -08:00
ishaan-jaff
4294657b99 (fix) use get_logging_payload 2024-01-18 13:40:48 -08:00
ishaan-jaff
ea32a8757b (feat) set team_id on virtual_keys 2024-01-18 13:34:51 -08:00
ishaan-jaff
73938080f2 (feat) track - api_key in spendLogs 2024-01-18 13:16:25 -08:00
Krrish Dholakia
1ea3833ef7 fix(parallel_request_limiter.py): decrement count for failed llm calls
https://github.com/BerriAI/litellm/issues/1477
2024-01-18 12:42:14 -08:00
ishaan-jaff
5b54bcc712 (feat) spendLogs table DynamoDB 2024-01-18 12:39:11 -08:00
ishaan-jaff
88cdfedf84 (feat) track cost streaming 2024-01-18 12:21:56 -08:00