Commit graph

3426 commits

Author SHA1 Message Date
ishaan-jaff
d0442ae0f2 (feat) router - usage based routing - consider input_tokens 2024-01-19 13:59:49 -08:00
ishaan-jaff
57a20100e4 (test) usage based routing 2024-01-19 13:58:08 -08:00
ishaan-jaff
8c0b7b1015 (feat) - improve router logging/debugging messages 2024-01-19 13:57:33 -08:00
ishaan-jaff
912e2de364 (test) usage based router + fallbacks 2024-01-19 12:34:08 -08:00
ishaan-jaff
7b2c15aa51 (feat) improve litellm.Router logging 2024-01-19 12:28:51 -08:00
Tim Asp
9d6a8b5de3 Cleaner fallback 2024-01-19 11:56:45 -08:00
Ishaan Jaff
650036071f
Merge pull request #1519 from BerriAI/litellm_proxy_make_success_handler_non_blocking
[Feat] litellm.acompletion() make Langfuse success handler non blocking
2024-01-19 11:41:00 -08:00
ishaan-jaff
8cf8da1378 (test) langfuse_latency_test_user 2024-01-19 11:38:44 -08:00
ishaan-jaff
cb99cd18e7 (feat) log cache_hit as langfuse tags 2024-01-19 11:38:18 -08:00
Tim Asp
27087e5662 fix error when metadata is none, prevents logging silently 2024-01-19 11:37:45 -08:00
Tim Asp
141d2219e2 Add env for project wide langsmith default project settings 2024-01-19 11:37:15 -08:00
ishaan-jaff
2f429f37b7 (test) test latency added with langfuse call 2024-01-19 11:28:10 -08:00
ishaan-jaff
cb40f58cd3 (fix) return usage in mock_completion 2024-01-19 11:25:47 -08:00
ishaan-jaff
6a695477ba (fix) async langfuse logger 2024-01-19 10:44:51 -08:00
Krrish Dholakia
1a29272b47 fix(parallel_request_limiter.py): handle tpm/rpm limits being null 2024-01-19 10:22:27 -08:00
ishaan-jaff
f2cfb76920 (fix) use asyncio run_in_executor 2024-01-19 09:52:51 -08:00
ishaan-jaff
e6b5152e63 (chore) update load test 2024-01-19 08:52:17 -08:00
ishaan-jaff
cd08a02764 (test) add blocking callback test 2024-01-19 08:51:52 -08:00
ishaan-jaff
a9c5b02303 (v0) fix 2024-01-19 08:51:14 -08:00
Krrish Dholakia
c5e144af23 docs(health.md): add /health/readiness and /health/liveliness to docs 2024-01-19 08:45:23 -08:00
Ishaan Jaff
8fef1a1b69
Merge pull request #1499 from duarteocarmo/patch-2
Adds s3_path prefix so that we can save objects in predifined location in s3 bucket
2024-01-18 21:22:15 -08:00
Ishaan Jaff
8338d04d1a
Merge pull request #1506 from HaruHunab1320/main
add headers to budget manager
2024-01-18 21:20:45 -08:00
Ishaan Jaff
79e261f533
Merge pull request #1509 from BerriAI/litellm_track_cost_user_id_chat_completions
[Feat] Proxy - Track Cost Per User (Using `user` passed to requests)
2024-01-18 20:44:02 -08:00
Keegan McCallum
3b719b2afd
Allow overriding headers for anthropic 2024-01-18 20:12:59 -08:00
Krrish Dholakia
f7694bc193 Merge branch 'main' into litellm_tpm_rpm_rate_limits 2024-01-18 19:10:07 -08:00
ishaan-jaff
5698be0df1 (fix) safe access litellm_params, proxy_server_request 2024-01-18 18:05:51 -08:00
Krrish Dholakia
94ce524c63 test(test_completion.py): handle together ai timeout 2024-01-18 17:54:16 -08:00
ishaan-jaff
16f3d7e0ed (feat) use user_id passed to request - cost track 2024-01-18 17:51:48 -08:00
ishaan-jaff
697c511e76 (feat) support user param for all providers 2024-01-18 17:45:59 -08:00
ishaan-jaff
ddd9ca86a7 (feat) proxy - track cost for user_ids that do not exist 2024-01-18 17:44:39 -08:00
Krrish Dholakia
e0aaa94f28 fix(main.py): read azure ad token from optional params extra body 2024-01-18 17:14:03 -08:00
Krrish Dholakia
1e5efdfa37 fix(proxy_server.py): support setting tpm/rpm limits per user / per key 2024-01-18 17:03:18 -08:00
Ishaan Jaff
a8ba5df90e
Merge pull request #1500 from BerriAI/litellm_create_keys_with_team_id
[Feat] /key/generate - create keys with`team_id`
2024-01-18 16:35:14 -08:00
Jakob
7ba7272a9a add headers to budget manager 2024-01-18 16:10:45 -08:00
Krrish Dholakia
5dac2402ef test(test_parallel_request_limiter.py): unit testing for tpm/rpm rate limits 2024-01-18 15:28:28 -08:00
Krrish Dholakia
22c3321208 test(test_parallel_request_limiter.py): unit testing for tpm/rpm rate limits 2024-01-18 15:25:29 -08:00
ishaan-jaff
340706565f (fix) add team_id to doc string 2024-01-18 15:23:05 -08:00
ishaan-jaff
2b6972111e (feat) write team_id to User Table 2024-01-18 14:42:46 -08:00
ishaan-jaff
5beef6dbcd (test) setting team_id 2024-01-18 14:33:13 -08:00
Ishaan Jaff
a26267851f
Merge pull request #1498 from BerriAI/litellm_spend_tracking_logs
[Feat] Proxy - Add Spend tracking logs
2024-01-18 14:21:51 -08:00
ishaan-jaff
90509a159a (fix) write team_id to key table 2024-01-18 13:54:08 -08:00
Krrish Dholakia
aef59c554f feat(parallel_request_limiter.py): add support for tpm/rpm limits 2024-01-18 13:52:15 -08:00
ishaan-jaff
42ad12b2bd (fix) support team_id for /key/generate 2024-01-18 13:48:52 -08:00
ishaan-jaff
4294657b99 (fix) use get_logging_payload 2024-01-18 13:40:48 -08:00
ishaan-jaff
ea32a8757b (feat) set team_id on virtual_keys 2024-01-18 13:34:51 -08:00
ishaan-jaff
7bdf0a0a63 (chore) cleanup utils.py 2024-01-18 13:21:51 -08:00
ishaan-jaff
73938080f2 (feat) track - api_key in spendLogs 2024-01-18 13:16:25 -08:00
Duarte OC
daa399bc60
adds s3 folder prefix to cache 2024-01-18 21:57:47 +01:00
Krrish Dholakia
1ea3833ef7 fix(parallel_request_limiter.py): decrement count for failed llm calls
https://github.com/BerriAI/litellm/issues/1477
2024-01-18 12:42:14 -08:00
ishaan-jaff
5b54bcc712 (feat) spendLogs table DynamoDB 2024-01-18 12:39:11 -08:00