ishaan-jaff
|
d0442ae0f2
|
(feat) router - usage based routing - consider input_tokens
|
2024-01-19 13:59:49 -08:00 |
|
ishaan-jaff
|
57a20100e4
|
(test) usage based routing
|
2024-01-19 13:58:08 -08:00 |
|
ishaan-jaff
|
8c0b7b1015
|
(feat) - improve router logging/debugging messages
|
2024-01-19 13:57:33 -08:00 |
|
ishaan-jaff
|
912e2de364
|
(test) usage based router + fallbacks
|
2024-01-19 12:34:08 -08:00 |
|
ishaan-jaff
|
7b2c15aa51
|
(feat) improve litellm.Router logging
|
2024-01-19 12:28:51 -08:00 |
|
Tim Asp
|
9d6a8b5de3
|
Cleaner fallback
|
2024-01-19 11:56:45 -08:00 |
|
Ishaan Jaff
|
650036071f
|
Merge pull request #1519 from BerriAI/litellm_proxy_make_success_handler_non_blocking
[Feat] litellm.acompletion() make Langfuse success handler non blocking
|
2024-01-19 11:41:00 -08:00 |
|
ishaan-jaff
|
8cf8da1378
|
(test) langfuse_latency_test_user
|
2024-01-19 11:38:44 -08:00 |
|
ishaan-jaff
|
cb99cd18e7
|
(feat) log cache_hit as langfuse tags
|
2024-01-19 11:38:18 -08:00 |
|
Tim Asp
|
27087e5662
|
fix error when metadata is none, prevents logging silently
|
2024-01-19 11:37:45 -08:00 |
|
Tim Asp
|
141d2219e2
|
Add env for project wide langsmith default project settings
|
2024-01-19 11:37:15 -08:00 |
|
ishaan-jaff
|
2f429f37b7
|
(test) test latency added with langfuse call
|
2024-01-19 11:28:10 -08:00 |
|
ishaan-jaff
|
cb40f58cd3
|
(fix) return usage in mock_completion
|
2024-01-19 11:25:47 -08:00 |
|
ishaan-jaff
|
6a695477ba
|
(fix) async langfuse logger
|
2024-01-19 10:44:51 -08:00 |
|
Krrish Dholakia
|
1a29272b47
|
fix(parallel_request_limiter.py): handle tpm/rpm limits being null
|
2024-01-19 10:22:27 -08:00 |
|
ishaan-jaff
|
f2cfb76920
|
(fix) use asyncio run_in_executor
|
2024-01-19 09:52:51 -08:00 |
|
ishaan-jaff
|
e6b5152e63
|
(chore) update load test
|
2024-01-19 08:52:17 -08:00 |
|
ishaan-jaff
|
cd08a02764
|
(test) add blocking callback test
|
2024-01-19 08:51:52 -08:00 |
|
ishaan-jaff
|
a9c5b02303
|
(v0) fix
|
2024-01-19 08:51:14 -08:00 |
|
Krrish Dholakia
|
c5e144af23
|
docs(health.md): add /health/readiness and /health/liveliness to docs
|
2024-01-19 08:45:23 -08:00 |
|
Ishaan Jaff
|
8fef1a1b69
|
Merge pull request #1499 from duarteocarmo/patch-2
Adds s3_path prefix so that we can save objects in predifined location in s3 bucket
|
2024-01-18 21:22:15 -08:00 |
|
Ishaan Jaff
|
8338d04d1a
|
Merge pull request #1506 from HaruHunab1320/main
add headers to budget manager
|
2024-01-18 21:20:45 -08:00 |
|
Ishaan Jaff
|
79e261f533
|
Merge pull request #1509 from BerriAI/litellm_track_cost_user_id_chat_completions
[Feat] Proxy - Track Cost Per User (Using `user` passed to requests)
|
2024-01-18 20:44:02 -08:00 |
|
Keegan McCallum
|
3b719b2afd
|
Allow overriding headers for anthropic
|
2024-01-18 20:12:59 -08:00 |
|
Krrish Dholakia
|
f7694bc193
|
Merge branch 'main' into litellm_tpm_rpm_rate_limits
|
2024-01-18 19:10:07 -08:00 |
|
ishaan-jaff
|
5698be0df1
|
(fix) safe access litellm_params, proxy_server_request
|
2024-01-18 18:05:51 -08:00 |
|
Krrish Dholakia
|
94ce524c63
|
test(test_completion.py): handle together ai timeout
|
2024-01-18 17:54:16 -08:00 |
|
ishaan-jaff
|
16f3d7e0ed
|
(feat) use user_id passed to request - cost track
|
2024-01-18 17:51:48 -08:00 |
|
ishaan-jaff
|
697c511e76
|
(feat) support user param for all providers
|
2024-01-18 17:45:59 -08:00 |
|
ishaan-jaff
|
ddd9ca86a7
|
(feat) proxy - track cost for user_ids that do not exist
|
2024-01-18 17:44:39 -08:00 |
|
Krrish Dholakia
|
e0aaa94f28
|
fix(main.py): read azure ad token from optional params extra body
|
2024-01-18 17:14:03 -08:00 |
|
Krrish Dholakia
|
1e5efdfa37
|
fix(proxy_server.py): support setting tpm/rpm limits per user / per key
|
2024-01-18 17:03:18 -08:00 |
|
Ishaan Jaff
|
a8ba5df90e
|
Merge pull request #1500 from BerriAI/litellm_create_keys_with_team_id
[Feat] /key/generate - create keys with`team_id`
|
2024-01-18 16:35:14 -08:00 |
|
Jakob
|
7ba7272a9a
|
add headers to budget manager
|
2024-01-18 16:10:45 -08:00 |
|
Krrish Dholakia
|
5dac2402ef
|
test(test_parallel_request_limiter.py): unit testing for tpm/rpm rate limits
|
2024-01-18 15:28:28 -08:00 |
|
Krrish Dholakia
|
22c3321208
|
test(test_parallel_request_limiter.py): unit testing for tpm/rpm rate limits
|
2024-01-18 15:25:29 -08:00 |
|
ishaan-jaff
|
340706565f
|
(fix) add team_id to doc string
|
2024-01-18 15:23:05 -08:00 |
|
ishaan-jaff
|
2b6972111e
|
(feat) write team_id to User Table
|
2024-01-18 14:42:46 -08:00 |
|
ishaan-jaff
|
5beef6dbcd
|
(test) setting team_id
|
2024-01-18 14:33:13 -08:00 |
|
Ishaan Jaff
|
a26267851f
|
Merge pull request #1498 from BerriAI/litellm_spend_tracking_logs
[Feat] Proxy - Add Spend tracking logs
|
2024-01-18 14:21:51 -08:00 |
|
ishaan-jaff
|
90509a159a
|
(fix) write team_id to key table
|
2024-01-18 13:54:08 -08:00 |
|
Krrish Dholakia
|
aef59c554f
|
feat(parallel_request_limiter.py): add support for tpm/rpm limits
|
2024-01-18 13:52:15 -08:00 |
|
ishaan-jaff
|
42ad12b2bd
|
(fix) support team_id for /key/generate
|
2024-01-18 13:48:52 -08:00 |
|
ishaan-jaff
|
4294657b99
|
(fix) use get_logging_payload
|
2024-01-18 13:40:48 -08:00 |
|
ishaan-jaff
|
ea32a8757b
|
(feat) set team_id on virtual_keys
|
2024-01-18 13:34:51 -08:00 |
|
ishaan-jaff
|
7bdf0a0a63
|
(chore) cleanup utils.py
|
2024-01-18 13:21:51 -08:00 |
|
ishaan-jaff
|
73938080f2
|
(feat) track - api_key in spendLogs
|
2024-01-18 13:16:25 -08:00 |
|
Duarte OC
|
daa399bc60
|
adds s3 folder prefix to cache
|
2024-01-18 21:57:47 +01:00 |
|
Krrish Dholakia
|
1ea3833ef7
|
fix(parallel_request_limiter.py): decrement count for failed llm calls
https://github.com/BerriAI/litellm/issues/1477
|
2024-01-18 12:42:14 -08:00 |
|
ishaan-jaff
|
5b54bcc712
|
(feat) spendLogs table DynamoDB
|
2024-01-18 12:39:11 -08:00 |
|