Commit graph

3353 commits

Author SHA1 Message Date
ishaan-jaff
ccfcc39681 (test) custom key/generate 2024-01-20 10:13:32 -08:00
ishaan-jaff
480f237fb5 (test) add custom_key_generate test 2024-01-20 09:59:04 -08:00
ishaan-jaff
13eb40e7bd v0 using custom_key_generate 2024-01-20 08:39:52 -08:00
Ishaan Jaff
435d4b9279
Merge pull request #1534 from BerriAI/litellm_custom_cooldown_times
[Feat] Litellm.Router set custom cooldown times
2024-01-19 20:49:17 -08:00
ishaan-jaff
ccbc47122c (fix) test caching 2024-01-19 20:45:10 -08:00
ishaan-jaff
84684c50fa (fix) router - timeout exception mapping 2024-01-19 20:30:41 -08:00
Krrish Dholakia
7cf0bb475f fix(proxy_server.py): run all endpoints through custom auth 2024-01-19 20:24:33 -08:00
Krrish Dholakia
b07677c6be fix(gemini.py): support streaming 2024-01-19 20:21:34 -08:00
Ishaan Jaff
b59e67f099
Merge pull request #1529 from BerriAI/litellm_router_add_type_hints
[Feat] Add typehints for litellm.Router
2024-01-19 19:45:54 -08:00
ishaan-jaff
16b688d1ff (feat) router - set custom cooldown times 2024-01-19 19:43:41 -08:00
ishaan-jaff
0fa9d8af82 (test) custom cooldown times - router 2024-01-19 19:41:52 -08:00
Krrish Dholakia
2c2163e4e5 fix(proxy_server.py): fix key info to handle pydantic v1 2024-01-19 18:52:09 -08:00
Krrish Dholakia
f2a8ceddc2 fix(utils.py): revert exception mapping change 2024-01-19 17:39:35 -08:00
Krrish Dholakia
f05aba1f85 fix(utils.py): add metadata to logging obj on setup, if exists 2024-01-19 17:29:47 -08:00
Krrish Dholakia
1ec5083542 fix(anthropic.py): fix pr for anthropic headers 2024-01-19 17:06:24 -08:00
Krrish Dholakia
1be4f7d2d1 fix(test_key_generate_prisma.py): fix testing
n
2024-01-19 16:53:46 -08:00
ishaan-jaff
91e57bd039 (fix) add router typehints 2024-01-19 16:32:43 -08:00
ishaan-jaff
a9cf6cec80 (feat) add typehints for litellm.acompletion 2024-01-19 16:05:26 -08:00
ishaan-jaff
b730482aaf v0 2024-01-19 15:49:37 -08:00
Ishaan Jaff
6134b655e8
Merge pull request #1513 from costly-ai/main
Allow overriding headers for anthropic
2024-01-19 15:21:45 -08:00
ishaan-jaff
6bc7cc46b4 (docs) router debugging 2024-01-19 15:18:00 -08:00
Ishaan Jaff
73684bc93f
Merge pull request #1525 from BerriAI/litellm_router_improvements
[Feat] Router improvements
2024-01-19 15:02:05 -08:00
ishaan-jaff
8a4f2c198b (test) test_usage_based_routing 2024-01-19 14:59:56 -08:00
Krrish Dholakia
f5ced089d6 test(tests/): add unit testing for proxy server endpoints 2024-01-19 14:54:29 -08:00
ishaan-jaff
f408162449 (test) debug logs test - router 2024-01-19 14:22:18 -08:00
ishaan-jaff
0196ac6376 (test) router-fallbacks 2024-01-19 14:12:09 -08:00
Ishaan Jaff
b2b41727ce
Merge pull request #1524 from timothyasp/langsmith-project-env-bug-fix
Langsmith: Add envs for project/run names; fix bug with None metadata
2024-01-19 14:04:16 -08:00
ishaan-jaff
d0442ae0f2 (feat) router - usage based routing - consider input_tokens 2024-01-19 13:59:49 -08:00
ishaan-jaff
57a20100e4 (test) usage based routing 2024-01-19 13:58:08 -08:00
ishaan-jaff
8c0b7b1015 (feat) - improve router logging/debugging messages 2024-01-19 13:57:33 -08:00
ishaan-jaff
912e2de364 (test) usage based router + fallbacks 2024-01-19 12:34:08 -08:00
ishaan-jaff
7b2c15aa51 (feat) improve litellm.Router logging 2024-01-19 12:28:51 -08:00
Tim Asp
9d6a8b5de3 Cleaner fallback 2024-01-19 11:56:45 -08:00
Ishaan Jaff
650036071f
Merge pull request #1519 from BerriAI/litellm_proxy_make_success_handler_non_blocking
[Feat] litellm.acompletion() make Langfuse success handler non blocking
2024-01-19 11:41:00 -08:00
ishaan-jaff
8cf8da1378 (test) langfuse_latency_test_user 2024-01-19 11:38:44 -08:00
ishaan-jaff
cb99cd18e7 (feat) log cache_hit as langfuse tags 2024-01-19 11:38:18 -08:00
Tim Asp
27087e5662 fix error when metadata is none, prevents logging silently 2024-01-19 11:37:45 -08:00
Tim Asp
141d2219e2 Add env for project wide langsmith default project settings 2024-01-19 11:37:15 -08:00
ishaan-jaff
2f429f37b7 (test) test latency added with langfuse call 2024-01-19 11:28:10 -08:00
ishaan-jaff
cb40f58cd3 (fix) return usage in mock_completion 2024-01-19 11:25:47 -08:00
ishaan-jaff
6a695477ba (fix) async langfuse logger 2024-01-19 10:44:51 -08:00
Krrish Dholakia
1a29272b47 fix(parallel_request_limiter.py): handle tpm/rpm limits being null 2024-01-19 10:22:27 -08:00
ishaan-jaff
f2cfb76920 (fix) use asyncio run_in_executor 2024-01-19 09:52:51 -08:00
ishaan-jaff
e6b5152e63 (chore) update load test 2024-01-19 08:52:17 -08:00
ishaan-jaff
cd08a02764 (test) add blocking callback test 2024-01-19 08:51:52 -08:00
ishaan-jaff
a9c5b02303 (v0) fix 2024-01-19 08:51:14 -08:00
Krrish Dholakia
c5e144af23 docs(health.md): add /health/readiness and /health/liveliness to docs 2024-01-19 08:45:23 -08:00
Ishaan Jaff
8fef1a1b69
Merge pull request #1499 from duarteocarmo/patch-2
Adds s3_path prefix so that we can save objects in predifined location in s3 bucket
2024-01-18 21:22:15 -08:00
Ishaan Jaff
8338d04d1a
Merge pull request #1506 from HaruHunab1320/main
add headers to budget manager
2024-01-18 21:20:45 -08:00
Ishaan Jaff
79e261f533
Merge pull request #1509 from BerriAI/litellm_track_cost_user_id_chat_completions
[Feat] Proxy - Track Cost Per User (Using `user` passed to requests)
2024-01-18 20:44:02 -08:00