ishaan-jaff
|
db002315e3
|
(feat) print debug info per deployment
|
2024-03-07 18:33:09 -08:00 |
|
Ishaan Jaff
|
7bb515a75b
|
Merge pull request #2388 from BerriAI/litellm_docs_show_how_to_set_lb_config
(docs) setting load balancing config
|
2024-03-07 12:23:40 -08:00 |
|
ishaan-jaff
|
9430301ae4
|
(docs) setting load balancing config
|
2024-03-07 12:22:39 -08:00 |
|
Krish Dholakia
|
228f6ea06f
|
Merge pull request #2386 from BerriAI/litellm_reintegrate_s3_testing
test: reintegrate s3 testing
|
2024-03-07 09:16:02 -08:00 |
|
Krrish Dholakia
|
807bf854c3
|
test: reintegrate s3 testing
|
2024-03-07 08:56:59 -08:00 |
|
Krrish Dholakia
|
6f7d3080af
|
test(test_router_caching.py): add more verbose logs
|
2024-03-07 08:37:00 -08:00 |
|
Krrish Dholakia
|
e0e1364cb6
|
docs(team_based_routing.md): add docs on team based routing
|
2024-03-07 08:14:46 -08:00 |
|
Krrish Dholakia
|
b035222a6a
|
bump: version 1.30.0 → 1.30.1
|
2024-03-07 07:57:13 -08:00 |
|
Krrish Dholakia
|
3c414c6357
|
fix(proxy_server.py): fix model alias map + add back testing
|
2024-03-07 07:56:51 -08:00 |
|
Krrish Dholakia
|
4185a262ed
|
test: increase time before checking budget reset - avoid deadlocking
|
2024-03-06 22:16:59 -08:00 |
|
Krrish Dholakia
|
fd5f0ed27c
|
fix(lowest_tpm_rpm.py): handle async scenarios
|
2024-03-06 21:38:30 -08:00 |
|
Krrish Dholakia
|
badd8cf7ef
|
fix(utils.py): fix google ai studio timeout error raising
|
2024-03-06 21:12:04 -08:00 |
|
Krrish Dholakia
|
c2ba09a1ae
|
bump: version 1.29.7 → 1.30.0
|
2024-03-06 21:04:46 -08:00 |
|
Krish Dholakia
|
ede9647e49
|
Merge pull request #2377 from BerriAI/litellm_team_level_model_groups
feat(proxy_server.py): team based model aliases
|
2024-03-06 21:03:53 -08:00 |
|
Krrish Dholakia
|
995c31db84
|
fix(utils.py): fix get optional param embeddings
|
2024-03-06 20:47:05 -08:00 |
|
ishaan-jaff
|
e1e9c6dbd2
|
(test) ci/cd run again
|
2024-03-06 20:40:27 -08:00 |
|
ishaan-jaff
|
60b2e3c7e6
|
(fix) vertex_ai test_vertex_projects optional params embedding
|
2024-03-06 20:33:25 -08:00 |
|
Krrish Dholakia
|
9acdae4349
|
test(test_completion.py): fix test
|
2024-03-06 20:13:09 -08:00 |
|
Krrish Dholakia
|
df0eb170e6
|
fix(proxy_server.py): fix sql query
|
2024-03-06 19:41:12 -08:00 |
|
Krish Dholakia
|
d0dec7fc71
|
Merge pull request #2379 from BerriAI/litellm_s3_bucket_folder_path
fix(caching.py): add s3 path as a top-level param
|
2024-03-06 19:35:46 -08:00 |
|
Krish Dholakia
|
050a056e09
|
Merge pull request #2347 from BerriAI/litellm_retry_rate_limited_requests
feat(proxy_server.py): retry if virtual key is rate limited
|
2024-03-06 19:23:11 -08:00 |
|
Krrish Dholakia
|
6346839574
|
test(test_parallel_request_limiter.py): add more verbose logging
|
2024-03-06 19:21:57 -08:00 |
|
Krrish Dholakia
|
6029be00d1
|
test(test_completion.py): temporary patch for wikipedia get image issue
|
2024-03-06 19:07:38 -08:00 |
|
Krrish Dholakia
|
ff279ec77b
|
test(test_completion.py): handle gemini timeout error
|
2024-03-06 19:05:39 -08:00 |
|
Krrish Dholakia
|
82b8a227ed
|
build(schema.prisma): add support for team-based model aliases
|
2024-03-06 18:55:44 -08:00 |
|
Krrish Dholakia
|
66a8dc850f
|
fix(factory.py): retry failed get request
|
2024-03-06 18:53:30 -08:00 |
|
ishaan-jaff
|
45d03f1901
|
(test) fix replicate test
|
2024-03-06 18:12:24 -08:00 |
|
Krrish Dholakia
|
12d663d693
|
fix(caching.py): add s3 path as a top-level param
|
2024-03-06 18:07:28 -08:00 |
|
Ishaan Jaff
|
7163db789d
|
Merge pull request #2375 from BerriAI/litellm_default_docker
[Fix] Switch off detailed_debug in default docker
|
2024-03-06 18:00:38 -08:00 |
|
Ishaan Jaff
|
7fb673ee46
|
Merge pull request #2378 from BerriAI/litellm_fix_dict_changed_size_during_iteration
[FIX] 🐛 embedding - "Dictionary changed size during iteration" Debug Log
|
2024-03-06 17:58:51 -08:00 |
|
ishaan-jaff
|
47174c106c
|
(fix) dict changed size during iteration
|
2024-03-06 17:53:01 -08:00 |
|
Krrish Dholakia
|
7bfadc258e
|
feat(proxy_server.py): team based model aliases
allow setting model aliases at a team level (e.g. route all 'gpt-3.5-turbo' requests from team-1 to model-deployment-group-2)
|
2024-03-06 17:42:08 -08:00 |
|
ishaan-jaff
|
ed04124802
|
(docs) best practices for high traffic
|
2024-03-06 16:36:35 -08:00 |
|
ishaan-jaff
|
5a91fafcc5
|
(feat) don't use --detailed_debug on all default litellm images
|
2024-03-06 16:31:32 -08:00 |
|
ishaan-jaff
|
a531aa998b
|
bump: version 1.29.6 → 1.29.7
|
2024-03-06 15:42:16 -08:00 |
|
ishaan-jaff
|
6e8d135bdb
|
bump: version 1.29.5 → 1.29.6
|
2024-03-06 15:42:16 -08:00 |
|
Ishaan Jaff
|
bc0cef53c5
|
Merge pull request #2371 from BerriAI/litellm_fix_user_new_swagger
(fix) admin UI swagger
|
2024-03-06 15:41:34 -08:00 |
|
ishaan-jaff
|
b6f3eb1434
|
(fix) remove unuse endpoint
|
2024-03-06 15:40:22 -08:00 |
|
ishaan-jaff
|
b0575bdcf0
|
(fix) admin UI swagger
|
2024-03-06 14:01:39 -08:00 |
|
Krrish Dholakia
|
7f4dd734c1
|
fix(vertex_ai.py): correctly parse optional params and pass vertex ai project
|
2024-03-06 14:00:50 -08:00 |
|
Ishaan Jaff
|
894f2f7a91
|
Merge pull request #2367 from BerriAI/litellm_admin_ui_fixes
(feat) admin UI show model avg latency, num requests
|
2024-03-06 13:30:59 -08:00 |
|
ishaan-jaff
|
ddad27e71b
|
(feat) admin UI show model avg latency, num requests
|
2024-03-06 12:59:09 -08:00 |
|
Ishaan Jaff
|
b721ef9152
|
Merge pull request #2363 from BerriAI/litellm_handle_circular_ref
(Fix) High Traffic Fix - handle litellm circular ref error
|
2024-03-06 12:38:01 -08:00 |
|
ishaan-jaff
|
0ea0566f20
|
(fix) high traffic langfuse, s3
|
2024-03-06 12:22:52 -08:00 |
|
ishaan-jaff
|
ffd23ef161
|
(fix) high traffic langfuse logging
|
2024-03-06 12:17:59 -08:00 |
|
ishaan-jaff
|
48f6189760
|
(feat) circular ref error on prisa
|
2024-03-06 12:08:22 -08:00 |
|
ishaan-jaff
|
8a75c4c3a3
|
(fix) circular ref error h
|
2024-03-06 12:02:44 -08:00 |
|
Krrish Dholakia
|
411f21e1f5
|
fix(factory.py): support image url requests for anthropic
|
2024-03-06 11:09:50 -08:00 |
|
ishaan-jaff
|
596f415f6b
|
(feat) handle litellm circular ref error
|
2024-03-06 10:21:25 -08:00 |
|
Krrish Dholakia
|
43c0d31ea6
|
fix(utils.py): set status code for api error
|
2024-03-05 21:37:59 -08:00 |
|