David Leen
|
a674de8f36
|
improve bedrock exception granularity
|
2024-01-12 16:38:55 +01:00 |
|
David Leen
|
8b021fc4cd
|
[bug] unbound variable in bedrock
note: the code was written as `json.dumps({})` even though it is more verbose in order to facilitate easier refactoring in the future
fixes #1428
|
2024-01-12 12:33:00 +01:00 |
|
XD3000/高瑞雲
|
574d042655
|
#1424:gunicorn can't run in windows
|
2024-01-12 14:30:23 +08:00 |
|
Krrish Dholakia
|
352f943dcf
|
fix(dynamo_db.py): don't auto-create tables, allow database_type == 'dynamodb'
|
2024-01-12 11:33:40 +05:30 |
|
Krrish Dholakia
|
51110bfb62
|
fix(main.py): support text completion routing
|
2024-01-12 11:24:31 +05:30 |
|
Krrish Dholakia
|
d895979065
|
test(test_health_check.py): fix test
|
2024-01-12 00:21:11 +05:30 |
|
Krrish Dholakia
|
0cbdec563b
|
refactor(main.py): trigger new release
|
2024-01-12 00:14:12 +05:30 |
|
Krrish Dholakia
|
a7f182b8ec
|
fix(azure.py): support health checks to text completion endpoints
|
2024-01-12 00:13:01 +05:30 |
|
Krrish Dholakia
|
f94a37a836
|
fix(dynamo_db.py): add cost tracking support for key + user
|
2024-01-11 23:56:41 +05:30 |
|
ishaan-jaff
|
b7567865de
|
(test) caching for bedrock/embedding str inputs
|
2024-01-11 23:12:57 +05:30 |
|
ishaan-jaff
|
276d11946e
|
(test) bedrock - embedding with strings
|
2024-01-11 23:04:41 +05:30 |
|
ishaan-jaff
|
a9d812eb8d
|
(fix) bedrock - embedding - support str input
|
2024-01-11 23:02:12 +05:30 |
|
Krrish Dholakia
|
9b3d78c4f3
|
fix(dynamo_db.py): if table create fails, tell user what the table + hash key needs to be
|
2024-01-11 23:01:28 +05:30 |
|
ishaan-jaff
|
a876748bf5
|
v0
|
2024-01-11 22:56:18 +05:30 |
|
Ishaan Jaff
|
d181bd22a7
|
Merge pull request #1422 from dleen/httpx
(fix) create httpx.Request instead of httpx.request
|
2024-01-11 22:31:55 +05:30 |
|
David Leen
|
6b87c13b9d
|
(fix) create httpx.Request instead of httpx.request
fixes #1420
|
2024-01-11 16:22:26 +01:00 |
|
Krrish Dholakia
|
43533812a7
|
fix(proxy_cli.py): read db url from config, not just environment
|
2024-01-11 19:19:29 +05:30 |
|
ishaan-jaff
|
1fb3547e48
|
(feat) improve litellm verbose logs
|
2024-01-11 18:13:08 +05:30 |
|
ishaan-jaff
|
f297a4d174
|
(feat) show args passed to litellm.completion, acompletion on call
|
2024-01-11 17:56:27 +05:30 |
|
ishaan-jaff
|
bb8eac0597
|
(test) improve s3 logging test
|
2024-01-11 16:57:51 +05:30 |
|
Ishaan Jaff
|
e5b491b39f
|
Merge pull request #1413 from BerriAI/litellm_log_cache_hits
[Feat] Proxy - Log Cache Hits on success callbacks + Testing
|
2024-01-11 16:39:22 +05:30 |
|
ishaan-jaff
|
1d9dad4af4
|
(feat) s3 logging - log cache hits
|
2024-01-11 15:57:54 +05:30 |
|
ishaan-jaff
|
4a1541c485
|
(fix) retry gemini-pro-vision 3 times
|
2024-01-11 14:39:08 +05:30 |
|
ishaan-jaff
|
f89385eed8
|
(fix) acompletion kwargs type hints
|
2024-01-11 14:22:37 +05:30 |
|
Krish Dholakia
|
40054f89b5
|
Merge pull request #1415 from BerriAI/litellm_bump_httpx_pool_limits
fix(router.py): bump httpx pool limits
|
2024-01-11 13:46:31 +05:30 |
|
Krrish Dholakia
|
40c7400894
|
fix(router.py): bump httpx pool limits
|
2024-01-11 12:51:29 +05:30 |
|
ishaan-jaff
|
bd5a14daf6
|
(fix) acompletion typehints - pass kwargs
|
2024-01-11 11:49:55 +05:30 |
|
ishaan-jaff
|
cc78e003bf
|
(test) s3 log cache hits
|
2024-01-11 11:44:48 +05:30 |
|
ishaan-jaff
|
ce426f8b07
|
(fix) s3 log cache hits
|
2024-01-11 11:44:20 +05:30 |
|
ishaan-jaff
|
cf86af46a8
|
(fix) litellm.acompletion with type hints
|
2024-01-11 10:47:12 +05:30 |
|
Ishaan Jaff
|
2433d6c613
|
Merge pull request #1200 from MateoCamara/explicit-args-acomplete
feat: added explicit args to acomplete
|
2024-01-11 10:39:05 +05:30 |
|
ishaan-jaff
|
0b20ab7d2b
|
(feat) proxy - support s3_callback_params
|
2024-01-11 09:57:47 +05:30 |
|
ishaan-jaff
|
f263cf51ea
|
(test) s3 logs for /chat/completions
|
2024-01-11 09:16:06 +05:30 |
|
ishaan-jaff
|
df0f689027
|
(test) s3 logging
|
2024-01-11 08:58:03 +05:30 |
|
ishaan-jaff
|
f61d8596e1
|
(fix) working s3 logging
|
2024-01-11 08:57:32 +05:30 |
|
ishaan-jaff
|
e04f76ad65
|
v0
|
2024-01-11 08:25:40 +05:30 |
|
Krrish Dholakia
|
65928cd5f2
|
test(test_tpm_rpm_routing.py): add more logging for the test
|
2024-01-11 00:43:14 +05:30 |
|
Krrish Dholakia
|
969594a4b1
|
test(test_router.py): handle rate limiting error
|
2024-01-11 00:00:17 +05:30 |
|
Krrish Dholakia
|
61f2fe5837
|
fix(main.py): fix streaming completion token counting error
|
2024-01-10 23:44:35 +05:30 |
|
Krrish Dholakia
|
3080f27b54
|
fix(utils.py): raise correct error for azure content blocked error
|
2024-01-10 23:31:51 +05:30 |
|
Krrish Dholakia
|
6a8d518e44
|
test(test_lowest_latency_routing.py): use the correct cache key
|
2024-01-10 22:15:01 +05:30 |
|
Krrish Dholakia
|
5bc44353e0
|
feat(proxy_cli.py): move print statements to show actually deployed port
|
2024-01-10 22:09:58 +05:30 |
|
ishaan-jaff
|
1ff9785c6b
|
(fix) test - moved to circe ci dockerfile
|
2024-01-10 21:54:13 +05:30 |
|
Krrish Dholakia
|
954d1b071c
|
test: remove invalid arg
|
2024-01-10 21:53:29 +05:30 |
|
Ishaan Jaff
|
58d0366447
|
Merge pull request #1399 from BerriAI/litellm_default_use_gunicorn
LiteLLM Proxy - Use Gunicorn with Uvicorn workers
|
2024-01-10 21:46:04 +05:30 |
|
Krrish Dholakia
|
9a829ff956
|
refactor: cleanup duplicates
|
2024-01-10 21:42:20 +05:30 |
|
Krrish Dholakia
|
31917176ff
|
fix(lowest_latency.py): fix merge issue
|
2024-01-10 21:37:46 +05:30 |
|
Krish Dholakia
|
298e937586
|
Merge branch 'main' into litellm_latency_routing_updates
|
2024-01-10 21:33:54 +05:30 |
|
Krrish Dholakia
|
7f269e92c5
|
test(test_completion_with_retries.py): remove duplicate test
|
2024-01-10 21:17:30 +05:30 |
|
Krrish Dholakia
|
14a65eb730
|
test(test_proxy_server_keys.py): removing as this is now tested via the docker build job
|
2024-01-10 21:05:12 +05:30 |
|