Commit graph

5570 commits

Author SHA1 Message Date
David Leen
d14099f9b4 Add explicit dependency on requests library 2024-01-11 16:20:50 +01:00
ishaan-jaff
1e80c1fd00 bump: version 1.17.0 → 1.17.1 2024-01-11 17:17:16 +05:30
ishaan-jaff
bb8eac0597 (test) improve s3 logging test 2024-01-11 16:57:51 +05:30
Ishaan Jaff
e5b491b39f
Merge pull request #1413 from BerriAI/litellm_log_cache_hits
[Feat] Proxy - Log Cache Hits on success callbacks + Testing
2024-01-11 16:39:22 +05:30
ishaan-jaff
1d9dad4af4 (feat) s3 logging - log cache hits 2024-01-11 15:57:54 +05:30
ishaan-jaff
c46a370919 (docs) logging proxy input / output 2024-01-11 15:37:03 +05:30
ishaan-jaff
4a1541c485 (fix) retry gemini-pro-vision 3 times 2024-01-11 14:39:08 +05:30
ishaan-jaff
f89385eed8 (fix) acompletion kwargs type hints 2024-01-11 14:22:37 +05:30
Krish Dholakia
40054f89b5
Merge pull request #1415 from BerriAI/litellm_bump_httpx_pool_limits
fix(router.py): bump httpx pool limits
2024-01-11 13:46:31 +05:30
Krrish Dholakia
40c7400894 fix(router.py): bump httpx pool limits 2024-01-11 12:51:29 +05:30
ishaan-jaff
bd5a14daf6 (fix) acompletion typehints - pass kwargs 2024-01-11 11:49:55 +05:30
ishaan-jaff
cc78e003bf (test) s3 log cache hits 2024-01-11 11:44:48 +05:30
ishaan-jaff
ce426f8b07 (fix) s3 log cache hits 2024-01-11 11:44:20 +05:30
ishaan-jaff
cf86af46a8 (fix) litellm.acompletion with type hints 2024-01-11 10:47:12 +05:30
Ishaan Jaff
2433d6c613
Merge pull request #1200 from MateoCamara/explicit-args-acomplete
feat: added explicit args to acomplete
2024-01-11 10:39:05 +05:30
Ishaan Jaff
a7371ba58d
Merge pull request #1408 from BerriAI/litellm_s3_logging_proxy
LiteLLM Proxy Add s3 Logging
2024-01-11 10:12:16 +05:30
ishaan-jaff
aef2dfbf55 (docs) proxy - s3 logging 2024-01-11 10:01:52 +05:30
ishaan-jaff
0b20ab7d2b (feat) proxy - support s3_callback_params 2024-01-11 09:57:47 +05:30
ishaan-jaff
cf8dd063cf (docs) add s3 logging to proxy 2024-01-11 09:45:42 +05:30
ishaan-jaff
f263cf51ea (test) s3 logs for /chat/completions 2024-01-11 09:16:06 +05:30
Ishaan Jaff
59d8abd42c
Update README.md 2024-01-11 09:00:33 +05:30
ishaan-jaff
df0f689027 (test) s3 logging 2024-01-11 08:58:03 +05:30
ishaan-jaff
f61d8596e1 (fix) working s3 logging 2024-01-11 08:57:32 +05:30
ishaan-jaff
e04f76ad65 v0 2024-01-11 08:25:40 +05:30
Ishaan Jaff
b103ca3960
Update ghcr_deploy.yml 2024-01-11 08:10:34 +05:30
Krrish Dholakia
8394315173 docs(deploy.md): update docker version tags to main-latest 2024-01-11 02:36:25 +05:30
Krrish Dholakia
65928cd5f2 test(test_tpm_rpm_routing.py): add more logging for the test 2024-01-11 00:43:14 +05:30
Krrish Dholakia
969594a4b1 test(test_router.py): handle rate limiting error 2024-01-11 00:00:17 +05:30
Krrish Dholakia
61f2fe5837 fix(main.py): fix streaming completion token counting error 2024-01-10 23:44:35 +05:30
Krrish Dholakia
3080f27b54 fix(utils.py): raise correct error for azure content blocked error 2024-01-10 23:31:51 +05:30
Krrish Dholakia
b8de5636d4 docs(quick_start.md): update docs to use correct docker image 2024-01-10 23:31:51 +05:30
ishaan-jaff
9e4449a072 (docs) bedrock - show bedrock/ prefix 2024-01-10 23:07:05 +05:30
ishaan-jaff
c9510ce3bf (fix) ghcr deploy action to use latest tag 2024-01-10 22:28:00 +05:30
ishaan-jaff
4d380a9f7d (fix) alpine Docker image 2024-01-10 22:18:37 +05:30
ishaan-jaff
6e19bb87e2 (docs) proxy config - show how to set seed, temp on config.yaml 2024-01-10 22:16:04 +05:30
Krrish Dholakia
6a8d518e44 test(test_lowest_latency_routing.py): use the correct cache key 2024-01-10 22:15:01 +05:30
Krrish Dholakia
5bc44353e0 feat(proxy_cli.py): move print statements to show actually deployed port 2024-01-10 22:09:58 +05:30
ishaan-jaff
03a0e04b0d (docs) proxy - we now use gunicorn default 2024-01-10 22:09:25 +05:30
ishaan-jaff
59669b4c2a (docs) key/gen link to Deploy instructions 2024-01-10 22:07:14 +05:30
ishaan-jaff
0d56115336 (fix) Dockerfile use same entrypoint as Dockerfile.database 2024-01-10 21:56:34 +05:30
ishaan-jaff
1ff9785c6b (fix) test - moved to circe ci dockerfile 2024-01-10 21:54:13 +05:30
Krrish Dholakia
954d1b071c test: remove invalid arg 2024-01-10 21:53:29 +05:30
Ishaan Jaff
58d0366447
Merge pull request #1399 from BerriAI/litellm_default_use_gunicorn
LiteLLM Proxy - Use Gunicorn with Uvicorn workers
2024-01-10 21:46:04 +05:30
Krrish Dholakia
9a829ff956 refactor: cleanup duplicates 2024-01-10 21:42:20 +05:30
Krrish Dholakia
31917176ff fix(lowest_latency.py): fix merge issue 2024-01-10 21:37:46 +05:30
ishaan-jaff
fc9af5e900 (fix) use Dockerfile from main 2024-01-10 21:36:31 +05:30
Krrish Dholakia
60229eff57 bump: version 1.16.22 → 1.17.0 2024-01-10 21:35:37 +05:30
Krish Dholakia
9e97227625
Merge pull request #1403 from BerriAI/litellm_latency_routing_updates
fix(lowest_latency.py): add back tpm/rpm checks, configurable time window support, improved latency tracking
2024-01-10 21:34:05 +05:30
Krish Dholakia
298e937586
Merge branch 'main' into litellm_latency_routing_updates 2024-01-10 21:33:54 +05:30
Krrish Dholakia
e44d3e51aa ci(config.yml): run prisma generate before testing 2024-01-10 21:26:38 +05:30