ishaan-jaff
|
ce426f8b07
|
(fix) s3 log cache hits
|
2024-01-11 11:44:20 +05:30 |
|
ishaan-jaff
|
cf86af46a8
|
(fix) litellm.acompletion with type hints
|
2024-01-11 10:47:12 +05:30 |
|
Ishaan Jaff
|
2433d6c613
|
Merge pull request #1200 from MateoCamara/explicit-args-acomplete
feat: added explicit args to acomplete
|
2024-01-11 10:39:05 +05:30 |
|
Ishaan Jaff
|
a7371ba58d
|
Merge pull request #1408 from BerriAI/litellm_s3_logging_proxy
LiteLLM Proxy Add s3 Logging
|
2024-01-11 10:12:16 +05:30 |
|
ishaan-jaff
|
aef2dfbf55
|
(docs) proxy - s3 logging
|
2024-01-11 10:01:52 +05:30 |
|
ishaan-jaff
|
0b20ab7d2b
|
(feat) proxy - support s3_callback_params
|
2024-01-11 09:57:47 +05:30 |
|
ishaan-jaff
|
cf8dd063cf
|
(docs) add s3 logging to proxy
|
2024-01-11 09:45:42 +05:30 |
|
ishaan-jaff
|
f263cf51ea
|
(test) s3 logs for /chat/completions
|
2024-01-11 09:16:06 +05:30 |
|
Ishaan Jaff
|
59d8abd42c
|
Update README.md
|
2024-01-11 09:00:33 +05:30 |
|
ishaan-jaff
|
df0f689027
|
(test) s3 logging
|
2024-01-11 08:58:03 +05:30 |
|
ishaan-jaff
|
f61d8596e1
|
(fix) working s3 logging
|
2024-01-11 08:57:32 +05:30 |
|
ishaan-jaff
|
e04f76ad65
|
v0
|
2024-01-11 08:25:40 +05:30 |
|
Ishaan Jaff
|
b103ca3960
|
Update ghcr_deploy.yml
|
2024-01-11 08:10:34 +05:30 |
|
Krrish Dholakia
|
8394315173
|
docs(deploy.md): update docker version tags to main-latest
|
2024-01-11 02:36:25 +05:30 |
|
Krrish Dholakia
|
65928cd5f2
|
test(test_tpm_rpm_routing.py): add more logging for the test
|
2024-01-11 00:43:14 +05:30 |
|
Krrish Dholakia
|
969594a4b1
|
test(test_router.py): handle rate limiting error
|
2024-01-11 00:00:17 +05:30 |
|
Krrish Dholakia
|
61f2fe5837
|
fix(main.py): fix streaming completion token counting error
|
2024-01-10 23:44:35 +05:30 |
|
Krrish Dholakia
|
3080f27b54
|
fix(utils.py): raise correct error for azure content blocked error
|
2024-01-10 23:31:51 +05:30 |
|
Krrish Dholakia
|
b8de5636d4
|
docs(quick_start.md): update docs to use correct docker image
|
2024-01-10 23:31:51 +05:30 |
|
ishaan-jaff
|
9e4449a072
|
(docs) bedrock - show bedrock/ prefix
|
2024-01-10 23:07:05 +05:30 |
|
ishaan-jaff
|
c9510ce3bf
|
(fix) ghcr deploy action to use latest tag
|
2024-01-10 22:28:00 +05:30 |
|
ishaan-jaff
|
4d380a9f7d
|
(fix) alpine Docker image
|
2024-01-10 22:18:37 +05:30 |
|
ishaan-jaff
|
6e19bb87e2
|
(docs) proxy config - show how to set seed, temp on config.yaml
|
2024-01-10 22:16:04 +05:30 |
|
Krrish Dholakia
|
6a8d518e44
|
test(test_lowest_latency_routing.py): use the correct cache key
|
2024-01-10 22:15:01 +05:30 |
|
Krrish Dholakia
|
5bc44353e0
|
feat(proxy_cli.py): move print statements to show actually deployed port
|
2024-01-10 22:09:58 +05:30 |
|
ishaan-jaff
|
03a0e04b0d
|
(docs) proxy - we now use gunicorn default
|
2024-01-10 22:09:25 +05:30 |
|
ishaan-jaff
|
59669b4c2a
|
(docs) key/gen link to Deploy instructions
|
2024-01-10 22:07:14 +05:30 |
|
ishaan-jaff
|
0d56115336
|
(fix) Dockerfile use same entrypoint as Dockerfile.database
|
2024-01-10 21:56:34 +05:30 |
|
ishaan-jaff
|
1ff9785c6b
|
(fix) test - moved to circe ci dockerfile
|
2024-01-10 21:54:13 +05:30 |
|
Krrish Dholakia
|
954d1b071c
|
test: remove invalid arg
|
2024-01-10 21:53:29 +05:30 |
|
Ishaan Jaff
|
58d0366447
|
Merge pull request #1399 from BerriAI/litellm_default_use_gunicorn
LiteLLM Proxy - Use Gunicorn with Uvicorn workers
|
2024-01-10 21:46:04 +05:30 |
|
Krrish Dholakia
|
9a829ff956
|
refactor: cleanup duplicates
|
2024-01-10 21:42:20 +05:30 |
|
Krrish Dholakia
|
31917176ff
|
fix(lowest_latency.py): fix merge issue
|
2024-01-10 21:37:46 +05:30 |
|
ishaan-jaff
|
fc9af5e900
|
(fix) use Dockerfile from main
|
2024-01-10 21:36:31 +05:30 |
|
Krrish Dholakia
|
60229eff57
|
bump: version 1.16.22 → 1.17.0
|
2024-01-10 21:35:37 +05:30 |
|
Krish Dholakia
|
9e97227625
|
Merge pull request #1403 from BerriAI/litellm_latency_routing_updates
fix(lowest_latency.py): add back tpm/rpm checks, configurable time window support, improved latency tracking
|
2024-01-10 21:34:05 +05:30 |
|
Krish Dholakia
|
298e937586
|
Merge branch 'main' into litellm_latency_routing_updates
|
2024-01-10 21:33:54 +05:30 |
|
Krrish Dholakia
|
e44d3e51aa
|
ci(config.yml): run prisma generate before testing
|
2024-01-10 21:26:38 +05:30 |
|
Krrish Dholakia
|
7f269e92c5
|
test(test_completion_with_retries.py): remove duplicate test
|
2024-01-10 21:17:30 +05:30 |
|
Krrish Dholakia
|
14a65eb730
|
test(test_proxy_server_keys.py): removing as this is now tested via the docker build job
|
2024-01-10 21:05:12 +05:30 |
|
Krrish Dholakia
|
162f6f1ed3
|
refactor: refactor key tests
|
2024-01-10 20:58:29 +05:30 |
|
Krrish Dholakia
|
0d86c4ce5b
|
refactor: move proxy server key testing
|
2024-01-10 20:56:52 +05:30 |
|
Krrish Dholakia
|
990c32a5d6
|
test(test_router.py): handle image gen timeouts
|
2024-01-10 20:56:52 +05:30 |
|
Krrish Dholakia
|
fe632c08a4
|
fix(router.py): allow user to control the latency routing time window
|
2024-01-10 20:56:52 +05:30 |
|
Ishaan Jaff
|
2b3fc15fa9
|
(ghcr) always add -latest tag on ghcr deploys
|
2024-01-10 20:55:28 +05:30 |
|
Krrish Dholakia
|
bb04a340a5
|
fix(lowest_latency.py): add back tpm/rpm checks, configurable time window
|
2024-01-10 20:52:01 +05:30 |
|
Krish Dholakia
|
5e811cfe80
|
Merge pull request #1402 from BerriAI/litellm_dockerfile_testing
Litellm dockerfile testing
|
2024-01-10 19:53:27 +05:30 |
|
Krrish Dholakia
|
186e6aaff5
|
build(dockerfile.database): clean up
|
2024-01-10 19:51:39 +05:30 |
|
Krrish Dholakia
|
33916cb87c
|
ci(config.yml): bring back local_testing and publish_to_pypi
|
2024-01-10 19:50:42 +05:30 |
|
Krrish Dholakia
|
81846ee563
|
build(config.yml): test fixes
|
2024-01-10 19:43:11 +05:30 |
|