litellm

Author	SHA1	Message	Date
David Leen	d14099f9b4	Add explicit dependency on requests library	2024-01-11 16:20:50 +01:00
ishaan-jaff	1e80c1fd00	bump: version 1.17.0 → 1.17.1	2024-01-11 17:17:16 +05:30
ishaan-jaff	bb8eac0597	(test) improve s3 logging test	2024-01-11 16:57:51 +05:30
Ishaan Jaff	e5b491b39f	Merge pull request #1413 from BerriAI/litellm_log_cache_hits [Feat] Proxy - Log Cache Hits on success callbacks + Testing	2024-01-11 16:39:22 +05:30
ishaan-jaff	1d9dad4af4	(feat) s3 logging - log cache hits	2024-01-11 15:57:54 +05:30
ishaan-jaff	c46a370919	(docs) logging proxy input / output	2024-01-11 15:37:03 +05:30
ishaan-jaff	4a1541c485	(fix) retry gemini-pro-vision 3 times	2024-01-11 14:39:08 +05:30
ishaan-jaff	f89385eed8	(fix) acompletion kwargs type hints	2024-01-11 14:22:37 +05:30
Krish Dholakia	40054f89b5	Merge pull request #1415 from BerriAI/litellm_bump_httpx_pool_limits fix(router.py): bump httpx pool limits	2024-01-11 13:46:31 +05:30
Krrish Dholakia	40c7400894	fix(router.py): bump httpx pool limits	2024-01-11 12:51:29 +05:30
ishaan-jaff	bd5a14daf6	(fix) acompletion typehints - pass kwargs	2024-01-11 11:49:55 +05:30
ishaan-jaff	cc78e003bf	(test) s3 log cache hits	2024-01-11 11:44:48 +05:30
ishaan-jaff	ce426f8b07	(fix) s3 log cache hits	2024-01-11 11:44:20 +05:30
ishaan-jaff	cf86af46a8	(fix) litellm.acompletion with type hints	2024-01-11 10:47:12 +05:30
Ishaan Jaff	2433d6c613	Merge pull request #1200 from MateoCamara/explicit-args-acomplete feat: added explicit args to acomplete	2024-01-11 10:39:05 +05:30
Ishaan Jaff	a7371ba58d	Merge pull request #1408 from BerriAI/litellm_s3_logging_proxy LiteLLM Proxy Add s3 Logging	2024-01-11 10:12:16 +05:30
ishaan-jaff	aef2dfbf55	(docs) proxy - s3 logging	2024-01-11 10:01:52 +05:30
ishaan-jaff	0b20ab7d2b	(feat) proxy - support s3_callback_params	2024-01-11 09:57:47 +05:30
ishaan-jaff	cf8dd063cf	(docs) add s3 logging to proxy	2024-01-11 09:45:42 +05:30
ishaan-jaff	f263cf51ea	(test) s3 logs for /chat/completions	2024-01-11 09:16:06 +05:30
Ishaan Jaff	59d8abd42c	Update README.md	2024-01-11 09:00:33 +05:30
ishaan-jaff	df0f689027	(test) s3 logging	2024-01-11 08:58:03 +05:30
ishaan-jaff	f61d8596e1	(fix) working s3 logging	2024-01-11 08:57:32 +05:30
ishaan-jaff	e04f76ad65	v0	2024-01-11 08:25:40 +05:30
Ishaan Jaff	b103ca3960	Update ghcr_deploy.yml	2024-01-11 08:10:34 +05:30
Krrish Dholakia	8394315173	docs(deploy.md): update docker version tags to main-latest	2024-01-11 02:36:25 +05:30
Krrish Dholakia	65928cd5f2	test(test_tpm_rpm_routing.py): add more logging for the test	2024-01-11 00:43:14 +05:30
Krrish Dholakia	969594a4b1	test(test_router.py): handle rate limiting error	2024-01-11 00:00:17 +05:30
Krrish Dholakia	61f2fe5837	fix(main.py): fix streaming completion token counting error	2024-01-10 23:44:35 +05:30
Krrish Dholakia	3080f27b54	fix(utils.py): raise correct error for azure content blocked error	2024-01-10 23:31:51 +05:30
Krrish Dholakia	b8de5636d4	docs(quick_start.md): update docs to use correct docker image	2024-01-10 23:31:51 +05:30
ishaan-jaff	9e4449a072	(docs) bedrock - show bedrock/ prefix	2024-01-10 23:07:05 +05:30
ishaan-jaff	c9510ce3bf	(fix) ghcr deploy action to use latest tag	2024-01-10 22:28:00 +05:30
ishaan-jaff	4d380a9f7d	(fix) alpine Docker image	2024-01-10 22:18:37 +05:30
ishaan-jaff	6e19bb87e2	(docs) proxy config - show how to set seed, temp on config.yaml	2024-01-10 22:16:04 +05:30
Krrish Dholakia	6a8d518e44	test(test_lowest_latency_routing.py): use the correct cache key	2024-01-10 22:15:01 +05:30
Krrish Dholakia	5bc44353e0	feat(proxy_cli.py): move print statements to show actually deployed port	2024-01-10 22:09:58 +05:30
ishaan-jaff	03a0e04b0d	(docs) proxy - we now use gunicorn default	2024-01-10 22:09:25 +05:30
ishaan-jaff	59669b4c2a	(docs) key/gen link to Deploy instructions	2024-01-10 22:07:14 +05:30
ishaan-jaff	0d56115336	(fix) Dockerfile use same entrypoint as Dockerfile.database	2024-01-10 21:56:34 +05:30
ishaan-jaff	1ff9785c6b	(fix) test - moved to circe ci dockerfile	2024-01-10 21:54:13 +05:30
Krrish Dholakia	954d1b071c	test: remove invalid arg	2024-01-10 21:53:29 +05:30
Ishaan Jaff	58d0366447	Merge pull request #1399 from BerriAI/litellm_default_use_gunicorn LiteLLM Proxy - Use Gunicorn with Uvicorn workers	2024-01-10 21:46:04 +05:30
Krrish Dholakia	9a829ff956	refactor: cleanup duplicates	2024-01-10 21:42:20 +05:30
Krrish Dholakia	31917176ff	fix(lowest_latency.py): fix merge issue	2024-01-10 21:37:46 +05:30
ishaan-jaff	fc9af5e900	(fix) use Dockerfile from main	2024-01-10 21:36:31 +05:30
Krrish Dholakia	60229eff57	bump: version 1.16.22 → 1.17.0	2024-01-10 21:35:37 +05:30
Krish Dholakia	9e97227625	Merge pull request #1403 from BerriAI/litellm_latency_routing_updates fix(lowest_latency.py): add back tpm/rpm checks, configurable time window support, improved latency tracking	2024-01-10 21:34:05 +05:30
Krish Dholakia	298e937586	Merge branch 'main' into litellm_latency_routing_updates	2024-01-10 21:33:54 +05:30
Krrish Dholakia	e44d3e51aa	ci(config.yml): run prisma generate before testing	2024-01-10 21:26:38 +05:30

1 2 3 4 5 ...

5570 commits