litellm-mirror

mirror of https://github.com/BerriAI/litellm.git synced 2025-04-25 02:34:29 +00:00

Author	SHA1	Message	Date
Krrish Dholakia	b523a43ba0	fix(caching.py): add more debug statements for caching	2024-02-06 15:12:57 -08:00
Krrish Dholakia	d1549cb2f3	refactor(main.py): trigger deploy n	2024-02-06 13:55:51 -08:00
ishaan-jaff	4233e2b850	(fix) redisvl requirements.txt issue	2024-02-06 13:52:32 -08:00
ishaan-jaff	a6afba8cf2	(fix) langfuse show semantic-similarity in tags	2024-02-06 13:41:22 -08:00
ishaan-jaff	f19b2becac	(fix) dockerfile requirements.txt	2024-02-06 13:36:35 -08:00
ishaan-jaff	3d0ece828a	(feat) show semantic-cache on health/readiness	2024-02-06 13:35:34 -08:00
Krrish Dholakia	fa5f4b9774	test(test_completion.py): fix test	2024-02-06 13:29:47 -08:00
ishaan-jaff	79c225a60f	(ci/cd) run again	2024-02-06 13:26:48 -08:00
Ishaan Jaff	8a8f538329	Merge pull request #1829 from BerriAI/litellm_add_semantic_cache [Feat] Add Semantic Caching to litellm💰	2024-02-06 13:18:59 -08:00
Ishaan Jaff	8119f547ef	Merge pull request #1857 from BerriAI/litellm_improve_logging_langfuse_cache_hits [FEAT] show langfuse logging / cache tags better through proxy	2024-02-06 13:15:09 -08:00
Krrish Dholakia	4e6f0e566d	bump: version 1.22.8 → 1.22.9	2024-02-06 13:10:52 -08:00
Krrish Dholakia	22913945b0	fix(utils.py): round max tokens to be int always	2024-02-06 13:10:52 -08:00
ishaan-jaff	3da30383f5	(feat) show langfuse logging tags better through proxy	2024-02-06 13:09:48 -08:00
ishaan-jaff	b28de5e329	(ci/cd) run again	2024-02-06 13:02:36 -08:00
ishaan-jaff	8c0f912780	(ci/cd) run again	2024-02-06 12:53:47 -08:00
ishaan-jaff	a1c34ac9ec	(ci/cd) fix test_config_no_auth	2024-02-06 12:47:19 -08:00
ishaan-jaff	1489fd369b	(fix) test_normal_router_tpm_limit	2024-02-06 12:44:30 -08:00
ishaan-jaff	13fe72d6d5	(fix) parallel_request_limiter debug	2024-02-06 12:43:28 -08:00
ishaan-jaff	506c14b896	(ci/cd) run again	2024-02-06 12:22:24 -08:00
ishaan-jaff	f939407305	(ci/cd) run pytest without -s	2024-02-06 12:22:16 -08:00
ishaan-jaff	29303e979e	(fix) proxy_startup test	2024-02-06 11:38:57 -08:00
ishaan-jaff	4099340ecb	(fix) rename proxy startup test	2024-02-06 11:27:24 -08:00
Ishaan Jaff	7cb69c72c8	Merge branch 'main' into litellm_add_semantic_cache	2024-02-06 11:18:43 -08:00
ishaan-jaff	8175fb4deb	(fix) mark semantic caching as beta test	2024-02-06 11:04:19 -08:00
ishaan-jaff	405a44727c	(ci/cd) run in verbose mode	2024-02-06 10:57:20 -08:00
ishaan-jaff	1afdf5cf36	(fix) semantic caching	2024-02-06 10:55:15 -08:00
ishaan-jaff	54c920c299	(docs) litellm semantic caching	2024-02-06 10:54:55 -08:00
ishaan-jaff	93504915d7	(docs) redis cache	2024-02-06 10:53:28 -08:00
ishaan-jaff	c8a83bb745	(fix) test-semantic caching	2024-02-06 10:39:44 -08:00
ishaan-jaff	2732c47b70	(feat) redis-semantic cache on proxy	2024-02-06 10:35:21 -08:00
ishaan-jaff	bdc2091838	(docs) using semantic caching on proxy	2024-02-06 10:32:07 -08:00
ishaan-jaff	a1fc1e49c7	(fix) use semantic cache on proxy	2024-02-06 10:27:33 -08:00
ishaan-jaff	05f379234d	allow setting redis_semantic cache_embedding model	2024-02-06 10:22:02 -08:00
Krrish Dholakia	e542aa10cc	bump: version 1.22.7 → 1.22.8	2024-02-06 10:12:13 -08:00
Krrish Dholakia	d1db67890c	fix(ollama.py): support format for ollama	2024-02-06 10:11:52 -08:00
ishaan-jaff	751fb1af89	(feat) log semantic_sim to langfuse	2024-02-06 09:31:57 -08:00
ishaan-jaff	c4e73768cf	(fix) add redisvl==0.0.7	2024-02-06 09:30:45 -08:00
ishaan-jaff	70a895329e	(feat) working semantic cache on proxy	2024-02-06 08:55:25 -08:00
ishaan-jaff	a3b1e3bc84	(feat) redis-semantic cache	2024-02-06 08:54:36 -08:00
ishaan-jaff	6249a97098	(feat) working semantic-cache on litellm proxy	2024-02-06 08:52:57 -08:00
ishaan-jaff	a125ffe190	(test) async semantic cache	2024-02-06 08:14:54 -08:00
ishaan-jaff	76def20ffe	(feat) RedisSemanticCache - async	2024-02-06 08:13:12 -08:00
Krish Dholakia	dd87386cad	Merge pull request #1846 from BerriAI/litellm_proxy_update_requirements_txt build(requirements.txt): update the proxy requirements.txt	2024-02-06 07:51:48 -08:00
Krrish Dholakia	f2ef32bcee	bump: version 1.22.6 → 1.22.7	2024-02-06 07:44:04 -08:00
Krrish Dholakia	9e091a0624	fix(ollama_chat.py): explicitly state if ollama call is streaming or not	2024-02-06 07:43:47 -08:00
Krrish Dholakia	4a5b635e35	build(requirements.txt): update the proxy requirements.txt	2024-02-06 07:35:46 -08:00
Krrish Dholakia	c2a523b954	fix(utils.py): use print_verbose for statements, so debug can be seen when running sdk	2024-02-06 07:30:26 -08:00
Krrish Dholakia	2e3748e6eb	fix(ollama_chat.py): fix ollama chat completion token counting	2024-02-06 07:30:26 -08:00
ishaan-jaff	47bed68c7f	(fix) test_normal_router_tpm_limit	2024-02-06 06:46:49 -08:00
ishaan-jaff	9a8abdb1ae	(ci/cd) print debug info for test_proxy_gunicorn_startup_config_dict	2024-02-05 22:53:31 -08:00

1 2 3 4 5 ...

6834 commits