litellm-mirror

mirror of https://github.com/BerriAI/litellm.git synced 2025-04-28 04:04:31 +00:00

Author	SHA1	Message	Date
Krrish Dholakia	866259f95f	feat(prometheus_services.py): monitor health of proxy adjacent services (redis / postgres / etc.)	2024-04-13 18:15:02 -07:00
Ishaan Jaff	abac496706	(ci/cd) run again	2024-04-12 20:48:26 -07:00
Ishaan Jaff	966356721b	fix - test caching atext_completion	2024-04-12 20:37:56 -07:00
Ishaan Jaff	204573f3f2	test - atext_completion + caching	2024-04-12 12:32:21 -07:00
Krrish Dholakia	9d9426c291	test(test_jwt.py): add testing to make sure user api key auth returns the expected values	2024-04-11 20:18:55 -07:00
Krrish Dholakia	0863c10b0b	fix(router.py): make sure pre call rpm check runs even when model not in model cost map	2024-04-11 09:27:46 -07:00
Krish Dholakia	e48cc9f1e4	Merge pull request #2942 from BerriAI/litellm_fix_router_loading Router Async Improvements	2024-04-10 20:16:53 -07:00
Ishaan Jaff	4058327b43	test -base64 cache hits	2024-04-10 16:46:56 -07:00
Krrish Dholakia	f5206d592a	fix(router.py): generate consistent model id's having the same id for a deployment, lets redis usage caching work across multiple instances	2024-04-10 15:23:57 -07:00
Krrish Dholakia	67b3a05e1a	test(test_completion.py): handle api instability	2024-04-09 21:58:48 -07:00
Krrish Dholakia	f0d61fd6dd	test(test_completion.py): change model	2024-04-09 21:38:17 -07:00
Krish Dholakia	8334e3cd6e	Merge pull request #2924 from BerriAI/litellm_text_completion_response fix(main.py): handle translating text completion openai to chat completion for async requests	2024-04-09 21:13:30 -07:00
Krrish Dholakia	ce9ea28e42	test(test_key_generate_prisma.py): add better unit testing for spend logs on proxy server	2024-04-09 17:58:18 -07:00
Krrish Dholakia	70fd803a6e	fix(main.py): handle translating text completion openai to chat completion for async requests also adds testing for this, to prevent future regressions	2024-04-09 16:47:49 -07:00
Krrish Dholakia	dab219d4d7	test(test_key_generate_prisma.py): fix check for raising error if key generate max budget > key upperbound	2024-04-09 13:07:57 -07:00
Krrish Dholakia	d98c5a1797	test(test_llm_guard.py): fix test	2024-04-08 21:55:35 -07:00
Krrish Dholakia	aa4acaf06b	fix(llm_guard.py): enable request-specific llm guard flag	2024-04-08 21:15:33 -07:00
Krrish Dholakia	075c96a408	fix(utils.py): fix reordering of items for cached embeddings ensures cached embedding item is returned in correct order	2024-04-08 12:18:24 -07:00
Ishaan Jaff	d1d3d932ca	Merge pull request #2879 from BerriAI/litellm_async_anthropic_api [Feat] Async Anthropic API 97.5% lower median latency	2024-04-07 09:56:52 -07:00
Krrish Dholakia	ad409d67f3	test(test_alerting.py): remove duplicate test	2024-04-06 19:33:01 -07:00
Ishaan Jaff	9b8555e482	undo adding next-asyncio	2024-04-06 19:28:51 -07:00
Krish Dholakia	1060244a7f	Merge pull request #2880 from BerriAI/litellm_api_base_alerting feat(proxy/utils.py): return api base for request hanging alerts	2024-04-06 19:17:18 -07:00
Krrish Dholakia	4bb070c0e4	test(test_custom_callback_input.py): fix assert	2024-04-06 18:48:33 -07:00
Krrish Dholakia	0d067fb65d	test(test_streaming.py): fix vertex project name	2024-04-06 18:45:17 -07:00
Ishaan Jaff	c37383192f	fix - test streaming	2024-04-06 18:28:07 -07:00
Ishaan Jaff	aca692282a	(ci/cd) run again	2024-04-06 18:26:52 -07:00
Ishaan Jaff	ccab3cdfb5	Revert "fix test streaming" This reverts commit `f10aba1007`.	2024-04-06 18:24:00 -07:00
Ishaan Jaff	9e92b9b8ad	fix test streaming	2024-04-06 18:23:10 -07:00
Ishaan Jaff	436133b4b3	ci/cd run again	2024-04-06 18:07:41 -07:00
Krrish Dholakia	a236cd22fc	test(test_amazing_vertex_completion.py): fix project name	2024-04-06 17:59:49 -07:00
Ishaan Jaff	3978a17ff9	test - async claude streaming	2024-04-06 17:55:26 -07:00
Krrish Dholakia	0dad78b53c	feat(proxy/utils.py): return api base for request hanging alerts	2024-04-06 15:58:53 -07:00
Ishaan Jaff	d23d6068ff	Merge pull request #2877 from BerriAI/litellm_fix_text_completion [Feat] Text-Completion-OpenAI - Re-use OpenAI Client	2024-04-06 12:15:52 -07:00
Krrish Dholakia	37c0c883a8	ci(vertex_key.json): switch gcloud accounts	2024-04-06 12:14:46 -07:00
Ishaan Jaff	ff0b021949	test - setting org for openai text completion	2024-04-06 11:53:26 -07:00
Ishaan Jaff	d8788bbe9b	test - router re-use openai client	2024-04-06 11:33:17 -07:00
Krish Dholakia	b8d285d120	Merge pull request #2798 from CLARKBENHAM/main add test for rate limits - Router isn't coroutine safe	2024-04-06 08:47:40 -07:00
Ishaan Jaff	6757b42ce9	(test) completion	2024-04-05 21:03:04 -07:00
Ishaan Jaff	8436b611b4	(ci/cd) run again	2024-04-05 20:44:51 -07:00
Ishaan Jaff	aa0c01dcf3	fix gemini test	2024-04-05 20:38:28 -07:00
Ishaan Jaff	a27e488248	(ci/cd) run again	2024-04-05 17:26:02 -07:00
Ishaan Jaff	d4ca3f38c7	fix testing yaml	2024-04-05 16:17:53 -07:00
Ishaan Jaff	72fddabf84	Merge pull request #2868 from BerriAI/litellm_add_command_r_on_proxy Add Azure Command-r-plus on litellm proxy	2024-04-05 15:13:47 -07:00
Ishaan Jaff	f65828db26	Merge pull request #2861 from BerriAI/litellm_add_azure_command_r_plust [FEAT] add azure command-r-plus	2024-04-05 15:13:35 -07:00
Ishaan Jaff	5c1a662caa	proxy - add azure/command r	2024-04-05 14:35:31 -07:00
Krrish Dholakia	6b7d253aa2	test(test_amazing_vertex_completion.py): handle vertex ai rate limit errors	2024-04-05 14:04:29 -07:00
Krrish Dholakia	7a0143a653	fix(test_amazing_vertex_completion.py): handle vertex ai rate limit error	2024-04-05 14:03:38 -07:00
Ishaan Jaff	bfaa2bd3df	catch gemini 429 errors	2024-04-05 14:02:01 -07:00
Ishaan Jaff	5e369473d3	fix gemini rate limit errors	2024-04-05 14:01:32 -07:00
Krrish Dholakia	00c0a0c209	fix(vertex_ai_anthropic.py): fix import	2024-04-05 13:38:05 -07:00

1 2 3 4 5 ...

2362 commits