litellm-mirror

mirror of https://github.com/BerriAI/litellm.git synced 2025-04-27 03:34:10 +00:00

Author	SHA1	Message	Date
Chen Wang	ebc889d77a	Use `max_input_token` for `trim_messages`	2024-04-16 13:36:25 +01:00
Krrish Dholakia	40d9167cae	test(test_amazing_vertex_completion.py): fix vertex anthropic streaming text	2024-04-15 14:37:22 -07:00
Krrish Dholakia	1ec7118e1f	fix(vertex_ai_anthropic.py): set vertex_credentials for vertex ai anthropic calls allows setting vertex credentials as a json string for vertex ai anthropic calls	2024-04-15 14:16:28 -07:00
Krrish Dholakia	50081479f9	fix(main.py): accept vertex service account credentials as json string allows us to dynamically set vertex ai credentials	2024-04-15 13:28:59 -07:00
Krish Dholakia	72b54eaad7	Merge pull request #3028 from BerriAI/litellm_anthropic_text_completion_fix fix(anthropic_text.py): add support for async text completion calls	2024-04-15 09:26:28 -07:00
Ishaan Jaff	c8832489d6	test groq tool calling	2024-04-15 08:41:53 -07:00
Ishaan Jaff	ea575ef62d	fix test groq function call	2024-04-15 08:40:39 -07:00
Krrish Dholakia	26286a54b8	fix(anthropic_text.py): add support for async text completion calls	2024-04-15 08:15:00 -07:00
Ishaan Jaff	017127a704	test - groq tool calling	2024-04-15 08:13:05 -07:00
Stefan Dobrev	1cfea955aa	Fix Anthropic system message handling	2024-04-14 16:23:59 +03:00
Krrish Dholakia	bef24cd4ab	fix(caching.py): don't decode a string	2024-04-13 18:48:03 -07:00
Krrish Dholakia	4e81acf2c6	feat(prometheus_services.py): monitor health of proxy adjacent services (redis / postgres / etc.)	2024-04-13 18:15:02 -07:00
Ishaan Jaff	5856ec03c6	(ci/cd) run again	2024-04-12 20:48:26 -07:00
Ishaan Jaff	f617f5ebb5	fix - test caching atext_completion	2024-04-12 20:37:56 -07:00
Ishaan Jaff	11cd1ec6cf	test - atext_completion + caching	2024-04-12 12:32:21 -07:00
Krrish Dholakia	77097f8e7d	test(test_jwt.py): add testing to make sure user api key auth returns the expected values	2024-04-11 20:18:55 -07:00
Krrish Dholakia	84d43484c6	fix(router.py): make sure pre call rpm check runs even when model not in model cost map	2024-04-11 09:27:46 -07:00
Krish Dholakia	83e7ed94ce	Merge pull request #2942 from BerriAI/litellm_fix_router_loading Router Async Improvements	2024-04-10 20:16:53 -07:00
Ishaan Jaff	8bc02b34c2	test -base64 cache hits	2024-04-10 16:46:56 -07:00
Krrish Dholakia	a47a719caa	fix(router.py): generate consistent model id's having the same id for a deployment, lets redis usage caching work across multiple instances	2024-04-10 15:23:57 -07:00
Krrish Dholakia	a311788f0d	test(test_completion.py): handle api instability	2024-04-09 21:58:48 -07:00
Krrish Dholakia	a6b004f10b	test(test_completion.py): change model	2024-04-09 21:38:17 -07:00
Krish Dholakia	2fa47cf416	Merge pull request #2924 from BerriAI/litellm_text_completion_response fix(main.py): handle translating text completion openai to chat completion for async requests	2024-04-09 21:13:30 -07:00
Krrish Dholakia	8857c9b978	test(test_key_generate_prisma.py): add better unit testing for spend logs on proxy server	2024-04-09 17:58:18 -07:00
Krrish Dholakia	855e7ed9d2	fix(main.py): handle translating text completion openai to chat completion for async requests also adds testing for this, to prevent future regressions	2024-04-09 16:47:49 -07:00
Krrish Dholakia	4495f6c058	test(test_key_generate_prisma.py): fix check for raising error if key generate max budget > key upperbound	2024-04-09 13:07:57 -07:00
Krrish Dholakia	ef4c85522c	test(test_llm_guard.py): fix test	2024-04-08 21:55:35 -07:00
Krrish Dholakia	b6cd200676	fix(llm_guard.py): enable request-specific llm guard flag	2024-04-08 21:15:33 -07:00
Krrish Dholakia	48bfc45cb0	fix(utils.py): fix reordering of items for cached embeddings ensures cached embedding item is returned in correct order	2024-04-08 12:18:24 -07:00
Ishaan Jaff	a5aef6ec00	Merge pull request #2879 from BerriAI/litellm_async_anthropic_api [Feat] Async Anthropic API 97.5% lower median latency	2024-04-07 09:56:52 -07:00
Krrish Dholakia	6773341a08	test(test_alerting.py): remove duplicate test	2024-04-06 19:33:01 -07:00
Ishaan Jaff	d51e853b60	undo adding next-asyncio	2024-04-06 19:28:51 -07:00
Krish Dholakia	83f608dc5d	Merge pull request #2880 from BerriAI/litellm_api_base_alerting feat(proxy/utils.py): return api base for request hanging alerts	2024-04-06 19:17:18 -07:00
Krrish Dholakia	6234982fd9	test(test_custom_callback_input.py): fix assert	2024-04-06 18:48:33 -07:00
Krrish Dholakia	5f910c1811	test(test_streaming.py): fix vertex project name	2024-04-06 18:45:17 -07:00
Ishaan Jaff	f08486448c	fix - test streaming	2024-04-06 18:28:07 -07:00
Ishaan Jaff	2622f0351b	(ci/cd) run again	2024-04-06 18:26:52 -07:00
Ishaan Jaff	4512510d37	Revert "fix test streaming" This reverts commit `f10aba1007`.	2024-04-06 18:24:00 -07:00
Ishaan Jaff	f10aba1007	fix test streaming	2024-04-06 18:23:10 -07:00
Ishaan Jaff	9be6b7ec7c	ci/cd run again	2024-04-06 18:07:41 -07:00
Krrish Dholakia	a6f987e065	test(test_amazing_vertex_completion.py): fix project name	2024-04-06 17:59:49 -07:00
Ishaan Jaff	548b2b6861	test - async claude streaming	2024-04-06 17:55:26 -07:00
Krrish Dholakia	6110d32b1c	feat(proxy/utils.py): return api base for request hanging alerts	2024-04-06 15:58:53 -07:00
Ishaan Jaff	a2c63075ef	Merge pull request #2877 from BerriAI/litellm_fix_text_completion [Feat] Text-Completion-OpenAI - Re-use OpenAI Client	2024-04-06 12:15:52 -07:00
Krrish Dholakia	b49e47b634	ci(vertex_key.json): switch gcloud accounts	2024-04-06 12:14:46 -07:00
Ishaan Jaff	3b622556be	test - setting org for openai text completion	2024-04-06 11:53:26 -07:00
Ishaan Jaff	a55f3cdace	test - router re-use openai client	2024-04-06 11:33:17 -07:00
Krish Dholakia	9119858f4a	Merge pull request #2798 from CLARKBENHAM/main add test for rate limits - Router isn't coroutine safe	2024-04-06 08:47:40 -07:00
Ishaan Jaff	3d298fc549	(test) completion	2024-04-05 21:03:04 -07:00
Ishaan Jaff	83580dc00a	(ci/cd) run again	2024-04-05 20:44:51 -07:00

... 5 6 7 8 9 ...

2623 commits