litellm-mirror

mirror of https://github.com/BerriAI/litellm.git synced 2025-04-25 10:44:24 +00:00

Author	SHA1	Message	Date
Ishaan Jaff	e28b240a5b	fix don't retry errors when no healthy deployments available	2024-08-20 12:17:05 -07:00
Ishaan Jaff	19c3a82d1b	test + never retry on 404 errors	2024-08-20 11:59:43 -07:00
Ishaan Jaff	08db691dec	use model access groups for teams	2024-08-17 16:45:53 -07:00
Krrish Dholakia	61f4b71ef7	refactor: replace .error() with .exception() logging for better debugging on sentry	2024-08-16 09:22:47 -07:00
Ishaan Jaff	0238ab077d	v0 track fallback events	2024-08-10 13:31:00 -07:00
Krrish Dholakia	7b6db63d30	fix(router.py): fallback on 400-status code requests	2024-08-09 12:16:49 -07:00
Krrish Dholakia	400653992c	feat(router.py): allow using .acompletion() for request prioritization allows /chat/completion endpoint to work for request prioritization calls	2024-08-07 16:43:12 -07:00
Ishaan Jaff	9cd437135b	fix getting provider_specific_deployment	2024-08-07 15:20:59 -07:00
Ishaan Jaff	f1ffa82062	fix use provider specific routing	2024-08-07 14:37:20 -07:00
Ishaan Jaff	5d7a1b2ec6	router use provider specific wildcard routing	2024-08-07 14:12:10 -07:00
Ishaan Jaff	18305b23f4	add + test provider specific routing	2024-08-07 13:49:46 -07:00
Krrish Dholakia	f0f900d69e	fix(router.py): add reason for fallback failure to client-side exception string make it easier to debug why a fallback failed to occur	2024-08-07 13:02:47 -07:00
Ishaan Jaff	d1e519afd1	use router_cooldown_handler	2024-08-07 10:40:55 -07:00
Krrish Dholakia	ce39649b2a	fix: fix test to specify allowed_fails	2024-08-05 21:39:59 -07:00
Krrish Dholakia	7a0792c918	fix(router.py): move deployment cooldown list message to error log, not client-side don't show user all deployments	2024-08-03 12:49:39 -07:00
Krrish Dholakia	6b8806b45f	feat(router.py): add flag for mock testing loadbalancing for rate limit errors	2024-08-03 12:34:11 -07:00
Krrish Dholakia	c65a438de2	fix(utils.py): fix linting errors	2024-07-30 18:38:10 -07:00
Krrish Dholakia	ec6db03c41	fix(router.py): gracefully handle scenario where completion response doesn't have total tokens Closes https://github.com/BerriAI/litellm/issues/4968	2024-07-30 15:14:03 -07:00
Krrish Dholakia	b25d4a8cb3	feat(ollama_chat.py): support ollama tool calling Closes https://github.com/BerriAI/litellm/issues/4812	2024-07-26 21:51:54 -07:00
Krrish Dholakia	84482703b8	docs(config.md): update wildcard docs	2024-07-26 08:59:53 -07:00
Ishaan Jaff	8f4c5437b8	router support setting pass_through_all_models	2024-07-25 18:34:12 -07:00
Krrish Dholakia	711496e260	fix(router.py): add support for diskcache to router	2024-07-25 14:30:46 -07:00
Ishaan Jaff	28bb2919b6	fix - test router debug logs	2024-07-20 18:45:31 -07:00
Ishaan Jaff	4038b3dcea	router - use verbose logger when using litellm.Router	2024-07-20 17:36:25 -07:00
Ishaan Jaff	08adda7091	control using enable_tag_filtering	2024-07-18 19:39:04 -07:00
Ishaan Jaff	4d0fbfea83	router - refactor to tag based routing	2024-07-18 19:22:09 -07:00
Ishaan Jaff	4b96cd46b2	Merge pull request #4786 from BerriAI/litellm_use_model_tier_keys [Feat-Enterprise] Use free/paid tiers for Virtual Keys	2024-07-18 18:07:09 -07:00
Krrish Dholakia	b23a633cf1	fix(utils.py): fix status code in exception mapping	2024-07-18 18:04:59 -07:00
Ishaan Jaff	64e38562d9	router - use free paid tier routing	2024-07-18 17:09:42 -07:00
Krrish Dholakia	0a94953896	fix(router.py): check for request_timeout in acompletion support 'request_timeout' param in router acompletion	2024-07-17 17:19:06 -07:00
Ishaan Jaff	e65daef572	router return get_deployment_by_model_group_name	2024-07-15 19:27:12 -07:00
Krish Dholakia	dacce3d78b	Merge pull request #4635 from BerriAI/litellm_anthropic_adapter Anthropic `/v1/messages` endpoint support	2024-07-10 22:41:53 -07:00
Krrish Dholakia	31829855c0	feat(proxy_server.py): working `/v1/messages` with config.yaml Adds async router support for adapter_completion call	2024-07-10 18:53:54 -07:00
Ishaan Jaff	62f475919b	feat - add DELETE assistants endpoint	2024-07-10 11:37:37 -07:00
Ishaan Jaff	f5eb862635	router - add acreate_assistants	2024-07-09 09:46:28 -07:00
Krish Dholakia	8661da1980	Merge branch 'main' into litellm_fix_httpx_transport	2024-07-06 19:12:06 -07:00
Ishaan Jaff	2609de43d0	use helper for init client + check if we should init sync clients	2024-07-06 12:52:41 -07:00
Krrish Dholakia	86632f6da0	fix(types/router.py): add custom pricing info to 'model_info' Fixes https://github.com/BerriAI/litellm/issues/4542	2024-07-04 16:07:58 -07:00
Krrish Dholakia	3d61a316cb	fix(router.py): bump azure default api version Allows 'tool_choice' to be passed to azure	2024-07-03 12:00:00 -07:00
Krrish Dholakia	892ba62730	fix(router.py): fix mounting logic	2024-07-02 17:54:32 -07:00
Krish Dholakia	21d3a28e51	Merge branch 'main' into litellm_support_dynamic_rpm_limiting	2024-07-02 17:51:18 -07:00
Krrish Dholakia	0647278a69	refactor: remove custom transport logic Not needed after azure dall-e-2 refactor	2024-07-02 17:35:27 -07:00
Krish Dholakia	d38f01e956	Merge branch 'main' into litellm_fix_httpx_transport	2024-07-02 17:17:43 -07:00
Krrish Dholakia	f23b17091d	fix(dynamic_rate_limiter.py): support dynamic rate limiting on rpm	2024-07-01 17:45:10 -07:00
Krrish Dholakia	ea74e01813	fix(router.py): disable cooldowns allow admin to disable model cooldowns	2024-07-01 15:03:10 -07:00
Krrish Dholakia	c9a424d28d	fix(router.py): fix get_router_model_info for azure models	2024-06-28 22:13:29 -07:00
Ishaan Jaff	d172a3ef6b	fix python3.8 install	2024-06-28 16:58:57 -07:00
Krrish Dholakia	aa6f7665c4	fix(router.py): only return 'max_tokens', 'input_cost_per_token', etc. in 'get_router_model_info' if base_model is set	2024-06-28 10:45:31 -07:00
Krrish Dholakia	98daedaf60	fix(router.py): fix setting httpx mounts	2024-06-26 17:22:04 -07:00
Krrish Dholakia	d98e00d1e0	fix(router.py): set `cooldown_time:` per model	2024-06-25 16:51:55 -07:00

... 2 3 4 5 6 ...

681 commits