fix(health.md): add rerank model health check information (#7295)

mirror of https://github.com/BerriAI/litellm.git synced 2025-04-27 11:43:54 +00:00

* fix(health.md): add rerank model health check information

* build(model_prices_and_context_window.json): add gemini 2.0 for google ai studio - pricing + commercial rate limits

* build(model_prices_and_context_window.json): add gemini-2.0 supports audio output = true

* docs(team_model_add.md): clarify allowing teams to add models is an enterprise feature

* fix(o1_transformation.py): add support for 'n', 'response_format' and 'stop' params for o1 and 'stream_options' param for o1-mini

* build(model_prices_and_context_window.json): add 'supports_system_message' to supporting openai models

needed as o1-preview, and o1-mini models don't support 'system message

* fix(o1_transformation.py): translate system message based on if o1 model supports it

* fix(o1_transformation.py): return 'stream' param support if o1-mini/o1-preview

o1 currently doesn't support streaming, but the other model versions do

Fixes https://github.com/BerriAI/litellm/issues/7292

* fix(o1_transformation.py): return tool calling/response_format in supported params if model map says so

Fixes https://github.com/BerriAI/litellm/issues/7292

* fix: fix linting errors

* fix: update '_transform_messages'

* fix(o1_transformation.py): fix provider passed for supported param checks

* test(base_llm_unit_tests.py): skip test if api takes >5s to respond

* fix(utils.py): return false in 'supports_factory' if can't find value

* fix(o1_transformation.py): always return stream + stream_options as supported params + handle stream options being passed in for azure o1

* feat(openai.py): support stream faking natively in openai handler

Allows o1 calls to be faked for just the "o1" model, allows native streaming for o1-mini, o1-preview

 Fixes https://github.com/BerriAI/litellm/issues/7292

* fix(openai.py): use inference param instead of original optional param

This commit is contained in:

Krish Dholakia

2024-12-18 19:18:10 -08:00

• committed by

GitHub

parent e95820367f

commit 1a4910f6c0

34 changed files with 800 additions and 515 deletions

									
										5

litellm/llms/replicate/chat/transformation.py
									
										View file
										
				@ -130,11 +130,6 @@ class ReplicateConfig(BaseConfig):

				            return split_model[1]

				        return model

				    def _transform_messages(

				        self, messages: List[AllMessageValues]

				    ) -> List[AllMessageValues]:

				        return messages

				    def get_error_class(

				        self, error_message: str, status_code: int, headers: Union[dict, httpx.Headers]

				    ) -> BaseLLMException:

Rows
Columns

fix(health.md): add rerank model health check information (#7295)

5 litellm/llms/replicate/chat/transformation.py Unescape Escape View file

5

litellm/llms/replicate/chat/transformation.py

View file