LiteLLM Minor Fixes and Improvements (09/13/2024) (#5689)

* refactor: cleanup unused variables + fix pyright errors

* feat(health_check.py): Closes https://github.com/BerriAI/litellm/issues/5686

* fix(o1_reasoning.py): add stricter check for o-1 reasoning model

* refactor(mistral/): make it easier to see mistral transformation logic

* fix(openai.py): fix openai o-1 model param mapping

Fixes https://github.com/BerriAI/litellm/issues/5685

* feat(main.py): infer finetuned gemini model from base model

Fixes https://github.com/BerriAI/litellm/issues/5678

* docs(vertex.md): update docs to call finetuned gemini models

* feat(proxy_server.py): allow admin to hide proxy model aliases

Closes https://github.com/BerriAI/litellm/issues/5692

* docs(load_balancing.md): add docs on hiding alias models from proxy config

* fix(base.py): don't raise notimplemented error

* fix(user_api_key_auth.py): fix model max budget check

* fix(router.py): fix elif

* fix(user_api_key_auth.py): don't set team_id to empty str

* fix(team_endpoints.py): fix response type

* test(test_completion.py): handle predibase error

* test(test_proxy_server.py): fix test

* fix(o1_transformation.py): fix max_completion_token mapping

* test(test_image_generation.py): mark flaky test
This commit is contained in:
Krish Dholakia 2024-09-14 10:02:55 -07:00 committed by GitHub
parent 60c5d3ebec
commit 713d762411
35 changed files with 1020 additions and 539 deletions

View file

@ -66,22 +66,24 @@ class BaseLLM:
return _aclient_session
def __exit__(self):
if hasattr(self, "_client_session"):
if hasattr(self, "_client_session") and self._client_session is not None:
self._client_session.close()
async def __aexit__(self, exc_type, exc_val, exc_tb):
if hasattr(self, "_aclient_session"):
await self._aclient_session.aclose()
await self._aclient_session.aclose() # type: ignore
def validate_environment(self): # set up the environment required to run the model
pass
def validate_environment(
self, *args, **kwargs
) -> Optional[Any]: # set up the environment required to run the model
return None
def completion(
self, *args, **kwargs
): # logic for parsing in - calling - parsing out model completion calls
pass
) -> Any: # logic for parsing in - calling - parsing out model completion calls
return None
def embedding(
self, *args, **kwargs
): # logic for parsing in - calling - parsing out model embedding calls
pass
) -> Any: # logic for parsing in - calling - parsing out model embedding calls
return None