Krish Dholakia
1ea046cc61
test: update tests to new deployment model ( #10142 )
...
* test: update tests to new deployment model
* test: update model name
* test: skip cohere rbac issue test
* test: update test - replace gpt-4o model
2025-04-18 14:22:12 -07:00
Krish Dholakia
3a7b13efa2
feat(health_check.py): set upperbound for api when making health check call ( #7865 )
...
Read Version from pyproject.toml / read-version (push) Successful in 10s
* feat(health_check.py): set upperbound for api when making health check call
prevent bad model from health check to hang and cause pod restarts
* fix(health_check.py): cleanup task once completed
* fix(constants.py): bump default health check timeout to 1min
* docs(health.md): add 'health_check_timeout' to health docs on litellm
* build(proxy_server_config.yaml): add bad model to health check
2025-01-18 19:47:43 -08:00
Ishaan Jaff
d510f1d517
(fix) health check - allow setting health_check_model
( #7752 )
...
* use _update_litellm_params_for_health_check
* fix Wildcard Routes
* test_update_litellm_params_for_health_check
* test_perform_health_check_with_health_check_model
* fix doc string
* huggingface/mistralai/Mistral-7B-Instruct-v0.3
2025-01-13 20:16:44 -08:00
Ishaan Jaff
3eb962c594
update - new test for test_text_completion_health_check
2024-12-28 19:36:23 -08:00
Ishaan Jaff
1e06ee3162
(Refactor) - Re use litellm.completion/litellm.embedding etc for health checks ( #7455 )
...
* add mode: realtime
* add _realtime_health_check
* test_realtime_health_check
* azure _realtime_health_check
* _realtime_health_check
* Realtime Models
* fix code quality
* delete OAI / Azure custom health check code
* simplest version of ahealth check
* update tests
* working health check post refactor
* working aspeech health check
* fix realtime health checks
* test_audio_transcription_health_check
* use get_audio_file_for_health_check
* test_text_completion_health_check
* ahealth_check
* simplify health check code
* update ahealth_check
* fix import
* fix unused imports
* fix ahealth_check
* fix local testing
* test_async_realtime_health_check
2024-12-28 18:38:54 -08:00
Ishaan Jaff
4e65722a00
(Bug Fix) Add health check support for realtime models ( #7453 )
...
* add mode: realtime
* add _realtime_health_check
* test_realtime_health_check
* azure _realtime_health_check
* _realtime_health_check
* Realtime Models
* fix code quality
2024-12-28 18:15:00 -08:00
Krish Dholakia
481645e49c
fix(acompletion): support fallbacks on acompletion ( #7184 )
...
Read Version from pyproject.toml / read-version (push) Successful in 45s
* fix(acompletion): support fallbacks on acompletion
allows health checks for wildcard routes to use fallback models
* test: update cohere generate api testing
* add max tokens to health check (#7000 )
* fix: fix health check test
* test: update testing
---------
Co-authored-by: Cameron <561860+wallies@users.noreply.github.com>
2024-12-11 19:20:54 -08:00
Krrish Dholakia
3560f0ef2c
refactor: move all testing to top-level of repo
...
Closes https://github.com/BerriAI/litellm/issues/486
2024-09-28 21:08:14 -07:00