litellm-mirror

mirror of https://github.com/BerriAI/litellm.git synced 2025-04-27 19:54:13 +00:00

History

Krish Dholakia 9d82ff4793 Litellm dev 12 26 2024 p3 (#7434 ) * build(model_prices_and_context_window.json): update groq models to specify 'supports_vision' parameter Closes https://github.com/BerriAI/litellm/issues/7433 * docs(groq.md): add groq vision example to docs Closes https://github.com/BerriAI/litellm/issues/7433 * fix(prometheus.py): refactor self.litellm_proxy_failed_requests_metric to use label factory * feat(prometheus.py): new 'litellm_proxy_failed_requests_by_tag_metric' allows tracking failed requests by tag on proxy * fix(prometheus.py): fix exception logging * feat(prometheus.py): add new 'litellm_request_total_latency_by_tag_metric' enables tracking latency by use-case * feat(prometheus.py): add new llm api latency by tag metric * feat(prometheus.py): new litellm_deployment_latency_per_output_token_by_tag metric allows tracking deployment latency by tag * fix(prometheus.py): refactor 'litellm_requests_metric' to use enum values + label factory * feat(prometheus.py): new litellm_proxy_total_requests_by_tag metric allows tracking total requests by tag * feat(prometheus.py): new metric litellm_deployment_successful_fallbacks_by_tag allows tracking deployment fallbacks by tag * fix(prometheus.py): new 'litellm_deployment_failed_fallbacks_by_tag' metric allows tracking failed fallbacks on deployment by custom tag * test: fix test * test: rename test to run earlier * test: skip flaky test		2024-12-26 21:21:16 -08:00
..
integrations	Litellm dev 12 26 2024 p3 (#7434 )	2024-12-26 21:21:16 -08:00
llms	(Feat) Add logging for `POST v1/fine_tuning/jobs` (#7426 )	2024-12-26 08:58:47 -08:00
passthrough_endpoints	(docs) Simplify `/vertex_ai/` pass through docs (#6910 )	2024-11-25 23:57:50 -08:00
adapter.py	feat(anthropic_adapter.py): support for translating anthropic params to openai format	2024-07-10 00:32:28 -07:00
caching.py	(feat) - provider budget improvements - ensure provider budgets work with multiple proxy instances + improve latency to ~90ms (#6886 )	2024-11-24 16:36:19 -08:00
completion.py	LiteLLM Minor Fixes and Improvements (09/12/2024) (#5658 )	2024-09-12 23:04:06 -07:00
embedding.py	Removed config dict type definition	2024-05-17 10:39:00 +08:00
files.py	Fix file type handling of uppercase extensions	2024-06-13 15:00:16 -07:00
guardrails.py	(feat) Support Dynamic Params for `guardrails` (#7415 )	2024-12-25 16:07:29 -08:00
rerank.py	(code refactor) - Add `BaseRerankConfig`. Use `BaseRerankConfig` for `cohere/rerank` and `azure_ai/rerank` (#7319 )	2024-12-19 17:03:34 -08:00
router.py	Support budget/rate limit tiers for keys (#7429 )	2024-12-26 19:05:27 -08:00
services.py	Litellm perf improvements 3 (#6573 )	2024-11-05 03:51:26 +05:30
utils.py	Support budget/rate limit tiers for keys (#7429 )	2024-12-26 19:05:27 -08:00