litellm-mirror/docs/my-website/release_notes/v1.67.0-stable/index.md at b82af5b826553b5d35864c924af3b368a235fd6d

phoenix/litellm-mirror

Fork 1

mirror of https://github.com/BerriAI/litellm.git synced 2025-04-24 18:24:20 +00:00

Krrish Dholakia 1ff7625984 docs: cleanup

2025-04-20 09:26:05 -07:00

9.9 KiB

Raw Blame History

title slug date authors tags hide_table_of_contents

v1.67.0-stable - SCIM Integration

v1.67.0-stable

2025-04-19T10:00:00

name	title	url	image_url
Krrish Dholakia	CEO, LiteLLM	https://www.linkedin.com/in/krish-d/	`1737327772`

name	title	url	image_url
Ishaan Jaffer	CTO, LiteLLM	https://www.linkedin.com/in/reffajnaahsi/	https://pbs.twimg.com/profile_images/1613813310264340481/lz54oEiB_400x400.jpg

sso

unified_file_id

cost_tracking

security

false

import Image from '@theme/IdealImage'; import Tabs from '@theme/Tabs'; import TabItem from '@theme/TabItem';

Key Highlights

SCIM Integration: Enables identity providers (Okta, Azure AD, OneLogin, etc.) to automate user and team (group) provisioning, updates, and deprovisioning
Team and Tag based usage tracking: You can now see usage and spend by team and tag at 1M+ spend logs.
Unified Responses API: Support for calling Anthropic, Gemini, Groq, etc. via OpenAI's new Responses API.

Let's dive in.

SCIM Integration

This release adds SCIM support to LiteLLM. This allows your SSO provider (Okta, Azure AD, etc) to automatically create/delete users, teams, and memberships on LiteLLM. This means that when you remove a team on your SSO provider, your SSO provider will automatically delete the corresponding team on LiteLLM.

Team and Tag based usage tracking

This release improves team and tag based usage tracking at 1m+ spend logs, making it easy to monitor your LLM API Spend in production. This covers:

View daily spend by teams + tags
View usage / spend by key, within teams
View spend by multiple tags
Allow internal users to view spend of teams they're a member of

Unified Responses API

This release allows you to call Azure OpenAI, Anthropic, AWS Bedrock, and Google Vertex AI models via the POST /v1/responses endpoint on LiteLLM. This means you can now use popular tools like OpenAI Codex with your own models.

OpenAI
1. gpt-4.1, gpt-4.1-mini, gpt-4.1-nano, o3, o3-mini, o4-mini pricing - Get Started, PR
2. o4 - correctly map o4 to openai o_series model
Azure AI
1. Phi-4 output cost per token fix - PR
2. Responses API support Get Started,PR
Anthropic
1. redacted message thinking support - Get Started,PR
Cohere
1. /v2/chat Passthrough endpoint support w/ cost tracking - Get Started, PR
Azure
1. Support azure tenant_id/client_id env vars - Get Started, PR
2. Fix response_format check for 2025+ api versions - PR
3. Add gpt-4.1, gpt-4.1-mini, gpt-4.1-nano, o3, o3-mini, o4-mini pricing
VLLM
1. Files - Support 'file' message type for VLLM video url's - Get Started, PR
2. Passthrough - new /vllm/ passthrough endpoint support Get Started, PR
Mistral
1. new /mistral passthrough endpoint support Get Started, PR
AWS
1. New mapped bedrock regions - PR
VertexAI / Google AI Studio
1. Gemini - Response format - Retain schema field ordering for google gemini and vertex by specifying propertyOrdering - Get Started, PR
2. Gemini-2.5-flash - return reasoning content Google AI Studio, Vertex AI
3. Gemini-2.5-flash - pricing + model information PR
4. Passthrough - new /vertex_ai/discovery route - enables calling AgentBuilder API routes Get Started, PR
Fireworks AI
1. return tool calling responses in tool_calls field (fireworks incorrectly returns this as a json str in content) PR
Triton
1. Remove fixed remove bad_words / stop words from /generate call - Get Started, PR
Other
1. Support for all litellm providers on Responses API (works with Codex) - Get Started, PR
2. Fix combining multiple tool calls in streaming response - Get Started, PR

Spend Tracking Improvements

Cost Control - inject cache control points in prompt for cost reduction Get Started, PR
Spend Tags - spend tags in headers - support x-litellm-tags even if tag based routing not enabled Get Started, PR
Gemini-2.5-flash - support cost calculation for reasoning tokens PR

Management Endpoints / UI

Users
1. Show created_at and updated_at on users page - PR
Virtual Keys
1. Filter by key alias - https://github.com/BerriAI/litellm/pull/10085
Usage Tab
1. Team based usage
  - New LiteLLM_DailyTeamSpend Table for aggregate team based usage logging - PR
  - New Team based usage dashboard + new /team/daily/activity API - PR
  - Return team alias on /team/daily/activity API - PR
  - allow internal user view spend for teams they belong to - PR
  - allow viewing top keys by team - PR
  <Image img={require('../../img/release_notes/new_team_usage.png')}/>
2. Tag Based Usage
  - New LiteLLM_DailyTagSpend Table for aggregate tag based usage logging - PR
  - Restrict to only Proxy Admins - PR
  - allow viewing top keys by tag
  - Return tags passed in request (i.e. dynamic tags) on /tag/list API - PR <Image img={require('../../img/release_notes/new_tag_usage.png')}/>
3. Track prompt caching metrics in daily user, team, tag tables - PR
4. Show usage by key (on all up, team, and tag usage dashboards) - PR
5. swap old usage with new usage tab
Models
1. Make columns resizable/hideable - PR
API Playground
1. Allow internal user to call api playground - PR
SCIM
1. Add LiteLLM SCIM Integration for Team and User management - Get Started, PR

Logging / Guardrail Integrations

GCS
1. Fix gcs pub sub logging with env var GCS_PROJECT_ID - Get Started, PR
AIM
1. Add litellm call id passing to Aim guardrails on pre and post-hooks calls - Get Started, PR
Azure blob storage
1. Ensure logging works in high throughput scenarios - Get Started, PR

General Proxy Improvements

Support setting litellm.modify_params via env var PR
Model Discovery - Check provider’s /models endpoints when calling proxy’s /v1/models endpoint - Get Started, PR
/utils/token_counter - fix retrieving custom tokenizer for db models - Get Started, PR
Prisma migrate - handle existing columns in db table - PR

9.9 KiB Raw Blame History Unescape Escape