litellm

Author	SHA1	Message	Date
lowjiansheng	f1c39510cb	add 0.2.3 helm	2024-08-19 23:59:58 +08:00
Krrish Dholakia	04d69464e2	fix(ollama.py): fix ollama embeddings - pass optional params Fixes https://github.com/BerriAI/litellm/issues/5267	2024-08-19 08:45:26 -07:00
Krrish Dholakia	cc42f96d6a	fix(ollama_chat.py): fix sync tool calling Fixes https://github.com/BerriAI/litellm/issues/5245	2024-08-19 08:31:46 -07:00
Krrish Dholakia	b8e4ef0abf	docs(json_mode.md): add azure openai models to doc	2024-08-19 07:19:23 -07:00
Ishaan Jaff	398295116f	inly write model tpm/rpm tracking when user set it	2024-08-18 09:58:09 -07:00
Krish Dholakia	f42ac2c9d8	Merge pull request #5264 from BerriAI/litellm_bedrock_pass_through feat: Bedrock pass-through endpoint support (All endpoints)	2024-08-18 09:55:22 -07:00
Ishaan Jaff	69afb07dea	sleep before checi g	2024-08-17 19:50:37 -07:00
Krrish Dholakia	663a0c1b83	feat(Support-pass-through-for-bedrock-endpoints): Allows pass-through support for bedrock endpoints	2024-08-17 17:57:43 -07:00
Ishaan Jaff	5adb7e29b9	fix test pass through	2024-08-17 17:42:51 -07:00
Ishaan Jaff	0bc67761dc	docs access groups	2024-08-17 17:38:28 -07:00
Ishaan Jaff	3cba235109	docs virtual key access groups	2024-08-17 17:37:23 -07:00
Ishaan Jaff	83515e88ce	Merge pull request #5263 from BerriAI/litellm_support_access_groups [Feat-Proxy] Use model access groups for teams	2024-08-17 17:11:11 -07:00
Ishaan Jaff	6fee350938	feat add model access groups for teams	2024-08-17 17:10:10 -07:00
Krrish Dholakia	f7a2e04426	feat(pass_through_endpoints.py): add pass-through support for all cohere endpoints	2024-08-17 16:57:55 -07:00
Ishaan Jaff	9b239111f4	fix test update tpm / rpm limits for a key	2024-08-17 16:57:23 -07:00
Ishaan Jaff	08db691dec	use model access groups for teams	2024-08-17 16:45:53 -07:00
Ishaan Jaff	d9c91838ce	docs cleanup	2024-08-17 15:59:23 -07:00
Ishaan Jaff	eff874bf05	fix proxy all models test	2024-08-17 15:54:51 -07:00
Ishaan Jaff	78d30990a3	docs clean up virtual key access	2024-08-17 15:39:50 -07:00
Ishaan Jaff	2a18a65f9e	bump: version 1.43.17 → 1.43.18	2024-08-17 15:27:50 -07:00
Ishaan Jaff	b83fa87880	update tpm / rpm limit per model	2024-08-17 15:26:12 -07:00
Krrish Dholakia	db54b66457	style(vertex_httpx.py): make vertex error string more helpful	2024-08-17 15:09:55 -07:00
Ishaan Jaff	671663abe6	docs rate limits per model per api key	2024-08-17 14:50:17 -07:00
Krish Dholakia	be37310e94	Merge pull request #5232 from Penagwin/fix_anthropic_tool_streaming_index Fixes the `tool_use` indexes not being correctly mapped	2024-08-17 14:33:50 -07:00
Ishaan Jaff	a60fc3ad70	Merge pull request #5261 from BerriAI/litellm_set_model_rpm_tpm_limit [Feat-Proxy] set rpm/tpm limits per api key per model	2024-08-17 14:30:54 -07:00
Ishaan Jaff	653d2e6ce0	fix parallel request limiter tests	2024-08-17 14:21:59 -07:00
Ishaan Jaff	221e5b829b	fix parallel request limiter	2024-08-17 14:14:12 -07:00
Krish Dholakia	5731287f1b	Merge pull request #5221 from kiriloman/adjust-pricing-file [PRICING] Use specific llama2 and llama3 model names in Ollama	2024-08-17 14:03:20 -07:00
Krish Dholakia	1a3b686580	Merge pull request #5219 from dhlidongming/fix-messages-length-check Fix incorrect message length check in cost calculator	2024-08-17 14:01:59 -07:00
Krish Dholakia	ff6ff133ee	Merge pull request #5260 from BerriAI/google_ai_studio_pass_through Pass-through endpoints for Gemini - Google AI Studio	2024-08-17 13:51:51 -07:00
Krrish Dholakia	0df41653f3	docs(google_ai_studio.md): add docs on google ai studio pass through endpoints	2024-08-17 13:47:05 -07:00
Ishaan Jaff	b35b09ea93	docs clean up emojis	2024-08-17 13:30:11 -07:00
Ishaan Jaff	9b0bd54571	docs cleanup - reduce emojis	2024-08-17 13:28:34 -07:00
Ishaan Jaff	68b54bed85	add tpm limits per api key per model	2024-08-17 13:20:55 -07:00
Krrish Dholakia	fd44cf8d26	feat(pass_through_endpoints.py): support streaming requests	2024-08-17 12:46:57 -07:00
Ishaan Jaff	fa96610bbc	fix async_pre_call_hook in parallel request limiter	2024-08-17 12:42:28 -07:00
Ishaan Jaff	feb8c3c5b4	Merge pull request #5259 from BerriAI/litellm_return_remaining_tokens_in_header [Feat] return `x-litellm-key-remaining-requests-{model}`: 1, `x-litellm-key-remaining-tokens-{model}: None` in response headers	2024-08-17 12:41:16 -07:00
Ishaan Jaff	ee0f772b5c	feat return rmng tokens for model for api key	2024-08-17 12:35:10 -07:00
Krrish Dholakia	bc0023a409	feat(google_ai_studio_endpoints.py): support pass-through endpoint for all google ai studio requests New Feature	2024-08-17 10:46:59 -07:00
Ishaan Jaff	5985c7e933	feat - use commong helper for getting model group	2024-08-17 10:46:04 -07:00
Ishaan Jaff	d630f77b73	show correct metric	2024-08-17 10:12:23 -07:00
Ishaan Jaff	412d30d362	add litellm-key-remaining-tokens on prometheus	2024-08-17 10:02:20 -07:00
Ishaan Jaff	785482f023	feat add settings for rpm/tpm limits for a model	2024-08-17 09:16:01 -07:00
Krrish Dholakia	b56ecd7e02	fix(pass_through_endpoints.py): fix returned response headers for pass-through endpoitns	2024-08-17 09:00:00 -07:00
Krrish Dholakia	08411f37b4	docs(vertex_ai.md): cleanup docs	2024-08-17 08:38:01 -07:00
Krrish Dholakia	b1bed459b4	bump: version 1.43.16 → 1.43.17	2024-08-16 21:34:35 -07:00
Krish Dholakia	75af146f0e	Merge pull request #5254 from BerriAI/litellm_log_model_price_information s3 - Log model price information	2024-08-16 19:34:23 -07:00
Krish Dholakia	f3e17cd692	Merge branch 'main' into litellm_log_model_price_information	2024-08-16 19:34:16 -07:00
Krish Dholakia	a8dd2b6910	Merge pull request #5244 from BerriAI/litellm_better_error_logging_sentry refactor: replace .error() with .exception() logging for better debugging on sentry	2024-08-16 19:16:20 -07:00
Krish Dholakia	6b1be4783a	Merge pull request #5251 from Manouchehri/oidc-improvements-20240816 (oidc): Add support for loading tokens via a file, env var, and path in env var	2024-08-16 19:15:31 -07:00

1 2 3 4 5 ...

16568 commits