Commit graph

16952 commits

Author SHA1 Message Date
Krrish Dholakia
50383cfe62 docs(custom_llm_server.md): add streaming example for custom llm call
Reference - https://github.com/BerriAI/litellm/issues/5147
2024-08-21 15:45:54 -07:00
Krish Dholakia
30df7c7429
Merge pull request #5253 from the-crypt-keeper/mistral-add-stop
Add the "stop" parameter to the mistral API interface
2024-08-21 15:40:54 -07:00
Krish Dholakia
9452b9398d
Merge pull request #5304 from beltranaceves/main
Fixed code snippet import typo in Structured Output docs
2024-08-21 15:40:00 -07:00
Krish Dholakia
81de1a0e90
Merge pull request #5318 from mikstew/mikio/checksum-annotation
add checksum/config pod annotation
2024-08-21 15:39:43 -07:00
Krish Dholakia
e961810139
Merge pull request #5323 from MarkRx/feature/langsmith-ids
Support LangSmith parent_run_id, trace_id, session_id
2024-08-21 15:38:50 -07:00
Krish Dholakia
7c34ead54f
Merge pull request #5322 from BerriAI/litellm_disable_storing_master_key_hash_in_db
feat(proxy_server.py): support disabling storing master key hash in db
2024-08-21 15:37:31 -07:00
Krish Dholakia
72169fd5c4
Merge branch 'main' into litellm_disable_storing_master_key_hash_in_db 2024-08-21 15:37:25 -07:00
Krrish Dholakia
008fa494a7 fix(router.py): fix linting error 2024-08-21 15:35:10 -07:00
Ishaan Jaff
2c7f5a0d27 fix test test_vertexai_multimodal_embedding 2024-08-21 15:25:13 -07:00
Krrish Dholakia
d995fff7a2 fix(vertex_httpx.py): fix json schema call to pass in response_mime_type=="application/json" 2024-08-21 15:24:04 -07:00
Ishaan Jaff
0e9efb3669 feat add multimodal embeddings on vertex 2024-08-21 15:19:48 -07:00
Ishaan Jaff
35781ab8d5 add multi modal vtx embedding 2024-08-21 15:05:59 -07:00
Krrish Dholakia
f24075bcaf test(test_caching.py): skip local test 2024-08-21 15:05:18 -07:00
Krrish Dholakia
e2d7539690 feat(caching.py): redis cluster support
Closes https://github.com/BerriAI/litellm/issues/4358
2024-08-21 15:01:52 -07:00
Ishaan Jaff
7e3dc83c0d add initial support for multimodal_embedding vertex 2024-08-21 14:29:05 -07:00
Ishaan Jaff
dd00cf2a97 add VertexMultimodalEmbeddingRequest type 2024-08-21 14:25:47 -07:00
Krish Dholakia
509ae0ca71
Merge pull request #5308 from BerriAI/litellm_team_admin_permissions
feat(user_api_key_auth.py): allow team admin to add new members to team
2024-08-21 14:21:22 -07:00
Krrish Dholakia
7aec6f0f2a fix(litellm_pre_call_utils.py): handle dynamic keys via api correctly 2024-08-21 13:37:21 -07:00
Ishaan Jaff
da2e5b2bb0
Merge pull request #5324 from BerriAI/litellm_add_qdrant_litellm_proxy
[Feat-Proxy] Add Qdrant Semantic Caching Support
2024-08-21 13:14:59 -07:00
MarkRx
19e5299a6d Support LangSmith parent_run_id, trace_id, session_id 2024-08-21 16:09:30 -04:00
Ishaan Jaff
d6493b0e7f docs semantic caching qdrant 2024-08-21 13:03:41 -07:00
Ishaan Jaff
e7ecb2fe3a fix qdrant litellm on proxy 2024-08-21 12:52:29 -07:00
Krrish Dholakia
c216e2cc2d build(config.yml): pin openai version 2024-08-21 12:50:44 -07:00
Krrish Dholakia
83bed56b66 fix(internal_user_endpoints.py): pass in user api key dict value 2024-08-21 12:46:43 -07:00
Ishaan Jaff
c6dfd2d276 fixes for using qdrant with litellm proxy 2024-08-21 12:36:41 -07:00
Krrish Dholakia
89014dfc07 feat(proxy_server.py): support disabling storing master key hash in db, for spend tracking 2024-08-21 12:35:37 -07:00
Ishaan Jaff
9dbd1b089b fix tg ai -deprecated model 2024-08-21 12:14:39 -07:00
Ishaan Jaff
228a0bd6f7 fix qdrant semantic caching test 2024-08-21 12:11:49 -07:00
Ishaan Jaff
428a74be07 fix drant url 2024-08-21 12:09:09 -07:00
Krrish Dholakia
8812da04e3 fix(vertex_httpx.py): Fix tool calling with empty param list
Fixes https://github.com/BerriAI/litellm/issues/5055
2024-08-21 09:03:34 -07:00
Ishaan Jaff
7d0196191f
Merge pull request #5018 from haadirakhangi/main
Qdrant Semantic Caching
2024-08-21 08:50:43 -07:00
Krrish Dholakia
1b6db8359a fix(utils.py): support openrouter streaming
Fixes https://github.com/BerriAI/litellm/issues/5080
2024-08-21 08:48:58 -07:00
Ishaan Jaff
dc261dd07a
Merge pull request #5317 from BerriAI/docs_use_litellm_gcp
docs - use litellm on gcp cloud run
2024-08-21 08:46:17 -07:00
Ishaan Jaff
4fb359edd8 docs - use litellm on gcp cloud run 2024-08-21 08:39:24 -07:00
Mikio Stewart
9d0103e224 fix configmap name in print 2024-08-21 08:37:21 -07:00
Krrish Dholakia
5ba517819c test(test_proxy_server.py): fix test to specify user role 2024-08-21 08:37:04 -07:00
Krish Dholakia
b2e182a42c
Merge pull request #5314 from thiswillbeyourgithub/fix_missing_openrouter_claude_sonnet_beta
fix/docs: was missing a beta model from openrouter of claude sonnet
2024-08-21 08:31:21 -07:00
Mikio Stewart
d129b848fe add checksum annotation 2024-08-21 08:30:55 -07:00
Ishaan Jaff
42df6b42c9 fix groq/3.1 reasoning model 2024-08-21 08:22:26 -07:00
Krrish Dholakia
04fc0bd7b3 feat(litellm_pre_call_utils.py): support passing tags/spend logs metadata from keys/team metadata to request 2024-08-21 08:13:36 -07:00
Ishaan Jaff
09aacfc6a8
Merge pull request #5315 from paul-gauthier/main
openrouter/anthropic/claude-3.5-sonnet: supports_assistant_prefill:true
2024-08-21 07:27:18 -07:00
Krrish Dholakia
8406016096 test(test_completion.py): fix gemini rate limit error 2024-08-21 07:17:27 -07:00
Krrish Dholakia
a6ee1e04fe test(test_completion.py): fix test 2024-08-21 07:01:02 -07:00
Paul Gauthier
2432cbc912 openrouter/anthropic/claude-3.5-sonnet: supports_assistant_prefill:true 2024-08-21 05:55:10 -07:00
thiswillbeyourgithub
00f7238c64 fix: was missing openrouter beta model from claude sonnet 2024-08-21 14:45:09 +02:00
Krrish Dholakia
de8eae20d9 docs(caching.md): add doc on enabling caching for just rate limiting features 2024-08-20 22:10:42 -07:00
Krrish Dholakia
a7578e9f2c fix(litellm_logging.py): add stricter check for special param being non none 2024-08-20 21:35:02 -07:00
Krrish Dholakia
8e9117f701 fix(utils.py): ensure consistent cost calc b/w returned header and logged object 2024-08-20 19:01:20 -07:00
Ishaan Jaff
f51f7750c0 bump: version 1.44.0 → 1.44.1 2024-08-20 17:35:06 -07:00
Ishaan Jaff
cea7b73015 enforece guardrails per API Key as enterprise 2024-08-20 17:34:28 -07:00