Commit graph

15179 commits

Author SHA1 Message Date
Krrish Dholakia
0e222cf76b feat(lakera_ai.py): support lakera custom thresholds + custom api base
Allows user to configure thresholds to trigger prompt injection rejections
2024-08-06 15:21:45 -07:00
David Manouchehri
533426e876 Add mistral.mistral-large-2407-v1:0 on Amazon Bedrock. 2024-07-25 19:36:18 -07:00
fracapuano
c86f8e1eb4 fix: now supports single tokens prediction 2024-07-25 19:35:52 -07:00
Krrish Dholakia
a190bc750f fix(custom_llm.py): pass input params to custom llm 2024-07-25 19:35:52 -07:00
Krrish Dholakia
f5fd9bd3ce feat(proxy_server.py): support custom llm handler on proxy 2024-07-25 19:35:52 -07:00
Krrish Dholakia
acda75b96f docs(custom_llm_server.md): add calling custom llm server to docs 2024-07-25 19:35:48 -07:00
Krrish Dholakia
3b8f357af4 feat(utils.py): support async streaming for custom llm provider 2024-07-25 19:35:48 -07:00
Krrish Dholakia
8c12f7f820 feat(utils.py): support sync streaming for custom llm provider 2024-07-25 19:35:48 -07:00
Krrish Dholakia
7a5c1e7599 fix(custom_llm.py): support async completion calls 2024-07-25 19:35:48 -07:00
Krrish Dholakia
b9b0612c79 feat(custom_llm.py): initial working commit for writing your own custom LLM handler
Fixes https://github.com/BerriAI/litellm/issues/4675

 Also Addresses https://github.com/BerriAI/litellm/discussions/4677
2024-07-25 19:35:48 -07:00
Ishaan Jaff
7d68536763 docs - add info about routing strategy on load balancing docs 2024-07-25 19:35:48 -07:00
Ishaan Jaff
fe3c57e1c1 feat support audio health checks for azure 2024-07-25 19:35:48 -07:00
Ishaan Jaff
5ff0e47f22 docs add example on using text to speech models 2024-07-25 19:35:48 -07:00
Ishaan Jaff
1a89f9e86d feat - support health check audio_speech 2024-07-25 19:35:48 -07:00
Ishaan Jaff
734d0c3643 fix whisper health check with litellm 2024-07-25 19:35:48 -07:00
Krrish Dholakia
c23cf1ecc5 fix(router.py): add support for diskcache to router 2024-07-25 19:35:48 -07:00
Krrish Dholakia
a5ecf6b1c1 fix(proxy_server.py): check if input list > 0 before indexing into it
resolves 'list index out of range' error
2024-07-25 19:35:48 -07:00
David Manouchehri
fb1a805959 Check for converse support first. 2024-07-25 19:35:40 -07:00
David Manouchehri
2dd30d59ae Support tool calling for Llama 3.1 on Amazon bedrock. 2024-07-25 19:35:40 -07:00
David Manouchehri
35776d7e73 Add Llama 3.1 405b for Bedrock 2024-07-25 19:35:40 -07:00
Krrish Dholakia
52648ebac6 docs(enterprise.md): cleanup docs 2024-07-25 19:35:40 -07:00
Krrish Dholakia
24ff9882ee docs(enterprise.md): cleanup docs 2024-07-25 19:35:40 -07:00
Krrish Dholakia
9ec34313b0 docs(caching.md): update caching docs to include ttl info 2024-07-25 19:35:40 -07:00
Krrish Dholakia
86973d6aca fix(main.py): fix calling openai gpt-3.5-turbo-instruct via /completions
Fixes https://github.com/BerriAI/litellm/issues/749
2024-07-25 19:35:40 -07:00
Krrish Dholakia
9955b1b486 fix(internal_user_endpoints.py): support updating budgets for /user/update 2024-07-25 19:35:29 -07:00
Ishaan Jaff
993920ca6c docs add mistral api large 2 2024-07-25 19:35:29 -07:00
Ishaan Jaff
53075030ff feat - add mistral large 2 2024-07-25 19:35:29 -07:00
Ishaan Jaff
bb44d69152 bump: version 1.42.0 → 1.42.1 2024-07-25 19:35:29 -07:00
Ishaan Jaff
910597fcfa docs groq models 2024-07-25 19:35:29 -07:00
Ishaan Jaff
6579095124 feat - add groq/llama-3.1 2024-07-25 19:35:29 -07:00
Krrish Dholakia
11cdb79fc0 test: cleanup testing 2024-07-25 19:35:29 -07:00
Krrish Dholakia
2c71f6dd04 feat(auth_check.py): support using redis cache for team objects
Allows team update / check logic to work across instances instantly
2024-07-25 19:35:29 -07:00
Krish Dholakia
e6963217ba
Merge branch 'main' into litellm_parallel_requests 2024-07-24 19:25:56 -07:00
Krish Dholakia
0ac7736b1f
Merge pull request #4638 from friendliai/feat/friendli-dedicated-endpoint
feat: add support for friendliai dedicated endpoint
2024-07-24 19:23:15 -07:00
wslee
40bb165108 support dynamic api base 2024-07-25 11:14:38 +09:00
wslee
dd10da4d46 add support for friendli dedicated endpoint 2024-07-25 11:14:35 +09:00
Krrish Dholakia
f35af3bf1c test(test_completion.py): update azure extra headers 2024-07-24 18:42:50 -07:00
Ishaan Jaff
b93b2636a9
Update README.md 2024-07-24 16:51:40 -07:00
Krrish Dholakia
b5c5ed2209 fix(key_management_endpoints.py): if budget duration set, set budget_reset_at 2024-07-24 15:02:22 -07:00
Ishaan Jaff
dc3b39ca71
Merge pull request #4864 from BerriAI/docs_add_using_groq_with_proxy
doc example using litellm proxy with groq
2024-07-24 14:34:12 -07:00
Ishaan Jaff
fe0b0ddaaa doc example using litellm proxy with groq 2024-07-24 14:33:49 -07:00
Ishaan Jaff
53dd47c5cb
Merge pull request #4862 from BerriAI/litellm_fix_unsupported_params_Error
[Fix-litellm python] Raise correct error for UnsupportedParams Error
2024-07-24 14:26:25 -07:00
Krrish Dholakia
65705fde25 test(test_embedding.py): add simple azure embedding ad token test
Addresses https://github.com/BerriAI/litellm/issues/4859#issuecomment-2248838617
2024-07-24 13:38:03 -07:00
Krrish Dholakia
77ffee4e2e test(test_completion.py): add basic test to confirm azure ad token flow works as expected 2024-07-24 13:07:25 -07:00
Krrish Dholakia
99d8b0ad68 fix(bedrock_httpx.py): fix async client check 2024-07-24 12:43:52 -07:00
Ishaan Jaff
30c27b3f92 test UnsupportedParamsError 2024-07-24 12:21:22 -07:00
Ishaan Jaff
8ea4b73c27 add UnsupportedParamsError to litellm exceptions 2024-07-24 12:20:14 -07:00
Ishaan Jaff
11512c057d feat use UnsupportedParamsError as litellm error type 2024-07-24 12:19:10 -07:00
Krrish Dholakia
d9539e518e build(docker-compose.yml): add prometheus scraper to docker compose
persists prometheus data across restarts
2024-07-24 10:09:23 -07:00
Ishaan Jaff
ac7f6598f3
Merge pull request #4853 from BerriAI/litellm_support_tags
[Feat] - Support Logging tags on langsmith
2024-07-24 09:51:56 -07:00