Commit graph

2831 commits

Author SHA1 Message Date
Krrish Dholakia
c7fd626805 docs(team_logging.md): add key-based logging to docs 2024-08-14 21:49:55 -07:00
Krrish Dholakia
3487d84fcc docs(pass_through.md): add doc on using langfuse client sdk w/ litellm proxy 2024-08-14 21:43:31 -07:00
Ishaan Jaff
1f631606a5
Merge pull request #5210 from BerriAI/litellm_add_prompt_caching_support
[Feat] Add Anthropic API Prompt Caching Support
2024-08-14 17:43:01 -07:00
Ishaan Jaff
912acb1cae docs using proxy with context caaching anthropic 2024-08-14 17:42:48 -07:00
Ishaan Jaff
2267b8a59f docs add examples with litellm proxy 2024-08-14 17:13:26 -07:00
Ishaan Jaff
fd122aa7a3 docs add examples doing context caching anthropic sdk 2024-08-14 17:07:51 -07:00
Ishaan Jaff
e0ff4823d0 add test for caching tool calls 2024-08-14 16:19:14 -07:00
Ishaan Jaff
45e367d4d4 docs Caching - Continuing Multi-Turn Convo 2024-08-14 15:26:25 -07:00
Ishaan Jaff
69a640e9c4 test amnthropic prompt caching 2024-08-14 14:59:46 -07:00
Krrish Dholakia
179dd7b893 docs(model_management.md): add section on adding additional model information to proxy config 2024-08-14 14:39:48 -07:00
Ishaan Jaff
acadabe6c9 use litellm_ prefix for new deployment metrics 2024-08-14 09:08:14 -07:00
Krrish Dholakia
4cef6df4cf docs(sidebar.js): cleanup docs 2024-08-14 09:04:52 -07:00
Zbigniew Łukasiak
963c921c5a
Mismatch in example fixed 2024-08-14 15:07:10 +02:00
Ishaan Jaff
4d2cedfdb6
Merge pull request #5191 from BerriAI/litellm_load_config_from_s3
[Feat] Allow loading LiteLLM config from s3 buckets
2024-08-13 21:19:16 -07:00
Ishaan Jaff
6f7b204294 docs - set litellm config as s3 object 2024-08-13 20:26:29 -07:00
Keith Stevens
17c6a4e532 Improving the proxy docs for configuring with vllm 2024-08-13 16:07:41 -07:00
Ishaan Jaff
b24da18d2d
Merge pull request #5180 from BerriAI/litellm_allow_controlling_logged_tags_langfuse
[Feat-Proxy+langfuse] LiteLLM-specific Tags on Langfuse - `cache_hit`, `cache_key`
2024-08-13 13:50:01 -07:00
Krrish Dholakia
7e99cfe938 docs(user_keys.md): cleanup instructor docs 2024-08-13 13:15:46 -07:00
Krrish Dholakia
ac8c0ecd85 docs(user_keys.md): cleanup docs 2024-08-13 13:14:27 -07:00
Ishaan Jaff
81680d6b1a docs control langfuse specific tags 2024-08-13 12:48:42 -07:00
Krrish Dholakia
718c2cfa4e docs(team_logging.md): cleanup docs 2024-08-12 19:53:05 -07:00
Krrish Dholakia
fdd9a07051 fix(utils.py): Break out of infinite streaming loop
Fixes https://github.com/BerriAI/litellm/issues/5158
2024-08-12 14:00:43 -07:00
Ishaan Jaff
dc8f9e7241 docs mark oidc as beta 2024-08-12 09:01:36 -07:00
Krrish Dholakia
8cbf8d5671 docs(perplexity.md): show how to get 'return_citations' 2024-08-12 09:01:14 -07:00
Ishaan Jaff
e46009f3d2
Merge pull request #5154 from BerriAI/litellm_send_prometheus_fallbacks_from_slack
[Feat-Proxy] send prometheus fallbacks stats to slack
2024-08-10 17:14:01 -07:00
Ishaan Jaff
cc3316104f doc new prometheus metrics 2024-08-10 17:13:36 -07:00
Ishaan Jaff
ffb7f9f280 add fallback_reports as slack alert 2024-08-10 15:26:32 -07:00
Krrish Dholakia
0ea056971c docs(prefix.md): add prefix support to docs 2024-08-10 13:55:47 -07:00
Krrish Dholakia
f10970f1b1 docs(custom_llm_server.md): clarify what to use for modifying incoming/outgoing calls 2024-08-10 12:58:43 -07:00
Ishaan Jaff
0acc6efa8f docs clean sidebar 2024-08-09 18:09:11 -07:00
Ishaan Jaff
4c08d1a21d docs migration policy 2024-08-09 18:06:37 -07:00
Ishaan Jaff
09000d4b66 docs add migration policy 2024-08-09 18:03:37 -07:00
Ishaan Jaff
fc9086759d docs prometheus metrics 2024-08-09 09:07:31 -07:00
Ishaan Jaff
a1c3167853 doc Grounding vertex ai 2024-08-09 08:31:38 -07:00
Ishaan Jaff
baaa444c8f docs fix typo 2024-08-09 08:17:36 -07:00
Krrish Dholakia
dde477494f docs(self_serve.md): add internal_user_budget_duration to docs 2024-08-08 23:54:26 -07:00
Ishaan Jaff
369ddfb49e docs vertex context caching 2024-08-08 17:18:12 -07:00
Ishaan Jaff
84c05a57d6 docs use (LLM Gateway) in some places 2024-08-08 17:00:52 -07:00
Ishaan Jaff
f179759672 docs vertex ai 2024-08-08 16:12:36 -07:00
Ishaan Jaff
e671ae58e3
Merge pull request #5119 from BerriAI/litellm_add_gemini_context_caching_litellm
[Feat-Proxy] Add Support for VertexAI context caching
2024-08-08 16:08:58 -07:00
Ishaan Jaff
d78c38f8e7 docs vertex 2024-08-08 16:07:14 -07:00
Ishaan Jaff
a3dd3a19fa docs cachedContent endpoint 2024-08-08 16:06:23 -07:00
Ishaan Jaff
8ad5a40283 doc on using litellm proxy with vertex ai content caching 2024-08-08 11:45:46 -07:00
Krrish Dholakia
2710bec02d docs(scheduler.md): cleanup docs to use /chat/completion endpoint 2024-08-07 21:49:06 -07:00
Krish Dholakia
e1610d37b9
Merge pull request #5099 from BerriAI/litellm_personal_user_budgets
fix(user_api_key_auth.py): respect team budgets over user budget, if key belongs to team
2024-08-07 20:00:16 -07:00
Krish Dholakia
baf01b47d8
Merge branch 'main' into litellm_personal_user_budgets 2024-08-07 19:59:50 -07:00
Krrish Dholakia
7e1f296981 docs(self_serve.md): cleanup docs on how to onboard new users + teams 2024-08-07 19:58:36 -07:00
Krrish Dholakia
400653992c feat(router.py): allow using .acompletion() for request prioritization
allows /chat/completion endpoint to work for request prioritization calls
2024-08-07 16:43:12 -07:00
Ishaan Jaff
e585dfba92 docs prom 2024-08-07 16:03:11 -07:00
Ishaan Jaff
04b201efed
Merge pull request #5098 from BerriAI/litellm_provider_wildcard_routing
[Feat-Router + Proxy] Add provider wildcard routing
2024-08-07 14:51:42 -07:00