litellm-mirror/docs/my-website/docs
Krish Dholakia 539f166166
Support budget/rate limit tiers for keys (#7429)
* feat(proxy/utils.py): get associated litellm budget from db in combined_view for key

allows user to create rate limit tiers and associate those to keys

* feat(proxy/_types.py): update the value of key-level tpm/rpm/model max budget metrics with the associated budget table values if set

allows rate limit tiers to be easily applied to keys

* docs(rate_limit_tiers.md): add doc on setting rate limit / budget tiers

make feature discoverable

* feat(key_management_endpoints.py): return litellm_budget_table value in key generate

make it easy for user to know associated budget on key creation

* fix(key_management_endpoints.py): document 'budget_id' param in `/key/generate`

* docs(key_management_endpoints.py): document budget_id usage

* refactor(budget_management_endpoints.py): refactor budget endpoints into separate file - makes it easier to run documentation testing against it

* docs(test_api_docs.py): add budget endpoints to ci/cd doc test + add missing param info to docs

* fix(customer_endpoints.py): use new pydantic obj name

* docs(user_management_heirarchy.md): add simple doc explaining teams/keys/org/users on litellm

* Litellm dev 12 26 2024 p2 (#7432)

* (Feat) Add logging for `POST v1/fine_tuning/jobs`  (#7426)

* init commit ft jobs logging

* add ft logging

* add logging for FineTuningJob

* simple FT Job create test

* (docs) - show all supported Azure OpenAI endpoints in overview  (#7428)

* azure batches

* update doc

* docs azure endpoints

* docs endpoints on azure

* docs azure batches api

* docs azure batches api

* fix(key_management_endpoints.py): fix key update to actually work

* test(test_key_management.py): add e2e test asserting ui key update call works

* fix: proxy/_types - fix linting erros

* test: update test

---------

Co-authored-by: Ishaan Jaff <ishaanjaffer0324@gmail.com>

* fix: test

* fix(parallel_request_limiter.py): enforce tpm/rpm limits on key from tiers

* fix: fix linting errors

* test: fix test

* fix: remove unused import

* test: update test

* docs(customer_endpoints.py): document new model_max_budget param

* test: specify unique key alias

* docs(budget_management_endpoints.py): document new model_max_budget param

* test: fix test

* test: fix tests

---------

Co-authored-by: Ishaan Jaff <ishaanjaffer0324@gmail.com>
2024-12-26 19:05:27 -08:00
..
adding_provider Litellm dev 12 25 2025 p2 (#7420) 2024-12-25 18:35:34 -08:00
caching (refactor) caching use LLMCachingHandler for async_get_cache and set_cache (#6208) 2024-10-14 16:34:01 +05:30
completion docs(input.md): document 'extra_headers' param support (#7268) 2024-12-17 07:19:14 -08:00
debugging docs - fix doc build time errors 2024-06-15 14:58:02 -07:00
embedding Litellm dev 11 21 2024 (#6837) 2024-11-22 01:53:52 +05:30
extras docs(code_quality.md): add doc on litellm code qa 2024-10-02 11:20:15 -04:00
guides LiteLLM Minor Fixes & Improvements (11/23/2024) (#6870) 2024-11-23 15:17:40 +05:30
langchain update langchain documentation to reflect refactor 2024-05-07 17:01:55 +05:30
observability docs: update the docs (#6923) 2024-11-28 03:43:20 +05:30
pass_through (feat) Add Bedrock knowledge base pass through endpoints (#7267) 2024-12-16 22:19:34 -08:00
projects add dbally project 2024-08-22 23:21:40 +02:00
providers (docs) - show all supported Azure OpenAI endpoints in overview (#7428) 2024-12-26 09:01:41 -08:00
proxy Support budget/rate limit tiers for keys (#7429) 2024-12-26 19:05:27 -08:00
tutorials docs: fix dead links 2024-08-26 13:28:25 -07:00
assistants.md [Feat-Proxy] Add Azure Assistants API - Create Assistant, Delete Assistant Support (#5777) 2024-09-18 16:27:33 -07:00
audio_transcription.md docs(fireworks_ai.md): add audio transcription to fireworks ai doc 2024-12-25 21:22:51 -08:00
batches.md docs - batches cost tracking (#7422) 2024-12-25 20:13:26 -08:00
benchmarks.md (docs) add benchmarks on 1K RPS (#6704) 2024-11-11 19:25:53 -08:00
budget_manager.md docs - use consistent name for LiteLLM proxy server 2024-08-03 12:54:07 -07:00
contact.md
contributing.md docs(contributing.md): add docs for contributing to litellm ui 2024-02-23 13:01:34 -08:00
data_security.md docs(data_security.md): cleanup docs 2024-10-01 15:33:10 -04:00
default_code_snippet.md
enterprise.md Litellm dev 12 23 2024 p1 (#7383) 2024-12-23 16:33:31 -08:00
exception_mapping.md Litellm dev 10 26 2024 (#6472) 2024-10-28 15:05:43 -07:00
files_endpoints.md docs files api 2024-12-24 20:46:43 -08:00
fine_tuning.md docs native vertex ft endpoint 2024-08-03 11:56:43 -07:00
getting_started.md Add Helicone 2024-07-08 15:41:35 -07:00
hosted.md docs gdpr regions cloud 2024-07-10 18:00:16 -07:00
image_generation.md Litellm dev 11 21 2024 (#6837) 2024-11-22 01:53:52 +05:30
index.md docs add response format on main pages 2024-12-16 08:41:12 -08:00
load_test.md (docs) add 1k rps load test doc (#6059) 2024-10-04 16:56:34 +05:30
load_test_advanced.md update load test doc 2024-10-04 18:47:26 +05:30
load_test_rpm.md (docs) add 1k rps load test doc (#6059) 2024-10-04 16:56:34 +05:30
load_test_sdk.md (docs) add 1k rps load test doc (#6059) 2024-10-04 16:56:34 +05:30
migration.md
migration_policy.md docs migration policy 2024-08-09 18:06:37 -07:00
moderation.md (docs + fix) Add docs on Moderations endpoint, Text Completion (#6947) 2024-11-27 16:30:48 -08:00
oidc.md (oidc): Improve docs for unofficial provider. 2024-08-16 20:30:41 +00:00
old_guardrails.md docs move lakera to free 2024-08-20 16:38:37 -07:00
projects.md
prompt_injection.md docs(sidebars.js): refactor docs 2024-08-22 18:22:50 -07:00
proxy_api.md
proxy_server.md docs - use consistent name for LiteLLM proxy server 2024-08-03 12:54:07 -07:00
realtime.md Litellm dev 10 22 2024 (#6384) 2024-10-22 21:18:54 -07:00
rerank.md docs infinity rerank api docs 2024-12-19 18:51:55 -08:00
router_architecture.md docs(router_architecture.md): add router architecture docs 2024-11-26 12:54:38 +05:30
routing.md Controll fallback prompts client-side (#7334) 2024-12-20 19:09:53 -08:00
rules.md
scheduler.md docs(scheduler.md): cleanup docs to use /chat/completion endpoint 2024-08-07 21:49:06 -07:00
sdk_custom_pricing.md organize docs 2024-08-03 12:54:07 -07:00
secret.md (feat) Use litellm/ prefix when storing virtual keys in AWS secret manager (#6765) 2024-11-15 18:07:43 -08:00
set_keys.md docs(fireworks_ai.md): add audio transcription to fireworks ai doc 2024-12-25 21:22:51 -08:00
simple_proxy_old_doc.md (docs + testing) Correctly document the timeout value used by litellm proxy is 6000 seconds + add to best practices for prod (#6339) 2024-10-23 14:09:35 +05:30
text_completion.md (docs + fix) Add docs on Moderations endpoint, Text Completion (#6947) 2024-11-27 16:30:48 -08:00
text_to_speech.md docs tts 2024-08-23 18:29:36 -07:00
troubleshoot.md docs(troubleshoot.md): cleanup docs 2024-05-27 09:51:35 -07:00
wildcard_routing.md LiteLLM Minor Fixes & Improvements (11/26/2024) (#6913) 2024-11-28 00:01:38 +05:30