Commit graph

1560 commits

Author SHA1 Message Date
Ishaan Jaff
9638e244f8 Merge pull request #2846 from BerriAI/litellm_docs_delete_cache_keys
docs - `delete`  cache keys
2024-04-04 14:07:50 -07:00
Krrish Dholakia
d6cc1c8522 docs(vertex.md): add docs on setting google_application_credentials 2024-04-04 13:49:03 -07:00
Ishaan Jaff
6e7e46a183 docs - delete cache keys 2024-04-04 12:20:14 -07:00
Krrish Dholakia
7e37f4c493 docs(token_auth.md): update links 2024-04-03 13:23:30 -07:00
Krrish Dholakia
6a5892e823 docs(token_auth.md): update docs 2024-04-03 13:21:25 -07:00
Ishaan Jaff
4f1cb06154 (docs) openai wildcard models 2024-04-01 19:53:34 -07:00
Krrish Dholakia
abf87fe448 fix(proxy_server.py): don't require scope for team-based jwt access
If team with the client_id exists then it should be allowed to make a request, if it doesn't then as we discussed it should return an error
2024-04-01 18:52:00 -07:00
Krrish Dholakia
91120df3f7 docs(openai.md): fix docs to include example of calling openai on proxy 2024-04-01 12:09:22 -07:00
Krrish Dholakia
0072174ef9 docs(routing.md): refactor docs to show how to use pre-call checks and fallback across model groups 2024-04-01 11:21:27 -07:00
Ishaan Jaff
2549646d12 Merge pull request #2779 from DaxServer/update-proxy-dockerfile-branch
fix(docs): Correct Docker pull command in deploy.md
2024-04-01 07:10:45 -07:00
DaxServer
10fdf487ae fix(docs): Correct Docker pull command in deploy.md
Corrected the Docker pull command in deploy.md to remove duplicated 'docker pull' command.
2024-03-31 20:10:00 +02:00
DaxServer
947ba9d15b docs: Update references to Ollama repository url
Updated references to the Ollama repository URL from https://github.com/jmorganca/ollama to https://github.com/ollama/ollama.
2024-03-31 19:35:37 +02:00
Krrish Dholakia
08a78ba85a docs(deploy.md): fix docs for litlelm-database docker run example 2024-03-30 20:08:27 -07:00
Krish Dholakia
1a30ad0d28 Merge pull request #2748 from BerriAI/litellm_anthropic_tool_calling_list_parsing_fix
fix(factory.py): parse list in xml tool calling response (anthropic)
2024-03-30 11:27:02 -07:00
Krrish Dholakia
d768d29c27 docs(users.md): fix doc for end-user param 2024-03-29 21:54:07 -07:00
Vincelwt
35c279eff8 Merge branch 'main' into main 2024-03-30 13:21:53 +09:00
Ishaan Jaff
7b7958e440 Merge pull request #2628 from BerriAI/dependabot/npm_and_yarn/docs/my-website/webpack-dev-middleware-5.3.4
build(deps): bump webpack-dev-middleware from 5.3.3 to 5.3.4 in /docs/my-website
2024-03-29 16:12:29 -07:00
Ishaan Jaff
00c60188a4 Merge pull request #2689 from BerriAI/dependabot/npm_and_yarn/docs/my-website/express-4.19.2
build(deps): bump express from 4.18.2 to 4.19.2 in /docs/my-website
2024-03-29 16:12:17 -07:00
Ishaan Jaff
6a51d0be65 (docs) grafana metrics 2024-03-29 14:38:37 -07:00
Ishaan Jaff
41f9622f29 (docs) grafana / prometheus 2024-03-29 14:25:45 -07:00
Ishaan Jaff
925ff3cefe (docs) /metrics endpoint 2024-03-29 13:36:24 -07:00
Ishaan Jaff
6354d22cc6 (docs) cleanup 2024-03-29 13:10:26 -07:00
Krrish Dholakia
08253b3414 docs: show how tool calling parsing works + how to get raw model response 2024-03-29 11:58:49 -07:00
Krrish Dholakia
62ac3e1de4 fix(sagemaker.py): support 'model_id' param for sagemaker
allow passing inference component param to sagemaker in the same format as we handle this for bedrock
2024-03-29 08:43:17 -07:00
Krrish Dholakia
91b29de394 docs(prod.md): update prod docs with batch writing info 2024-03-28 23:42:43 -07:00
Krrish Dholakia
9f27b7578b docs(prod.md): doc improvements 2024-03-28 19:04:24 -07:00
Krrish Dholakia
ca8d6e5e51 docs(prod.md): fix docker run commands 2024-03-28 18:51:53 -07:00
Krrish Dholakia
473bab8a19 refactor(proxy/utils.py): add more debug logs 2024-03-28 18:44:35 -07:00
Krrish Dholakia
1619ca9081 docs(prod.md): cleanup doc 2024-03-28 18:34:09 -07:00
Krrish Dholakia
87edf12d22 docs(prod.md): improve docs 2024-03-28 15:35:07 -07:00
Krrish Dholakia
b073d9366f docs(prod.md): update docs with litellm spend logs server machine spec 2024-03-28 15:26:26 -07:00
Krrish Dholakia
8361a824ba docs(prod.md): add litellm spend logs server to docs 2024-03-28 15:15:10 -07:00
Krish Dholakia
b828290c81 Merge pull request #2722 from BerriAI/litellm_db_perf_improvement
feat(proxy/utils.py): enable updating db in a separate server
2024-03-28 14:56:14 -07:00
Krrish Dholakia
cc8f5aa7ef build(ghcr_deploy.yml): deploy spend logs server docker image
make it easy for user to deploy a separate spend logs server
2024-03-28 13:39:52 -07:00
Krrish Dholakia
8656f7b71d docs(gemini.md): add link to google ai studio api key 2024-03-28 10:12:59 -07:00
Ishaan Jaff
efa57bb7a7 Merge pull request #2729 from BerriAI/litellm_show_better_error_msg_with_role
(fix) show user their role when rejecting /team/new requests
2024-03-28 07:42:31 -07:00
Ishaan Jaff
9e96241ce9 (docs) update UI 2024-03-27 22:24:56 -07:00
Krrish Dholakia
74637075ef docs(call_hooks.md): show result in docs 2024-03-27 21:04:51 -07:00
Krrish Dholakia
7fece25ccb docs(call_hooks.md): show admin how to enforce user param 2024-03-27 20:58:26 -07:00
Krrish Dholakia
eec4e62d52 docs(instructor.md): improve default example 2024-03-27 12:51:05 -07:00
Krrish Dholakia
b51c21b76a docs(token_auth.md): cleanup docs 2024-03-26 21:42:07 -07:00
Krrish Dholakia
a10797c01f docs(token_auth.md): update docs 2024-03-26 21:41:08 -07:00
Krrish Dholakia
badb2166de docs(token_auth.md): update jwt auth docs with new info 2024-03-26 21:33:03 -07:00
Krrish Dholakia
7b38dbbbe9 docs(enterprise.md): update docs to turn on/off llm guard per key 2024-03-26 18:02:44 -07:00
Krish Dholakia
d51f12ca44 Merge pull request #2704 from BerriAI/litellm_jwt_auth_improvements_3
fix(handle_jwt.py): enable team-based jwt-auth access
2024-03-26 16:06:56 -07:00
Krrish Dholakia
a0f55b92e6 fix(handle_jwt.py): support public key caching ttl param 2024-03-26 14:32:55 -07:00
Ishaan Jaff
f5f7e344c7 Merge branch 'main' into litellm_remove_litellm_telemetry 2024-03-26 11:35:02 -07:00
Ishaan Jaff
d321f6f638 (docs) switch of litellm telemetry 2024-03-26 11:19:55 -07:00
Ishaan Jaff
bf5b55df69 (fix) prod.md 2024-03-25 22:30:22 -07:00
Krrish Dholakia
a9737adcc0 docs(proxy/caching.md): add ttl param to proxy/caching.md 2024-03-25 13:46:52 -07:00