Commit graph

1533 commits

Author SHA1 Message Date
Ishaan Jaff
a78ed81cd9 (docs) grafana metrics 2024-03-29 14:38:37 -07:00
Ishaan Jaff
24570bc075 (docs) grafana / prometheus 2024-03-29 14:25:45 -07:00
Ishaan Jaff
c2283235a1 (docs) /metrics endpoint 2024-03-29 13:36:24 -07:00
Ishaan Jaff
ffa29ddfef (docs) cleanup 2024-03-29 13:10:26 -07:00
Krrish Dholakia
d547944556 fix(sagemaker.py): support 'model_id' param for sagemaker
allow passing inference component param to sagemaker in the same format as we handle this for bedrock
2024-03-29 08:43:17 -07:00
Krrish Dholakia
cdb940d504 docs(prod.md): update prod docs with batch writing info 2024-03-28 23:42:43 -07:00
Krrish Dholakia
85a5291142 docs(prod.md): doc improvements 2024-03-28 19:04:24 -07:00
Krrish Dholakia
da7a00d6d2 docs(prod.md): fix docker run commands 2024-03-28 18:51:53 -07:00
Krrish Dholakia
7c44b32cc2 refactor(proxy/utils.py): add more debug logs 2024-03-28 18:44:35 -07:00
Krrish Dholakia
eb318afe52 docs(prod.md): cleanup doc 2024-03-28 18:34:09 -07:00
Krrish Dholakia
ced902f822 docs(prod.md): improve docs 2024-03-28 15:35:07 -07:00
Krrish Dholakia
eb3806feba docs(prod.md): update docs with litellm spend logs server machine spec 2024-03-28 15:26:26 -07:00
Krrish Dholakia
c15df27c1e docs(prod.md): add litellm spend logs server to docs 2024-03-28 15:15:10 -07:00
Krish Dholakia
934a9ac2b4
Merge pull request #2722 from BerriAI/litellm_db_perf_improvement
feat(proxy/utils.py): enable updating db in a separate server
2024-03-28 14:56:14 -07:00
Krrish Dholakia
a09818e72e build(ghcr_deploy.yml): deploy spend logs server docker image
make it easy for user to deploy a separate spend logs server
2024-03-28 13:39:52 -07:00
Krrish Dholakia
746cd3da11 docs(gemini.md): add link to google ai studio api key 2024-03-28 10:12:59 -07:00
Ishaan Jaff
5e5f3d5fd2
Merge pull request #2729 from BerriAI/litellm_show_better_error_msg_with_role
(fix) show user their role when rejecting /team/new requests
2024-03-28 07:42:31 -07:00
Ishaan Jaff
d5f6fe4eff (docs) update UI 2024-03-27 22:24:56 -07:00
Krrish Dholakia
526aa9230f docs(call_hooks.md): show result in docs 2024-03-27 21:04:51 -07:00
Krrish Dholakia
e6b929fff3 docs(call_hooks.md): show admin how to enforce user param 2024-03-27 20:58:26 -07:00
Krrish Dholakia
d08da5b05a docs(instructor.md): improve default example 2024-03-27 12:51:05 -07:00
Krrish Dholakia
90b859ebcb docs(token_auth.md): cleanup docs 2024-03-26 21:42:07 -07:00
Krrish Dholakia
282176c502 docs(token_auth.md): update docs 2024-03-26 21:41:08 -07:00
Krrish Dholakia
ca84e7a8e8 docs(token_auth.md): update jwt auth docs with new info 2024-03-26 21:33:03 -07:00
Krrish Dholakia
bf7cc943fb docs(enterprise.md): update docs to turn on/off llm guard per key 2024-03-26 18:02:44 -07:00
Krish Dholakia
0ab708e6f1
Merge pull request #2704 from BerriAI/litellm_jwt_auth_improvements_3
fix(handle_jwt.py): enable team-based jwt-auth access
2024-03-26 16:06:56 -07:00
Krrish Dholakia
752516df1b fix(handle_jwt.py): support public key caching ttl param 2024-03-26 14:32:55 -07:00
Ishaan Jaff
da503eab18
Merge branch 'main' into litellm_remove_litellm_telemetry 2024-03-26 11:35:02 -07:00
Ishaan Jaff
4d81df3d6f (docs) switch of litellm telemetry 2024-03-26 11:19:55 -07:00
Ishaan Jaff
995c379a63 (fix) prod.md 2024-03-25 22:30:22 -07:00
Krrish Dholakia
16ade7e556 docs(proxy/caching.md): add ttl param to proxy/caching.md 2024-03-25 13:46:52 -07:00
Krrish Dholakia
03b8444d3c docs(token_auth.md): add renaming jwt scope string to docs 2024-03-25 12:49:44 -07:00
Krrish Dholakia
53695943e3 docs(instructor.md): tutorial on using litellm with instructor 2024-03-25 08:35:11 -07:00
Krrish Dholakia
9e9de7f6e2 docs(routing.md): add fallbacks being done in order 2024-03-24 12:13:19 -07:00
Krrish Dholakia
1c60fd0e78 docs(routing.md): add url 2024-03-23 20:03:42 -07:00
Krrish Dholakia
7c74ea8b77 docs(routing.md): add proxy example to pre-call checks in routing docs 2024-03-23 20:00:50 -07:00
Ishaan Jaff
92759ef055
Merge pull request #2670 from BerriAI/litellm_docs_best_practices_prod
(docs) best prod practices
2024-03-23 19:38:22 -07:00
Krish Dholakia
c92fa1af7c
Merge pull request #2669 from BerriAI/litellm_router_pre_call_checks
feat(router.py): enable pre-call checks
2024-03-23 19:38:09 -07:00
Ishaan Jaff
09992a6122 (docs) prod best perf 2024-03-23 19:36:26 -07:00
Ishaan Jaff
d04b4dea3e (docs) best prod practices 2024-03-23 19:29:21 -07:00
Krrish Dholakia
e8e7964025 docs(routing.md): add pre-call checks to docs 2024-03-23 19:10:34 -07:00
Ishaan Jaff
2ae489c506 (docs) update config set_verbose 2024-03-23 18:54:31 -07:00
Ishaan Jaff
04a09830de
Merge pull request #2668 from BerriAI/litellm_update_deploy_docs
[Docs] Add Docs on deploying to EKS Cluster + K8
2024-03-23 18:42:36 -07:00
Ishaan Jaff
30ae52c21e (docs) using litellm on EKS 2024-03-23 17:49:00 -07:00
Ishaan Jaff
f646a4612b
Merge pull request #2667 from BerriAI/litellm_update_gunicorn_instructions
[Docs] update gunicorn instructions - Uvicorn perf is significantly better on K8s
2024-03-23 17:43:09 -07:00
Ishaan Jaff
61d2e91632 (docs) update gunicorn usage 2024-03-23 17:39:07 -07:00
Vivek Aditya
efc90b04c7 minor fix 2024-03-23 12:50:46 +05:30
Vivek Aditya
6bd49c6087 Athina docs updated with information about additional fields and a minor fix in the callback 2024-03-23 12:42:07 +05:30
Krrish Dholakia
265dd5cd4f docs(token_auth.md): add project based auth to docs 2024-03-22 17:27:40 -07:00
Krrish Dholakia
d06b9a5a47 fix(proxy_server.py): enable jwt-auth for users
allow a user to auth into the proxy via jwt's and call allowed routes
2024-03-22 17:08:10 -07:00