Ishaan Jaff
|
a78ed81cd9
|
(docs) grafana metrics
|
2024-03-29 14:38:37 -07:00 |
|
Ishaan Jaff
|
24570bc075
|
(docs) grafana / prometheus
|
2024-03-29 14:25:45 -07:00 |
|
Ishaan Jaff
|
c2283235a1
|
(docs) /metrics endpoint
|
2024-03-29 13:36:24 -07:00 |
|
Ishaan Jaff
|
ffa29ddfef
|
(docs) cleanup
|
2024-03-29 13:10:26 -07:00 |
|
Krrish Dholakia
|
d547944556
|
fix(sagemaker.py): support 'model_id' param for sagemaker
allow passing inference component param to sagemaker in the same format as we handle this for bedrock
|
2024-03-29 08:43:17 -07:00 |
|
Krrish Dholakia
|
cdb940d504
|
docs(prod.md): update prod docs with batch writing info
|
2024-03-28 23:42:43 -07:00 |
|
Krrish Dholakia
|
85a5291142
|
docs(prod.md): doc improvements
|
2024-03-28 19:04:24 -07:00 |
|
Krrish Dholakia
|
da7a00d6d2
|
docs(prod.md): fix docker run commands
|
2024-03-28 18:51:53 -07:00 |
|
Krrish Dholakia
|
7c44b32cc2
|
refactor(proxy/utils.py): add more debug logs
|
2024-03-28 18:44:35 -07:00 |
|
Krrish Dholakia
|
eb318afe52
|
docs(prod.md): cleanup doc
|
2024-03-28 18:34:09 -07:00 |
|
Krrish Dholakia
|
ced902f822
|
docs(prod.md): improve docs
|
2024-03-28 15:35:07 -07:00 |
|
Krrish Dholakia
|
eb3806feba
|
docs(prod.md): update docs with litellm spend logs server machine spec
|
2024-03-28 15:26:26 -07:00 |
|
Krrish Dholakia
|
c15df27c1e
|
docs(prod.md): add litellm spend logs server to docs
|
2024-03-28 15:15:10 -07:00 |
|
Krish Dholakia
|
934a9ac2b4
|
Merge pull request #2722 from BerriAI/litellm_db_perf_improvement
feat(proxy/utils.py): enable updating db in a separate server
|
2024-03-28 14:56:14 -07:00 |
|
Krrish Dholakia
|
a09818e72e
|
build(ghcr_deploy.yml): deploy spend logs server docker image
make it easy for user to deploy a separate spend logs server
|
2024-03-28 13:39:52 -07:00 |
|
Krrish Dholakia
|
746cd3da11
|
docs(gemini.md): add link to google ai studio api key
|
2024-03-28 10:12:59 -07:00 |
|
Ishaan Jaff
|
5e5f3d5fd2
|
Merge pull request #2729 from BerriAI/litellm_show_better_error_msg_with_role
(fix) show user their role when rejecting /team/new requests
|
2024-03-28 07:42:31 -07:00 |
|
Ishaan Jaff
|
d5f6fe4eff
|
(docs) update UI
|
2024-03-27 22:24:56 -07:00 |
|
Krrish Dholakia
|
526aa9230f
|
docs(call_hooks.md): show result in docs
|
2024-03-27 21:04:51 -07:00 |
|
Krrish Dholakia
|
e6b929fff3
|
docs(call_hooks.md): show admin how to enforce user param
|
2024-03-27 20:58:26 -07:00 |
|
Krrish Dholakia
|
d08da5b05a
|
docs(instructor.md): improve default example
|
2024-03-27 12:51:05 -07:00 |
|
Krrish Dholakia
|
90b859ebcb
|
docs(token_auth.md): cleanup docs
|
2024-03-26 21:42:07 -07:00 |
|
Krrish Dholakia
|
282176c502
|
docs(token_auth.md): update docs
|
2024-03-26 21:41:08 -07:00 |
|
Krrish Dholakia
|
ca84e7a8e8
|
docs(token_auth.md): update jwt auth docs with new info
|
2024-03-26 21:33:03 -07:00 |
|
Krrish Dholakia
|
bf7cc943fb
|
docs(enterprise.md): update docs to turn on/off llm guard per key
|
2024-03-26 18:02:44 -07:00 |
|
Krish Dholakia
|
0ab708e6f1
|
Merge pull request #2704 from BerriAI/litellm_jwt_auth_improvements_3
fix(handle_jwt.py): enable team-based jwt-auth access
|
2024-03-26 16:06:56 -07:00 |
|
Krrish Dholakia
|
752516df1b
|
fix(handle_jwt.py): support public key caching ttl param
|
2024-03-26 14:32:55 -07:00 |
|
Ishaan Jaff
|
da503eab18
|
Merge branch 'main' into litellm_remove_litellm_telemetry
|
2024-03-26 11:35:02 -07:00 |
|
Ishaan Jaff
|
4d81df3d6f
|
(docs) switch of litellm telemetry
|
2024-03-26 11:19:55 -07:00 |
|
Ishaan Jaff
|
995c379a63
|
(fix) prod.md
|
2024-03-25 22:30:22 -07:00 |
|
Krrish Dholakia
|
16ade7e556
|
docs(proxy/caching.md): add ttl param to proxy/caching.md
|
2024-03-25 13:46:52 -07:00 |
|
Krrish Dholakia
|
03b8444d3c
|
docs(token_auth.md): add renaming jwt scope string to docs
|
2024-03-25 12:49:44 -07:00 |
|
Krrish Dholakia
|
53695943e3
|
docs(instructor.md): tutorial on using litellm with instructor
|
2024-03-25 08:35:11 -07:00 |
|
Krrish Dholakia
|
9e9de7f6e2
|
docs(routing.md): add fallbacks being done in order
|
2024-03-24 12:13:19 -07:00 |
|
Krrish Dholakia
|
1c60fd0e78
|
docs(routing.md): add url
|
2024-03-23 20:03:42 -07:00 |
|
Krrish Dholakia
|
7c74ea8b77
|
docs(routing.md): add proxy example to pre-call checks in routing docs
|
2024-03-23 20:00:50 -07:00 |
|
Ishaan Jaff
|
92759ef055
|
Merge pull request #2670 from BerriAI/litellm_docs_best_practices_prod
(docs) best prod practices
|
2024-03-23 19:38:22 -07:00 |
|
Krish Dholakia
|
c92fa1af7c
|
Merge pull request #2669 from BerriAI/litellm_router_pre_call_checks
feat(router.py): enable pre-call checks
|
2024-03-23 19:38:09 -07:00 |
|
Ishaan Jaff
|
09992a6122
|
(docs) prod best perf
|
2024-03-23 19:36:26 -07:00 |
|
Ishaan Jaff
|
d04b4dea3e
|
(docs) best prod practices
|
2024-03-23 19:29:21 -07:00 |
|
Krrish Dholakia
|
e8e7964025
|
docs(routing.md): add pre-call checks to docs
|
2024-03-23 19:10:34 -07:00 |
|
Ishaan Jaff
|
2ae489c506
|
(docs) update config set_verbose
|
2024-03-23 18:54:31 -07:00 |
|
Ishaan Jaff
|
04a09830de
|
Merge pull request #2668 from BerriAI/litellm_update_deploy_docs
[Docs] Add Docs on deploying to EKS Cluster + K8
|
2024-03-23 18:42:36 -07:00 |
|
Ishaan Jaff
|
30ae52c21e
|
(docs) using litellm on EKS
|
2024-03-23 17:49:00 -07:00 |
|
Ishaan Jaff
|
f646a4612b
|
Merge pull request #2667 from BerriAI/litellm_update_gunicorn_instructions
[Docs] update gunicorn instructions - Uvicorn perf is significantly better on K8s
|
2024-03-23 17:43:09 -07:00 |
|
Ishaan Jaff
|
61d2e91632
|
(docs) update gunicorn usage
|
2024-03-23 17:39:07 -07:00 |
|
Vivek Aditya
|
efc90b04c7
|
minor fix
|
2024-03-23 12:50:46 +05:30 |
|
Vivek Aditya
|
6bd49c6087
|
Athina docs updated with information about additional fields and a minor fix in the callback
|
2024-03-23 12:42:07 +05:30 |
|
Krrish Dholakia
|
265dd5cd4f
|
docs(token_auth.md): add project based auth to docs
|
2024-03-22 17:27:40 -07:00 |
|
Krrish Dholakia
|
d06b9a5a47
|
fix(proxy_server.py): enable jwt-auth for users
allow a user to auth into the proxy via jwt's and call allowed routes
|
2024-03-22 17:08:10 -07:00 |
|