Commit graph

1474 commits

Author SHA1 Message Date
Ishaan Jaff
aca6ec85e2 (fix) add some better load testing 2024-03-22 19:48:54 -07:00
Ishaan Jaff
28e62af4e1 (fix) update load test used 2024-03-22 19:48:54 -07:00
Krrish Dholakia
d06b9a5a47 fix(proxy_server.py): enable jwt-auth for users
allow a user to auth into the proxy via jwt's and call allowed routes
2024-03-22 17:08:10 -07:00
Krrish Dholakia
9bf086386e fix(handle_jwt.py): add more logging for jwt header 2024-03-22 16:33:32 -07:00
Krish Dholakia
8c45986e72
Merge pull request #2619 from BerriAI/litellm_install_tenacity
(fix) include tenacity in req.txt
2024-03-21 20:55:56 -07:00
Krrish Dholakia
d57995a73d build(schema.prisma): use jsonProtocol to fix db connection issues
https://github.com/prisma/prisma/discussions/19978
2024-03-21 18:01:45 -07:00
Krrish Dholakia
4ac14a4e85 build(networking.tsx): fix trailing slash 2024-03-21 17:50:59 -07:00
Krrish Dholakia
d9577c3e2b build(networking.tsx): modify url to prevent redirects 2024-03-21 17:43:41 -07:00
Krrish Dholakia
33964233a5 fix(proxy_server.py): allow user to disable swagger ui docs via env
user can disable swagger ui docs by setting 'NO_DOCS="True"' in their env
2024-03-21 17:15:18 -07:00
Krish Dholakia
33a433eb0a
Merge branch 'main' into litellm_llm_api_prompt_injection_check 2024-03-21 09:57:10 -07:00
Krrish Dholakia
0521e8a1d9 fix(prompt_injection_detection.py): fix type check 2024-03-21 08:56:13 -07:00
Krrish Dholakia
8e8c4e214e fix: fix linting issue 2024-03-21 08:19:09 -07:00
Ishaan Jaff
14459b89e3 (fix) include tenacity in req.txt 2024-03-21 08:10:12 -07:00
Ishaan Jaff
bcd62034ed
Merge pull request #2563 from eltociear/patch-2
Update proxy_server.py
2024-03-21 07:29:33 -07:00
Krrish Dholakia
d91f9a9f50 feat(proxy_server.py): enable llm api based prompt injection checks
run user calls through an llm api to check for prompt injection attacks. This happens in parallel to th
e actual llm call using `async_moderation_hook`
2024-03-20 22:43:42 -07:00
Krish Dholakia
007d439017
Merge pull request #2606 from BerriAI/litellm_jwt_auth_updates
fix(handle_jwt.py): track spend for user using jwt auth
2024-03-20 19:40:17 -07:00
Krrish Dholakia
f24d3ffdb6 fix(proxy_server.py): fix import 2024-03-20 19:15:06 -07:00
Krrish Dholakia
3bb0e24cb7 fix(prompt_injection_detection.py): ensure combinations are actual phrases, not just 1-2 words
reduces misflagging

https://github.com/BerriAI/litellm/issues/2601
2024-03-20 19:09:38 -07:00
Krrish Dholakia
8bb00c4ae8 fix(caching.py): enable async setting of cache for dual cache 2024-03-20 18:42:34 -07:00
Krrish Dholakia
90e17b5422 fix(handle_jwt.py): track spend for user using jwt auth 2024-03-20 10:55:52 -07:00
Ishaan Jaff
3ad6e5ffc1 (feat) start proxy with default num_workers=1 2024-03-20 10:46:32 -07:00
Krrish Dholakia
ca970a90c4 fix(handle_jwt.py): remove issuer check 2024-03-20 08:35:23 -07:00
Ishaan Jaff
4ed551dc52 (feat) better debugging for /cache/ping 2024-03-20 08:30:11 -07:00
Ishaan Jaff
2256ece5a9 (feat) litellm cache ping 2024-03-20 08:24:13 -07:00
Ishaan Jaff
8f750b71eb (fix) caching - don't require cache password 2024-03-19 20:50:16 -07:00
Krrish Dholakia
f25b03326b fix(proxy_server.py): allow user to disable scheduled reset budget task 2024-03-19 20:36:22 -07:00
Krrish Dholakia
2dfdc8dd69 Revert "Merge pull request #2593 from BerriAI/litellm_reset_budget_fix"
This reverts commit afd363129f, reversing
changes made to c94bc94ad5.
2024-03-19 20:25:41 -07:00
Krrish Dholakia
3a866dd349 bump: version 1.32.5 → 1.32.6 2024-03-19 20:17:50 -07:00
Krish Dholakia
afd363129f
Merge pull request #2593 from BerriAI/litellm_reset_budget_fix
fix(proxy/utils.py): fix reset budget logic
2024-03-19 20:17:03 -07:00
Krrish Dholakia
37795c0d92 fix(proxy_server.py): add more debug logs 2024-03-19 19:59:43 -07:00
Krrish Dholakia
f6de3a0359 fix: better debug logs 2024-03-19 19:28:26 -07:00
Ishaan Jaff
c94bc94ad5
Merge pull request #2591 from BerriAI/litellm_metrics_endpoint
[Feat] /metrics endpoint for Prometheus, Grafana
2024-03-19 18:08:22 -07:00
Krrish Dholakia
e0d3b18835 fix(proxy/utils.py): fix reset budget logic
uses fewer clients - prevents read timeouts
2024-03-19 18:02:09 -07:00
Krish Dholakia
5171d7689f
Merge pull request #2592 from BerriAI/litellm_jwt_auth
feat(handle_jwt.py): support authenticating admins into the proxy via jwt's
2024-03-19 17:54:54 -07:00
Ishaan Jaff
bed1c0478a (fix) include prom endpoint 2024-03-19 17:30:07 -07:00
Krrish Dholakia
1c68a7fd1d refactor(handle_jwt.py): print jwt version 2024-03-19 17:17:11 -07:00
Krrish Dholakia
8c91156842 build: build fixes 2024-03-19 16:59:59 -07:00
Krrish Dholakia
a8d3d51d21 docs(token_based_auth.md): add jwt auth to docs 2024-03-19 16:34:27 -07:00
Ishaan Jaff
aa1c480452 (feat) using prom litellm 2024-03-19 15:49:23 -07:00
Krrish Dholakia
302bab6f1f feat(handle_jwt.py): support authenticating admins into the proxy via jwt's 2024-03-19 15:00:27 -07:00
Ishaan Jaff
4b7e102187 (v0) prometheus metric 2024-03-19 14:48:38 -07:00
Ishaan Jaff
4dc69db8e2 (feat) new ui build 2024-03-19 13:06:30 -07:00
Krrish Dholakia
7c74a0e6e2 fix(proxy_server.py): expose disable_spend_logs flag in config general settings
Writing each spend log adds +300ms latency

https://github.com/BerriAI/litellm/issues/1714#issuecomment-1924727281
2024-03-19 12:08:37 -07:00
Krrish Dholakia
c03694ae1e fix(proxy/utils.py): cleanup logging 2024-03-19 09:29:29 -07:00
Krish Dholakia
c4dbd0407e
Merge pull request #2561 from BerriAI/litellm_batch_writing_db
fix(proxy/utils.py): move to batch writing db updates
2024-03-18 21:50:47 -07:00
Krrish Dholakia
7eaddaef10 refactor(proxy_server.py): re-add custom db client logic - prevent regressions 2024-03-18 21:16:28 -07:00
Ishaan Jaff
a3397dd6c5 (feat) update ui build 2024-03-18 20:32:28 -07:00
Ishaan Jaff
51d658e878 (fix) if litellm-proxy-budget set use it 2024-03-18 20:31:23 -07:00
Krrish Dholakia
f588bff69b fix(proxy_server.py): fix spend log update 2024-03-18 20:26:28 -07:00
Ishaan Jaff
87dd3f1235 (fix) show global spend on UI 2024-03-18 18:15:08 -07:00