Commit graph

9378 commits

Author SHA1 Message Date
Krrish Dholakia
9ef7afd2b4 test(test_completion.py): skip unresponsive endpoint 2024-03-27 20:12:22 -07:00
Krrish Dholakia
2926d5a8eb fix(proxy/utils.py): check cache before alerting user 2024-03-27 20:09:15 -07:00
Krrish Dholakia
34790c7e12 fix(proxy_server.py): fix linting error 2024-03-27 19:54:25 -07:00
Krrish Dholakia
ad43138f28 fix(proxy_server.py): fix budget add logic to accurately log who created it 2024-03-27 19:43:15 -07:00
Krish Dholakia
75f2b9dd73
Merge pull request #2725 from BerriAI/litellm_calculate_spend
feat(proxy_server.py): new `/spend/calculate` endpoint
2024-03-27 19:39:07 -07:00
Krish Dholakia
ef51544741
Merge pull request #2726 from BerriAI/litellm_enforce_user_param
feat(auth_checks.py): enable admin to enforce 'user' param for all openai endpoints
2024-03-27 19:38:52 -07:00
Ishaan Jaff
e81eb1bd11
Merge pull request #2721 from BerriAI/litellm_remove_background_tasks
(fix) Proxy - remove background tasks
2024-03-27 19:32:56 -07:00
Krish Dholakia
848b0ba67f Updated config.yml 2024-03-27 19:23:33 -07:00
Krish Dholakia
5bd136e650 Updated config.yml 2024-03-27 19:23:08 -07:00
Ishaan Jaff
31383b18cb (fix) cost tracking 2024-03-27 19:18:39 -07:00
Ishaan Jaff
0c69c05c54 (fix) google pip install pyarrow dep 2024-03-27 19:14:25 -07:00
Ishaan Jaff
73db5b45f8 (ci/cd) run again 2024-03-27 19:04:07 -07:00
Ishaan Jaff
d1fa22f9ed (fix) ci/cd google deps 2024-03-27 19:00:52 -07:00
Krrish Dholakia
4eb93832e4 feat(auth_checks.py): enable admin to enforce 'user' param for all openai endpoints 2024-03-27 17:36:27 -07:00
Krrish Dholakia
9d0ce1aefd feat(proxy_server.py): new /spend/calculate endpoint
Allows user to calculate spend before making the call
2024-03-27 17:08:55 -07:00
Krrish Dholakia
a408c46a67 fix(proxy_server.py): don't auto-create user when creating key 2024-03-27 16:48:57 -07:00
Ishaan Jaff
e4e4dd01cd (fix) access router model names in constant time 2024-03-27 16:18:08 -07:00
Krrish Dholakia
1e856443e1 feat(proxy/utils.py): enable updating db in a separate server 2024-03-27 16:02:36 -07:00
Ishaan Jaff
c97724df7d (fix) remove background tasks 2024-03-27 15:51:00 -07:00
Ishaan Jaff
93597b7ba0
Merge pull request #2719 from BerriAI/litellm_bump_uvicorn
(fix) bump uvicorn on proxy docker builds
2024-03-27 15:49:58 -07:00
onukura
f86472518d Add a feature to ollama aembedding to accept batch input 2024-03-27 21:39:19 +00:00
Ishaan Jaff
da835ad10a (fix) bump uvicorn on proxy docker builds 2024-03-27 14:35:17 -07:00
Krrish Dholakia
9b7383ac67 fix(utils.py): don't run post-call rules on a coroutine function 2024-03-27 13:16:27 -07:00
Krrish Dholakia
d08da5b05a docs(instructor.md): improve default example 2024-03-27 12:51:05 -07:00
Krrish Dholakia
62a591f90c bump: version 1.34.6 → 1.34.7 2024-03-27 10:15:28 -07:00
Krrish Dholakia
6c34e48180 feat(index.ts): initial commit for proxy edge worker
testing to see if a js worker improves proxy perf (and by how much)
2024-03-27 10:15:20 -07:00
Krish Dholakia
d259c754ef
Merge pull request #2701 from rmann-nflx/main
Updating the default Claude3 max tokens
2024-03-27 10:14:20 -07:00
Krish Dholakia
b6be1ecb37
Merge pull request #2714 from readevalprint/patch-1
Fix #2713 Remove duplicated "blocked" field on LiteLLM_TeamTable
2024-03-27 10:13:56 -07:00
Tim
d939755a73
Remove duplicated "blocked" field on LiteLLM_TeamTable 2024-03-27 17:02:18 +01:00
Andrea Ponti
25bacfacdc Add trace_name in langfuse logging v2 2024-03-27 16:58:08 +01:00
Krrish Dholakia
90b859ebcb docs(token_auth.md): cleanup docs 2024-03-26 21:42:07 -07:00
Krrish Dholakia
282176c502 docs(token_auth.md): update docs 2024-03-26 21:41:08 -07:00
Krrish Dholakia
ca84e7a8e8 docs(token_auth.md): update jwt auth docs with new info 2024-03-26 21:33:03 -07:00
Krrish Dholakia
1d054c3e71 bump: version 1.34.5 → 1.34.6 2024-03-26 21:19:07 -07:00
Krrish Dholakia
9375b131ee refactor(main.py): trigger new build 2024-03-26 21:18:51 -07:00
Krrish Dholakia
b204f0c01c fix(proxy_cli.py): fix circular import issue 2024-03-26 21:16:41 -07:00
Krish Dholakia
c1f8d346b8
Merge pull request #2706 from BerriAI/litellm_key_llm_guardrails
feat(llm_guard.py): enable key-specific llm guard check
2024-03-26 19:02:11 -07:00
Krish Dholakia
e266142d2b
Merge pull request #2705 from BerriAI/litellm_permissions_table
enable new `/team/disable` endpoint
2024-03-26 18:47:34 -07:00
Krrish Dholakia
4488480188 test(test_llm_guard.py): fix test 2024-03-26 18:37:27 -07:00
Krrish Dholakia
f62f642393 test(test_llm_guard.py): fix test 2024-03-26 18:13:15 -07:00
Krrish Dholakia
5b66cb3864 test(test_exceptions.py): handle api instability 2024-03-26 18:06:49 -07:00
Ishaan Jaff
6d6549c6c8 (fix) kub.yaml 2024-03-26 18:05:55 -07:00
Ishaan Jaff
4d22fe0947 bump: version 1.34.4 → 1.34.5 2024-03-26 18:04:05 -07:00
Krrish Dholakia
bf7cc943fb docs(enterprise.md): update docs to turn on/off llm guard per key 2024-03-26 18:02:44 -07:00
Krrish Dholakia
1046a63521 test(test_llm_guard.py): unit testing for key-level llm guard enabling 2024-03-26 17:55:53 -07:00
Krrish Dholakia
6d418a2920 fix(llm_guard.py): working llm-guard 'key-specific' mode 2024-03-26 17:47:20 -07:00
Krrish Dholakia
e10eb8f6fe feat(llm_guard.py): enable key-specific llm guard check 2024-03-26 17:21:51 -07:00
Krrish Dholakia
bec093675c build(schema.prisma): update schema to enable team blocking 2024-03-26 17:03:32 -07:00
Krrish Dholakia
77472b80eb fix(proxy_server.py): use consistent naming schema - move to /team/block 2024-03-26 16:59:36 -07:00
Krish Dholakia
7eb2c7942c
Merge pull request #2675 from onukura/ollama-embedding
Fix Ollama embedding
2024-03-26 16:08:28 -07:00