Commit graph

9897 commits

Author SHA1 Message Date
Ishaan Jaff
9465288e9e migration script 2024-04-11 09:55:14 -07:00
Krrish Dholakia
c597d610f3 bump: version 1.35.0 → 1.35.1 2024-04-11 09:28:26 -07:00
Krrish Dholakia
84d43484c6 fix(router.py): make sure pre call rpm check runs even when model not in model cost map 2024-04-11 09:27:46 -07:00
Ishaan Jaff
240aaf7af8 fix - show api_base in migration script 2024-04-11 09:26:13 -07:00
Ishaan Jaff
69c095b768
Merge pull request #2953 from BerriAI/litellm_fix_sec_issue
[security fix - Low Severity] - disable demo accounts
2024-04-11 08:07:24 -07:00
Ishaan Jaff
96cd038a73 security fix - disable demo accounts 2024-04-11 07:40:34 -07:00
Ishaan Jaff
1c1b0f929f fix - migration script for /model/new 2024-04-11 07:34:00 -07:00
Krish Dholakia
78571babb3
Merge pull request #2830 from onukura/onukura-patch-1
Update README.md
2024-04-11 06:47:53 -07:00
Krrish Dholakia
d999acd20d docs(routing.md): reorder routing strategies 2024-04-10 22:29:24 -07:00
Krrish Dholakia
83a7a9f0b7 docs(routing.md): add calling via proxy tutorial to router docs 2024-04-10 22:24:29 -07:00
Krish Dholakia
6749306239
Merge pull request #2945 from ushuz/hotfix
Fix image generation pre call hook call type
2024-04-10 21:53:00 -07:00
Krrish Dholakia
9f517b2907 docs(routing.md): add async usage based routing to docs 2024-04-10 21:51:36 -07:00
Krrish Dholakia
b8bd7bde1e bump: version 1.34.42 → 1.35.0 2024-04-10 21:45:44 -07:00
Krrish Dholakia
623613203a refactor(main.py): trigger new build
contains fixes for async batch get
2024-04-10 21:45:06 -07:00
Krrish Dholakia
76bd667b12 fix(caching.py): fix async batch redis get request 2024-04-10 21:37:45 -07:00
Ishaan Jaff
a4d7ef967e bump: version 1.34.41 → 1.34.42 2024-04-10 21:29:15 -07:00
Ishaan Jaff
8a1cdc9017
Merge pull request #2941 from BerriAI/litellm_fix_sec_report
[FIX] -  Security issue Server-Side Template Injection in /completions endpoint in
2024-04-10 21:28:49 -07:00
Ishaan Jaff
b3f62b7ce3 fix cleanup 2024-04-10 21:28:04 -07:00
Ishaan Jaff
6a5d5bcd47 fix use ImmutableSandboxedEnvironment 2024-04-10 21:26:50 -07:00
Krrish Dholakia
f4d34673d0 fix: update gitignore 2024-04-10 20:56:39 -07:00
Ishaan Jaff
f9cabf09ff fix jinja2 use ImmutableSandboxedEnvironment 2024-04-10 20:30:13 -07:00
Krish Dholakia
83e7ed94ce
Merge pull request #2942 from BerriAI/litellm_fix_router_loading
Router Async Improvements
2024-04-10 20:16:53 -07:00
Krrish Dholakia
de5dd737d2 docs(enterprise.md): improve language 2024-04-10 20:06:50 -07:00
Krrish Dholakia
79dc453863 docs(enterprise.md): fix language 2024-04-10 20:03:55 -07:00
Krrish Dholakia
266dba65e7 fix(router.py): handle 1 deployment being picked 2024-04-10 18:32:54 -07:00
Ishaan Jaff
e5bd1b644d
Merge pull request #2946 from BerriAI/litellm_fix_error_setting_env_vars
fix - /config/update setting env variables
2024-04-10 18:22:06 -07:00
Ishaan Jaff
a36c2f8e92 feat - migration script to hosted proxy 2024-04-10 18:21:57 -07:00
Krrish Dholakia
52462e8bac fix(router.py): move specific deployment check outside common functions 2024-04-10 18:06:31 -07:00
Krrish Dholakia
37ac17aebd fix(router.py): fix datetime object 2024-04-10 17:55:24 -07:00
Ishaan Jaff
e90645bd5d fix - setting env vars on proxy 2024-04-10 17:50:28 -07:00
Ishaan Jaff
bc50b0a4a1
Merge pull request #2923 from BerriAI/litellm_return_better_error_from_health
fix - return stack trace on failing /health checks - first 1000 chars
2024-04-10 17:48:13 -07:00
Krish Dholakia
3a8cd306e3
Merge pull request #2940 from Manouchehri/gemini-1.0-pro-001-and-002-fix-1
Feature: add gemini-1.0-pro-001 and gemini-1.0-pro-002 to models list
2024-04-10 17:44:53 -07:00
John HU
c9b05828ff Fix image generation pre call hook call type 2024-04-11 00:18:35 +00:00
Krrish Dholakia
2531701a2a fix(router.py): make get_cooldown_deployment logic async 2024-04-10 16:57:01 -07:00
Ishaan Jaff
680c5e0e94 bump: version 1.34.40 → 1.34.41 2024-04-10 16:48:26 -07:00
Ishaan Jaff
3424068cad
Merge pull request #2943 from BerriAI/litellm_allow_base_64_embedding
Fix - Embedding Caching - allow base 64 embedding cache hits
2024-04-10 16:47:50 -07:00
Ishaan Jaff
8bc02b34c2 test -base64 cache hits 2024-04-10 16:46:56 -07:00
Ishaan Jaff
0c26404cb3 fix - allow base64 cache hits embedding responses 2024-04-10 16:44:40 -07:00
Ishaan Jaff
05e02f829c fix - security report / vuln 2024-04-10 15:42:17 -07:00
Krrish Dholakia
a47a719caa fix(router.py): generate consistent model id's
having the same id for a deployment, lets redis usage caching work across multiple instances
2024-04-10 15:23:57 -07:00
Krrish Dholakia
180cf9bd5c feat(lowest_tpm_rpm_v2.py): move to using redis.incr and redis.mget for getting model usage from redis
makes routing work across multiple instances
2024-04-10 14:56:23 -07:00
David Manouchehri
7f137bc909
Feature: add gemini-1.0-pro-001 and gemini-1.0-pro-002 to models list 2024-04-10 21:48:56 +00:00
Ishaan Jaff
c86213ac98 fix - error on setting env vars 2024-04-10 14:15:40 -07:00
Ishaan Jaff
89ff90bceb new ui build 2024-04-10 14:06:19 -07:00
Ishaan Jaff
f6fc21e917 fix- user not allowed to access key info 2024-04-10 14:01:03 -07:00
Ishaan Jaff
c662b5557c
Merge pull request #2939 from BerriAI/litellm_fix_error_ui_sso
[UI] Fix SSO Error - user not allowed to access info
2024-04-10 13:36:33 -07:00
Ishaan Jaff
195840b761 fix - error user not allowed to access info 2024-04-10 13:34:22 -07:00
Ishaan Jaff
05aefe18ce ui - set admin when setting up sso 2024-04-10 13:33:32 -07:00
Krrish Dholakia
b2741933dc fix(proxy_cli.py): don't double load the router config
was causing callbacks to be instantiated twice - double couting usage in cache
2024-04-10 13:23:56 -07:00
Ishaan Jaff
a374e83d52 ui - new build 2024-04-10 12:11:12 -07:00