Krrish Dholakia
|
7e63381df2
|
test(test_key_generate_prisma.py): skip bad test
|
2024-02-07 22:49:17 -08:00 |
|
Krrish Dholakia
|
778bf46af2
|
fix(proxy_cli.py): allow db connection limit and pool timeouts to be configurable params
|
2024-02-07 21:48:38 -08:00 |
|
Krrish Dholakia
|
c03d538c3f
|
fix(utils.py): fix max connection limit
|
2024-02-07 21:42:56 -08:00 |
|
Krrish Dholakia
|
e46a27a4a7
|
feat(proxy_server.py): support batch writing failed spend logs
87.38% improvement in spend logging reliability
|
2024-02-07 19:31:14 -08:00 |
|
Krrish Dholakia
|
f392a226bb
|
fix(proxy_server.py): use set updates for more efficient spend updating
|
2024-02-07 16:37:46 -08:00 |
|
Krrish Dholakia
|
655fcd4d79
|
fix(utils.py): fix ollama stop sequence mapping
|
2024-02-07 13:14:03 -08:00 |
|
Krrish Dholakia
|
66913222a1
|
docs(langfuse_integration.md): docs for showing how to log errors to langfuse
|
2024-02-07 12:07:11 -08:00 |
|
Krrish Dholakia
|
048acc8e68
|
docs(ui.md): add more help to docs
|
2024-02-07 11:46:57 -08:00 |
|
Krrish Dholakia
|
2fcbe06af8
|
docs(ui.md): add proxy admin view to docs
|
2024-02-07 11:38:27 -08:00 |
|
Krrish Dholakia
|
e165ec40d4
|
docs(ui.md): ui doc updates
|
2024-02-07 11:38:27 -08:00 |
|
ishaan-jaff
|
258fe63e7d
|
(fix) ui - when request body is None
|
2024-02-07 11:33:43 -08:00 |
|
Krrish Dholakia
|
8939593826
|
fix(proxy_server.py): fix merge errors
|
2024-02-07 00:04:52 -08:00 |
|
Krrish Dholakia
|
184e78772b
|
refactor(proxy_server.py): fix merge error
|
2024-02-06 23:44:23 -08:00 |
|
Krrish Dholakia
|
9e138b9e4e
|
bump: version 1.22.11 → 1.23.0
|
2024-02-06 23:39:28 -08:00 |
|
Krrish Dholakia
|
46dd08c207
|
refactor(main.py): trigger rebuild
|
2024-02-06 23:39:28 -08:00 |
|
Krish Dholakia
|
f785aee0df
|
Merge pull request #1860 from BerriAI/litellm_spend_logging_high_traffic
fix(proxy_server.py): prisma client fixes for high traffic
|
2024-02-06 23:37:43 -08:00 |
|
Krish Dholakia
|
df60edfa07
|
Merge branch 'main' into litellm_spend_logging_high_traffic
|
2024-02-06 23:36:58 -08:00 |
|
Krrish Dholakia
|
fd9c7a90af
|
fix(proxy_server.py): update user cache to with new spend
|
2024-02-06 23:06:05 -08:00 |
|
Krrish Dholakia
|
73d8e3e640
|
fix(ollama_chat.py): fix token counting
|
2024-02-06 22:18:46 -08:00 |
|
Krrish Dholakia
|
4174471dac
|
fix(proxy_server.py): fix endpoint
|
2024-02-06 22:09:30 -08:00 |
|
Krish Dholakia
|
2bc710d8e9
|
Merge pull request #1843 from BerriAI/litellm_admin_ui_view_all_keys
feat(ui): enable admin to view all valid keys created on the proxy
|
2024-02-06 22:06:46 -08:00 |
|
Krrish Dholakia
|
0874c17a31
|
fix: export npm build into proxy
|
2024-02-06 20:12:50 -08:00 |
|
Krrish Dholakia
|
4a0df3cb4f
|
fix(proxy_cli.py-&&-proxy_server.py): bump reset budget intervals and fix pool limits for prisma connections
|
2024-02-06 19:39:49 -08:00 |
|
ishaan-jaff
|
5f4b06fb19
|
(docs) caching
|
2024-02-06 19:39:32 -08:00 |
|
ishaan-jaff
|
5a29f362ee
|
(fix) allow litellm_settings to be None
|
2024-02-06 19:29:39 -08:00 |
|
ishaan-jaff
|
e9bf16bbda
|
bump: version 1.22.10 → 1.22.11
|
2024-02-06 19:23:57 -08:00 |
|
ishaan-jaff
|
c69eaebfd8
|
(fix) dockerfile for semantic caching
|
2024-02-06 19:23:27 -08:00 |
|
ishaan-jaff
|
7b26b3b789
|
(ci/cd) run again
|
2024-02-06 18:25:15 -08:00 |
|
Krrish Dholakia
|
b6adeec347
|
fix(proxy_server.py): prisma client fixes for high traffic
|
2024-02-06 17:30:36 -08:00 |
|
ishaan-jaff
|
83628938ab
|
bump: version 1.22.9 → 1.22.10
|
2024-02-06 17:12:46 -08:00 |
|
Ishaan Jaff
|
73c6ce890b
|
Merge pull request #1859 from BerriAI/litellm_allow_using_budgets_without_keys
[Feat] Budgets for 'user' param passed to /chat/completions, /embeddings etc
|
2024-02-06 16:32:25 -08:00 |
|
ishaan-jaff
|
6369424629
|
(ci/cd) run again
|
2024-02-06 16:08:25 -08:00 |
|
Krish Dholakia
|
0fd64bc906
|
Merge pull request #1839 from BerriAI/litellm_slack_langfuse_alerting
fix(proxy/utils.py): if langfuse trace id passed in, include in slack alert
|
2024-02-06 15:49:00 -08:00 |
|
Krish Dholakia
|
9e9fb747ce
|
Merge branch 'main' into litellm_slack_langfuse_alerting
|
2024-02-06 15:48:52 -08:00 |
|
ishaan-jaff
|
8208ebd9db
|
(docs) budget per end_user
|
2024-02-06 15:39:45 -08:00 |
|
ishaan-jaff
|
196787359f
|
(test) track_cost_ for end users
|
2024-02-06 15:25:51 -08:00 |
|
ishaan-jaff
|
52b864976b
|
(feat) support max_user_budget
|
2024-02-06 15:19:36 -08:00 |
|
Krrish Dholakia
|
be81183782
|
refactor(main.py): trigger deploy
n
|
2024-02-06 15:17:40 -08:00 |
|
ishaan-jaff
|
78f75647da
|
(fix) redisvl requirements.txt issue
|
2024-02-06 15:17:40 -08:00 |
|
ishaan-jaff
|
8ba2c8dbf7
|
(fix) langfuse show semantic-similarity in tags
|
2024-02-06 15:17:40 -08:00 |
|
ishaan-jaff
|
eb3b68a2f0
|
(fix) dockerfile requirements.txt
|
2024-02-06 15:17:40 -08:00 |
|
ishaan-jaff
|
325ca43946
|
(feat) show semantic-cache on health/readiness
|
2024-02-06 15:17:40 -08:00 |
|
Krrish Dholakia
|
0d03b28a3b
|
test(test_completion.py): fix test
|
2024-02-06 15:17:40 -08:00 |
|
ishaan-jaff
|
b5db630dba
|
(ci/cd) run again
|
2024-02-06 15:17:40 -08:00 |
|
ishaan-jaff
|
43061d612d
|
(fix) mark semantic caching as beta test
|
2024-02-06 15:17:40 -08:00 |
|
ishaan-jaff
|
e32c2beddd
|
(fix) semantic caching
|
2024-02-06 15:17:40 -08:00 |
|
ishaan-jaff
|
102f20fc03
|
(docs) litellm semantic caching
|
2024-02-06 15:17:40 -08:00 |
|
ishaan-jaff
|
b49b37568a
|
(docs) redis cache
|
2024-02-06 15:17:40 -08:00 |
|
ishaan-jaff
|
f3de05cc54
|
(fix) test-semantic caching
|
2024-02-06 15:17:40 -08:00 |
|
ishaan-jaff
|
f8248b2c79
|
(feat) redis-semantic cache on proxy
|
2024-02-06 15:17:40 -08:00 |
|