Commit graph

4484 commits

Author SHA1 Message Date
Andres Barbaro
0400e95433 Add safety_settings parameter to gemini generate_content calls 2024-02-17 19:13:04 -08:00
ishaan-jaff
db4fa90edb (fix) dynamo db test - new model_spend params 2024-02-17 19:13:04 -08:00
ishaan-jaff
9845d34075 (fix) issue with storing model max budget 2024-02-17 19:13:04 -08:00
ishaan-jaff
dceece00c9 (fix) ui build 2024-02-17 19:13:04 -08:00
ishaan-jaff
9276e205bf (feat) set fast api root path 2024-02-17 19:13:04 -08:00
Krrish Dholakia
0a3284e753 docs(enterprise.md): add llama guard tutorial to enterprise docs 2024-02-17 19:13:04 -08:00
Krrish Dholakia
3ae5912f34 refactor(main.py): trigger new build 2024-02-17 19:13:04 -08:00
Alexandre Sorokine
23b6643080 fix for importllib compatibility issue for python 3.8
was tested for python versions 3.8, 3.9, 3.10, 3.11, 3.12
2024-02-17 19:13:04 -08:00
Krrish Dholakia
bf77295168 feat(llama_guard.py): add llama guard support for content moderation + new async_moderation_hook endpoint 2024-02-17 19:13:04 -08:00
Krrish Dholakia
7d647a457d test(test_custom_callback_input.py): fix image gen callback test 2024-02-17 19:13:04 -08:00
Krrish Dholakia
0cc6341ed0 fix(utils.py): support image gen logging to langfuse 2024-02-17 19:13:04 -08:00
Krrish Dholakia
045d84e167 fix(proxy_cli.py): allow user to control db connection pool + timeouts from config 2024-02-17 19:03:43 -08:00
Ishaan Jaff
bf572b495a
Merge pull request #2047 from BerriAI/litellm_ui_cost_tracking
[FIX] Spend Tracking bug for Keys made on Admin UI (when role = proxy_admin)
2024-02-17 18:59:02 -08:00
ishaan-jaff
901a1a3655 (fix) litellm bug in 2024-02-17 18:41:37 -08:00
Krrish Dholakia
ddf0911c46 feat(google_text_moderation.py): allow user to use google text moderation for content mod on proxy 2024-02-17 18:36:29 -08:00
Krrish Dholakia
73acdf3736 refactor: rename enterprise specific hooks and callbacks to be more precise 2024-02-17 17:52:45 -08:00
Krrish Dholakia
074d93cc97 feat(llama_guard.py): allow user to define custom unsafe content categories 2024-02-17 17:42:47 -08:00
Krish Dholakia
ad81e19282
Merge pull request #2013 from afbarbaro/litellm_gemini_safety_settings
Add safety_settings parameter to gemini generate_content calls
2024-02-17 16:47:29 -08:00
ishaan-jaff
44202aa25b (fix) dynamo db test - new model_spend params 2024-02-17 16:00:08 -08:00
Ishaan Jaff
35af2d666b
Merge pull request #2022 from BerriAI/litellm_budgets_per_llm
[FEAT] Track spend per model (for Key, User and Team)
2024-02-17 15:35:31 -08:00
ishaan-jaff
c069403b05 (fix) ui build 2024-02-17 14:06:52 -08:00
ishaan-jaff
844df2412e (feat) set fast api root path 2024-02-17 13:03:24 -08:00
Krrish Dholakia
3565f74338 docs(enterprise.md): add llama guard tutorial to enterprise docs 2024-02-17 09:25:49 -08:00
Krrish Dholakia
05b2a3c146 refactor(main.py): trigger new build 2024-02-17 08:25:58 -08:00
Krish Dholakia
9ce29295af
Merge pull request #2017 from sorokine/main 2024-02-17 07:07:56 -08:00
Krrish Dholakia
2a4a6995ac feat(llama_guard.py): add llama guard support for content moderation + new async_moderation_hook endpoint 2024-02-16 18:45:25 -08:00
ishaan-jaff
62d0c54cfb (fix) issue with storing model max budget 2024-02-16 18:18:35 -08:00
ishaan-jaff
e76a3c5ce5 (fix) _types for model_max_budget 2024-02-16 17:47:22 -08:00
ishaan-jaff
659a394a36 (fix) types for model_max_budget 2024-02-16 17:46:34 -08:00
ishaan-jaff
2e074a8585 (test) test_call_with_key_over_model_budget 2024-02-16 17:45:49 -08:00
ishaan-jaff
4ea354ee6e (feat) budgets per model 2024-02-16 17:19:01 -08:00
Krish Dholakia
5e7dda4f88
Merge pull request #2025 from BerriAI/litellm_langfuse_image_gen_logging
fix(utils.py): support image gen logging to langfuse
2024-02-16 17:08:07 -08:00
ishaan-jaff
e8dcf8fa13 (fix) setting model_max_budget 2024-02-16 17:00:23 -08:00
ishaan-jaff
d65c6d3869 (feat) track spend key-model, user-model, team-model 2024-02-16 16:32:17 -08:00
Krrish Dholakia
9cbdbda8e4 test(test_custom_callback_input.py): fix image gen callback test 2024-02-16 16:29:26 -08:00
ishaan-jaff
2c9d142e42 (feat) track key spend per model 2024-02-16 16:15:24 -08:00
Krrish Dholakia
f57483ea70 fix(utils.py): support image gen logging to langfuse 2024-02-16 16:12:52 -08:00
ishaan-jaff
a2f1d2ee52 (feat) set key-model budgets 2024-02-16 15:44:34 -08:00
Ishaan Jaff
4d4653b619
Merge branch 'main' into litellm_fix_generic_sso 2024-02-16 14:51:46 -08:00
Ishaan Jaff
e6b9e5b46f
Merge pull request #2020 from Lunik/feat/generic-sso-scopes
 Allow to configure scopes in generic SSO
2024-02-16 14:45:09 -08:00
ishaan-jaff
5857b02bfc (feat) generic sso, set generic_user_id_attribute_name 2024-02-16 14:40:27 -08:00
Lunik
af4537e421
Allow to configure scopes in generic SSO
Signed-off-by: Lunik <lunik@tiwabbit.fr>
2024-02-16 23:24:58 +01:00
Krrish Dholakia
5f9e141d1e fix(huggingface_restapi.py): return streamed response correctly 2024-02-16 13:25:13 -08:00
Krrish Dholakia
86886de9d8 test(test_completion.py): handle azure gpt-4 vision internal error 2024-02-16 13:16:46 -08:00
Alexandre Sorokine
3c919cbbe7 fix for importllib compatibility issue for python 3.8
was tested for python versions 3.8, 3.9, 3.10, 3.11, 3.12
2024-02-16 15:52:39 -05:00
Krrish Dholakia
0d54887149 test(test_amazing_vertex_completion.py): handle rate limit errors 2024-02-16 11:54:12 -08:00
Krrish Dholakia
01077b20b9 fix(proxy_server.py): restrict model access for /v1/completions endpoint 2024-02-16 11:54:12 -08:00
ishaan-jaff
936e460a30 (chore) fix spend tracking request tags 2024-02-16 11:54:12 -08:00
Krrish Dholakia
b3d48da640 fix(main.py): map list input to ollama prompt input format 2024-02-16 11:54:12 -08:00
Krrish Dholakia
e9b27d9811 fix(proxy_server.py): re-add /team/info endpoint (fixing merge issue) 2024-02-16 11:54:12 -08:00