Commit graph

962 commits

Author SHA1 Message Date
Krrish Dholakia
7089b13632 fix(llm_guard.py): add streaming hook for moderation calls 2024-02-20 20:31:32 -08:00
Krrish Dholakia
5b6b1f120f fix(utils.py): fix print statement 2024-02-19 23:00:41 -08:00
Krrish Dholakia
11c12e7381 fix(gemini.py): fix async streaming + add native async completions 2024-02-19 22:41:36 -08:00
Krrish Dholakia
423bd4012d refactor: rename enterprise specific hooks and callbacks to be more precise 2024-02-17 17:52:45 -08:00
Krish Dholakia
8db15adfa6 Merge pull request #2017 from sorokine/main 2024-02-17 07:07:56 -08:00
Krrish Dholakia
67cd9b1c63 feat(llama_guard.py): add llama guard support for content moderation + new async_moderation_hook endpoint 2024-02-16 18:45:25 -08:00
Krrish Dholakia
2a5a14d612 fix(utils.py): support image gen logging to langfuse 2024-02-16 16:12:52 -08:00
Alexandre Sorokine
5684b232ec fix for importllib compatibility issue for python 3.8
was tested for python versions 3.8, 3.9, 3.10, 3.11, 3.12
2024-02-16 15:52:39 -05:00
Krish Dholakia
3c0fc3ea3c Merge branch 'main' into litellm_moderations_improvements 2024-02-15 23:08:25 -08:00
Krish Dholakia
233aff5ba9 Merge pull request #1916 from RenaLu/main
Add support for Vertex AI custom models deployed on private endpoint
2024-02-15 22:47:36 -08:00
Krrish Dholakia
6943358cf7 fix(utils.py): add more exception mapping for huggingface 2024-02-15 21:26:22 -08:00
Krish Dholakia
1d04fb3cd6 Merge pull request #1989 from BerriAI/litellm_redis_url_fix
fix(redis.py): fix instantiating redis client from url
2024-02-15 21:23:17 -08:00
Krrish Dholakia
f82a335118 fix(test_streaming.py): handle hf tgi zephyr not loading for streaming issue 2024-02-15 19:24:02 -08:00
ishaan-jaff
e2752eb415 (fix) merge conflicts 2024-02-15 18:34:53 -08:00
Ishaan Jaff
8dee325f52 Merge branch 'main' into litellm_allow_users_use_generic_api_logger 2024-02-15 18:27:25 -08:00
ishaan-jaff
f304370682 (fix) merge conflict 2024-02-15 18:25:19 -08:00
Krrish Dholakia
daa1bef1ed fix(utils.py): add try-except around the tiktoken local import 2024-02-15 17:57:31 -08:00
ishaan-jaff
6267026e9c (fix) importing enterprise features 2024-02-15 17:24:27 -08:00
ishaan-jaff
85eefcb4f4 (chore) debug sys path docker error 2024-02-15 17:23:07 -08:00
Krrish Dholakia
36ec6ed4d1 fix(utils.py): fix local tiktoken package import 2024-02-15 16:18:54 -08:00
ishaan-jaff
8b1a2a8518 (feat) fix api callback imports 2024-02-15 16:15:36 -08:00
ishaan-jaff
bbface2a01 (feat) log with generic logger 2024-02-15 13:43:16 -08:00
Krish Dholakia
f565949fd5 Merge pull request #1947 from sorokine/main
Update utils.py: switched from pkg_resources to importlib.resources for `filename` stored in the `os.environ(TIKTOKEN_CACHE_DIR)`
2024-02-14 22:02:35 -08:00
Krish Dholakia
eda9fa300e Merge branch 'main' into litellm_aioboto3_sagemaker 2024-02-14 21:46:58 -08:00
Rena Lu
f5971a373e handle optional params 2024-02-14 23:27:16 +00:00
ishaan-jaff
6c1c2e8c7d (feat) add moderation on router 2024-02-14 11:00:09 -08:00
Krrish Dholakia
1a57b21fd6 fix(utils.py): fix streaming rule calling 2024-02-12 22:36:32 -08:00
Krrish Dholakia
fd6f64a4ae feat(utils.py): enable post call rules for streaming 2024-02-12 22:08:04 -08:00
Krrish Dholakia
5de569fcb1 feat(sagemaker.py): aioboto3 streaming support 2024-02-12 21:18:34 -08:00
Alexandre Sorokine
acfbda7b6b Update utils.py: switched from pkg_resources to importlib.resources for filename stored in the os/environ(TIKTOKEN_CACHE_DIR) 2024-02-12 16:28:38 -05:00
ishaan-jaff
e71009e4b4 (feat) fix litellm verbose logger 2024-02-09 19:01:21 -08:00
ishaan-jaff
1884c24d31 (bedrock) raise timeout error 2024-02-09 14:37:34 -08:00
Krrish Dholakia
07aa05bf17 fix(test_parallel_request_limiter.py): use mock responses for streaming 2024-02-08 21:45:38 -08:00
Ishaan Jaff
82f10ed322 Merge pull request #1892 from BerriAI/litellm_speed_up_s3_logging
[FEAT] 76 % Faster s3 logging Proxy / litellm.acompletion / router.acompletion 🚀
2024-02-08 11:48:36 -08:00
ishaan-jaff
6328baac6a (feat) speed up s3 logging 2024-02-08 10:59:54 -08:00
ishaan-jaff
798509a067 (feat) fix s3 cache 2024-02-08 10:01:56 -08:00
ishaan-jaff
6aef843911 (fix) azure cost calc 2024-02-07 17:33:10 -08:00
ishaan-jaff
7bed984179 (fix) cost tracking 2024-02-07 17:06:05 -08:00
ishaan-jaff
67305fcb20 (feat) use base_model for azure response_cost 2024-02-07 16:33:07 -08:00
Krrish Dholakia
ef95b70b87 fix(utils.py): fix ollama stop sequence mapping 2024-02-07 13:14:03 -08:00
ishaan-jaff
e2ccdb7a1b (feat) redis-semantic cache 2024-02-06 13:30:55 -08:00
Krrish Dholakia
659a460923 fix(utils.py): round max tokens to be int always 2024-02-06 13:17:26 -08:00
Krrish Dholakia
32639bf398 fix(utils.py): return finish reason for last vertex ai chunk 2024-02-06 09:21:03 -08:00
Krrish Dholakia
9b2a2e6c8b fix(utils.py): use print_verbose for statements, so debug can be seen when running sdk 2024-02-06 07:30:26 -08:00
Krrish Dholakia
9a526b6cd4 fix(ollama_chat.py): fix ollama chat completion token counting 2024-02-06 07:30:26 -08:00
Krrish Dholakia
1b975bbe28 fix(langfuse.py): support logging failed llm api calls to langfuse 2024-02-05 16:16:15 -08:00
Krrish Dholakia
2e3c7088d1 fix(utils.py): support together ai function calling 2024-02-05 15:30:44 -08:00
Krrish Dholakia
1c8a3973ab fix(utils.py): handle count response tokens false case token counting 2024-02-05 08:47:10 -08:00
Krish Dholakia
92d50616c2 Merge pull request #1805 from BerriAI/litellm_cost_tracking_image_gen
feat(utils.py): support cost tracking for openai/azure image gen models
2024-02-03 22:23:22 -08:00
Krrish Dholakia
f9aa05c268 fix(utils.py): support time based pricing for openai-compatible together ai 2024-02-03 19:59:32 -08:00