Commit graph

933 commits

Author SHA1 Message Date
Krrish Dholakia
fd6f64a4ae feat(utils.py): enable post call rules for streaming 2024-02-12 22:08:04 -08:00
ishaan-jaff
e71009e4b4 (feat) fix litellm verbose logger 2024-02-09 19:01:21 -08:00
ishaan-jaff
1884c24d31 (bedrock) raise timeout error 2024-02-09 14:37:34 -08:00
Krrish Dholakia
07aa05bf17 fix(test_parallel_request_limiter.py): use mock responses for streaming 2024-02-08 21:45:38 -08:00
Ishaan Jaff
82f10ed322 Merge pull request #1892 from BerriAI/litellm_speed_up_s3_logging
[FEAT] 76 % Faster s3 logging Proxy / litellm.acompletion / router.acompletion 🚀
2024-02-08 11:48:36 -08:00
ishaan-jaff
6328baac6a (feat) speed up s3 logging 2024-02-08 10:59:54 -08:00
ishaan-jaff
798509a067 (feat) fix s3 cache 2024-02-08 10:01:56 -08:00
ishaan-jaff
6aef843911 (fix) azure cost calc 2024-02-07 17:33:10 -08:00
ishaan-jaff
7bed984179 (fix) cost tracking 2024-02-07 17:06:05 -08:00
ishaan-jaff
67305fcb20 (feat) use base_model for azure response_cost 2024-02-07 16:33:07 -08:00
Krrish Dholakia
ef95b70b87 fix(utils.py): fix ollama stop sequence mapping 2024-02-07 13:14:03 -08:00
ishaan-jaff
e2ccdb7a1b (feat) redis-semantic cache 2024-02-06 13:30:55 -08:00
Krrish Dholakia
659a460923 fix(utils.py): round max tokens to be int always 2024-02-06 13:17:26 -08:00
Krrish Dholakia
32639bf398 fix(utils.py): return finish reason for last vertex ai chunk 2024-02-06 09:21:03 -08:00
Krrish Dholakia
9b2a2e6c8b fix(utils.py): use print_verbose for statements, so debug can be seen when running sdk 2024-02-06 07:30:26 -08:00
Krrish Dholakia
9a526b6cd4 fix(ollama_chat.py): fix ollama chat completion token counting 2024-02-06 07:30:26 -08:00
Krrish Dholakia
1b975bbe28 fix(langfuse.py): support logging failed llm api calls to langfuse 2024-02-05 16:16:15 -08:00
Krrish Dholakia
2e3c7088d1 fix(utils.py): support together ai function calling 2024-02-05 15:30:44 -08:00
Krrish Dholakia
1c8a3973ab fix(utils.py): handle count response tokens false case token counting 2024-02-05 08:47:10 -08:00
Krish Dholakia
92d50616c2 Merge pull request #1805 from BerriAI/litellm_cost_tracking_image_gen
feat(utils.py): support cost tracking for openai/azure image gen models
2024-02-03 22:23:22 -08:00
Krrish Dholakia
f9aa05c268 fix(utils.py): support time based pricing for openai-compatible together ai 2024-02-03 19:59:32 -08:00
Krrish Dholakia
740eda84ee fix(utils.py): support get_secret("TOGETHER_AI_TOKEN") 2024-02-03 19:35:09 -08:00
Krrish Dholakia
85a3515d83 fix(utils.py): route together ai calls to openai client
together ai is now openai-compatible

n
2024-02-03 19:22:48 -08:00
Krrish Dholakia
8195bfd286 fix(utils.py): fix conditional check 2024-02-03 18:58:58 -08:00
Krish Dholakia
45cbb3cf3d Merge branch 'main' into litellm_embedding_caching_updates 2024-02-03 18:08:47 -08:00
Krrish Dholakia
96c2c64dce feat(utils.py): support cost tracking for openai/azure image gen models 2024-02-03 17:09:54 -08:00
Krish Dholakia
39305f1e07 Merge pull request #1802 from BerriAI/litellm_vertex_ai_high_traffic_fix
fix(vertex_ai.py): treat vertex ai high-traffic error as a rate limit error - allows user-controlled backoff logic to work here
2024-02-03 15:37:05 -08:00
Krish Dholakia
dbaad8ae56 Merge pull request #1799 from BerriAI/litellm_bedrock_stable_diffusion_support
feat(bedrock.py): add stable diffusion image generation support
2024-02-03 12:59:00 -08:00
Krrish Dholakia
16a43224ba fix(vertex_ai.py): treat vertex ai high-traffic error as a rate limit error - allows backoff logic to work here 2024-02-03 12:58:16 -08:00
Krrish Dholakia
5994c1e7ef feat(bedrock.py): add stable diffusion image generation support 2024-02-03 12:08:38 -08:00
Krrish Dholakia
3f23b18dad feat(vertex_ai.py): vertex ai gecko text embedding support 2024-02-03 09:48:29 -08:00
Krrish Dholakia
7699f48930 fix(utils.py): fix cost tracking for cache hits (should be 0) 2024-02-02 17:33:39 -08:00
ishaan-jaff
2e4b1bcf4d (feat) use same id to log on s3, langfuse 2024-02-02 16:26:59 -08:00
Krrish Dholakia
34886b0261 fix(helicone.py): fix logging 2024-02-02 14:30:01 -08:00
Krrish Dholakia
af1319bd9f fix(utils.py): dynamic switch langfuse client based on keys 2024-02-02 14:03:46 -08:00
Krish Dholakia
6824014c5f Merge pull request #1761 from BerriAI/litellm_fix_dynamic_callbacks
fix(utils.py): override default success callbacks with dynamic callbacks if set
2024-02-02 13:06:55 -08:00
ishaan-jaff
fdddb69dea (feat) improve exceptions for OpenAI compatible - 2024-02-02 10:14:26 -08:00
Krrish Dholakia
e20e8beec3 test(test_keys.py): add more logging 2024-02-02 10:08:02 -08:00
ishaan-jaff
c73684a5b1 (feat) show correct provider in exceptions 2024-02-02 08:37:42 -08:00
Krrish Dholakia
16c3ab1da0 fix(utils.py): logging fixes 2024-02-02 08:31:17 -08:00
Krrish Dholakia
aa40663150 fix(utils.py): add track cost callback to callback list for team id callbacks 2024-02-02 08:18:06 -08:00
Krrish Dholakia
dbf2b0b2c8 fix(utils.py): override default success callbacks with dynamic callbacks if set 2024-02-02 06:21:43 -08:00
Krish Dholakia
83a592aa02 Merge pull request #1749 from BerriAI/litellm_vertex_ai_model_garden
feat(vertex_ai.py): vertex ai model garden support
2024-02-01 21:52:12 -08:00
Krrish Dholakia
35df1413f8 test: testing fixes 2024-02-01 21:20:46 -08:00
Krrish Dholakia
99678147df feat(utils.py): support dynamic langfuse params and team settings on proxy 2024-02-01 21:08:24 -08:00
Krrish Dholakia
e6dc63d348 feat(utils.py): support dynamic success callbacks 2024-02-01 19:42:01 -08:00
Krrish Dholakia
71ee4524c5 fix(utils.py): fix azure exception mapping 2024-02-01 19:05:20 -08:00
Krrish Dholakia
0ab8b764cc fix(utils.py): fix deepinfra streaming 2024-02-01 18:46:50 -08:00
Krrish Dholakia
635a34b543 fix(utils.py): fix streaming chunks to not return role, unless set 2024-02-01 09:55:56 -08:00
Krrish Dholakia
c506f374e5 test: add more logging 2024-01-31 19:31:07 -08:00