Commit graph

123 commits

Author SHA1 Message Date
Krish Dholakia
8ce1e49fbe
fix(utils.py): correctly log streaming cache hits (#5417) (#5426)
Fixes https://github.com/BerriAI/litellm/issues/5401
2024-08-28 22:50:33 -07:00
Ishaan Jaff
3c8a19e343 use cost per token for jamba 2024-08-27 14:18:04 -07:00
Krrish Dholakia
e542475f39 build(config.yml): bump anyio version 2024-08-27 07:37:06 -07:00
Krrish Dholakia
6bfcb431cd fix(asyncify.py): fix linting errors 2024-08-27 07:37:06 -07:00
Krrish Dholakia
8d8c961b50 fix(asyncify.py): fix linting errors 2024-08-27 07:37:06 -07:00
Krrish Dholakia
2cf149fbad perf(sagemaker.py): asyncify hf prompt template check
leads to 189% improvement in RPS @ 100 users
2024-08-27 07:37:06 -07:00
Krrish Dholakia
9ba1b78a8f fix(streaming_utils.py): fix generic_chunk_has_all_required_fields 2024-08-26 21:13:02 -07:00
Krrish Dholakia
8e9acd117b fix(sagemaker.py): support streaming for messages api
Fixes https://github.com/BerriAI/litellm/issues/5372
2024-08-26 15:08:08 -07:00
Ishaan Jaff
e2cdb00a81 track api_call_start_time 2024-08-22 13:52:03 -07:00
Krrish Dholakia
8a05ce77e9 feat(litellm_logging.py): add 'saved_cache_cost' to standard logging payload (s3) 2024-08-21 16:58:07 -07:00
Krrish Dholakia
7aec6f0f2a fix(litellm_pre_call_utils.py): handle dynamic keys via api correctly 2024-08-21 13:37:21 -07:00
Krrish Dholakia
a7578e9f2c fix(litellm_logging.py): add stricter check for special param being non none 2024-08-20 21:35:02 -07:00
Krrish Dholakia
8e9117f701 fix(utils.py): ensure consistent cost calc b/w returned header and logged object 2024-08-20 19:01:20 -07:00
Krish Dholakia
f888204a12
Merge pull request #5287 from BerriAI/litellm_fix_response_cost_cal
fix(cost_calculator.py): only override base model if custom pricing is set
2024-08-20 11:42:48 -07:00
Ishaan Jaff
c7b3978655
Merge pull request #5288 from BerriAI/litellm_aporia_refactor
[Feat] V2 aporia guardrails litellm
2024-08-19 20:41:45 -07:00
Krrish Dholakia
55217fa8d7 feat(cost_calculator.py): only override base model if custom pricing is set 2024-08-19 16:05:49 -07:00
Ishaan Jaff
8cb62213e1 feat run aporia as post call success hook 2024-08-19 11:25:31 -07:00
Krrish Dholakia
3cafebbc65 test(test_caching.py): re-introduce testing for s3 cache w/ streaming
Closes https://github.com/BerriAI/litellm/issues/3268
2024-08-19 10:56:48 -07:00
Krrish Dholakia
f7a2e04426 feat(pass_through_endpoints.py): add pass-through support for all cohere endpoints 2024-08-17 16:57:55 -07:00
Krrish Dholakia
bc0023a409 feat(google_ai_studio_endpoints.py): support pass-through endpoint for all google ai studio requests
New Feature
2024-08-17 10:46:59 -07:00
Krish Dholakia
f3e17cd692
Merge branch 'main' into litellm_log_model_price_information 2024-08-16 19:34:16 -07:00
Krish Dholakia
a8dd2b6910
Merge pull request #5244 from BerriAI/litellm_better_error_logging_sentry
refactor: replace .error() with .exception() logging for better debugging on sentry
2024-08-16 19:16:20 -07:00
Ishaan Jaff
51da6ab64e fix databricks streaming test 2024-08-16 16:56:08 -07:00
Krrish Dholakia
a92dcdd2d6 fix(litellm_logging.py): fix price information logging to s3 2024-08-16 16:42:38 -07:00
Krrish Dholakia
178139f18d feat(litellm_logging.py): support logging model price information to s3 logs 2024-08-16 16:21:34 -07:00
Krrish Dholakia
61f4b71ef7 refactor: replace .error() with .exception() logging for better debugging on sentry 2024-08-16 09:22:47 -07:00
Krrish Dholakia
28d7de30de fix(litellm_logging.py): wrap function to safely fail 2024-08-15 18:05:06 -07:00
Krrish Dholakia
f6dba82882 feat(litellm_logging.py): cleanup payload + add response cost to logged payload 2024-08-15 17:53:25 -07:00
Krrish Dholakia
3ddeb3297d fix(litellm_logging.py): fix standard payload 2024-08-15 17:33:40 -07:00
Krrish Dholakia
cda50e5d47 fix(s3.py): fix s3 logging payload to have valid json values
Previously pydantic objects were being stringified, making them unparsable
2024-08-15 17:09:02 -07:00
Ishaan Jaff
ce9af1e1df refactor prometheus to be a customLogger class 2024-08-10 09:28:46 -07:00
Ishaan Jaff
20ef72194c use customLogger for prometheus logger 2024-08-10 09:15:23 -07:00
Krrish Dholakia
a26b23a3f4 fix(litellm_logging.py): fix calling success callback w/ stream_options true
Fixes https://github.com/BerriAI/litellm/issues/5118
2024-08-09 18:20:42 -07:00
Ishaan Jaff
68a36600c2 fix use get_file_check_sum 2024-08-08 08:03:08 -07:00
Ishaan Jaff
a4eb32f4f3 fix linting errors 2024-08-05 08:54:04 -07:00
Ishaan Jaff
566d51767f use util convert_litellm_response_object_to_dict 2024-08-05 08:40:19 -07:00
Krrish Dholakia
fe7e68adc8 fix(utils.py): fix codestral streaming 2024-08-02 07:38:06 -07:00
Krrish Dholakia
f506eb341b feat(litellm_logging.py): log exception response headers to langfuse 2024-08-01 18:07:47 -07:00
Ishaan Jaff
6c899ced28 init gcs using gcs_bucket 2024-08-01 18:07:38 -07:00
Krrish Dholakia
7404373e93 fix(google.py): fix cost tracking for vertex ai mistral models 2024-08-01 18:07:38 -07:00
Krrish Dholakia
69efc4dcdd fix(litellm_logging.py): fix linting erros 2024-08-01 17:32:22 -07:00
Krrish Dholakia
10b571ca42 fix(litellm_logging.py): use 1 cost calc function across response headers + logging integrations
Ensures consistent cost calculation when azure base models are used
2024-08-01 10:26:59 -07:00
Krrish Dholakia
185a6857f9 fix(utils.py): fix cost tracking for vertex ai partner models 2024-07-30 14:20:52 -07:00
Ishaan Jaff
f71ba63cab refactor use common helper 2024-07-27 11:39:03 -07:00
Krrish Dholakia
fe7f78fbf6 feat(vertex_httpx.py): support logging vertex ai safety results to langfuse
Closes https://github.com/BerriAI/litellm/issues/3230
2024-07-26 20:50:43 -07:00
Krrish Dholakia
fe0b55f2ca fix(utils.py): fix cache hits for streaming
Fixes https://github.com/BerriAI/litellm/issues/4109
2024-07-26 19:04:08 -07:00
Krrish Dholakia
2f773d9cb6 fix(litellm_cost_calc/google.py): support meta llama vertex ai cost tracking 2024-07-25 22:12:07 -07:00
Krish Dholakia
e770ac48a2
Merge branch 'main' into litellm_braintrust_integration 2024-07-22 22:40:39 -07:00
Krrish Dholakia
548e4f53f8 feat(redact_messages.py): allow remove sensitive key information before passing to logging integration 2024-07-22 20:58:02 -07:00
Krrish Dholakia
e4ab50e1a1 feat(braintrust_logging.py): working braintrust logging for successful calls 2024-07-22 17:04:55 -07:00