Ishaan Jaff
|
3c8a19e343
|
use cost per token for jamba
|
2024-08-27 14:18:04 -07:00 |
|
Krrish Dholakia
|
e542475f39
|
build(config.yml): bump anyio version
|
2024-08-27 07:37:06 -07:00 |
|
Krrish Dholakia
|
6bfcb431cd
|
fix(asyncify.py): fix linting errors
|
2024-08-27 07:37:06 -07:00 |
|
Krrish Dholakia
|
8d8c961b50
|
fix(asyncify.py): fix linting errors
|
2024-08-27 07:37:06 -07:00 |
|
Krrish Dholakia
|
2cf149fbad
|
perf(sagemaker.py): asyncify hf prompt template check
leads to 189% improvement in RPS @ 100 users
|
2024-08-27 07:37:06 -07:00 |
|
Krrish Dholakia
|
9ba1b78a8f
|
fix(streaming_utils.py): fix generic_chunk_has_all_required_fields
|
2024-08-26 21:13:02 -07:00 |
|
Krrish Dholakia
|
8e9acd117b
|
fix(sagemaker.py): support streaming for messages api
Fixes https://github.com/BerriAI/litellm/issues/5372
|
2024-08-26 15:08:08 -07:00 |
|
Ishaan Jaff
|
e2cdb00a81
|
track api_call_start_time
|
2024-08-22 13:52:03 -07:00 |
|
Krrish Dholakia
|
8a05ce77e9
|
feat(litellm_logging.py): add 'saved_cache_cost' to standard logging payload (s3)
|
2024-08-21 16:58:07 -07:00 |
|
Krrish Dholakia
|
7aec6f0f2a
|
fix(litellm_pre_call_utils.py): handle dynamic keys via api correctly
|
2024-08-21 13:37:21 -07:00 |
|
Krrish Dholakia
|
a7578e9f2c
|
fix(litellm_logging.py): add stricter check for special param being non none
|
2024-08-20 21:35:02 -07:00 |
|
Krrish Dholakia
|
8e9117f701
|
fix(utils.py): ensure consistent cost calc b/w returned header and logged object
|
2024-08-20 19:01:20 -07:00 |
|
Krish Dholakia
|
f888204a12
|
Merge pull request #5287 from BerriAI/litellm_fix_response_cost_cal
fix(cost_calculator.py): only override base model if custom pricing is set
|
2024-08-20 11:42:48 -07:00 |
|
Ishaan Jaff
|
c7b3978655
|
Merge pull request #5288 from BerriAI/litellm_aporia_refactor
[Feat] V2 aporia guardrails litellm
|
2024-08-19 20:41:45 -07:00 |
|
Krrish Dholakia
|
55217fa8d7
|
feat(cost_calculator.py): only override base model if custom pricing is set
|
2024-08-19 16:05:49 -07:00 |
|
Ishaan Jaff
|
8cb62213e1
|
feat run aporia as post call success hook
|
2024-08-19 11:25:31 -07:00 |
|
Krrish Dholakia
|
3cafebbc65
|
test(test_caching.py): re-introduce testing for s3 cache w/ streaming
Closes https://github.com/BerriAI/litellm/issues/3268
|
2024-08-19 10:56:48 -07:00 |
|
Krrish Dholakia
|
f7a2e04426
|
feat(pass_through_endpoints.py): add pass-through support for all cohere endpoints
|
2024-08-17 16:57:55 -07:00 |
|
Krrish Dholakia
|
bc0023a409
|
feat(google_ai_studio_endpoints.py): support pass-through endpoint for all google ai studio requests
New Feature
|
2024-08-17 10:46:59 -07:00 |
|
Krish Dholakia
|
f3e17cd692
|
Merge branch 'main' into litellm_log_model_price_information
|
2024-08-16 19:34:16 -07:00 |
|
Krish Dholakia
|
a8dd2b6910
|
Merge pull request #5244 from BerriAI/litellm_better_error_logging_sentry
refactor: replace .error() with .exception() logging for better debugging on sentry
|
2024-08-16 19:16:20 -07:00 |
|
Ishaan Jaff
|
51da6ab64e
|
fix databricks streaming test
|
2024-08-16 16:56:08 -07:00 |
|
Krrish Dholakia
|
a92dcdd2d6
|
fix(litellm_logging.py): fix price information logging to s3
|
2024-08-16 16:42:38 -07:00 |
|
Krrish Dholakia
|
178139f18d
|
feat(litellm_logging.py): support logging model price information to s3 logs
|
2024-08-16 16:21:34 -07:00 |
|
Krrish Dholakia
|
61f4b71ef7
|
refactor: replace .error() with .exception() logging for better debugging on sentry
|
2024-08-16 09:22:47 -07:00 |
|
Krrish Dholakia
|
28d7de30de
|
fix(litellm_logging.py): wrap function to safely fail
|
2024-08-15 18:05:06 -07:00 |
|
Krrish Dholakia
|
f6dba82882
|
feat(litellm_logging.py): cleanup payload + add response cost to logged payload
|
2024-08-15 17:53:25 -07:00 |
|
Krrish Dholakia
|
3ddeb3297d
|
fix(litellm_logging.py): fix standard payload
|
2024-08-15 17:33:40 -07:00 |
|
Krrish Dholakia
|
cda50e5d47
|
fix(s3.py): fix s3 logging payload to have valid json values
Previously pydantic objects were being stringified, making them unparsable
|
2024-08-15 17:09:02 -07:00 |
|
Ishaan Jaff
|
ce9af1e1df
|
refactor prometheus to be a customLogger class
|
2024-08-10 09:28:46 -07:00 |
|
Ishaan Jaff
|
20ef72194c
|
use customLogger for prometheus logger
|
2024-08-10 09:15:23 -07:00 |
|
Krrish Dholakia
|
a26b23a3f4
|
fix(litellm_logging.py): fix calling success callback w/ stream_options true
Fixes https://github.com/BerriAI/litellm/issues/5118
|
2024-08-09 18:20:42 -07:00 |
|
Ishaan Jaff
|
68a36600c2
|
fix use get_file_check_sum
|
2024-08-08 08:03:08 -07:00 |
|
Ishaan Jaff
|
a4eb32f4f3
|
fix linting errors
|
2024-08-05 08:54:04 -07:00 |
|
Ishaan Jaff
|
566d51767f
|
use util convert_litellm_response_object_to_dict
|
2024-08-05 08:40:19 -07:00 |
|
Krrish Dholakia
|
fe7e68adc8
|
fix(utils.py): fix codestral streaming
|
2024-08-02 07:38:06 -07:00 |
|
Krrish Dholakia
|
f506eb341b
|
feat(litellm_logging.py): log exception response headers to langfuse
|
2024-08-01 18:07:47 -07:00 |
|
Ishaan Jaff
|
6c899ced28
|
init gcs using gcs_bucket
|
2024-08-01 18:07:38 -07:00 |
|
Krrish Dholakia
|
7404373e93
|
fix(google.py): fix cost tracking for vertex ai mistral models
|
2024-08-01 18:07:38 -07:00 |
|
Krrish Dholakia
|
69efc4dcdd
|
fix(litellm_logging.py): fix linting erros
|
2024-08-01 17:32:22 -07:00 |
|
Krrish Dholakia
|
10b571ca42
|
fix(litellm_logging.py): use 1 cost calc function across response headers + logging integrations
Ensures consistent cost calculation when azure base models are used
|
2024-08-01 10:26:59 -07:00 |
|
Krrish Dholakia
|
185a6857f9
|
fix(utils.py): fix cost tracking for vertex ai partner models
|
2024-07-30 14:20:52 -07:00 |
|
Ishaan Jaff
|
f71ba63cab
|
refactor use common helper
|
2024-07-27 11:39:03 -07:00 |
|
Krrish Dholakia
|
fe7f78fbf6
|
feat(vertex_httpx.py): support logging vertex ai safety results to langfuse
Closes https://github.com/BerriAI/litellm/issues/3230
|
2024-07-26 20:50:43 -07:00 |
|
Krrish Dholakia
|
fe0b55f2ca
|
fix(utils.py): fix cache hits for streaming
Fixes https://github.com/BerriAI/litellm/issues/4109
|
2024-07-26 19:04:08 -07:00 |
|
Krrish Dholakia
|
2f773d9cb6
|
fix(litellm_cost_calc/google.py): support meta llama vertex ai cost tracking
|
2024-07-25 22:12:07 -07:00 |
|
Krish Dholakia
|
e770ac48a2
|
Merge branch 'main' into litellm_braintrust_integration
|
2024-07-22 22:40:39 -07:00 |
|
Krrish Dholakia
|
548e4f53f8
|
feat(redact_messages.py): allow remove sensitive key information before passing to logging integration
|
2024-07-22 20:58:02 -07:00 |
|
Krrish Dholakia
|
e4ab50e1a1
|
feat(braintrust_logging.py): working braintrust logging for successful calls
|
2024-07-22 17:04:55 -07:00 |
|
Ishaan Jaff
|
c7f72cbbde
|
feat - add support to init arize ai
|
2024-07-22 10:58:20 -07:00 |
|