Commit graph

41 commits

Author SHA1 Message Date
skucherlapati
2ad342e7bf add medlm models to cost map 2024-07-17 21:17:58 -07:00
Ishaan Jaff
14f5cab09a fix medllm test 2024-07-17 20:19:37 -07:00
skucherlapati
ce474ff008 fix failing tests on PR-4760 2024-07-17 19:32:17 -07:00
skucherlapati
db3d6925c6 add medlm cost calc 2024-07-17 14:54:54 -07:00
Krrish Dholakia
959c627dd3 fix(litellm_logging.py): log response_cost=0 for failed calls
Fixes https://github.com/BerriAI/litellm/issues/4604
2024-07-15 19:25:56 -07:00
Krrish Dholakia
b1be355d42 build(model_prices_and_context_window.json): add azure ai jamba instruct pricing + token details
Adds jamba instruct, mistral, llama3 pricing + token info for azure_ai
2024-07-13 16:34:31 -07:00
Krrish Dholakia
2163434ff3 fix(llm_cost_calc/google.py): fix google embedding cost calculation
Fixes https://github.com/BerriAI/litellm/issues/4630
2024-07-11 11:55:48 -07:00
Krrish Dholakia
407639cc7d fix(cost_calculator.py): support openai+azure tts calls 2024-07-05 20:58:08 -07:00
Krish Dholakia
869275585a
Merge branch 'main' into litellm_response_cost_headers 2024-06-27 21:33:09 -07:00
Krrish Dholakia
94c069e869 fix(cost_calculator.py): infer provider name if not given
Fixes https://github.com/BerriAI/litellm/issues/4452
2024-06-27 18:41:04 -07:00
Krrish Dholakia
f533e1da09 fix(utils.py): return 'response_cost' in completion call
Closes https://github.com/BerriAI/litellm/issues/4335
2024-06-26 17:55:57 -07:00
spdustin@gmail.com
4acc2d50ad fix: use per-token costs for claude via vertex_ai 2024-06-21 11:21:36 -05:00
Krrish Dholakia
16da21e839 feat(llm_cost_calc/google.py): do character based cost calculation for vertex ai
Calculate cost for vertex ai responses using characters in query/response

 Closes https://github.com/BerriAI/litellm/issues/4165
2024-06-19 17:18:42 -07:00
Krrish Dholakia
f597aa432b feat(cost_calculator.py): add cost calculating for dynamic context window (vertex ai / google ai studio) 2024-06-17 12:38:10 -07:00
Krrish Dholakia
3f9811c6a9 test(test_completion_cost.py): fix import 2024-06-15 12:08:49 -07:00
Krrish Dholakia
f3a845eff9 build(model_prices_and_context_window.json): update together ai model pricing - account for new categories 2024-06-08 19:56:35 -07:00
Krrish Dholakia
b26c3c7d22 fix(cost_calculator.py): fixes tgai unmapped model pricing
Fixes error where tgai helper function returned None. Enforces stronger type hints, refactors code, adds more unit testing.
2024-06-08 19:43:57 -07:00
Krrish Dholakia
15b226ca4f test(test_completion_cost.py): cleanup test 2024-06-04 16:57:25 -07:00
Krrish Dholakia
52a2f5150c fix(utils.py): fix cost calculation for openai-compatible streaming object 2024-06-04 10:36:25 -07:00
Krrish Dholakia
a2a5884df1 fix(utils.py): allow passing in custom pricing to completion_cost as params 2024-05-16 16:24:44 -07:00
Krrish Dholakia
b4a8665d11 fix(utils.py): fix custom pricing when litellm model != response obj model name 2024-05-13 15:25:35 -07:00
Ishaan Jaff
282b8d0ae4 test bedrock pricing 2024-05-08 15:26:53 -07:00
Krrish Dholakia
2cfb97141d fix(utils.py): replicate now also has token based pricing for some models 2024-04-29 08:06:15 -07:00
Krrish Dholakia
7dd94c802e fix(azure.py): support cost tracking for azure/dall-e-3 2024-03-12 10:55:54 -07:00
Krrish Dholakia
fa45c569fd feat: add cost tracking + caching for transcription calls 2024-03-09 15:43:38 -08:00
Krrish Dholakia
478307d4cf fix(bedrock.py): support anthropic messages api on bedrock (claude-3) 2024-03-04 17:15:47 -08:00
ishaan-jaff
85e9116758 (ci/cd) aws disabled our test account :( 2024-02-28 17:26:23 -08:00
Krrish Dholakia
4c951d20bc test: removing aws tests - account suspended - pending their approval 2024-02-28 13:46:20 -08:00
Krrish Dholakia
49b2dc4180 test(test_completion_cost.py): fix test 2024-02-03 22:00:49 -08:00
Krrish Dholakia
a32639fa79 fix(utils.py): support max token adjustment for sagemaker 2024-01-31 19:09:54 -08:00
Krrish Dholakia
f5da95685a feat(utils.py): support region based pricing for bedrock + use bedrock's token counts if given 2024-01-26 14:53:58 -08:00
Krrish Dholakia
ac131b3824 fix(utils.py): completion_cost support for image gen models 2024-01-25 18:08:18 -08:00
ishaan-jaff
6694975ec3 (test) azure completion_cost 2024-01-05 13:53:08 +05:30
ishaan-jaff
72e7178c9b (test) azure/embedding + completion_cost 2024-01-05 13:19:17 +05:30
ishaan-jaff
2880a7b6b4 allow users to pass custom timing for replicate 2023-09-06 18:32:40 -07:00
ishaan-jaff
fc7ad0c245 bump v 2023-09-06 18:14:58 -07:00
ishaan-jaff
8b3b682000 add replicate pricing 2023-09-06 18:14:34 -07:00
ishaan-jaff
bd77d5ac21 docs update 2023-09-06 17:16:24 -07:00
ishaan-jaff
1ba6b6761b show pricing for tg ai completion 2023-09-06 17:10:49 -07:00
ishaan-jaff
bab27634a8 rename max_tokens.json 2023-09-06 16:28:17 -07:00
ishaan-jaff
fbd67bc24c add experimental together_computer cost calc 2023-09-06 16:08:44 -07:00