Krish Dholakia
|
a9a9969d6d
|
Merge pull request #1641 from BerriAI/litellm_bedrock_region_based_pricing
feat(utils.py): support region based pricing for bedrock + use bedrock's token counts if given
|
2024-01-26 20:28:16 -08:00 |
|
Krrish Dholakia
|
e1beaf0945
|
feat(utils.py): support region based pricing for bedrock + use bedrock's token counts if given
|
2024-01-26 14:53:58 -08:00 |
|
ishaan-jaff
|
bc66c0e366
|
(feat) add support for dimensions param
|
2024-01-26 10:54:34 -08:00 |
|
Krrish Dholakia
|
cc6a58d2e0
|
test(main.py): adding more logging
|
2024-01-25 18:15:24 -08:00 |
|
Krrish Dholakia
|
7591aba27e
|
fix(main.py): allow vertex ai project and location to be set in completion() call
|
2024-01-25 16:40:23 -08:00 |
|
Krrish Dholakia
|
d88e190304
|
fix(main.py): fix logging event loop for async logging but sync streaming
|
2024-01-25 15:59:53 -08:00 |
|
Krrish Dholakia
|
806eef02dd
|
fix(main.py): fix order of assembly for streaming chunks
|
2024-01-25 14:51:08 -08:00 |
|
Krrish Dholakia
|
402235dc5d
|
fix(utils.py): fix sagemaker async logging for sync streaming
https://github.com/BerriAI/litellm/issues/1592
|
2024-01-25 12:49:45 -08:00 |
|
Ishaan Jaff
|
97dd61a6cb
|
Merge pull request #1561 from BerriAI/litellm_sagemaker_streaming
[Feat] Add REAL Sagemaker streaming
|
2024-01-22 22:10:20 -08:00 |
|
ishaan-jaff
|
eb9b3987fb
|
(v0) sagemaker streaming
|
2024-01-22 21:50:40 -08:00 |
|
Krrish Dholakia
|
db2b7bfd4e
|
fix(openai.py): fix linting issue
|
2024-01-22 18:20:15 -08:00 |
|
Krrish Dholakia
|
e423aeff85
|
fix: support streaming custom cost completion tracking
|
2024-01-22 15:15:34 -08:00 |
|
Krrish Dholakia
|
85b9ad7def
|
fix(main.py): support custom pricing for embedding calls
|
2024-01-22 15:15:34 -08:00 |
|
Krrish Dholakia
|
480c3d3991
|
feat(utils.py): support custom cost tracking per second
https://github.com/BerriAI/litellm/issues/1374
|
2024-01-22 15:15:34 -08:00 |
|
ishaan-jaff
|
033de31ec9
|
(feat) mock_response set custom_llm_provider in hidden param
|
2024-01-22 14:22:16 -08:00 |
|
Krrish Dholakia
|
2ecc2f12cd
|
fix(gemini.py): support streaming
|
2024-01-19 20:21:34 -08:00 |
|
Tanaro Laptop
|
af0f939079
|
change max_tokens type to int
|
2024-01-20 01:50:27 +01:00 |
|
Ishaan Jaff
|
268f9e3c7e
|
Merge pull request #1513 from costly-ai/main
Allow overriding headers for anthropic
|
2024-01-19 15:21:45 -08:00 |
|
ishaan-jaff
|
cf2bf1fe60
|
(fix) return usage in mock_completion
|
2024-01-19 11:25:47 -08:00 |
|
Keegan McCallum
|
48948805a8
|
Allow overriding headers for anthropic
|
2024-01-18 20:12:59 -08:00 |
|
Krrish Dholakia
|
aa01ed0f38
|
fix(main.py): read azure ad token from optional params extra body
|
2024-01-18 17:14:03 -08:00 |
|
Krrish Dholakia
|
cc89aa7456
|
fix(bedrock.py): add support for sts based boto3 initialization
https://github.com/BerriAI/litellm/issues/1476
|
2024-01-17 12:08:59 -08:00 |
|
ishaan-jaff
|
cd2828e93a
|
(feat) set custom_llm_provider in stream chunk builder
|
2024-01-13 11:09:22 -08:00 |
|
Krrish Dholakia
|
0bcca3fed3
|
refactor(main.py): trigger rebuild
|
2024-01-13 15:55:56 +05:30 |
|
ishaan-jaff
|
f7bdee69fb
|
(fix) always check if response has hidden_param attr
|
2024-01-12 17:51:34 -08:00 |
|
ishaan-jaff
|
e83c70ea55
|
(feat) set custom_llm_provider for embedding hidden params
|
2024-01-12 17:35:08 -08:00 |
|
ishaan-jaff
|
f9271b59b4
|
(v0)
|
2024-01-12 17:05:51 -08:00 |
|
Krrish Dholakia
|
becdabe837
|
fix(main.py): support text completion routing
|
2024-01-12 11:24:31 +05:30 |
|
Krrish Dholakia
|
cbb021c9af
|
refactor(main.py): trigger new release
|
2024-01-12 00:14:12 +05:30 |
|
Krrish Dholakia
|
0e1ea4325c
|
fix(azure.py): support health checks to text completion endpoints
|
2024-01-12 00:13:01 +05:30 |
|
Krish Dholakia
|
7ecfc09221
|
Merge branch 'main' into litellm_embedding_caching_updates
|
2024-01-11 23:58:51 +05:30 |
|
Krrish Dholakia
|
36068b707a
|
fix(proxy_cli.py): read db url from config, not just environment
|
2024-01-11 19:19:29 +05:30 |
|
Krrish Dholakia
|
f3b7e98da7
|
fix(main.py): init custom llm provider earlier
|
2024-01-11 18:30:10 +05:30 |
|
Krrish Dholakia
|
4de82617c0
|
fix(main.py): add back **kwargs for acompletion
|
2024-01-11 16:55:19 +05:30 |
|
Krrish Dholakia
|
66addb1a01
|
fix(utils.py): support caching individual items in embedding input list
https://github.com/BerriAI/litellm/issues/1350
|
2024-01-11 16:51:34 +05:30 |
|
Krrish Dholakia
|
1472dc3f54
|
fix: n
|
2024-01-11 16:30:05 +05:30 |
|
ishaan-jaff
|
c41b47dc8b
|
(fix) acompletion kwargs type hints
|
2024-01-11 14:22:37 +05:30 |
|
ishaan-jaff
|
29393fb512
|
(fix) acompletion typehints - pass kwargs
|
2024-01-11 11:49:55 +05:30 |
|
ishaan-jaff
|
cea0d6c8b0
|
(fix) litellm.acompletion with type hints
|
2024-01-11 10:47:12 +05:30 |
|
Ishaan Jaff
|
6e1be43595
|
Merge pull request #1200 from MateoCamara/explicit-args-acomplete
feat: added explicit args to acomplete
|
2024-01-11 10:39:05 +05:30 |
|
Krrish Dholakia
|
e71154f286
|
fix(main.py): fix streaming completion token counting error
|
2024-01-10 23:44:35 +05:30 |
|
Mateo Cámara
|
fb37ea291e
|
Merge branch 'main' into explicit-args-acomplete
|
2024-01-09 13:07:37 +01:00 |
|
Mateo Cámara
|
8b84117367
|
Reverted changes made by the IDE automatically
|
2024-01-09 12:55:12 +01:00 |
|
ishaan-jaff
|
84271cb608
|
(feat) add exception mapping for litellm.image_generation
|
2024-01-09 16:54:47 +05:30 |
|
Mateo Cámara
|
6a9d846506
|
Added the new acompletion parameters based on CompletionRequest attributes
|
2024-01-09 12:05:31 +01:00 |
|
Krrish Dholakia
|
5daa3ce237
|
fix(main.py): support cost calculation for text completion streaming object
|
2024-01-08 12:41:43 +05:30 |
|
Krrish Dholakia
|
e4a5a3395c
|
fix(huggingface_restapi.py): support timeouts for huggingface + openai text completions
https://github.com/BerriAI/litellm/issues/1334
|
2024-01-08 11:40:56 +05:30 |
|
Krrish Dholakia
|
176af67aac
|
fix(caching.py): support ttl, s-max-age, and no-cache cache controls
https://github.com/BerriAI/litellm/issues/1306
|
2024-01-03 12:42:43 +05:30 |
|
ishaan-jaff
|
f582ef666f
|
(fix) counting response tokens+streaming
|
2024-01-03 12:06:39 +05:30 |
|
ishaan-jaff
|
0e8809abf2
|
(feat) add xinference as an embedding provider
|
2024-01-02 15:32:26 +05:30 |
|