Krrish Dholakia
|
01c7e18f31
|
fix(utils.py): include system fingerprint in streaming response object
|
2023-11-30 08:45:52 -08:00 |
|
Krrish Dholakia
|
e98fac66da
|
fix(utils.py): fix register model cost map
|
2023-11-29 21:12:29 -08:00 |
|
Krrish Dholakia
|
0d200cd8dc
|
feat(main.py): allow updating model cost via completion()
|
2023-11-29 20:14:39 -08:00 |
|
ishaan-jaff
|
09caab549a
|
(feat) async embeddings: OpenAI
|
2023-11-29 19:35:08 -08:00 |
|
ishaan-jaff
|
2c74dbed17
|
(chore) util: remove_model_id
|
2023-11-29 17:30:33 -08:00 |
|
Krrish Dholakia
|
5411d5a6fd
|
fix(utils.py): raise stop iteration exception on bedrock stream close
|
2023-11-29 16:43:11 -08:00 |
|
Krrish Dholakia
|
ab76daa90b
|
fix(bedrock.py): support ai21 / bedrock streaming
|
2023-11-29 16:35:06 -08:00 |
|
ishaan-jaff
|
2bbd9c063d
|
(fix) OpenAI embedding
|
2023-11-29 16:09:31 -08:00 |
|
Krrish Dholakia
|
96a27ce954
|
fix(utils.py): stop sequence filtering for amazon titan models
|
2023-11-29 16:04:14 -08:00 |
|
Krrish Dholakia
|
6c98715b94
|
bump: version 1.7.13 → 1.7.14
|
2023-11-29 15:19:18 -08:00 |
|
Krrish Dholakia
|
2b437a2699
|
fix(utils.py): return last streaming chunk
|
2023-11-29 12:11:08 -08:00 |
|
Krrish Dholakia
|
b6bc75e27a
|
fix(utils.py): fix parallel tool calling when streaming
|
2023-11-29 10:56:21 -08:00 |
|
Krrish Dholakia
|
9024a47dc2
|
fix(utils.py): bedrock/cohere optional params
|
2023-11-29 08:08:48 -08:00 |
|
Krrish Dholakia
|
4e9aa0d338
|
fix(utils.py): fix bedrock/cohere supported params
|
2023-11-28 17:42:50 -08:00 |
|
Krrish Dholakia
|
bb1267eb07
|
fix(router.py): fix exponential backoff to use retry-after if present in headers
|
2023-11-28 17:25:03 -08:00 |
|
Krrish Dholakia
|
5ed957ebbe
|
fix(utils.py): bug fix return only non-null responses
|
2023-11-28 09:43:42 -08:00 |
|
Krrish Dholakia
|
e8331a4647
|
fix(utils.py): azure tool calling streaming
|
2023-11-27 19:07:38 -08:00 |
|
Krrish Dholakia
|
4cdd930fa2
|
fix(stream_chunk_builder): adding support for tool calling in completion counting
|
2023-11-27 18:39:47 -08:00 |
|
ishaan-jaff
|
9cef551623
|
(feat) raise APIConnectionError error for Azure +OpenAI
|
2023-11-27 18:08:47 -08:00 |
|
Krrish Dholakia
|
04f745e314
|
fix(router.py): speed improvements to the router
|
2023-11-27 17:35:26 -08:00 |
|
ishaan-jaff
|
d0538e32c9
|
(feat) completion: sagemaker debugging - show boto3 request sent
|
2023-11-27 09:04:50 -08:00 |
|
Krrish Dholakia
|
59ba1560e5
|
fix(router.py): fix fallbacks
|
2023-11-25 19:34:20 -08:00 |
|
Krrish Dholakia
|
ab0bc87427
|
fix(router.py): check if fallbacks is none
|
2023-11-25 14:58:07 -08:00 |
|
Krrish Dholakia
|
95579fda7d
|
fix(utils.py): fix bedrock + cohere calls
|
2023-11-25 14:45:42 -08:00 |
|
Krrish Dholakia
|
2eb7386095
|
fix(utils.py): fix embedding response object conversion
|
2023-11-25 14:25:06 -08:00 |
|
Krrish Dholakia
|
e43e1e9ab1
|
fix(utils.py): fix linting errors
|
2023-11-25 14:19:14 -08:00 |
|
Krrish Dholakia
|
ec81b393e2
|
fix(utils.py): fix linting erros
|
2023-11-25 14:14:46 -08:00 |
|
Krrish Dholakia
|
a070bee5be
|
fix(utils.py): fix linting issues
|
2023-11-25 14:11:40 -08:00 |
|
Krrish Dholakia
|
8970f12780
|
fix(utils.py): fix linting issues
|
2023-11-25 13:48:50 -08:00 |
|
Krrish Dholakia
|
6d9f7b8f9d
|
fix: fix nlp cloud streaming
|
2023-11-25 13:45:23 -08:00 |
|
Krrish Dholakia
|
30f47d3169
|
bump: version 1.7.0 → 1.7.1
|
2023-11-25 12:34:28 -08:00 |
|
Krrish Dholakia
|
dac76a4861
|
fix(utils.py): fix embedding response output parsing
|
2023-11-25 12:06:57 -08:00 |
|
Krrish Dholakia
|
e732fb8b97
|
fix(main.py): logit bias mapping for batch_completions
|
2023-11-24 16:05:51 -08:00 |
|
ishaan-jaff
|
0f873c756d
|
(fix) completion: OpenAI/Azure filter out None params
|
2023-11-24 14:01:21 -08:00 |
|
Krrish Dholakia
|
4a5dae3941
|
fix(main.py): fix streaming_chunk_builder to return usage
|
2023-11-24 11:27:04 -08:00 |
|
ishaan-jaff
|
19fb24cd15
|
(feat) cost tracking for azure llms
|
2023-11-23 21:41:38 -08:00 |
|
Krrish Dholakia
|
7d221fe863
|
fix(utils.py): make failure logging sync
|
2023-11-23 20:19:27 -08:00 |
|
ishaan-jaff
|
695eaac542
|
(fix) cost calculator for FT: gpt-3.5
|
2023-11-23 18:28:21 -08:00 |
|
Krrish Dholakia
|
1cf85b4eb9
|
fix(utils.py): vertex ai api error exception mapping
|
2023-11-23 17:50:50 -08:00 |
|
Krrish Dholakia
|
94dc3f66f3
|
fix(utils.py): remove eos token for zephyr models
|
2023-11-23 17:47:39 -08:00 |
|
Krrish Dholakia
|
f24786095a
|
fix(vertex_ai.py): fix exception mapping for vertex ai
|
2023-11-23 17:35:33 -08:00 |
|
ishaan-jaff
|
43ae3b94fa
|
(fix) debugging: viewing raw POST request
|
2023-11-23 16:28:19 -08:00 |
|
ishaan-jaff
|
37dedeee04
|
(fix) debugging: POST request
|
2023-11-23 16:08:59 -08:00 |
|
ishaan-jaff
|
bb00595429
|
(feat) cost: azure gpt + testing
|
2023-11-23 14:20:48 -08:00 |
|
ishaan-jaff
|
46fc8be4c8
|
(feat) cost tracking ft:gpt-3.5-turbo
|
2023-11-23 13:58:59 -08:00 |
|
ishaan-jaff
|
da7c07574f
|
(test) cost calc on azure
|
2023-11-23 13:50:09 -08:00 |
|
Krrish Dholakia
|
4f183dc6a0
|
fix(utils.py): support reading api keys dynamically from the os environment
|
2023-11-23 13:41:56 -08:00 |
|
Krrish Dholakia
|
3e76d4b422
|
feat(router.py): add server cooldown logic
|
2023-11-22 15:59:48 -08:00 |
|
ishaan-jaff
|
e8ff4d5eca
|
(feat) clean out junk params from litellm embedding
|
2023-11-22 13:50:45 -08:00 |
|
Krrish Dholakia
|
b0801f61e6
|
test(test_caching.py): cleaning up tests
|
2023-11-22 13:43:48 -08:00 |
|