ishaan-jaff
|
e58b3d5df0
|
(feat) add azure/gpt-4-1106-preview
|
2023-11-29 18:21:31 -08:00 |
|
ishaan-jaff
|
7bcc23e8e9
|
(fix) router: set default rpm/tpm when not set
|
2023-11-29 18:13:27 -08:00 |
|
ishaan-jaff
|
c1914a01bc
|
(docs) routing
|
2023-11-29 18:09:39 -08:00 |
|
ishaan-jaff
|
f299120394
|
(docs) router
|
2023-11-29 18:08:00 -08:00 |
|
ishaan-jaff
|
305faab542
|
(test) router:get_available_deployment
|
2023-11-29 17:54:41 -08:00 |
|
ishaan-jaff
|
23af756531
|
(feat) router: random pick based on tpm/rpm
|
2023-11-29 17:54:06 -08:00 |
|
ishaan-jaff
|
2c74dbed17
|
(chore) util: remove_model_id
|
2023-11-29 17:30:33 -08:00 |
|
ishaan-jaff
|
7a38a45d62
|
(test) test weighted selection router
|
2023-11-29 17:30:18 -08:00 |
|
ishaan-jaff
|
48416f8018
|
(test) add rpm to load test profiling
|
2023-11-29 17:14:34 -08:00 |
|
ishaan-jaff
|
088d2bc081
|
(fix) use weighted shuffle when rpm set
|
2023-11-29 17:13:11 -08:00 |
|
Krrish Dholakia
|
38efc21f81
|
bump: version 1.7.18 → 1.7.19
|
2023-11-29 16:50:11 -08:00 |
|
Krrish Dholakia
|
61185aa12c
|
fix(main.py): fix null finish reason issue for ollama
|
2023-11-29 16:50:11 -08:00 |
|
ishaan-jaff
|
69eca78000
|
(docs) simple proxy
|
2023-11-29 16:44:40 -08:00 |
|
Krrish Dholakia
|
c2f642dbec
|
bump: version 1.7.17 → 1.7.18
|
2023-11-29 16:43:11 -08:00 |
|
Krrish Dholakia
|
5411d5a6fd
|
fix(utils.py): raise stop iteration exception on bedrock stream close
|
2023-11-29 16:43:11 -08:00 |
|
Ishaan Jaff
|
286ce586be
|
Update README.md
|
2023-11-29 16:40:16 -08:00 |
|
ishaan-jaff
|
4b78481fbd
|
(docs) simple proxy
|
2023-11-29 16:38:36 -08:00 |
|
ishaan-jaff
|
2d0432c5b7
|
(docs) simple proxy
|
2023-11-29 16:36:07 -08:00 |
|
Krrish Dholakia
|
52c9159a54
|
bump: version 1.7.16 → 1.7.17
|
2023-11-29 16:35:06 -08:00 |
|
Krrish Dholakia
|
ab76daa90b
|
fix(bedrock.py): support ai21 / bedrock streaming
|
2023-11-29 16:35:06 -08:00 |
|
ishaan-jaff
|
3b89cff65e
|
(docs) simple proxy
|
2023-11-29 16:33:00 -08:00 |
|
ishaan-jaff
|
032cd0121b
|
(docs) simple proxy
|
2023-11-29 16:31:08 -08:00 |
|
ishaan-jaff
|
3cc8305ec6
|
(fix) proxy: /health
|
2023-11-29 16:23:37 -08:00 |
|
ishaan-jaff
|
d3672452ce
|
(test) 1k requests
|
2023-11-29 16:22:18 -08:00 |
|
ishaan-jaff
|
3c6764efef
|
(feat) proxy+ router: support 1k request/second
|
2023-11-29 16:22:04 -08:00 |
|
ishaan-jaff
|
da75b15176
|
(feat) completion: add rpm, tpm as litellm params
|
2023-11-29 16:19:05 -08:00 |
|
ishaan-jaff
|
9bf603889f
|
(fix) azure: remove max retries before completion
|
2023-11-29 16:09:31 -08:00 |
|
ishaan-jaff
|
66bc0fc343
|
(fix) proxy: /health works with router updates
|
2023-11-29 16:09:31 -08:00 |
|
ishaan-jaff
|
8a398a1777
|
(feat) proxy: add weighted shuffle + set cooldown to 1s
|
2023-11-29 16:09:31 -08:00 |
|
ishaan-jaff
|
2bbd9c063d
|
(fix) OpenAI embedding
|
2023-11-29 16:09:31 -08:00 |
|
Krrish Dholakia
|
3c254cc555
|
bump: version 1.7.15 → 1.7.16
|
2023-11-29 16:04:22 -08:00 |
|
Krrish Dholakia
|
96a27ce954
|
fix(utils.py): stop sequence filtering for amazon titan models
|
2023-11-29 16:04:14 -08:00 |
|
Krrish Dholakia
|
dccfe1cc3e
|
bump: version 1.7.14 → 1.7.15
|
2023-11-29 15:37:21 -08:00 |
|
Krrish Dholakia
|
04a1c20bc5
|
fix(router.py): skip api key when generating model id for router deployments
|
2023-11-29 15:37:08 -08:00 |
|
Krrish Dholakia
|
6c98715b94
|
bump: version 1.7.13 → 1.7.14
|
2023-11-29 15:19:18 -08:00 |
|
Krrish Dholakia
|
451851e6a4
|
fix(main.py): have stream_chunk_builder return successful response even if token_counter fails
|
2023-11-29 15:19:11 -08:00 |
|
Krrish Dholakia
|
e41bd77df4
|
bump: version 1.7.12 → 1.7.13
|
2023-11-29 12:22:27 -08:00 |
|
Krrish Dholakia
|
7175cac1de
|
test(test_custom_logger.py): comment out streaming/success test due to pytest build issues
|
2023-11-29 12:18:02 -08:00 |
|
Krrish Dholakia
|
2b437a2699
|
fix(utils.py): return last streaming chunk
|
2023-11-29 12:11:08 -08:00 |
|
Krrish Dholakia
|
a9ed768991
|
fix(azure.py): fix error handling for openai/azure streaming
|
2023-11-29 11:52:24 -08:00 |
|
Krrish Dholakia
|
b6bc75e27a
|
fix(utils.py): fix parallel tool calling when streaming
|
2023-11-29 10:56:21 -08:00 |
|
Krrish Dholakia
|
9024a47dc2
|
fix(utils.py): bedrock/cohere optional params
|
2023-11-29 08:08:48 -08:00 |
|
ishaan-jaff
|
f307e82a41
|
(fix) proxy: making receiving data print_verbose
|
2023-11-29 07:50:52 -08:00 |
|
Krrish Dholakia
|
5fc7cd28d9
|
bump: version 1.7.11 → 1.7.12
|
2023-11-28 21:34:56 -08:00 |
|
ishaan-jaff
|
6f71299bb0
|
(fix) embedding pop out client from params
|
2023-11-28 21:22:01 -08:00 |
|
Krrish Dholakia
|
383dd53e86
|
fix(main.py): passing client as a litellm-specific kwarg
|
2023-11-28 21:20:05 -08:00 |
|
Krrish Dholakia
|
3154cd1758
|
test(test_exceptions.py): cleanup testing
|
2023-11-28 21:11:49 -08:00 |
|
ishaan-jaff
|
4c4dde60c7
|
(ci/cd) test exceptions
|
2023-11-28 21:11:12 -08:00 |
|
ishaan-jaff
|
afd20098be
|
(feat) router: init client for OpenAI compatible providers
|
2023-11-28 17:49:53 -08:00 |
|
Krrish Dholakia
|
b9ae6275ca
|
bump: version 1.7.10 → 1.7.11
|
2023-11-28 17:42:57 -08:00 |
|