Krrish Dholakia
|
2c1103247d
|
test: fix flaky test
|
2023-12-01 15:46:39 -08:00 |
|
Krrish Dholakia
|
4da20ac0b9
|
fix(proxy_server.py): accept max tokens as int
|
2023-12-01 15:19:34 -08:00 |
|
Krrish Dholakia
|
ac4429c35f
|
test(test_completion.py): commenting out flaky test
|
2023-12-01 15:06:54 -08:00 |
|
Krrish Dholakia
|
7b99cfe6b9
|
test(test_token_counter.py): include testing for function calls
|
2023-11-30 18:24:21 -08:00 |
|
Krrish Dholakia
|
032f71adb2
|
fix(router.py): support cloudflare ai gateway for azure models on router
|
2023-11-30 14:09:06 -08:00 |
|
ishaan-jaff
|
936c27c9ee
|
(fix +test) langfuse log metadata
|
2023-11-30 13:53:43 -08:00 |
|
Krrish Dholakia
|
a5c99d8594
|
fix(langfuse.py): fix metadata logging for langfuse
|
2023-11-30 13:45:34 -08:00 |
|
Krrish Dholakia
|
82553e8aac
|
fix(azure.py): fix linting errors
|
2023-11-30 13:32:29 -08:00 |
|
Krrish Dholakia
|
4f07c8565a
|
feat(main.py): add support for azure-openai via cloudflare ai gateway
|
2023-11-30 13:19:49 -08:00 |
|
Krrish Dholakia
|
01c7e18f31
|
fix(utils.py): include system fingerprint in streaming response object
|
2023-11-30 08:45:52 -08:00 |
|
Krrish Dholakia
|
0d200cd8dc
|
feat(main.py): allow updating model cost via completion()
|
2023-11-29 20:14:39 -08:00 |
|
ishaan-jaff
|
4c1ef4e270
|
(chore) fix testing
|
2023-11-29 20:05:13 -08:00 |
|
Krrish Dholakia
|
a9fdae0d23
|
test(test_streaming.py): refactor testing
|
2023-11-29 19:58:04 -08:00 |
|
Krrish Dholakia
|
c312ac4ca8
|
fix(main.py): don't pass stream to petals
|
2023-11-29 19:58:04 -08:00 |
|
Krrish Dholakia
|
1f5a1122fc
|
fix(replicate.py): fix custom prompt formatting
|
2023-11-29 19:44:09 -08:00 |
|
ishaan-jaff
|
c05da0797b
|
(feat) Embedding: Async Azure
|
2023-11-29 19:43:47 -08:00 |
|
ishaan-jaff
|
53554bae85
|
(test) aembedding
|
2023-11-29 19:36:42 -08:00 |
|
ishaan-jaff
|
10e21ae978
|
(test) aembedding
|
2023-11-29 19:35:32 -08:00 |
|
ishaan-jaff
|
3891462b29
|
(fix) router: azure/embedding support
|
2023-11-29 19:06:36 -08:00 |
|
ishaan-jaff
|
305faab542
|
(test) router:get_available_deployment
|
2023-11-29 17:54:41 -08:00 |
|
ishaan-jaff
|
7a38a45d62
|
(test) test weighted selection router
|
2023-11-29 17:30:18 -08:00 |
|
ishaan-jaff
|
48416f8018
|
(test) add rpm to load test profiling
|
2023-11-29 17:14:34 -08:00 |
|
Krrish Dholakia
|
ab76daa90b
|
fix(bedrock.py): support ai21 / bedrock streaming
|
2023-11-29 16:35:06 -08:00 |
|
Krrish Dholakia
|
7175cac1de
|
test(test_custom_logger.py): comment out streaming/success test due to pytest build issues
|
2023-11-29 12:18:02 -08:00 |
|
Krrish Dholakia
|
2b437a2699
|
fix(utils.py): return last streaming chunk
|
2023-11-29 12:11:08 -08:00 |
|
Krrish Dholakia
|
a9ed768991
|
fix(azure.py): fix error handling for openai/azure streaming
|
2023-11-29 11:52:24 -08:00 |
|
Krrish Dholakia
|
b6bc75e27a
|
fix(utils.py): fix parallel tool calling when streaming
|
2023-11-29 10:56:21 -08:00 |
|
Krrish Dholakia
|
3154cd1758
|
test(test_exceptions.py): cleanup testing
|
2023-11-28 21:11:49 -08:00 |
|
ishaan-jaff
|
4c4dde60c7
|
(ci/cd) test exceptions
|
2023-11-28 21:11:12 -08:00 |
|
Krrish Dholakia
|
bb1267eb07
|
fix(router.py): fix exponential backoff to use retry-after if present in headers
|
2023-11-28 17:25:03 -08:00 |
|
ishaan-jaff
|
f4a7760ea1
|
(feat+test) use passed OpenAI client
|
2023-11-28 16:09:10 -08:00 |
|
ishaan-jaff
|
01c38d37fa
|
(test) pass client to Azure completion
|
2023-11-28 15:57:11 -08:00 |
|
ishaan-jaff
|
1a0b683a8e
|
(test) using client: compleition
|
2023-11-28 15:44:56 -08:00 |
|
Krrish Dholakia
|
094144de58
|
fix(router.py): removing model id before making call
|
2023-11-28 10:09:45 -08:00 |
|
Krrish Dholakia
|
150b91d476
|
fix(utils.py): fix streaming on-success logging
|
2023-11-28 09:11:47 -08:00 |
|
Krrish Dholakia
|
c4aea7432f
|
build: adding debug logs to gitignore
|
2023-11-27 22:05:07 -08:00 |
|
ishaan-jaff
|
c52861906b
|
(test) router cooldowns
|
2023-11-27 22:03:02 -08:00 |
|
Krrish Dholakia
|
e8331a4647
|
fix(utils.py): azure tool calling streaming
|
2023-11-27 19:07:38 -08:00 |
|
Krrish Dholakia
|
4cdd930fa2
|
fix(stream_chunk_builder): adding support for tool calling in completion counting
|
2023-11-27 18:39:47 -08:00 |
|
Krrish Dholakia
|
04f745e314
|
fix(router.py): speed improvements to the router
|
2023-11-27 17:35:26 -08:00 |
|
ishaan-jaff
|
18d9222945
|
(test) litellm using uuid for model name
|
2023-11-27 16:37:39 -08:00 |
|
ishaan-jaff
|
37f3b1edd1
|
(ci/cd) run again
|
2023-11-27 11:12:11 -08:00 |
|
ishaan-jaff
|
a4754f9098
|
(test) competion
|
2023-11-27 10:13:46 -08:00 |
|
ishaan-jaff
|
26938f697e
|
(feat) completion:debugging - show raw POST request
|
2023-11-27 10:13:37 -08:00 |
|
ishaan-jaff
|
90687d51f1
|
(test) sagemaker add chat models
|
2023-11-27 10:11:56 -08:00 |
|
Krrish Dholakia
|
fa713abfc3
|
fix(router.py): check for fallbacks in completion params for router
|
2023-11-25 18:46:45 -08:00 |
|
ishaan-jaff
|
2c4f52b6b1
|
(fix) linting
|
2023-11-25 15:46:07 -08:00 |
|
ishaan-jaff
|
0b56d584aa
|
(test) openai exception mapping for openai.AuthenticationError
|
2023-11-25 15:43:46 -08:00 |
|
ishaan-jaff
|
afaca3f819
|
(fix) acompletion: Raise same error as completion
|
2023-11-25 15:33:46 -08:00 |
|
Krrish Dholakia
|
ab0bc87427
|
fix(router.py): check if fallbacks is none
|
2023-11-25 14:58:07 -08:00 |
|