Commit graph

976 commits

Author SHA1 Message Date
Krrish Dholakia
2c1103247d test: fix flaky test 2023-12-01 15:46:39 -08:00
Krrish Dholakia
4da20ac0b9 fix(proxy_server.py): accept max tokens as int 2023-12-01 15:19:34 -08:00
Krrish Dholakia
ac4429c35f test(test_completion.py): commenting out flaky test 2023-12-01 15:06:54 -08:00
Krrish Dholakia
7b99cfe6b9 test(test_token_counter.py): include testing for function calls 2023-11-30 18:24:21 -08:00
Krrish Dholakia
032f71adb2 fix(router.py): support cloudflare ai gateway for azure models on router 2023-11-30 14:09:06 -08:00
ishaan-jaff
936c27c9ee (fix +test) langfuse log metadata 2023-11-30 13:53:43 -08:00
Krrish Dholakia
a5c99d8594 fix(langfuse.py): fix metadata logging for langfuse 2023-11-30 13:45:34 -08:00
Krrish Dholakia
82553e8aac fix(azure.py): fix linting errors 2023-11-30 13:32:29 -08:00
Krrish Dholakia
4f07c8565a feat(main.py): add support for azure-openai via cloudflare ai gateway 2023-11-30 13:19:49 -08:00
Krrish Dholakia
01c7e18f31 fix(utils.py): include system fingerprint in streaming response object 2023-11-30 08:45:52 -08:00
Krrish Dholakia
0d200cd8dc feat(main.py): allow updating model cost via completion() 2023-11-29 20:14:39 -08:00
ishaan-jaff
4c1ef4e270 (chore) fix testing 2023-11-29 20:05:13 -08:00
Krrish Dholakia
a9fdae0d23 test(test_streaming.py): refactor testing 2023-11-29 19:58:04 -08:00
Krrish Dholakia
c312ac4ca8 fix(main.py): don't pass stream to petals 2023-11-29 19:58:04 -08:00
Krrish Dholakia
1f5a1122fc fix(replicate.py): fix custom prompt formatting 2023-11-29 19:44:09 -08:00
ishaan-jaff
c05da0797b (feat) Embedding: Async Azure 2023-11-29 19:43:47 -08:00
ishaan-jaff
53554bae85 (test) aembedding 2023-11-29 19:36:42 -08:00
ishaan-jaff
10e21ae978 (test) aembedding 2023-11-29 19:35:32 -08:00
ishaan-jaff
3891462b29 (fix) router: azure/embedding support 2023-11-29 19:06:36 -08:00
ishaan-jaff
305faab542 (test) router:get_available_deployment 2023-11-29 17:54:41 -08:00
ishaan-jaff
7a38a45d62 (test) test weighted selection router 2023-11-29 17:30:18 -08:00
ishaan-jaff
48416f8018 (test) add rpm to load test profiling 2023-11-29 17:14:34 -08:00
Krrish Dholakia
ab76daa90b fix(bedrock.py): support ai21 / bedrock streaming 2023-11-29 16:35:06 -08:00
Krrish Dholakia
7175cac1de test(test_custom_logger.py): comment out streaming/success test due to pytest build issues 2023-11-29 12:18:02 -08:00
Krrish Dholakia
2b437a2699 fix(utils.py): return last streaming chunk 2023-11-29 12:11:08 -08:00
Krrish Dholakia
a9ed768991 fix(azure.py): fix error handling for openai/azure streaming 2023-11-29 11:52:24 -08:00
Krrish Dholakia
b6bc75e27a fix(utils.py): fix parallel tool calling when streaming 2023-11-29 10:56:21 -08:00
Krrish Dholakia
3154cd1758 test(test_exceptions.py): cleanup testing 2023-11-28 21:11:49 -08:00
ishaan-jaff
4c4dde60c7 (ci/cd) test exceptions 2023-11-28 21:11:12 -08:00
Krrish Dholakia
bb1267eb07 fix(router.py): fix exponential backoff to use retry-after if present in headers 2023-11-28 17:25:03 -08:00
ishaan-jaff
f4a7760ea1 (feat+test) use passed OpenAI client 2023-11-28 16:09:10 -08:00
ishaan-jaff
01c38d37fa (test) pass client to Azure completion 2023-11-28 15:57:11 -08:00
ishaan-jaff
1a0b683a8e (test) using client: compleition 2023-11-28 15:44:56 -08:00
Krrish Dholakia
094144de58 fix(router.py): removing model id before making call 2023-11-28 10:09:45 -08:00
Krrish Dholakia
150b91d476 fix(utils.py): fix streaming on-success logging 2023-11-28 09:11:47 -08:00
Krrish Dholakia
c4aea7432f build: adding debug logs to gitignore 2023-11-27 22:05:07 -08:00
ishaan-jaff
c52861906b (test) router cooldowns 2023-11-27 22:03:02 -08:00
Krrish Dholakia
e8331a4647 fix(utils.py): azure tool calling streaming 2023-11-27 19:07:38 -08:00
Krrish Dholakia
4cdd930fa2 fix(stream_chunk_builder): adding support for tool calling in completion counting 2023-11-27 18:39:47 -08:00
Krrish Dholakia
04f745e314 fix(router.py): speed improvements to the router 2023-11-27 17:35:26 -08:00
ishaan-jaff
18d9222945 (test) litellm using uuid for model name 2023-11-27 16:37:39 -08:00
ishaan-jaff
37f3b1edd1 (ci/cd) run again 2023-11-27 11:12:11 -08:00
ishaan-jaff
a4754f9098 (test) competion 2023-11-27 10:13:46 -08:00
ishaan-jaff
26938f697e (feat) completion:debugging - show raw POST request 2023-11-27 10:13:37 -08:00
ishaan-jaff
90687d51f1 (test) sagemaker add chat models 2023-11-27 10:11:56 -08:00
Krrish Dholakia
fa713abfc3 fix(router.py): check for fallbacks in completion params for router 2023-11-25 18:46:45 -08:00
ishaan-jaff
2c4f52b6b1 (fix) linting 2023-11-25 15:46:07 -08:00
ishaan-jaff
0b56d584aa (test) openai exception mapping for openai.AuthenticationError 2023-11-25 15:43:46 -08:00
ishaan-jaff
afaca3f819 (fix) acompletion: Raise same error as completion 2023-11-25 15:33:46 -08:00
Krrish Dholakia
ab0bc87427 fix(router.py): check if fallbacks is none 2023-11-25 14:58:07 -08:00