Commit graph

477 commits

Author SHA1 Message Date
ishaan-jaff
99515c2e25 (fix) linting 2023-11-20 17:00:20 -08:00
ishaan-jaff
11ec2710c6 (fix) completion: max_retries using OpenAI client 2023-11-20 16:57:37 -08:00
canada4663
74ed37c4f2 bedrock embedding changes pre-testing 2023-11-18 21:00:06 -08:00
ishaan-jaff
32f22adf8b (feat) openai improve logging post_call 2023-11-17 15:51:27 -08:00
Krrish Dholakia
17bb1184bd fix(main.py): fix linting issue 2023-11-17 15:45:00 -08:00
Krrish Dholakia
0ab6b2451d fix(acompletion): support client side timeouts + raise exceptions correctly for async calls 2023-11-17 15:39:47 -08:00
Krrish Dholakia
7ef1014e59 fix(factory.py): for ollama models check if it's instruct or not before applying prompt template 2023-11-16 15:45:08 -08:00
Krrish Dholakia
51bf637656 feat: global client for sync + async calls (openai + Azure only) 2023-11-16 14:44:13 -08:00
Krrish Dholakia
d7f7694848 fix(openai.py): fix linting issues 2023-11-16 12:57:53 -08:00
Krrish Dholakia
a94c09c13c fix(openai.py): handling extra headers 2023-11-16 12:48:21 -08:00
Krrish Dholakia
f99a161d98 fix(azure.py): fix linting errors 2023-11-16 12:15:50 -08:00
Krrish Dholakia
bf0f8b824c fix(azure.py): use openai client sdk for handling sync+async calling 2023-11-16 12:08:12 -08:00
Krrish Dholakia
a23c0a2599 fix(openai.py): fix linting issues 2023-11-16 11:01:28 -08:00
Krrish Dholakia
bb51216846 fix(openai.py): supporting openai client sdk for handling sync + async calls (incl. for openai-compatible apis) 2023-11-16 10:35:03 -08:00
Ishaan Jaff
d6ad62d793
Merge pull request #811 from dchristian3188/bedrock-llama
Bedrock llama
2023-11-16 07:57:50 -08:00
Krrish Dholakia
93aae8669d fix(openai.py): using openai sdk for completion calls 2023-11-15 20:25:27 -08:00
Krrish Dholakia
da68e1ea81 perf(openai.py): extending request timeout 2023-11-15 18:45:45 -08:00
Krrish Dholakia
a575ff867f fix(openai.py): fix linting errors 2023-11-15 18:33:01 -08:00
Krrish Dholakia
d090562dc1 fix(openai.py): move chat completions post call to requests 2023-11-15 18:29:07 -08:00
Krrish Dholakia
fc8b54eb73 fix(openai.py): switch back to using requests instead of httpx 2023-11-15 18:25:21 -08:00
Krrish Dholakia
2750a26633 fix(openai.py): fix timeout errors 2023-11-15 18:03:14 -08:00
Krrish Dholakia
a3d280baa3 test: cleanup testing 2023-11-15 17:57:14 -08:00
Krrish Dholakia
9fd7c5b343 test: set request timeout at request level 2023-11-15 17:42:31 -08:00
Krrish Dholakia
d6e142428b fix(openai-+-azure): async calling 2023-11-15 17:03:02 -08:00
Krrish Dholakia
03efc9185e fix(huggingface_restapi.py): async implementation 2023-11-15 16:54:15 -08:00
Krrish Dholakia
bcea28e2e4 fix(utils): fixing exception mapping 2023-11-15 15:51:17 -08:00
Krrish Dholakia
a59494571f fix(huggingface_restapi.py): fix linting errors 2023-11-15 15:34:21 -08:00
Krrish Dholakia
1a705bfbcb refactor(huggingface_restapi.py): moving async completion + streaming to real async calls 2023-11-15 15:14:21 -08:00
Krrish Dholakia
8c104e9c6a fix(azure.py-+-proxy_server.py): fix function calling response object + support router on proxy 2023-11-15 13:15:16 -08:00
Krrish Dholakia
efe81032f4 fix: fix linting errors 2023-11-14 21:51:39 -08:00
Krrish Dholakia
59eaeba92a fix(router.py): enabling retrying with expo backoff (without tenacity) for router 2023-11-14 20:57:51 -08:00
Krrish Dholakia
1738341dcb fix(main.py): misrouting ollama models to nlp cloud 2023-11-14 18:55:08 -08:00
ishaan-jaff
f650be4fee (feat) completion debug view HF POST request 2023-11-14 17:57:41 -08:00
ishaan-jaff
e82b8ed7e2 (feat) debug ollama POST request 2023-11-14 17:53:48 -08:00
ishaan-jaff
7c317b78eb (feat) add ability to view POST requests from litellm.completion() 2023-11-14 17:27:20 -08:00
Krrish Dholakia
a7222f257c fix(main.py): keep client consistent across calls + exponential backoff retry on ratelimit errors 2023-11-14 16:26:05 -08:00
ishaan-jaff
75cf3b3555 (fix) AI21 exception mapping - raise error when status !=200 2023-11-14 15:01:22 -08:00
ishaan-jaff
0779dfbbd6 (fix) azure_ad_token 2023-11-14 12:20:14 -08:00
ishaan-jaff
ed56f280d9 (feat) add azure_ad_token 2023-11-14 12:12:14 -08:00
Krrish Dholakia
e9e86cac79 fix(palm.py): exception mapping bad requests / filtered responses 2023-11-14 11:53:13 -08:00
Krrish Dholakia
b722594792 test(utils.py): add logging and fix azure streaming 2023-11-13 17:24:13 -08:00
David Christian
fe7e3ff038 added support for bedrock llama models 2023-11-13 15:41:21 -08:00
Krrish Dholakia
8a3b771e50 fix(tests): fixing response objects for testing 2023-11-13 14:39:30 -08:00
Krrish Dholakia
d4de55b053 fix(together_ai.py): exception mapping for tgai 2023-11-13 13:17:15 -08:00
Krrish Dholakia
45b6f8b853 refactor: fixing linting issues 2023-11-11 18:52:28 -08:00
Krrish Dholakia
ae35c13015 refactor(ai21,-aleph-alpha,-ollama): making ai21, aleph-alpha, ollama compatible with openai v1 sdk 2023-11-11 17:49:13 -08:00
Krrish Dholakia
c6ce3fedcd fix(main.py): fix caching for router 2023-11-11 17:45:23 -08:00
Krrish Dholakia
4f42beb9d9 refactor(huggingface,-anthropic,-replicate,-sagemaker): making huggingface, anthropic, replicate, sagemaker compatible openai v1 sdk 2023-11-11 17:38:15 -08:00
Krrish Dholakia
547598a134 refactor(bedrock.py-+-cohere.py): making bedrock and cohere compatible with openai v1 sdk 2023-11-11 17:33:19 -08:00
Krrish Dholakia
39c2597c33 refactor(azure.py): working azure completion calls with openai v1 sdk 2023-11-11 16:44:39 -08:00