ishaan-jaff
|
99515c2e25
|
(fix) linting
|
2023-11-20 17:00:20 -08:00 |
|
ishaan-jaff
|
11ec2710c6
|
(fix) completion: max_retries using OpenAI client
|
2023-11-20 16:57:37 -08:00 |
|
canada4663
|
74ed37c4f2
|
bedrock embedding changes pre-testing
|
2023-11-18 21:00:06 -08:00 |
|
ishaan-jaff
|
32f22adf8b
|
(feat) openai improve logging post_call
|
2023-11-17 15:51:27 -08:00 |
|
Krrish Dholakia
|
17bb1184bd
|
fix(main.py): fix linting issue
|
2023-11-17 15:45:00 -08:00 |
|
Krrish Dholakia
|
0ab6b2451d
|
fix(acompletion): support client side timeouts + raise exceptions correctly for async calls
|
2023-11-17 15:39:47 -08:00 |
|
Krrish Dholakia
|
7ef1014e59
|
fix(factory.py): for ollama models check if it's instruct or not before applying prompt template
|
2023-11-16 15:45:08 -08:00 |
|
Krrish Dholakia
|
51bf637656
|
feat: global client for sync + async calls (openai + Azure only)
|
2023-11-16 14:44:13 -08:00 |
|
Krrish Dholakia
|
d7f7694848
|
fix(openai.py): fix linting issues
|
2023-11-16 12:57:53 -08:00 |
|
Krrish Dholakia
|
a94c09c13c
|
fix(openai.py): handling extra headers
|
2023-11-16 12:48:21 -08:00 |
|
Krrish Dholakia
|
f99a161d98
|
fix(azure.py): fix linting errors
|
2023-11-16 12:15:50 -08:00 |
|
Krrish Dholakia
|
bf0f8b824c
|
fix(azure.py): use openai client sdk for handling sync+async calling
|
2023-11-16 12:08:12 -08:00 |
|
Krrish Dholakia
|
a23c0a2599
|
fix(openai.py): fix linting issues
|
2023-11-16 11:01:28 -08:00 |
|
Krrish Dholakia
|
bb51216846
|
fix(openai.py): supporting openai client sdk for handling sync + async calls (incl. for openai-compatible apis)
|
2023-11-16 10:35:03 -08:00 |
|
Ishaan Jaff
|
d6ad62d793
|
Merge pull request #811 from dchristian3188/bedrock-llama
Bedrock llama
|
2023-11-16 07:57:50 -08:00 |
|
Krrish Dholakia
|
93aae8669d
|
fix(openai.py): using openai sdk for completion calls
|
2023-11-15 20:25:27 -08:00 |
|
Krrish Dholakia
|
da68e1ea81
|
perf(openai.py): extending request timeout
|
2023-11-15 18:45:45 -08:00 |
|
Krrish Dholakia
|
a575ff867f
|
fix(openai.py): fix linting errors
|
2023-11-15 18:33:01 -08:00 |
|
Krrish Dholakia
|
d090562dc1
|
fix(openai.py): move chat completions post call to requests
|
2023-11-15 18:29:07 -08:00 |
|
Krrish Dholakia
|
fc8b54eb73
|
fix(openai.py): switch back to using requests instead of httpx
|
2023-11-15 18:25:21 -08:00 |
|
Krrish Dholakia
|
2750a26633
|
fix(openai.py): fix timeout errors
|
2023-11-15 18:03:14 -08:00 |
|
Krrish Dholakia
|
a3d280baa3
|
test: cleanup testing
|
2023-11-15 17:57:14 -08:00 |
|
Krrish Dholakia
|
9fd7c5b343
|
test: set request timeout at request level
|
2023-11-15 17:42:31 -08:00 |
|
Krrish Dholakia
|
d6e142428b
|
fix(openai-+-azure): async calling
|
2023-11-15 17:03:02 -08:00 |
|
Krrish Dholakia
|
03efc9185e
|
fix(huggingface_restapi.py): async implementation
|
2023-11-15 16:54:15 -08:00 |
|
Krrish Dholakia
|
bcea28e2e4
|
fix(utils): fixing exception mapping
|
2023-11-15 15:51:17 -08:00 |
|
Krrish Dholakia
|
a59494571f
|
fix(huggingface_restapi.py): fix linting errors
|
2023-11-15 15:34:21 -08:00 |
|
Krrish Dholakia
|
1a705bfbcb
|
refactor(huggingface_restapi.py): moving async completion + streaming to real async calls
|
2023-11-15 15:14:21 -08:00 |
|
Krrish Dholakia
|
8c104e9c6a
|
fix(azure.py-+-proxy_server.py): fix function calling response object + support router on proxy
|
2023-11-15 13:15:16 -08:00 |
|
Krrish Dholakia
|
efe81032f4
|
fix: fix linting errors
|
2023-11-14 21:51:39 -08:00 |
|
Krrish Dholakia
|
59eaeba92a
|
fix(router.py): enabling retrying with expo backoff (without tenacity) for router
|
2023-11-14 20:57:51 -08:00 |
|
Krrish Dholakia
|
1738341dcb
|
fix(main.py): misrouting ollama models to nlp cloud
|
2023-11-14 18:55:08 -08:00 |
|
ishaan-jaff
|
f650be4fee
|
(feat) completion debug view HF POST request
|
2023-11-14 17:57:41 -08:00 |
|
ishaan-jaff
|
e82b8ed7e2
|
(feat) debug ollama POST request
|
2023-11-14 17:53:48 -08:00 |
|
ishaan-jaff
|
7c317b78eb
|
(feat) add ability to view POST requests from litellm.completion()
|
2023-11-14 17:27:20 -08:00 |
|
Krrish Dholakia
|
a7222f257c
|
fix(main.py): keep client consistent across calls + exponential backoff retry on ratelimit errors
|
2023-11-14 16:26:05 -08:00 |
|
ishaan-jaff
|
75cf3b3555
|
(fix) AI21 exception mapping - raise error when status !=200
|
2023-11-14 15:01:22 -08:00 |
|
ishaan-jaff
|
0779dfbbd6
|
(fix) azure_ad_token
|
2023-11-14 12:20:14 -08:00 |
|
ishaan-jaff
|
ed56f280d9
|
(feat) add azure_ad_token
|
2023-11-14 12:12:14 -08:00 |
|
Krrish Dholakia
|
e9e86cac79
|
fix(palm.py): exception mapping bad requests / filtered responses
|
2023-11-14 11:53:13 -08:00 |
|
Krrish Dholakia
|
b722594792
|
test(utils.py): add logging and fix azure streaming
|
2023-11-13 17:24:13 -08:00 |
|
David Christian
|
fe7e3ff038
|
added support for bedrock llama models
|
2023-11-13 15:41:21 -08:00 |
|
Krrish Dholakia
|
8a3b771e50
|
fix(tests): fixing response objects for testing
|
2023-11-13 14:39:30 -08:00 |
|
Krrish Dholakia
|
d4de55b053
|
fix(together_ai.py): exception mapping for tgai
|
2023-11-13 13:17:15 -08:00 |
|
Krrish Dholakia
|
45b6f8b853
|
refactor: fixing linting issues
|
2023-11-11 18:52:28 -08:00 |
|
Krrish Dholakia
|
ae35c13015
|
refactor(ai21,-aleph-alpha,-ollama): making ai21, aleph-alpha, ollama compatible with openai v1 sdk
|
2023-11-11 17:49:13 -08:00 |
|
Krrish Dholakia
|
c6ce3fedcd
|
fix(main.py): fix caching for router
|
2023-11-11 17:45:23 -08:00 |
|
Krrish Dholakia
|
4f42beb9d9
|
refactor(huggingface,-anthropic,-replicate,-sagemaker): making huggingface, anthropic, replicate, sagemaker compatible openai v1 sdk
|
2023-11-11 17:38:15 -08:00 |
|
Krrish Dholakia
|
547598a134
|
refactor(bedrock.py-+-cohere.py): making bedrock and cohere compatible with openai v1 sdk
|
2023-11-11 17:33:19 -08:00 |
|
Krrish Dholakia
|
39c2597c33
|
refactor(azure.py): working azure completion calls with openai v1 sdk
|
2023-11-11 16:44:39 -08:00 |
|