Commit graph

345 commits

Author SHA1 Message Date
Krrish Dholakia
fff0228c20 fix(factory.py): support togethercomputer codellama pt 2023-12-06 19:02:58 -08:00
Krrish Dholakia
0295509b3b fix(factory.py): fix claude 2.1 prompt template to handle system, assistant, user prompt 2023-12-06 18:02:06 -08:00
Krrish Dholakia
f1c1ec8523 fix(bedrock.py): fix embeddings call 2023-12-06 14:16:00 -08:00
Krrish Dholakia
b24c9b4cbf refactor: fix linting 2023-12-06 13:27:40 -08:00
Krrish Dholakia
d962d5d4c0 fix(bedrock.py): adding support for cohere embeddings 2023-12-06 13:25:18 -08:00
Krrish Dholakia
102de97960 refactor: fix linting errors 2023-12-06 11:46:15 -08:00
Krrish Dholakia
94f065f83c feat(sagemaker.py): support huggingface embedding models 2023-12-06 11:41:38 -08:00
Krrish Dholakia
648d41c96f fix(sagemaker.py): prompt templating fixes 2023-12-05 17:47:44 -08:00
Krrish Dholakia
ff949490de docs(input.md): add hf_model_name to docs 2023-12-05 16:56:18 -08:00
Krrish Dholakia
88845dddb1 fix(sagemaker.py): bring back llama2 templating for sagemaker 2023-12-05 16:42:19 -08:00
Krrish Dholakia
54d8a9df3f fix(sagemaker.py): enable passing hf model name for prompt template 2023-12-05 16:31:59 -08:00
Krrish Dholakia
a38504ff1b fix(sagemaker.py): fix meta llama model name for sagemaker custom deployment 2023-12-05 16:23:03 -08:00
Krrish Dholakia
3c60682eb4 fix(sagemaker.py): accept all amazon neuron llama2 models 2023-12-05 16:19:28 -08:00
Krrish Dholakia
01fc7f1931 fix(sagemaker.py): add support for amazon neuron llama models 2023-12-05 16:18:20 -08:00
Krrish Dholakia
b4c78c7b9e fix(utils.py): support sagemaker llama2 custom endpoints 2023-12-05 16:05:15 -08:00
Krrish Dholakia
71e64c34cb fix(huggingface_restapi.py): raise better exceptions for unprocessable hf responses 2023-12-05 07:28:21 -08:00
Krish Dholakia
b90fcbdac4
Merge pull request #970 from nbaldwin98/fixing-replicate-sys-prompt
fix system prompts for replicate
2023-12-04 16:39:44 -08:00
ishaan-jaff
32ecc1a677 (feat) replicate/deployments: add POST Req view 2023-12-04 13:43:03 -08:00
chabala98
c2e2e927fb fix system prompts for replicate 2023-12-01 13:16:35 +01:00
ishaan-jaff
1081d4c766 (feat) aembedding: return raw openai response 2023-11-30 20:02:47 -08:00
Krrish Dholakia
c473abde49 fix(azure.py): logging fix 2023-11-30 14:13:40 -08:00
Krrish Dholakia
032f71adb2 fix(router.py): support cloudflare ai gateway for azure models on router 2023-11-30 14:09:06 -08:00
Krrish Dholakia
82553e8aac fix(azure.py): fix linting errors 2023-11-30 13:32:29 -08:00
Krrish Dholakia
4f07c8565a feat(main.py): add support for azure-openai via cloudflare ai gateway 2023-11-30 13:19:49 -08:00
ishaan-jaff
4ed5b3b46d (chore) linting fix 2023-11-29 19:58:12 -08:00
Krrish Dholakia
1f5a1122fc fix(replicate.py): fix custom prompt formatting 2023-11-29 19:44:09 -08:00
ishaan-jaff
c05da0797b (feat) Embedding: Async Azure 2023-11-29 19:43:47 -08:00
ishaan-jaff
09caab549a (feat) async embeddings: OpenAI 2023-11-29 19:35:08 -08:00
Krrish Dholakia
ab76daa90b fix(bedrock.py): support ai21 / bedrock streaming 2023-11-29 16:35:06 -08:00
ishaan-jaff
9bf603889f (fix) azure: remove max retries before completion 2023-11-29 16:09:31 -08:00
Krrish Dholakia
a9ed768991 fix(azure.py): fix error handling for openai/azure streaming 2023-11-29 11:52:24 -08:00
ishaan-jaff
0f0ddcc0fb (fix) using AzureOpenAI client 2023-11-28 17:17:40 -08:00
ishaan-jaff
8609694b49 (fix) completion:openai-pop out max_retries from completion kwargs 2023-11-28 17:09:58 -08:00
ishaan-jaff
f4a7760ea1 (feat+test) use passed OpenAI client 2023-11-28 16:09:10 -08:00
ishaan-jaff
400a268934 (feat) completion: Azure allow users to pass client to router 2023-11-28 15:56:52 -08:00
ishaan-jaff
7914623fbc (feat) allow users to pass azure client for acmompletion 2023-11-28 15:44:56 -08:00
Krrish Dholakia
e8331a4647 fix(utils.py): azure tool calling streaming 2023-11-27 19:07:38 -08:00
ishaan-jaff
9d259d08e7 (linting) fix 2023-11-27 10:27:51 -08:00
ishaan-jaff
26938f697e (feat) completion:debugging - show raw POST request 2023-11-27 10:13:37 -08:00
ishaan-jaff
f7ae01da8a (feat) completion:sagemaker - support chat models 2023-11-27 10:11:10 -08:00
ishaan-jaff
e407b185ee (feat) completion:sagemaker - better debugging 2023-11-27 09:08:20 -08:00
ishaan-jaff
afaca3f819 (fix) acompletion: Raise same error as completion 2023-11-25 15:33:46 -08:00
Krrish Dholakia
6d9f7b8f9d fix: fix nlp cloud streaming 2023-11-25 13:45:23 -08:00
Krrish Dholakia
30f47d3169 bump: version 1.7.0 → 1.7.1 2023-11-25 12:34:28 -08:00
Krrish Dholakia
620633ec28 fix(openai.py): fix linting issues 2023-11-25 12:21:29 -08:00
Krrish Dholakia
dac76a4861 fix(utils.py): fix embedding response output parsing 2023-11-25 12:06:57 -08:00
ishaan-jaff
2e08acba93 (feat) embedding: better logging 2023-11-25 11:10:06 -08:00
ishaan-jaff
77a9eb8a77 (feat) logging: for embedding openai.py 2023-11-25 11:10:06 -08:00
ishaan-jaff
23466107a7 (feat) 10x faster embeddings 2023-11-24 17:02:57 -08:00
ishaan-jaff
824136667f (fix) add azure/ to model. TY Krrish ! 2023-11-23 21:44:08 -08:00