Commit graph

2221 commits

Author SHA1 Message Date
Krrish Dholakia
bab36c2c6f work for hf inference endpoint 2023-09-11 18:37:55 -07:00
ishaan-jaff
56bd8c1c52 olla upgrades, fix streaming, add non streaming resp 2023-09-09 14:07:13 -07:00
ishaan-jaff
599be6a374 raise vllm error 2023-09-08 15:27:01 -07:00
Krrish Dholakia
6b3cb18983 fix linting issues 2023-09-06 20:43:59 -07:00
Krrish Dholakia
35cf6ef0a1 batch completions for vllm now works too 2023-09-06 19:26:19 -07:00
ishaan-jaff
8b3b682000 add replicate pricing 2023-09-06 18:14:34 -07:00
Krrish Dholakia
4cfcabd919 adding support for vllm 2023-09-06 18:07:44 -07:00
ishaan-jaff
04f8b20651 fix linting errors @krrishdholakia 2023-09-06 16:38:42 -07:00
Krrish Dholakia
0ace48d719 update custom prompt template function 2023-09-06 13:14:36 -07:00
Krrish Dholakia
44f71aa321 logging replicate response logs 2023-09-06 11:28:40 -07:00
ishaan-jaff
1da6026622 add flan + vicuna + fix replicate errors 2023-09-06 11:23:58 -07:00
Krrish Dholakia
48ee4a08ac updates 2023-09-06 11:21:48 -07:00
ishaan-jaff
d4c4a138ca add replicate support for max_tokens 2023-09-06 10:38:21 -07:00
ishaan-jaff
bc9b629726 add Replicate Error class 2023-09-06 10:25:40 -07:00
ishaan-jaff
1c61b7b229 add replicate streaming 2023-09-06 10:23:13 -07:00
ishaan-jaff
c45b132675 use replicate http requests instead 2023-09-06 09:43:05 -07:00
Krrish Dholakia
3d6836417e adding prompt template for falcon 180b 2023-09-06 08:44:13 -07:00
Krrish Dholakia
074f6dbfaf fixing linting errors 2023-09-05 14:52:57 -07:00
Krrish Dholakia
af33a85043 only use tgai's prompt template for llama2 instruct models 2023-09-05 12:25:52 -07:00
Krrish Dholakia
64f3d3c56e prompt formatting for together ai llama2 models 2023-09-05 11:57:13 -07:00
ishaan-jaff
fe4caf5c3d baseten client mapping 2023-09-04 15:41:37 -07:00
ishaan-jaff
79acfb4dab fix aleph alpha client init 2023-09-04 15:14:09 -07:00
Krrish Dholakia
2384806cfd adding first-party + custom prompt templates for huggingface 2023-09-04 14:54:09 -07:00
ishaan-jaff
a474b89779 clean up hugging face completion() 2023-09-04 14:41:06 -07:00
ishaan-jaff
2bf9ee4ecf v0 bedrock support 2023-09-04 12:40:40 -07:00
ishaan-jaff
126830f08a allow users to set AWS_REGION_NAME 2023-09-04 11:57:22 -07:00
ishaan-jaff
e03d442e8f add optional params for llama-2 2023-09-04 11:41:20 -07:00
ishaan-jaff
4a4ee51df3 working sagemaker support 2023-09-04 11:30:34 -07:00
ishaan-jaff
138c26d98d v0 add sagemaker 2023-09-04 11:02:20 -07:00
ishaan-jaff
38564ddc82 clean out AI21 Init Client calls 2023-09-04 10:08:53 -07:00
ishaan-jaff
f2b0fa90ab remove init for together_ai completion calls 2023-09-04 09:59:24 -07:00
ishaan-jaff
bc065f08df remove init AnthropicClient for completion calls 2023-09-04 09:34:15 -07:00
ishaan-jaff
09ae510a58 use api_base instead of custom_api_base 2023-09-02 17:11:30 -07:00
Krrish Dholakia
83b8af8567 adding support for aleph alpha 2023-09-02 13:15:41 -07:00
Krrish Dholakia
14d4c7ead2 update baseten handler to handle TGI calls 2023-08-30 19:14:48 -07:00
Krrish Dholakia
b4b2dbf005 clean up print statements 2023-08-30 16:11:49 -07:00
Krrish Dholakia
fcaf514546 updates 2023-08-30 16:05:42 -07:00
Krrish Dholakia
0ea59702fd add huggingface 2023-08-30 16:05:33 -07:00
Krrish Dholakia
daa949a539 return logprobs for hf models 2023-08-30 15:16:26 -07:00
Krrish Dholakia
546ad43b15 adding context window exceeded error to huggingface 2023-08-29 16:46:04 -07:00
Krrish Dholakia
509120bf61 add context window exceeded error for anthropic 2023-08-29 16:28:07 -07:00
Krrish Dholakia
436e8eadb2 adding coverage for ai21 2023-08-29 13:32:20 -07:00
Krrish Dholakia
f11599e50c add coverage for rate limit errors to togetherai 2023-08-29 12:54:56 -07:00
Krrish Dholakia
e8eb92c108 new logger client 2023-08-28 14:59:00 -07:00
Krrish Dholakia
d542066d4b fixes to streaming for ai21, baseten, and openai text completions 2023-08-28 09:42:51 -07:00
ishaan-jaff
b713acb0a4 formatting improvements 2023-08-28 09:20:50 -07:00
ishaan-jaff
70b323e0f5 anthropic fixes 2023-08-28 09:17:29 -07:00
ishaan-jaff
65941644dc anthropic py fixes 2023-08-28 09:15:29 -07:00
Ishaan Jaff
8c35ffe884
Merge branch 'main' into fix-streaming-anthropic-2 2023-08-28 09:05:51 -07:00
Krrish Dholakia
0ac17646d9 fix anthropic streaming 2023-08-26 18:54:51 -07:00