Commit graph

2095 commits

Author SHA1 Message Date
ishaan-jaff
1eed36eb1d add replicate pricing 2023-09-06 18:14:34 -07:00
Krrish Dholakia
7290a972e5 adding support for vllm 2023-09-06 18:07:44 -07:00
ishaan-jaff
f4955431c8 docs update 2023-09-06 17:16:24 -07:00
ishaan-jaff
99a3d18a36 show pricing for tg ai completion 2023-09-06 17:10:49 -07:00
ishaan-jaff
955ba98093 rename max_tokens.json 2023-09-06 16:28:17 -07:00
ishaan-jaff
e30a70f0d0 add experimental together_computer cost calc 2023-09-06 16:08:44 -07:00
Krrish Dholakia
311bfb7bb7 logging replicate response logs 2023-09-06 11:28:40 -07:00
ishaan-jaff
a2d425f7de add flan + vicuna + fix replicate errors 2023-09-06 11:23:58 -07:00
ishaan-jaff
99d3f8be81 add vicuna translation 2023-09-06 11:14:24 -07:00
Krrish Dholakia
3defed9a24 bump version 2023-09-06 11:05:11 -07:00
ishaan-jaff
870c6592b7 send optional_params for llama2-70b chat replicate 2023-09-06 11:01:39 -07:00
ishaan-jaff
89ebdab2b3 add replicate support for max_tokens 2023-09-06 10:38:21 -07:00
Krrish Dholakia
1d10d3198d updates to exception mapping 2023-09-06 10:36:22 -07:00
ishaan-jaff
74e0e90620 add replicate streaming 2023-09-06 10:23:13 -07:00
Krrish Dholakia
6132471463 update docs on together ai 2023-09-06 08:26:05 -07:00
ishaan-jaff
54a77ec457 remove install_and_import remove petals 2023-09-05 15:06:24 -07:00
Krrish Dholakia
5ae420317e adding first-party + custom prompt templates for huggingface 2023-09-04 14:54:09 -07:00
Krrish Dholakia
57dcd1fee9 update exception mapping and get model cost map 2023-09-04 11:53:20 -07:00
ishaan-jaff
44f44ad5a3 add optional params for llama-2 2023-09-04 11:41:20 -07:00
yujong-lee
74c89c123a Remove print 2023-09-03 17:47:36 +09:00
ishaan-jaff
4a994dc498 use api_base instead of custom_api_base 2023-09-02 17:11:30 -07:00
ishaan-jaff
bd0a118552 test without id 2023-09-02 16:54:04 -07:00
Krrish Dholakia
a07251a514 fix azure streaming 2023-09-02 16:10:19 -07:00
Krrish Dholakia
78cd8d6e27 update to logging 2023-09-02 15:45:45 -07:00
Krrish Dholakia
a972676655 adding support for aleph alpha 2023-09-02 13:15:41 -07:00
Krrish Dholakia
5cd84b709e fix streaming cache logic 2023-09-01 16:56:07 -07:00
Krrish Dholakia
deb2f89a49 updates to tests 2023-09-01 16:01:28 -07:00
ishaan-jaff
042b182c31 use dynamic config args 2023-09-01 15:39:30 -07:00
Krrish Dholakia
455794c628 updates to logging 2023-09-01 14:38:52 -07:00
Krrish Dholakia
db5e7a3515 fixes to litedebugger 2023-09-01 13:22:16 -07:00
Krrish Dholakia
1ec46dd4f1 cleanup 2023-09-01 12:08:46 -07:00
Krrish Dholakia
bfa79ea3c0 fixes to core logging 2023-09-01 11:51:49 -07:00
Krrish Dholakia
1f2732a835 update model split tests with ui 2023-08-31 16:42:40 -07:00
Krrish Dholakia
0f973402ee add completion call id 2023-08-31 12:54:38 -07:00
ishaan-jaff
4f72dbc69c add langfuse integration 2023-08-31 11:04:17 -07:00
ishaan-jaff
98e0010df1 litellm call id's in responses when client true 2023-08-31 10:09:10 -07:00
ishaan-jaff
7d5d6751dc write to tmp dir 2023-08-31 08:55:19 -07:00
yujong-lee
e969358656 Cleanup test, at least pass with openai 2023-08-31 20:33:34 +09:00
yujong-lee
3f9c710ef4 Fix! 2023-08-31 20:18:17 +09:00
Krrish Dholakia
4927e5879f update baseten handler to handle TGI calls 2023-08-30 19:14:48 -07:00
Krrish Dholakia
1385c26aff return logprobs for hf models 2023-08-30 15:16:26 -07:00
Krrish Dholakia
eb33e1fa71 adding testing to make sure litedebugger is opt-in only 2023-08-30 09:55:33 -07:00
Ishaan Jaff
19f77b7ac4 Merge pull request #240 from estill01/async_iterator_fix
Add async iterator to fix/enable streaming responses
2023-08-29 19:08:35 -07:00
Krrish Dholakia
259de2d117 adding context window exceeded error to huggingface 2023-08-29 16:46:04 -07:00
Krrish Dholakia
7c4cc5942d adding contextwindow exceeded error mapping for replicate 2023-08-29 16:36:31 -07:00
Krrish Dholakia
c00cb299fc add context window exceeded error for anthropic 2023-08-29 16:28:07 -07:00
Krrish Dholakia
b40d9f79b5 add context window exceeded error mapping to openai 2023-08-29 16:08:00 -07:00
estill01
53c2d7b3f9 Add async iterator to fix/enable streaming responses 2023-08-29 15:44:44 -07:00
Krrish Dholakia
be271b58e8 adding new exception mapping details to documentation 2023-08-29 15:40:16 -07:00
Krrish Dholakia
efbde50a03 adding coverage for openrouter in exception mapping 2023-08-29 15:30:24 -07:00