Krrish Dholakia
|
072d93696c
|
remove print statement
|
2023-09-07 08:21:25 -07:00 |
|
ishaan-jaff
|
2880a7b6b4
|
allow users to pass custom timing for replicate
|
2023-09-06 18:32:40 -07:00 |
|
ishaan-jaff
|
8b3b682000
|
add replicate pricing
|
2023-09-06 18:14:34 -07:00 |
|
Krrish Dholakia
|
4cfcabd919
|
adding support for vllm
|
2023-09-06 18:07:44 -07:00 |
|
ishaan-jaff
|
bd77d5ac21
|
docs update
|
2023-09-06 17:16:24 -07:00 |
|
ishaan-jaff
|
1ba6b6761b
|
show pricing for tg ai completion
|
2023-09-06 17:10:49 -07:00 |
|
ishaan-jaff
|
bab27634a8
|
rename max_tokens.json
|
2023-09-06 16:28:17 -07:00 |
|
ishaan-jaff
|
fbd67bc24c
|
add experimental together_computer cost calc
|
2023-09-06 16:08:44 -07:00 |
|
Krrish Dholakia
|
44f71aa321
|
logging replicate response logs
|
2023-09-06 11:28:40 -07:00 |
|
ishaan-jaff
|
1da6026622
|
add flan + vicuna + fix replicate errors
|
2023-09-06 11:23:58 -07:00 |
|
ishaan-jaff
|
af60b2ba77
|
add vicuna translation
|
2023-09-06 11:14:24 -07:00 |
|
Krrish Dholakia
|
afcd6b28cc
|
bump version
|
2023-09-06 11:05:11 -07:00 |
|
ishaan-jaff
|
0ddda7c035
|
send optional_params for llama2-70b chat replicate
|
2023-09-06 11:01:39 -07:00 |
|
ishaan-jaff
|
d4c4a138ca
|
add replicate support for max_tokens
|
2023-09-06 10:38:21 -07:00 |
|
Krrish Dholakia
|
ef43141554
|
updates to exception mapping
|
2023-09-06 10:36:22 -07:00 |
|
ishaan-jaff
|
1c61b7b229
|
add replicate streaming
|
2023-09-06 10:23:13 -07:00 |
|
Krrish Dholakia
|
b4a9699138
|
update docs on together ai
|
2023-09-06 08:26:05 -07:00 |
|
ishaan-jaff
|
2a36f06763
|
remove install_and_import remove petals
|
2023-09-05 15:06:24 -07:00 |
|
Krrish Dholakia
|
2384806cfd
|
adding first-party + custom prompt templates for huggingface
|
2023-09-04 14:54:09 -07:00 |
|
Krrish Dholakia
|
73bb1b96e9
|
update exception mapping and get model cost map
|
2023-09-04 11:53:20 -07:00 |
|
ishaan-jaff
|
e03d442e8f
|
add optional params for llama-2
|
2023-09-04 11:41:20 -07:00 |
|
yujong-lee
|
44d2b2ca74
|
Remove print
|
2023-09-03 17:47:36 +09:00 |
|
ishaan-jaff
|
09ae510a58
|
use api_base instead of custom_api_base
|
2023-09-02 17:11:30 -07:00 |
|
ishaan-jaff
|
e6836985c8
|
test without id
|
2023-09-02 16:54:04 -07:00 |
|
Krrish Dholakia
|
56853aca53
|
fix azure streaming
|
2023-09-02 16:10:19 -07:00 |
|
Krrish Dholakia
|
895e3c1a94
|
update to logging
|
2023-09-02 15:45:45 -07:00 |
|
Krrish Dholakia
|
83b8af8567
|
adding support for aleph alpha
|
2023-09-02 13:15:41 -07:00 |
|
Krrish Dholakia
|
0e445e6e69
|
fix streaming cache logic
|
2023-09-01 16:56:07 -07:00 |
|
Krrish Dholakia
|
4d17c57373
|
updates to tests
|
2023-09-01 16:01:28 -07:00 |
|
ishaan-jaff
|
51e486441d
|
use dynamic config args
|
2023-09-01 15:39:30 -07:00 |
|
Krrish Dholakia
|
34ed4cc23c
|
updates to logging
|
2023-09-01 14:38:52 -07:00 |
|
Krrish Dholakia
|
0ff4515f20
|
fixes to litedebugger
|
2023-09-01 13:22:16 -07:00 |
|
Krrish Dholakia
|
1d73f05ae1
|
cleanup
|
2023-09-01 12:08:46 -07:00 |
|
Krrish Dholakia
|
fda6dba1ec
|
fixes to core logging
|
2023-09-01 11:51:49 -07:00 |
|
Krrish Dholakia
|
b44299cce7
|
update model split tests with ui
|
2023-08-31 16:42:40 -07:00 |
|
Krrish Dholakia
|
66bfd70253
|
add completion call id
|
2023-08-31 12:54:38 -07:00 |
|
ishaan-jaff
|
f466c443fe
|
add langfuse integration
|
2023-08-31 11:04:17 -07:00 |
|
ishaan-jaff
|
beac62b8fe
|
litellm call id's in responses when client true
|
2023-08-31 10:09:10 -07:00 |
|
ishaan-jaff
|
bb766c34ca
|
write to tmp dir
|
2023-08-31 08:55:19 -07:00 |
|
yujong-lee
|
2cc9074d09
|
Cleanup test, at least pass with openai
|
2023-08-31 20:33:34 +09:00 |
|
yujong-lee
|
119e75655f
|
Fix!
|
2023-08-31 20:18:17 +09:00 |
|
Krrish Dholakia
|
14d4c7ead2
|
update baseten handler to handle TGI calls
|
2023-08-30 19:14:48 -07:00 |
|
Krrish Dholakia
|
daa949a539
|
return logprobs for hf models
|
2023-08-30 15:16:26 -07:00 |
|
Krrish Dholakia
|
b90e5ed6db
|
adding testing to make sure litedebugger is opt-in only
|
2023-08-30 09:55:33 -07:00 |
|
Ishaan Jaff
|
6651cabd69
|
Merge pull request #240 from estill01/async_iterator_fix
Add async iterator to fix/enable streaming responses
|
2023-08-29 19:08:35 -07:00 |
|
Krrish Dholakia
|
546ad43b15
|
adding context window exceeded error to huggingface
|
2023-08-29 16:46:04 -07:00 |
|
Krrish Dholakia
|
ad493a3109
|
adding contextwindow exceeded error mapping for replicate
|
2023-08-29 16:36:31 -07:00 |
|
Krrish Dholakia
|
509120bf61
|
add context window exceeded error for anthropic
|
2023-08-29 16:28:07 -07:00 |
|
Krrish Dholakia
|
f7955d52b5
|
add context window exceeded error mapping to openai
|
2023-08-29 16:08:00 -07:00 |
|
estill01
|
4d009a94b9
|
Add async iterator to fix/enable streaming responses
|
2023-08-29 15:44:44 -07:00 |
|