Ayush Somani
|
0bf3de0a01
|
(docs) update docker compose docs
|
2023-12-06 10:37:45 +05:30 |
|
Ayush Somani
|
b0fcc5af67
|
Merge branch 'main' into main
|
2023-12-06 10:32:59 +05:30 |
|
ishaan-jaff
|
642c62f7b7
|
(fix) proxy: better debugging when -debug is on
|
2023-12-05 18:19:15 -08:00 |
|
ishaan-jaff
|
48aa00d6c0
|
(fix) proxy - clean up print statement
|
2023-12-05 18:14:01 -08:00 |
|
ishaan-jaff
|
27d7d7ba9c
|
(feat) proxy cli, better description of config yaml param
|
2023-12-05 18:11:29 -08:00 |
|
ishaan-jaff
|
56acded998
|
(router) better debugging using config.yaml
|
2023-12-05 18:07:27 -08:00 |
|
ishaan-jaff
|
155e99b9a3
|
(fix) prox cli: remove deprecated param
|
2023-12-05 18:04:08 -08:00 |
|
Krrish Dholakia
|
39bb972168
|
bump: version 1.10.9 → 1.10.10
|
2023-12-05 18:01:58 -08:00 |
|
ishaan-jaff
|
cb52e3347e
|
(fix) proxy: make yaml load print_verbose
|
2023-12-05 18:00:00 -08:00 |
|
Krrish Dholakia
|
648d41c96f
|
fix(sagemaker.py): prompt templating fixes
|
2023-12-05 17:47:44 -08:00 |
|
ishaan-jaff
|
0eccc1b1f8
|
(test) router: call 1 deployment
|
2023-12-05 17:35:35 -08:00 |
|
ishaan-jaff
|
1fa9ddd739
|
(chore) linting fix
|
2023-12-05 17:29:09 -08:00 |
|
ishaan-jaff
|
e788a34da4
|
(chore) linting fix
|
2023-12-05 17:26:03 -08:00 |
|
ishaan-jaff
|
4d5313343b
|
(feat) proxy /embedding check 1 deploy call
|
2023-12-05 17:22:07 -08:00 |
|
ishaan-jaff
|
3af4f7fb0f
|
(fix) proxy: /chat/cmp - check 1 deployment
|
2023-12-05 17:19:48 -08:00 |
|
ishaan-jaff
|
a532cf14ae
|
(feat) router - track original deployment names
|
2023-12-05 17:19:48 -08:00 |
|
Krrish Dholakia
|
e4fae5a3e8
|
docs(aws_sagemaker.md): support for all huggingface/jumpstart modelsn
|
2023-12-05 16:59:37 -08:00 |
|
Krrish Dholakia
|
1addaecf48
|
docs(aws_sagemaker.md): add hf_model_name to sagemaker docs
|
2023-12-05 16:58:19 -08:00 |
|
ishaan-jaff
|
703a575a5d
|
(test) call 1 deployment on router
|
2023-12-05 16:56:38 -08:00 |
|
ishaan-jaff
|
bb6a1968b3
|
(fix) router - allow user to call 1 deployment
|
2023-12-05 16:56:38 -08:00 |
|
Krrish Dholakia
|
ff949490de
|
docs(input.md): add hf_model_name to docs
|
2023-12-05 16:56:18 -08:00 |
|
Krrish Dholakia
|
88845dddb1
|
fix(sagemaker.py): bring back llama2 templating for sagemaker
|
2023-12-05 16:42:19 -08:00 |
|
Krrish Dholakia
|
54d8a9df3f
|
fix(sagemaker.py): enable passing hf model name for prompt template
|
2023-12-05 16:31:59 -08:00 |
|
Krrish Dholakia
|
a38504ff1b
|
fix(sagemaker.py): fix meta llama model name for sagemaker custom deployment
|
2023-12-05 16:23:03 -08:00 |
|
Krrish Dholakia
|
3c60682eb4
|
fix(sagemaker.py): accept all amazon neuron llama2 models
|
2023-12-05 16:19:28 -08:00 |
|
Krrish Dholakia
|
01fc7f1931
|
fix(sagemaker.py): add support for amazon neuron llama models
|
2023-12-05 16:18:20 -08:00 |
|
ishaan-jaff
|
d2dab362df
|
(fix) proxy debugging display Init API key
|
2023-12-05 16:08:17 -08:00 |
|
Krrish Dholakia
|
b4c78c7b9e
|
fix(utils.py): support sagemaker llama2 custom endpoints
|
2023-12-05 16:05:15 -08:00 |
|
ishaan-jaff
|
09c2c1610d
|
bump: version 1.10.8 → 1.10.9
|
2023-12-05 15:37:39 -08:00 |
|
ishaan-jaff
|
c4bda13820
|
(fix) sagemaker Llama-2 70b
|
2023-12-05 15:32:17 -08:00 |
|
Krrish Dholakia
|
68ca2a28d4
|
docs: adds redis url to router + proxy docs
|
2023-12-05 15:08:00 -08:00 |
|
Krrish Dholakia
|
4d7ff1b33b
|
fix(proxy_server.py): don't override exceptions if they're of type httpexception
|
2023-12-05 14:33:28 -08:00 |
|
ishaan-jaff
|
c717ed4d05
|
(test) router: test async embedding + embedding
|
2023-12-05 14:28:23 -08:00 |
|
ishaan-jaff
|
3ff57493f4
|
(test) router: openai async, sync, stream, no stream
|
2023-12-05 14:21:37 -08:00 |
|
ishaan-jaff
|
bc70a6fba8
|
(test) router: add tests for azure completion, acompletion
|
2023-12-05 13:59:27 -08:00 |
|
ishaan-jaff
|
0d1b42eda5
|
(test) azure - test async + sync embedding
|
2023-12-05 13:35:05 -08:00 |
|
Krrish Dholakia
|
d606a9cb4c
|
refactor(router.py): linting fixes
|
2023-12-05 13:33:44 -08:00 |
|
ishaan-jaff
|
63939c0a11
|
(fix) linting
|
2023-12-05 13:30:12 -08:00 |
|
ishaan-jaff
|
1463cc6023
|
(test) router Azure regular chat completion call
|
2023-12-05 13:28:07 -08:00 |
|
ishaan-jaff
|
4e3040b357
|
(chore) linting fix
|
2023-12-05 13:23:35 -08:00 |
|
ishaan-jaff
|
e579918dd9
|
(test) Router: Test Azure acompletion, stream
|
2023-12-05 13:22:27 -08:00 |
|
Krrish Dholakia
|
58ab0a3f03
|
fix(router.py): fix cache init
|
2023-12-05 12:54:27 -08:00 |
|
ishaan-jaff
|
5829227d86
|
(test) router streaming + azure
|
2023-12-05 12:54:00 -08:00 |
|
ishaan-jaff
|
3f84ab04c4
|
(fix) router: Azure Client Init
|
2023-12-05 12:54:00 -08:00 |
|
ishaan-jaff
|
d9f083b5f8
|
(fix) router: remove misleading print statement
|
2023-12-05 12:54:00 -08:00 |
|
Krrish Dholakia
|
397eefabe1
|
test: remove local test
|
2023-12-05 12:45:52 -08:00 |
|
Krrish Dholakia
|
a9b50a12c5
|
bump: version 1.10.7 → 1.10.8
|
2023-12-05 12:38:42 -08:00 |
|
Krrish Dholakia
|
2a02fcbffb
|
fix(utils.py): map cohere finish reasons
|
2023-12-05 12:38:18 -08:00 |
|
Krrish Dholakia
|
55b34f969c
|
bump: version 1.10.6 → 1.10.7
|
2023-12-05 12:26:44 -08:00 |
|
Krrish Dholakia
|
ef7795add6
|
fix(utils.py): set text if empty string
|
2023-12-05 12:26:44 -08:00 |
|