Commit graph

1562 commits

Author SHA1 Message Date
Krrish Dholakia
ff949490de docs(input.md): add hf_model_name to docs 2023-12-05 16:56:18 -08:00
Krrish Dholakia
54d8a9df3f fix(sagemaker.py): enable passing hf model name for prompt template 2023-12-05 16:31:59 -08:00
Krrish Dholakia
b4c78c7b9e fix(utils.py): support sagemaker llama2 custom endpoints 2023-12-05 16:05:15 -08:00
Krrish Dholakia
68ca2a28d4 docs: adds redis url to router + proxy docs 2023-12-05 15:08:00 -08:00
ishaan-jaff
c717ed4d05 (test) router: test async embedding + embedding 2023-12-05 14:28:23 -08:00
ishaan-jaff
3ff57493f4 (test) router: openai async, sync, stream, no stream 2023-12-05 14:21:37 -08:00
ishaan-jaff
bc70a6fba8 (test) router: add tests for azure completion, acompletion 2023-12-05 13:59:27 -08:00
ishaan-jaff
0d1b42eda5 (test) azure - test async + sync embedding 2023-12-05 13:35:05 -08:00
ishaan-jaff
1463cc6023 (test) router Azure regular chat completion call 2023-12-05 13:28:07 -08:00
ishaan-jaff
e579918dd9 (test) Router: Test Azure acompletion, stream 2023-12-05 13:22:27 -08:00
ishaan-jaff
5829227d86 (test) router streaming + azure 2023-12-05 12:54:00 -08:00
Krrish Dholakia
b46c73a46e fix: fix proxy testing 2023-12-05 11:13:09 -08:00
ishaan-jaff
3bdf61f02a (test) test bedrock on router 2023-12-05 09:35:08 -08:00
ishaan-jaff
9ba17657ad (feat) init redis cache with **kwargs 2023-12-04 20:50:08 -08:00
Krrish Dholakia
030bd22078 feat(proxy_server.py): allow user to override api key auth 2023-12-04 18:32:58 -08:00
ishaan-jaff
fa5b453d39 (test) init router 2023-12-04 17:31:12 -08:00
ishaan-jaff
886b52d448 (test) init router clients 2023-12-04 17:31:12 -08:00
ishaan-jaff
cba98cf530 (test) init router with 4 clients 2023-12-04 17:31:12 -08:00
Krrish Dholakia
e0ccb281d8 feat(utils.py): add async success callbacks for custom functions 2023-12-04 16:42:40 -08:00
ishaan-jaff
a9905bcd0a (test) fix config 2023-12-04 16:00:04 -08:00
ishaan-jaff
9b3a0c69f5 (fix) config testing 2023-12-04 15:24:46 -08:00
ishaan-jaff
b7281825d3 (test) add streaming sagemaker test 2023-12-04 15:20:06 -08:00
ishaan-jaff
41365b6e47 (test) palm/stream 2023-12-04 15:07:07 -08:00
ishaan-jaff
533b93f714 (test) proxy: reading configs 2023-12-04 14:49:59 -08:00
ishaan-jaff
50284771b7 (test) test_reading proxy 2023-12-04 13:24:41 -08:00
ishaan-jaff
de4a7b719d (test) proxy: reading config.yaml 2023-12-04 13:16:19 -08:00
Krrish Dholakia
728b879c33 fix(utils.py): fix azure streaming bug 2023-12-04 12:38:22 -08:00
Krrish Dholakia
d0d8ba46c5 test(test_proxy_server_keys.py): fix relative import 2023-12-04 11:35:43 -08:00
Krrish Dholakia
63e55f1865 fix(proxy_server.py): fix /key/generate post endpoint 2023-12-04 10:44:13 -08:00
ishaan-jaff
ebd9404cfd (test) proxy: don't overwrite user 2023-12-04 10:19:35 -08:00
ishaan-jaff
93f5c266da (test) test completion: if 'user' passed to API 2023-12-04 09:50:36 -08:00
Krrish Dholakia
3b70a6864f test: commenting out local test 2023-12-02 20:53:24 -08:00
Krrish Dholakia
6c0eec4ff4 fix(main.py): fix pydantic warning for usage dict 2023-12-02 20:02:55 -08:00
Krrish Dholakia
5a4c054eef fix(proxy_server.py): add testing for model info being added to /model/new 2023-12-02 14:15:45 -08:00
ishaan-jaff
f897d5e849 (ci/cd) HF embedding 2023-12-02 10:57:33 -08:00
ishaan-jaff
6a0ebc1bb3 (chore) cleanup proxy 2023-12-02 09:37:14 -08:00
Krrish Dholakia
2c1103247d test: fix flaky test 2023-12-01 15:46:39 -08:00
Krrish Dholakia
4da20ac0b9 fix(proxy_server.py): accept max tokens as int 2023-12-01 15:19:34 -08:00
Krrish Dholakia
ac4429c35f test(test_completion.py): commenting out flaky test 2023-12-01 15:06:54 -08:00
Krrish Dholakia
7b99cfe6b9 test(test_token_counter.py): include testing for function calls 2023-11-30 18:24:21 -08:00
Krrish Dholakia
032f71adb2 fix(router.py): support cloudflare ai gateway for azure models on router 2023-11-30 14:09:06 -08:00
ishaan-jaff
936c27c9ee (fix +test) langfuse log metadata 2023-11-30 13:53:43 -08:00
Krrish Dholakia
a5c99d8594 fix(langfuse.py): fix metadata logging for langfuse 2023-11-30 13:45:34 -08:00
Krrish Dholakia
82553e8aac fix(azure.py): fix linting errors 2023-11-30 13:32:29 -08:00
Krrish Dholakia
4f07c8565a feat(main.py): add support for azure-openai via cloudflare ai gateway 2023-11-30 13:19:49 -08:00
Krrish Dholakia
01c7e18f31 fix(utils.py): include system fingerprint in streaming response object 2023-11-30 08:45:52 -08:00
Krrish Dholakia
0d200cd8dc feat(main.py): allow updating model cost via completion() 2023-11-29 20:14:39 -08:00
ishaan-jaff
4c1ef4e270 (chore) fix testing 2023-11-29 20:05:13 -08:00
Krrish Dholakia
a9fdae0d23 test(test_streaming.py): refactor testing 2023-11-29 19:58:04 -08:00
Krrish Dholakia
c312ac4ca8 fix(main.py): don't pass stream to petals 2023-11-29 19:58:04 -08:00