Ishaan Jaff
|
a965c4dfb1
|
Update proxy_server_config.yaml
|
2023-11-30 09:26:54 -08:00 |
|
Krrish Dholakia
|
062ede96e3
|
refactor(proxy_server.py): fix linting issues
|
2023-11-30 09:24:59 -08:00 |
|
ishaan-jaff
|
cbc358ef77
|
(docs) proxy: Render deploys
|
2023-11-30 09:21:07 -08:00 |
|
Krrish Dholakia
|
b4b7acdb72
|
fix(utils.py): fix azure completion cost calculation
|
2023-11-30 09:19:35 -08:00 |
|
Krrish Dholakia
|
7ee089b5ca
|
fix(proxy_server.py): provide an endpoint that gives model-specific info from proxy
|
2023-11-30 09:08:19 -08:00 |
|
Ishaan Jaff
|
f742d14582
|
Update Dockerfile
|
2023-11-30 09:03:04 -08:00 |
|
Ishaan Jaff
|
57df87120a
|
Update Dockerfile
|
2023-11-30 09:00:23 -08:00 |
|
Ishaan Jaff
|
6fddd078b9
|
Update Dockerfile
|
2023-11-30 08:57:44 -08:00 |
|
Ishaan Jaff
|
96151cf260
|
Update Dockerfile
|
2023-11-30 08:57:10 -08:00 |
|
ishaan-jaff
|
a56d4a1e83
|
(fix) proxy: print cwd()
|
2023-11-30 08:52:06 -08:00 |
|
Ishaan Jaff
|
1772da92c0
|
Update Dockerfile
|
2023-11-30 08:46:38 -08:00 |
|
Krrish Dholakia
|
8ab31b99a9
|
bump: version 1.8.0 → 1.8.1
|
2023-11-30 08:45:52 -08:00 |
|
Krrish Dholakia
|
01c7e18f31
|
fix(utils.py): include system fingerprint in streaming response object
|
2023-11-30 08:45:52 -08:00 |
|
Ishaan Jaff
|
78421d89aa
|
Update Dockerfile
|
2023-11-30 08:44:25 -08:00 |
|
Ishaan Jaff
|
2a2fffdd09
|
Update Dockerfile
|
2023-11-30 08:40:24 -08:00 |
|
ishaan-jaff
|
ecdc5bdad6
|
(dos) config.yaml
|
2023-11-30 08:34:36 -08:00 |
|
Ishaan Jaff
|
f7da28eb34
|
Merge pull request #958 from rishabgit/main
docs: adds gpt-3.5-turbo-1106 in supported models
|
2023-11-30 08:26:55 -08:00 |
|
Rishab Mallick
|
5693626ba5
|
docs: adds gpt-3.5-turbo-1106 in supported models
|
2023-11-30 13:32:51 +05:30 |
|
ishaan-jaff
|
9331f67c69
|
(docs) add pplx-online
|
2023-11-29 21:18:42 -08:00 |
|
ishaan-jaff
|
08fd41d43d
|
(feat) add pplx-online llms
|
2023-11-29 21:15:36 -08:00 |
|
Krrish Dholakia
|
e98fac66da
|
fix(utils.py): fix register model cost map
|
2023-11-29 21:12:29 -08:00 |
|
Krrish Dholakia
|
03fb143a87
|
docs(simple_proxy.md): adding librechat tutorial to docs
|
2023-11-29 21:03:17 -08:00 |
|
ishaan-jaff
|
e5ce45dc2c
|
(cleanup) proxy/health
|
2023-11-29 20:15:52 -08:00 |
|
Krrish Dholakia
|
583571a43a
|
bump: version 1.7.21 → 1.8.0
|
2023-11-29 20:14:54 -08:00 |
|
Krrish Dholakia
|
0d200cd8dc
|
feat(main.py): allow updating model cost via completion()
|
2023-11-29 20:14:39 -08:00 |
|
ishaan-jaff
|
4c1ef4e270
|
(chore) fix testing
|
2023-11-29 20:05:13 -08:00 |
|
Krrish Dholakia
|
50cc4a8595
|
fix(proxy_server.py): have /health and /routes be router endpoints
|
2023-11-29 19:59:56 -08:00 |
|
ishaan-jaff
|
4ed5b3b46d
|
(chore) linting fix
|
2023-11-29 19:58:12 -08:00 |
|
Krrish Dholakia
|
a9fdae0d23
|
test(test_streaming.py): refactor testing
|
2023-11-29 19:58:04 -08:00 |
|
Krrish Dholakia
|
7b53bf7d9a
|
bump: version 1.7.20 → 1.7.21
|
2023-11-29 19:58:04 -08:00 |
|
Krrish Dholakia
|
c312ac4ca8
|
fix(main.py): don't pass stream to petals
|
2023-11-29 19:58:04 -08:00 |
|
ishaan-jaff
|
9780efca4b
|
(feat) router: async client Azure, OpenAI
|
2023-11-29 19:45:08 -08:00 |
|
Krrish Dholakia
|
2760cdcce5
|
bump: version 1.7.19 → 1.7.20
|
2023-11-29 19:44:16 -08:00 |
|
Krrish Dholakia
|
1f5a1122fc
|
fix(replicate.py): fix custom prompt formatting
|
2023-11-29 19:44:09 -08:00 |
|
ishaan-jaff
|
c05da0797b
|
(feat) Embedding: Async Azure
|
2023-11-29 19:43:47 -08:00 |
|
ishaan-jaff
|
53554bae85
|
(test) aembedding
|
2023-11-29 19:36:42 -08:00 |
|
ishaan-jaff
|
10e21ae978
|
(test) aembedding
|
2023-11-29 19:35:32 -08:00 |
|
ishaan-jaff
|
09caab549a
|
(feat) async embeddings: OpenAI
|
2023-11-29 19:35:08 -08:00 |
|
ishaan-jaff
|
3891462b29
|
(fix) router: azure/embedding support
|
2023-11-29 19:06:36 -08:00 |
|
ishaan-jaff
|
e58b3d5df0
|
(feat) add azure/gpt-4-1106-preview
|
2023-11-29 18:21:31 -08:00 |
|
ishaan-jaff
|
7bcc23e8e9
|
(fix) router: set default rpm/tpm when not set
|
2023-11-29 18:13:27 -08:00 |
|
ishaan-jaff
|
c1914a01bc
|
(docs) routing
|
2023-11-29 18:09:39 -08:00 |
|
ishaan-jaff
|
f299120394
|
(docs) router
|
2023-11-29 18:08:00 -08:00 |
|
ishaan-jaff
|
305faab542
|
(test) router:get_available_deployment
|
2023-11-29 17:54:41 -08:00 |
|
ishaan-jaff
|
23af756531
|
(feat) router: random pick based on tpm/rpm
|
2023-11-29 17:54:06 -08:00 |
|
ishaan-jaff
|
2c74dbed17
|
(chore) util: remove_model_id
|
2023-11-29 17:30:33 -08:00 |
|
ishaan-jaff
|
7a38a45d62
|
(test) test weighted selection router
|
2023-11-29 17:30:18 -08:00 |
|
ishaan-jaff
|
48416f8018
|
(test) add rpm to load test profiling
|
2023-11-29 17:14:34 -08:00 |
|
ishaan-jaff
|
088d2bc081
|
(fix) use weighted shuffle when rpm set
|
2023-11-29 17:13:11 -08:00 |
|
Krrish Dholakia
|
38efc21f81
|
bump: version 1.7.18 → 1.7.19
|
2023-11-29 16:50:11 -08:00 |
|