Commit graph

3875 commits

Author SHA1 Message Date
ishaan-jaff
cbc358ef77 (docs) proxy: Render deploys 2023-11-30 09:21:07 -08:00
Krrish Dholakia
b4b7acdb72 fix(utils.py): fix azure completion cost calculation 2023-11-30 09:19:35 -08:00
Krrish Dholakia
7ee089b5ca fix(proxy_server.py): provide an endpoint that gives model-specific info from proxy 2023-11-30 09:08:19 -08:00
Ishaan Jaff
f742d14582
Update Dockerfile 2023-11-30 09:03:04 -08:00
Ishaan Jaff
57df87120a
Update Dockerfile 2023-11-30 09:00:23 -08:00
Ishaan Jaff
6fddd078b9
Update Dockerfile 2023-11-30 08:57:44 -08:00
Ishaan Jaff
96151cf260
Update Dockerfile 2023-11-30 08:57:10 -08:00
ishaan-jaff
a56d4a1e83 (fix) proxy: print cwd() 2023-11-30 08:52:06 -08:00
Ishaan Jaff
1772da92c0
Update Dockerfile 2023-11-30 08:46:38 -08:00
Krrish Dholakia
8ab31b99a9 bump: version 1.8.0 → 1.8.1 2023-11-30 08:45:52 -08:00
Krrish Dholakia
01c7e18f31 fix(utils.py): include system fingerprint in streaming response object 2023-11-30 08:45:52 -08:00
Ishaan Jaff
78421d89aa
Update Dockerfile 2023-11-30 08:44:25 -08:00
Ishaan Jaff
2a2fffdd09
Update Dockerfile 2023-11-30 08:40:24 -08:00
ishaan-jaff
ecdc5bdad6 (dos) config.yaml 2023-11-30 08:34:36 -08:00
Ishaan Jaff
f7da28eb34
Merge pull request #958 from rishabgit/main
docs: adds gpt-3.5-turbo-1106 in supported models
2023-11-30 08:26:55 -08:00
Rishab Mallick
5693626ba5 docs: adds gpt-3.5-turbo-1106 in supported models 2023-11-30 13:32:51 +05:30
ishaan-jaff
9331f67c69 (docs) add pplx-online 2023-11-29 21:18:42 -08:00
ishaan-jaff
08fd41d43d (feat) add pplx-online llms 2023-11-29 21:15:36 -08:00
Krrish Dholakia
e98fac66da fix(utils.py): fix register model cost map 2023-11-29 21:12:29 -08:00
Krrish Dholakia
03fb143a87 docs(simple_proxy.md): adding librechat tutorial to docs 2023-11-29 21:03:17 -08:00
ishaan-jaff
e5ce45dc2c (cleanup) proxy/health 2023-11-29 20:15:52 -08:00
Krrish Dholakia
583571a43a bump: version 1.7.21 → 1.8.0 2023-11-29 20:14:54 -08:00
Krrish Dholakia
0d200cd8dc feat(main.py): allow updating model cost via completion() 2023-11-29 20:14:39 -08:00
ishaan-jaff
4c1ef4e270 (chore) fix testing 2023-11-29 20:05:13 -08:00
Krrish Dholakia
50cc4a8595 fix(proxy_server.py): have /health and /routes be router endpoints 2023-11-29 19:59:56 -08:00
ishaan-jaff
4ed5b3b46d (chore) linting fix 2023-11-29 19:58:12 -08:00
Krrish Dholakia
a9fdae0d23 test(test_streaming.py): refactor testing 2023-11-29 19:58:04 -08:00
Krrish Dholakia
7b53bf7d9a bump: version 1.7.20 → 1.7.21 2023-11-29 19:58:04 -08:00
Krrish Dholakia
c312ac4ca8 fix(main.py): don't pass stream to petals 2023-11-29 19:58:04 -08:00
ishaan-jaff
9780efca4b (feat) router: async client Azure, OpenAI 2023-11-29 19:45:08 -08:00
Krrish Dholakia
2760cdcce5 bump: version 1.7.19 → 1.7.20 2023-11-29 19:44:16 -08:00
Krrish Dholakia
1f5a1122fc fix(replicate.py): fix custom prompt formatting 2023-11-29 19:44:09 -08:00
ishaan-jaff
c05da0797b (feat) Embedding: Async Azure 2023-11-29 19:43:47 -08:00
ishaan-jaff
53554bae85 (test) aembedding 2023-11-29 19:36:42 -08:00
ishaan-jaff
10e21ae978 (test) aembedding 2023-11-29 19:35:32 -08:00
ishaan-jaff
09caab549a (feat) async embeddings: OpenAI 2023-11-29 19:35:08 -08:00
ishaan-jaff
3891462b29 (fix) router: azure/embedding support 2023-11-29 19:06:36 -08:00
ishaan-jaff
e58b3d5df0 (feat) add azure/gpt-4-1106-preview 2023-11-29 18:21:31 -08:00
ishaan-jaff
7bcc23e8e9 (fix) router: set default rpm/tpm when not set 2023-11-29 18:13:27 -08:00
ishaan-jaff
c1914a01bc (docs) routing 2023-11-29 18:09:39 -08:00
ishaan-jaff
f299120394 (docs) router 2023-11-29 18:08:00 -08:00
ishaan-jaff
305faab542 (test) router:get_available_deployment 2023-11-29 17:54:41 -08:00
ishaan-jaff
23af756531 (feat) router: random pick based on tpm/rpm 2023-11-29 17:54:06 -08:00
ishaan-jaff
2c74dbed17 (chore) util: remove_model_id 2023-11-29 17:30:33 -08:00
ishaan-jaff
7a38a45d62 (test) test weighted selection router 2023-11-29 17:30:18 -08:00
ishaan-jaff
48416f8018 (test) add rpm to load test profiling 2023-11-29 17:14:34 -08:00
ishaan-jaff
088d2bc081 (fix) use weighted shuffle when rpm set 2023-11-29 17:13:11 -08:00
Krrish Dholakia
38efc21f81 bump: version 1.7.18 → 1.7.19 2023-11-29 16:50:11 -08:00
Krrish Dholakia
61185aa12c fix(main.py): fix null finish reason issue for ollama 2023-11-29 16:50:11 -08:00
ishaan-jaff
69eca78000 (docs) simple proxy 2023-11-29 16:44:40 -08:00