Commit graph

1961 commits

Author SHA1 Message Date
Krrish Dholakia
7c962637f5 fix(utils.py): fix cost calculation to handle tool input 2023-12-08 09:53:46 -08:00
Krrish Dholakia
8d5c3ea36c fix(requirements.txt): add ollama async_generator in requirements.txt 2023-12-08 09:20:04 -08:00
Krrish Dholakia
30204497e0 fix: setting cache responses on proxy 2023-12-07 20:39:40 -08:00
Krrish Dholakia
665939bc48 fix(proxy_server.py): add call hooks pre+post completion and embedding calls 2023-12-07 20:35:32 -08:00
Krrish Dholakia
c739b29eb0 feat(proxy_server.py): enable background health checks 2023-12-07 19:40:06 -08:00
ishaan-jaff
f744445db4 (fix) make print_verbose non blocking 2023-12-07 17:31:32 -08:00
Krrish Dholakia
69c34493ce fix(router.py): fix default caching response value 2023-12-07 13:44:31 -08:00
ishaan-jaff
c6d19046f2 (test) proxy - async custom logger 2023-12-07 13:19:17 -08:00
Krrish Dholakia
4d1c294837 docs(config.md): adding docs on parallel request rate limiting 2023-12-07 11:27:48 -08:00
Krrish Dholakia
e835bbd88b refactor(proxy_server.py): linting fix 2023-12-06 22:49:30 -08:00
Krrish Dholakia
583de6ab92 fix(bedrock.py): fix output format for cohere embeddings 2023-12-06 22:47:01 -08:00
ishaan-jaff
a7584a2b75 (test) unset model_group_alias_map after test 2023-12-06 20:35:14 -08:00
ishaan-jaff
13a0a9f22c (feat) proxy use model_group_alias_map 2023-12-06 20:23:24 -08:00
Krrish Dholakia
2af2a17bc8 test: fix proxy server testing 2023-12-06 18:38:53 -08:00
ishaan-jaff
aa5bc26f59 (feat) proxy: protect health endpoint 2023-12-06 18:14:54 -08:00
Krrish Dholakia
b7e75b940a test: fix config import for proxy testing 2023-12-06 17:40:38 -08:00
ishaan-jaff
0532c038ff (feat) proxy print set callbacks 2023-12-06 17:16:24 -08:00
Krrish Dholakia
37f717d0f7 fix(proxy_server.py): make headers json serializable 2023-12-06 17:09:02 -08:00
Krrish Dholakia
8cbcc85496 fix(proxy_server.py): enable rate limiting concurrent user requests 2023-12-06 15:11:05 -08:00
ishaan-jaff
d7f81cd9aa (feat) proxy - custom on failure callback 2023-12-06 14:43:47 -08:00
ishaan-jaff
58f9ef8960 (chore) print verbose 2023-12-06 14:14:20 -08:00
ishaan-jaff
d628e6b86b (fix) print statements 2023-12-06 14:11:23 -08:00
ishaan-jaff
60317f1476 (fix) proxy /model/new writing to config 2023-12-06 14:11:23 -08:00
Krrish Dholakia
0dff2ebf55 fix(proxy_server.py): allow worker config to just be the config filepath 2023-12-06 14:03:25 -08:00
ishaan-jaff
6b8d1a20f5 (feat) proxy: use async_callback function 2023-12-06 13:51:24 -08:00
Krrish Dholakia
a18bdb3f2e fix(bedrock.py): adding support for cohere embeddings 2023-12-06 13:25:18 -08:00
ishaan-jaff
9703afdef1 (fix) linting 2023-12-06 13:14:26 -08:00
ishaan-jaff
e5d03889fa (feat) /v1/model/info 2023-12-06 13:03:29 -08:00
Krrish Dholakia
6d1a5089e3 refactor: fix linting errors 2023-12-06 11:46:15 -08:00
ishaan-jaff
edb3f82629 (fix) proxy - move new health check import 2023-12-06 10:13:06 -08:00
Ishaan Jaff
f3d8825290 Merge pull request #1023 from PSU3D0/speedup_health_endpoint
(feat) Speedup health endpoint
2023-12-06 09:52:13 -08:00
Krrish Dholakia
1546b85bc7 feat(proxy_server.py): adding /model/delete endpoint 2023-12-05 22:38:38 -08:00
ishaan-jaff
905f366f43 (fix) router - allow users to call a specific_model explicit 2023-12-05 21:57:00 -08:00
Frank Colson
497602606f Speedup health endpoint 2023-12-05 22:09:01 -07:00
ishaan-jaff
1365a536f3 (fix) proxy: better debugging when -debug is on 2023-12-05 18:19:15 -08:00
ishaan-jaff
70a4fbcfeb (fix) proxy - clean up print statement 2023-12-05 18:14:01 -08:00
ishaan-jaff
457c47b0c9 (fix) proxy: make yaml load print_verbose 2023-12-05 18:00:00 -08:00
ishaan-jaff
bc6a97495e (feat) proxy /embedding check 1 deploy call 2023-12-05 17:22:07 -08:00
ishaan-jaff
0c4190bec7 (fix) proxy: /chat/cmp - check 1 deployment 2023-12-05 17:19:48 -08:00
ishaan-jaff
862d4bdef1 (fix) proxy debugging display Init API key 2023-12-05 16:08:17 -08:00
Krrish Dholakia
4328a66a3a fix(proxy_server.py): don't override exceptions if they're of type httpexception 2023-12-05 14:33:28 -08:00
Krrish Dholakia
94abb14b99 fix(_redis.py): support additional params for redis 2023-12-05 12:16:51 -08:00
Krrish Dholakia
a858a2752d refactor(proxy_server.py): clean up print statements in proxy server 2023-12-05 11:14:16 -08:00
Krrish Dholakia
9318a29fb1 fix: fix proxy testing 2023-12-05 11:13:09 -08:00
Krrish Dholakia
4848b41abe feat(proxy_server.py): allow user to override api key auth 2023-12-04 18:32:58 -08:00
Krrish Dholakia
d1a525b6c9 feat(utils.py): add async success callbacks for custom functions 2023-12-04 16:42:40 -08:00
ishaan-jaff
080dfe2bd5 (fix) proxy: custom callbacks 2023-12-04 11:29:26 -08:00
ishaan-jaff
7a83691e46 (feat) proxy: set custom headers in metadata 2023-12-04 10:49:01 -08:00
Krrish Dholakia
813bb15a00 fix(proxy_server.py): fix /key/generate post endpoint 2023-12-04 10:44:13 -08:00
ishaan-jaff
89972239a6 (fix) proxy 2023-12-04 10:23:55 -08:00