Commit graph

523 commits

Author SHA1 Message Date
Krrish Dholakia
2231601d5a fix(ollama.py): fix async completion calls for ollama 2023-12-13 13:10:25 -08:00
Krrish Dholakia
66e0c06476 fix(proxy/utils.py): fix db writes on retry 2023-12-11 21:14:12 -08:00
Krrish Dholakia
47d0884c0c test(test_custom_callback_unit.py): adding unit tests for custom callbacks + fixing related bugs 2023-12-11 11:44:09 -08:00
Krrish Dholakia
8eb7dc6393 fix(proxy_server.py): support for streaming 2023-12-09 16:23:04 -08:00
Krrish Dholakia
9c6584a376 fix(proxy_server.py): enable pre+post-call hooks and max parallel request limits 2023-12-08 17:11:30 -08:00
Krrish Dholakia
b198234f78 fix(proxy/utils.py): correctly raise exceptions on db call fail 2023-12-08 13:35:42 -08:00
Krrish Dholakia
04fc583baf fix(proxy/utils.py): add backoff/retry logic to db read/writes 2023-12-08 13:34:31 -08:00
Krrish Dholakia
8460924f1d fix(proxy_server.py): initializing sentry in proxy logging before db init 2023-12-08 11:58:16 -08:00
Krrish Dholakia
4ff969bf6d feat(proxy_server.py): add sentry logging for db read/writes 2023-12-08 11:40:19 -08:00
Krrish Dholakia
316abe38a9 fix(proxy/utils.py): tracking db failed writes 2023-12-08 10:40:02 -08:00
Krrish Dholakia
665939bc48 fix(proxy_server.py): add call hooks pre+post completion and embedding calls 2023-12-07 20:35:32 -08:00
Krrish Dholakia
ae1b793ec7 docs(deploy.md): add docker instructions to deploy docs 2023-12-07 09:22:54 -08:00
Krrish Dholakia
545ee2e16f test: fix linting errors 2023-12-04 18:43:01 -08:00
Krrish Dholakia
4848b41abe feat(proxy_server.py): allow user to override api key auth 2023-12-04 18:32:58 -08:00
Krrish Dholakia
f337b6569e test: fix linting errors 2023-12-04 12:45:15 -08:00
Krrish Dholakia
813bb15a00 fix(proxy_server.py): fix /key/generate post endpoint 2023-12-04 10:44:13 -08:00
Krrish Dholakia
3560c9356e fix: fix linting issues 2023-12-02 19:30:03 -08:00
Krrish Dholakia
6015bff80b fix(proxy_server.py): hash keys 2023-12-02 19:30:03 -08:00
Krrish Dholakia
32cdd0a613 feat(proxy_server.py): tracking spend per api key 2023-11-24 15:14:06 -08:00
ishaan-jaff
865e3959f3 (feat) proxy: cost tracking add track_cost_callback 2023-11-24 05:53:23 -08:00
Krrish Dholakia
e4deb09eb6 fix(router.py): add support for context window fallbacks on router 2023-11-23 16:43:02 -08:00
ishaan-jaff
c64aad7335 (feat) proxy: cost tracking per completion request 2023-11-23 16:08:59 -08:00
Krrish Dholakia
17867e6e15 fix(factory.py): fixing llama-2 non-chat models prompt templating 2023-11-07 21:33:54 -08:00