Commit graph

1984 commits

Author SHA1 Message Date
Krrish Dholakia
e396fcb55c fix(main.py): pass user_id + encoding_format for logging + to openai/azure 2023-12-12 15:46:44 -08:00
Krrish Dholakia
d059d1b101 fix(sagemaker.py): debug streaming 2023-12-12 15:45:07 -08:00
ishaan-jaff
d5bb71fb48 (fix) proxy : cleanup router 2023-12-11 22:12:22 -08:00
Krrish Dholakia
66e0c06476 fix(proxy/utils.py): fix db writes on retry 2023-12-11 21:14:12 -08:00
Krrish Dholakia
47d0884c0c test(test_custom_callback_unit.py): adding unit tests for custom callbacks + fixing related bugs 2023-12-11 11:44:09 -08:00
ishaan-jaff
4216617688 (test) assert metadata in callbacks 2023-12-11 09:47:38 -08:00
Krrish Dholakia
a5fddf55dd fix(proxy_server.py): fix /model/new adding new model issue 2023-12-09 22:44:11 -08:00
ishaan-jaff
587649d37f (fix) proxy /model/info 2023-12-09 22:35:09 -08:00
Krrish Dholakia
0fbe0f017a fix(proxy_server.py): fix key gen error 2023-12-09 22:04:59 -08:00
Krrish Dholakia
01394a7bbd fix: minor bug fixes 2023-12-09 21:24:52 -08:00
Krrish Dholakia
8eb7dc6393 fix(proxy_server.py): support for streaming 2023-12-09 16:23:04 -08:00
ishaan-jaff
827cc66c15 (feat) proxy - cache allow bool 2023-12-09 15:55:38 -08:00
ishaan-jaff
9951c6d569 (fix) proxy - show logging when Cache created 2023-12-09 15:21:04 -08:00
Krrish Dholakia
043d0c0c89 fix(proxy_server.py): fix linting errors 2023-12-09 12:17:33 -08:00
Krrish Dholakia
2048ef445d fix(proxy_server.py): fix pydantic version errors 2023-12-09 12:09:49 -08:00
Krrish Dholakia
a8f5b4b69c fix(proxy_server.py): add custom json helper func to work across multiple pydantic versions 2023-12-09 11:13:03 -08:00
Krrish Dholakia
612be8c844 fix(proxy_server.py): add logging around /key/generate 2023-12-09 10:26:12 -08:00
ishaan-jaff
f97ebdb4bd (docs) proxy - OTEL + traceloop 2023-12-08 18:39:52 -08:00
ishaan-jaff
1e3d909408 (fix) linting 2023-12-08 17:26:14 -08:00
Krrish Dholakia
9c6584a376 fix(proxy_server.py): enable pre+post-call hooks and max parallel request limits 2023-12-08 17:11:30 -08:00
ishaan-jaff
de6880dc09 (feat) pass model_info, proxy_server_request to callback 2023-12-08 14:26:18 -08:00
Krrish Dholakia
8460924f1d fix(proxy_server.py): initializing sentry in proxy logging before db init 2023-12-08 11:58:16 -08:00
Krrish Dholakia
4ff969bf6d feat(proxy_server.py): add sentry logging for db read/writes 2023-12-08 11:40:19 -08:00
Krrish Dholakia
7c962637f5 fix(utils.py): fix cost calculation to handle tool input 2023-12-08 09:53:46 -08:00
Krrish Dholakia
8d5c3ea36c fix(requirements.txt): add ollama async_generator in requirements.txt 2023-12-08 09:20:04 -08:00
Krrish Dholakia
30204497e0 fix: setting cache responses on proxy 2023-12-07 20:39:40 -08:00
Krrish Dholakia
665939bc48 fix(proxy_server.py): add call hooks pre+post completion and embedding calls 2023-12-07 20:35:32 -08:00
Krrish Dholakia
c739b29eb0 feat(proxy_server.py): enable background health checks 2023-12-07 19:40:06 -08:00
ishaan-jaff
f744445db4 (fix) make print_verbose non blocking 2023-12-07 17:31:32 -08:00
Krrish Dholakia
69c34493ce fix(router.py): fix default caching response value 2023-12-07 13:44:31 -08:00
ishaan-jaff
c6d19046f2 (test) proxy - async custom logger 2023-12-07 13:19:17 -08:00
Krrish Dholakia
4d1c294837 docs(config.md): adding docs on parallel request rate limiting 2023-12-07 11:27:48 -08:00
Krrish Dholakia
e835bbd88b refactor(proxy_server.py): linting fix 2023-12-06 22:49:30 -08:00
Krrish Dholakia
583de6ab92 fix(bedrock.py): fix output format for cohere embeddings 2023-12-06 22:47:01 -08:00
ishaan-jaff
a7584a2b75 (test) unset model_group_alias_map after test 2023-12-06 20:35:14 -08:00
ishaan-jaff
13a0a9f22c (feat) proxy use model_group_alias_map 2023-12-06 20:23:24 -08:00
Krrish Dholakia
2af2a17bc8 test: fix proxy server testing 2023-12-06 18:38:53 -08:00
ishaan-jaff
aa5bc26f59 (feat) proxy: protect health endpoint 2023-12-06 18:14:54 -08:00
Krrish Dholakia
b7e75b940a test: fix config import for proxy testing 2023-12-06 17:40:38 -08:00
ishaan-jaff
0532c038ff (feat) proxy print set callbacks 2023-12-06 17:16:24 -08:00
Krrish Dholakia
37f717d0f7 fix(proxy_server.py): make headers json serializable 2023-12-06 17:09:02 -08:00
Krrish Dholakia
8cbcc85496 fix(proxy_server.py): enable rate limiting concurrent user requests 2023-12-06 15:11:05 -08:00
ishaan-jaff
d7f81cd9aa (feat) proxy - custom on failure callback 2023-12-06 14:43:47 -08:00
ishaan-jaff
58f9ef8960 (chore) print verbose 2023-12-06 14:14:20 -08:00
ishaan-jaff
d628e6b86b (fix) print statements 2023-12-06 14:11:23 -08:00
ishaan-jaff
60317f1476 (fix) proxy /model/new writing to config 2023-12-06 14:11:23 -08:00
Krrish Dholakia
0dff2ebf55 fix(proxy_server.py): allow worker config to just be the config filepath 2023-12-06 14:03:25 -08:00
ishaan-jaff
6b8d1a20f5 (feat) proxy: use async_callback function 2023-12-06 13:51:24 -08:00
Krrish Dholakia
a18bdb3f2e fix(bedrock.py): adding support for cohere embeddings 2023-12-06 13:25:18 -08:00
ishaan-jaff
9703afdef1 (fix) linting 2023-12-06 13:14:26 -08:00