Commit graph

4686 commits

Author SHA1 Message Date
Krrish Dholakia
bdf29ca71f fix(sagemaker.py): debug streaming 2023-12-12 15:45:07 -08:00
ishaan-jaff
6d76878382 (fix) pydantic: Field "model_list" has conflict with protected namespace "model_". 2023-12-12 12:38:11 -08:00
ishaan-jaff
f8c7528bff (fix) proxy : cleanup router 2023-12-11 22:12:22 -08:00
Krrish Dholakia
3e7604420e fix(proxy/utils.py): fix db writes on retry 2023-12-11 21:14:12 -08:00
Krrish Dholakia
ea89a8a938 test(test_custom_callback_unit.py): adding unit tests for custom callbacks + fixing related bugs 2023-12-11 11:44:09 -08:00
ishaan-jaff
71eee5087d (test) assert metadata in callbacks 2023-12-11 09:47:38 -08:00
ishaan-jaff
43b0da3ebc (docs) - proxy_config.yaml 2023-12-11 09:06:27 -08:00
Krrish Dholakia
22f04e3b33 fix(proxy_server.py): fix /model/new adding new model issue 2023-12-09 22:44:11 -08:00
ishaan-jaff
127f25703a (fix) proxy /model/info 2023-12-09 22:35:09 -08:00
Krrish Dholakia
07b4c72a98 fix(proxy_server.py): fix key gen error 2023-12-09 22:04:59 -08:00
Krrish Dholakia
f10bb708c0 fix: fix run_ollama_serve to only run if api base is none 2023-12-09 21:31:46 -08:00
Krrish Dholakia
82484691cb fix: minor bug fixes 2023-12-09 21:24:52 -08:00
Krrish Dholakia
6ef0e8485e fix(proxy_server.py): support for streaming 2023-12-09 16:23:04 -08:00
ishaan-jaff
1f1f749ad9 (feat) proxy - cache allow bool 2023-12-09 15:55:38 -08:00
ishaan-jaff
994c204e45 (fix) proxy - show logging when Cache created 2023-12-09 15:21:04 -08:00
Krrish Dholakia
dd6caa8b55 fix(proxy_server.py): fix linting errors 2023-12-09 12:17:33 -08:00
Krrish Dholakia
ed50522863 fix(proxy_server.py): fix pydantic version errors 2023-12-09 12:09:49 -08:00
Krrish Dholakia
c1c81af9d4 fix(proxy_server.py): add custom json helper func to work across multiple pydantic versions 2023-12-09 11:13:03 -08:00
Krrish Dholakia
c482c399ff fix(proxy_server.py): add logging around /key/generate 2023-12-09 10:26:12 -08:00
ishaan-jaff
0dc7d538b5 (docs) proxy - OTEL + traceloop 2023-12-08 18:39:52 -08:00
ishaan-jaff
5efe336968 (fix) linting 2023-12-08 17:26:14 -08:00
Krrish Dholakia
5fa2b6e5ad fix(proxy_server.py): enable pre+post-call hooks and max parallel request limits 2023-12-08 17:11:30 -08:00
ishaan-jaff
be94a8c478 (feat) pass model_info, proxy_server_request to callback 2023-12-08 14:26:18 -08:00
Krrish Dholakia
bab958561e fix(proxy/utils.py): correctly raise exceptions on db call fail 2023-12-08 13:35:42 -08:00
Krrish Dholakia
9f0b83248a fix(proxy/utils.py): add backoff/retry logic to db read/writes 2023-12-08 13:34:31 -08:00
Krrish Dholakia
f06c3e0f08 fix(proxy_server.py): initializing sentry in proxy logging before db init 2023-12-08 11:58:16 -08:00
Krrish Dholakia
7aec95ed7c feat(proxy_server.py): add sentry logging for db read/writes 2023-12-08 11:40:19 -08:00
Krrish Dholakia
4e6a8d09d0 fix(proxy/utils.py): tracking db failed writes 2023-12-08 10:40:02 -08:00
Krrish Dholakia
1b35736797 fix(utils.py): fix cost calculation to handle tool input 2023-12-08 09:53:46 -08:00
Krrish Dholakia
9494c2cd9e fix(requirements.txt): add ollama async_generator in requirements.txt 2023-12-08 09:20:04 -08:00
Krrish Dholakia
2f1c5aa0c7 fix: setting cache responses on proxy 2023-12-07 20:39:40 -08:00
Krrish Dholakia
f5afc429b3 fix(proxy_server.py): add call hooks pre+post completion and embedding calls 2023-12-07 20:35:32 -08:00
Krrish Dholakia
9cf3051ea2 feat(proxy_server.py): enable background health checks 2023-12-07 19:40:06 -08:00
ishaan-jaff
fd04b48764 (feat) async callbacks with litellm.completion() 2023-12-07 18:09:57 -08:00
ishaan-jaff
762f28e4d7 (fix) make print_verbose non blocking 2023-12-07 17:31:32 -08:00
Krrish Dholakia
e5638e2c5d fix(router.py): fix default caching response value 2023-12-07 13:44:31 -08:00
ishaan-jaff
2bc583c2a6 (test) proxy - async custom logger 2023-12-07 13:19:17 -08:00
Krrish Dholakia
d77e0cc716 docs(config.md): adding docs on parallel request rate limiting 2023-12-07 11:27:48 -08:00
Krrish Dholakia
c7aaa4adf8 docs(deploy.md): add docker instructions to deploy docs 2023-12-07 09:22:54 -08:00
Krrish Dholakia
bd8d59e693 refactor(proxy_server.py): linting fix 2023-12-06 22:49:30 -08:00
Krrish Dholakia
c1e95740b0 fix(bedrock.py): fix output format for cohere embeddings 2023-12-06 22:47:01 -08:00
ishaan-jaff
fa70b1f85b (test) unset model_group_alias_map after test 2023-12-06 20:35:14 -08:00
ishaan-jaff
900b8d66f3 (feat) proxy use model_group_alias_map 2023-12-06 20:23:24 -08:00
Krrish Dholakia
c0eedf28fc test: fix proxy server testing 2023-12-06 18:38:53 -08:00
ishaan-jaff
19b1deb200 (feat) proxy: protect health endpoint 2023-12-06 18:14:54 -08:00
Krrish Dholakia
45b4140615 test: fix config import for proxy testing 2023-12-06 17:40:38 -08:00
Krrish Dholakia
d814184bc3 test: fix test imports 2023-12-06 17:21:47 -08:00
ishaan-jaff
b60dc20f4b (fix) proxy edit custom logger 2023-12-06 17:16:24 -08:00
ishaan-jaff
1bac052eca (fix) proxy use async logging 2023-12-06 17:16:24 -08:00
ishaan-jaff
dfb30d38fa (feat) proxy print set callbacks 2023-12-06 17:16:24 -08:00