Commit graph

408 commits

Author SHA1 Message Date
Krrish Dholakia
81078c4004 fix(proxy/utils.py): jsonify object before db writes 2023-12-21 13:03:14 +05:30
Sihyeon Jang
4f41c3c513 fix: success_callback logic for cost_tracking
Signed-off-by: Sihyeon Jang <uneedsihyeon@gmail.com>
2023-12-21 16:09:59 +09:00
ishaan-jaff
683a1ee979 (feat) proxy key/generate pass metadata in requests 2023-12-20 13:42:49 +05:30
ishaan-jaff
229b56fc35 (docs) swagger - add embedding tag 2023-12-20 09:04:56 +05:30
ishaan-jaff
aa78415894 (docs) swager - add embeddings tag 2023-12-20 06:29:36 +05:30
ishaan-jaff
9548334e2f (docs) swagger docs add description 2023-12-20 06:27:26 +05:30
ishaan-jaff
8b26e64b5d (fix) proxy: add link t swagger docs on startup 2023-12-20 06:02:05 +05:30
ishaan-jaff
ce1b0b89ba (fix) proxy - health checks support cli model 2023-12-19 12:55:20 +05:30
Krrish Dholakia
3923c389fd build(Dockerfile): fixing build requirements 2023-12-16 17:52:30 -08:00
Krrish Dholakia
3291de9e11 fix(proxy_server.py): setup depedencies on server startup 2023-12-16 11:56:11 -08:00
ishaan-jaff
6b7d0eada4 (feat) proxy - set cache configs on proxy 2023-12-16 14:44:39 +05:30
ishaan-jaff
37bf94dcdf (fix) proxy remove errant print 2023-12-16 12:26:02 +05:30
Krrish Dholakia
4791dda66f feat(proxy_server.py): enable infinite retries on rate limited requests 2023-12-15 20:03:41 -08:00
ishaan-jaff
0b4f1dee42 (feat) proxy: embeddings-add OpenAI exception mapping 2023-12-15 14:02:24 +05:30
Krrish Dholakia
adaf341f54 fix(proxy_server.py): allow post-call rules to be passed in as a file 2023-12-14 18:21:09 -08:00
Krrish Dholakia
2a4c1a1803 fix(proxy_server.py): don't pass in user param if not sent 2023-12-14 14:17:33 -08:00
Krrish Dholakia
1608dd7e0b fix(main.py): support async streaming for text completions endpoint 2023-12-14 13:56:32 -08:00
Krrish Dholakia
db8d793618 fix(proxy_server.py): handle scenario where no json body is passed in - e.g. GET request
https://github.com/BerriAI/litellm/issues/1062
2023-12-14 09:57:35 -08:00
ishaan-jaff
d1cf41888b (feat) proxy add docstring for /test 2023-12-14 16:51:16 +05:30
ishaan-jaff
80fc8050eb (feat) proxy - model group alias 2023-12-14 13:24:10 +05:30
Krrish Dholakia
effdddc1c8 fix(custom_logger.py): enable pre_call hooks to modify incoming data to proxy 2023-12-13 16:20:37 -08:00
Krrish Dholakia
7b8851cce5 fix(ollama.py): fix async completion calls for ollama 2023-12-13 13:10:25 -08:00
Krrish Dholakia
693292a64c feat(proxy_server.py): add new /key/update endpoint 2023-12-12 17:18:51 -08:00
Krrish Dholakia
a266ad0319 refactor(proxy_server.py): code cleanup 2023-12-12 15:49:41 -08:00
Krrish Dholakia
8b07a6c046 fix(main.py): pass user_id + encoding_format for logging + to openai/azure 2023-12-12 15:46:44 -08:00
Krrish Dholakia
bdf29ca71f fix(sagemaker.py): debug streaming 2023-12-12 15:45:07 -08:00
ishaan-jaff
f8c7528bff (fix) proxy : cleanup router 2023-12-11 22:12:22 -08:00
Krrish Dholakia
3e7604420e fix(proxy/utils.py): fix db writes on retry 2023-12-11 21:14:12 -08:00
Krrish Dholakia
ea89a8a938 test(test_custom_callback_unit.py): adding unit tests for custom callbacks + fixing related bugs 2023-12-11 11:44:09 -08:00
ishaan-jaff
71eee5087d (test) assert metadata in callbacks 2023-12-11 09:47:38 -08:00
Krrish Dholakia
22f04e3b33 fix(proxy_server.py): fix /model/new adding new model issue 2023-12-09 22:44:11 -08:00
ishaan-jaff
127f25703a (fix) proxy /model/info 2023-12-09 22:35:09 -08:00
Krrish Dholakia
07b4c72a98 fix(proxy_server.py): fix key gen error 2023-12-09 22:04:59 -08:00
Krrish Dholakia
82484691cb fix: minor bug fixes 2023-12-09 21:24:52 -08:00
Krrish Dholakia
6ef0e8485e fix(proxy_server.py): support for streaming 2023-12-09 16:23:04 -08:00
ishaan-jaff
1f1f749ad9 (feat) proxy - cache allow bool 2023-12-09 15:55:38 -08:00
ishaan-jaff
994c204e45 (fix) proxy - show logging when Cache created 2023-12-09 15:21:04 -08:00
Krrish Dholakia
dd6caa8b55 fix(proxy_server.py): fix linting errors 2023-12-09 12:17:33 -08:00
Krrish Dholakia
ed50522863 fix(proxy_server.py): fix pydantic version errors 2023-12-09 12:09:49 -08:00
Krrish Dholakia
c1c81af9d4 fix(proxy_server.py): add custom json helper func to work across multiple pydantic versions 2023-12-09 11:13:03 -08:00
Krrish Dholakia
c482c399ff fix(proxy_server.py): add logging around /key/generate 2023-12-09 10:26:12 -08:00
ishaan-jaff
0dc7d538b5 (docs) proxy - OTEL + traceloop 2023-12-08 18:39:52 -08:00
ishaan-jaff
5efe336968 (fix) linting 2023-12-08 17:26:14 -08:00
Krrish Dholakia
5fa2b6e5ad fix(proxy_server.py): enable pre+post-call hooks and max parallel request limits 2023-12-08 17:11:30 -08:00
ishaan-jaff
be94a8c478 (feat) pass model_info, proxy_server_request to callback 2023-12-08 14:26:18 -08:00
Krrish Dholakia
f06c3e0f08 fix(proxy_server.py): initializing sentry in proxy logging before db init 2023-12-08 11:58:16 -08:00
Krrish Dholakia
7aec95ed7c feat(proxy_server.py): add sentry logging for db read/writes 2023-12-08 11:40:19 -08:00
Krrish Dholakia
1b35736797 fix(utils.py): fix cost calculation to handle tool input 2023-12-08 09:53:46 -08:00
Krrish Dholakia
9494c2cd9e fix(requirements.txt): add ollama async_generator in requirements.txt 2023-12-08 09:20:04 -08:00
Krrish Dholakia
2f1c5aa0c7 fix: setting cache responses on proxy 2023-12-07 20:39:40 -08:00