Krrish Dholakia
|
2a4c1a1803
|
fix(proxy_server.py): don't pass in user param if not sent
|
2023-12-14 14:17:33 -08:00 |
|
Krrish Dholakia
|
1608dd7e0b
|
fix(main.py): support async streaming for text completions endpoint
|
2023-12-14 13:56:32 -08:00 |
|
Krrish Dholakia
|
db8d793618
|
fix(proxy_server.py): handle scenario where no json body is passed in - e.g. GET request
https://github.com/BerriAI/litellm/issues/1062
|
2023-12-14 09:57:35 -08:00 |
|
ishaan-jaff
|
d1cf41888b
|
(feat) proxy add docstring for /test
|
2023-12-14 16:51:16 +05:30 |
|
ishaan-jaff
|
80fc8050eb
|
(feat) proxy - model group alias
|
2023-12-14 13:24:10 +05:30 |
|
Krrish Dholakia
|
effdddc1c8
|
fix(custom_logger.py): enable pre_call hooks to modify incoming data to proxy
|
2023-12-13 16:20:37 -08:00 |
|
Krrish Dholakia
|
7b8851cce5
|
fix(ollama.py): fix async completion calls for ollama
|
2023-12-13 13:10:25 -08:00 |
|
Krrish Dholakia
|
693292a64c
|
feat(proxy_server.py): add new /key/update endpoint
|
2023-12-12 17:18:51 -08:00 |
|
Krrish Dholakia
|
a266ad0319
|
refactor(proxy_server.py): code cleanup
|
2023-12-12 15:49:41 -08:00 |
|
Krrish Dholakia
|
8b07a6c046
|
fix(main.py): pass user_id + encoding_format for logging + to openai/azure
|
2023-12-12 15:46:44 -08:00 |
|
Krrish Dholakia
|
bdf29ca71f
|
fix(sagemaker.py): debug streaming
|
2023-12-12 15:45:07 -08:00 |
|
ishaan-jaff
|
f8c7528bff
|
(fix) proxy : cleanup router
|
2023-12-11 22:12:22 -08:00 |
|
Krrish Dholakia
|
3e7604420e
|
fix(proxy/utils.py): fix db writes on retry
|
2023-12-11 21:14:12 -08:00 |
|
Krrish Dholakia
|
ea89a8a938
|
test(test_custom_callback_unit.py): adding unit tests for custom callbacks + fixing related bugs
|
2023-12-11 11:44:09 -08:00 |
|
ishaan-jaff
|
71eee5087d
|
(test) assert metadata in callbacks
|
2023-12-11 09:47:38 -08:00 |
|
Krrish Dholakia
|
22f04e3b33
|
fix(proxy_server.py): fix /model/new adding new model issue
|
2023-12-09 22:44:11 -08:00 |
|
ishaan-jaff
|
127f25703a
|
(fix) proxy /model/info
|
2023-12-09 22:35:09 -08:00 |
|
Krrish Dholakia
|
07b4c72a98
|
fix(proxy_server.py): fix key gen error
|
2023-12-09 22:04:59 -08:00 |
|
Krrish Dholakia
|
82484691cb
|
fix: minor bug fixes
|
2023-12-09 21:24:52 -08:00 |
|
Krrish Dholakia
|
6ef0e8485e
|
fix(proxy_server.py): support for streaming
|
2023-12-09 16:23:04 -08:00 |
|
ishaan-jaff
|
1f1f749ad9
|
(feat) proxy - cache allow bool
|
2023-12-09 15:55:38 -08:00 |
|
ishaan-jaff
|
994c204e45
|
(fix) proxy - show logging when Cache created
|
2023-12-09 15:21:04 -08:00 |
|
Krrish Dholakia
|
dd6caa8b55
|
fix(proxy_server.py): fix linting errors
|
2023-12-09 12:17:33 -08:00 |
|
Krrish Dholakia
|
ed50522863
|
fix(proxy_server.py): fix pydantic version errors
|
2023-12-09 12:09:49 -08:00 |
|
Krrish Dholakia
|
c1c81af9d4
|
fix(proxy_server.py): add custom json helper func to work across multiple pydantic versions
|
2023-12-09 11:13:03 -08:00 |
|
Krrish Dholakia
|
c482c399ff
|
fix(proxy_server.py): add logging around /key/generate
|
2023-12-09 10:26:12 -08:00 |
|
ishaan-jaff
|
0dc7d538b5
|
(docs) proxy - OTEL + traceloop
|
2023-12-08 18:39:52 -08:00 |
|
ishaan-jaff
|
5efe336968
|
(fix) linting
|
2023-12-08 17:26:14 -08:00 |
|
Krrish Dholakia
|
5fa2b6e5ad
|
fix(proxy_server.py): enable pre+post-call hooks and max parallel request limits
|
2023-12-08 17:11:30 -08:00 |
|
ishaan-jaff
|
be94a8c478
|
(feat) pass model_info, proxy_server_request to callback
|
2023-12-08 14:26:18 -08:00 |
|
Krrish Dholakia
|
f06c3e0f08
|
fix(proxy_server.py): initializing sentry in proxy logging before db init
|
2023-12-08 11:58:16 -08:00 |
|
Krrish Dholakia
|
7aec95ed7c
|
feat(proxy_server.py): add sentry logging for db read/writes
|
2023-12-08 11:40:19 -08:00 |
|
Krrish Dholakia
|
1b35736797
|
fix(utils.py): fix cost calculation to handle tool input
|
2023-12-08 09:53:46 -08:00 |
|
Krrish Dholakia
|
9494c2cd9e
|
fix(requirements.txt): add ollama async_generator in requirements.txt
|
2023-12-08 09:20:04 -08:00 |
|
Krrish Dholakia
|
2f1c5aa0c7
|
fix: setting cache responses on proxy
|
2023-12-07 20:39:40 -08:00 |
|
Krrish Dholakia
|
f5afc429b3
|
fix(proxy_server.py): add call hooks pre+post completion and embedding calls
|
2023-12-07 20:35:32 -08:00 |
|
Krrish Dholakia
|
9cf3051ea2
|
feat(proxy_server.py): enable background health checks
|
2023-12-07 19:40:06 -08:00 |
|
ishaan-jaff
|
762f28e4d7
|
(fix) make print_verbose non blocking
|
2023-12-07 17:31:32 -08:00 |
|
Krrish Dholakia
|
e5638e2c5d
|
fix(router.py): fix default caching response value
|
2023-12-07 13:44:31 -08:00 |
|
ishaan-jaff
|
2bc583c2a6
|
(test) proxy - async custom logger
|
2023-12-07 13:19:17 -08:00 |
|
Krrish Dholakia
|
d77e0cc716
|
docs(config.md): adding docs on parallel request rate limiting
|
2023-12-07 11:27:48 -08:00 |
|
Krrish Dholakia
|
bd8d59e693
|
refactor(proxy_server.py): linting fix
|
2023-12-06 22:49:30 -08:00 |
|
Krrish Dholakia
|
c1e95740b0
|
fix(bedrock.py): fix output format for cohere embeddings
|
2023-12-06 22:47:01 -08:00 |
|
ishaan-jaff
|
fa70b1f85b
|
(test) unset model_group_alias_map after test
|
2023-12-06 20:35:14 -08:00 |
|
ishaan-jaff
|
900b8d66f3
|
(feat) proxy use model_group_alias_map
|
2023-12-06 20:23:24 -08:00 |
|
Krrish Dholakia
|
c0eedf28fc
|
test: fix proxy server testing
|
2023-12-06 18:38:53 -08:00 |
|
ishaan-jaff
|
19b1deb200
|
(feat) proxy: protect health endpoint
|
2023-12-06 18:14:54 -08:00 |
|
Krrish Dholakia
|
45b4140615
|
test: fix config import for proxy testing
|
2023-12-06 17:40:38 -08:00 |
|
ishaan-jaff
|
dfb30d38fa
|
(feat) proxy print set callbacks
|
2023-12-06 17:16:24 -08:00 |
|
Krrish Dholakia
|
58848841e1
|
fix(proxy_server.py): make headers json serializable
|
2023-12-06 17:09:02 -08:00 |
|