ishaan-jaff
|
587649d37f
|
(fix) proxy /model/info
|
2023-12-09 22:35:09 -08:00 |
|
Krrish Dholakia
|
0fbe0f017a
|
fix(proxy_server.py): fix key gen error
|
2023-12-09 22:04:59 -08:00 |
|
Krrish Dholakia
|
01394a7bbd
|
fix: minor bug fixes
|
2023-12-09 21:24:52 -08:00 |
|
Krrish Dholakia
|
8eb7dc6393
|
fix(proxy_server.py): support for streaming
|
2023-12-09 16:23:04 -08:00 |
|
ishaan-jaff
|
827cc66c15
|
(feat) proxy - cache allow bool
|
2023-12-09 15:55:38 -08:00 |
|
ishaan-jaff
|
9951c6d569
|
(fix) proxy - show logging when Cache created
|
2023-12-09 15:21:04 -08:00 |
|
Krrish Dholakia
|
043d0c0c89
|
fix(proxy_server.py): fix linting errors
|
2023-12-09 12:17:33 -08:00 |
|
Krrish Dholakia
|
2048ef445d
|
fix(proxy_server.py): fix pydantic version errors
|
2023-12-09 12:09:49 -08:00 |
|
Krrish Dholakia
|
a8f5b4b69c
|
fix(proxy_server.py): add custom json helper func to work across multiple pydantic versions
|
2023-12-09 11:13:03 -08:00 |
|
Krrish Dholakia
|
612be8c844
|
fix(proxy_server.py): add logging around /key/generate
|
2023-12-09 10:26:12 -08:00 |
|
ishaan-jaff
|
f97ebdb4bd
|
(docs) proxy - OTEL + traceloop
|
2023-12-08 18:39:52 -08:00 |
|
ishaan-jaff
|
1e3d909408
|
(fix) linting
|
2023-12-08 17:26:14 -08:00 |
|
Krrish Dholakia
|
9c6584a376
|
fix(proxy_server.py): enable pre+post-call hooks and max parallel request limits
|
2023-12-08 17:11:30 -08:00 |
|
ishaan-jaff
|
de6880dc09
|
(feat) pass model_info, proxy_server_request to callback
|
2023-12-08 14:26:18 -08:00 |
|
Krrish Dholakia
|
8460924f1d
|
fix(proxy_server.py): initializing sentry in proxy logging before db init
|
2023-12-08 11:58:16 -08:00 |
|
Krrish Dholakia
|
4ff969bf6d
|
feat(proxy_server.py): add sentry logging for db read/writes
|
2023-12-08 11:40:19 -08:00 |
|
Krrish Dholakia
|
7c962637f5
|
fix(utils.py): fix cost calculation to handle tool input
|
2023-12-08 09:53:46 -08:00 |
|
Krrish Dholakia
|
8d5c3ea36c
|
fix(requirements.txt): add ollama async_generator in requirements.txt
|
2023-12-08 09:20:04 -08:00 |
|
Krrish Dholakia
|
30204497e0
|
fix: setting cache responses on proxy
|
2023-12-07 20:39:40 -08:00 |
|
Krrish Dholakia
|
665939bc48
|
fix(proxy_server.py): add call hooks pre+post completion and embedding calls
|
2023-12-07 20:35:32 -08:00 |
|
Krrish Dholakia
|
c739b29eb0
|
feat(proxy_server.py): enable background health checks
|
2023-12-07 19:40:06 -08:00 |
|
ishaan-jaff
|
f744445db4
|
(fix) make print_verbose non blocking
|
2023-12-07 17:31:32 -08:00 |
|
Krrish Dholakia
|
69c34493ce
|
fix(router.py): fix default caching response value
|
2023-12-07 13:44:31 -08:00 |
|
ishaan-jaff
|
c6d19046f2
|
(test) proxy - async custom logger
|
2023-12-07 13:19:17 -08:00 |
|
Krrish Dholakia
|
4d1c294837
|
docs(config.md): adding docs on parallel request rate limiting
|
2023-12-07 11:27:48 -08:00 |
|
Krrish Dholakia
|
e835bbd88b
|
refactor(proxy_server.py): linting fix
|
2023-12-06 22:49:30 -08:00 |
|
Krrish Dholakia
|
583de6ab92
|
fix(bedrock.py): fix output format for cohere embeddings
|
2023-12-06 22:47:01 -08:00 |
|
ishaan-jaff
|
a7584a2b75
|
(test) unset model_group_alias_map after test
|
2023-12-06 20:35:14 -08:00 |
|
ishaan-jaff
|
13a0a9f22c
|
(feat) proxy use model_group_alias_map
|
2023-12-06 20:23:24 -08:00 |
|
Krrish Dholakia
|
2af2a17bc8
|
test: fix proxy server testing
|
2023-12-06 18:38:53 -08:00 |
|
ishaan-jaff
|
aa5bc26f59
|
(feat) proxy: protect health endpoint
|
2023-12-06 18:14:54 -08:00 |
|
Krrish Dholakia
|
b7e75b940a
|
test: fix config import for proxy testing
|
2023-12-06 17:40:38 -08:00 |
|
ishaan-jaff
|
0532c038ff
|
(feat) proxy print set callbacks
|
2023-12-06 17:16:24 -08:00 |
|
Krrish Dholakia
|
37f717d0f7
|
fix(proxy_server.py): make headers json serializable
|
2023-12-06 17:09:02 -08:00 |
|
Krrish Dholakia
|
8cbcc85496
|
fix(proxy_server.py): enable rate limiting concurrent user requests
|
2023-12-06 15:11:05 -08:00 |
|
ishaan-jaff
|
d7f81cd9aa
|
(feat) proxy - custom on failure callback
|
2023-12-06 14:43:47 -08:00 |
|
ishaan-jaff
|
58f9ef8960
|
(chore) print verbose
|
2023-12-06 14:14:20 -08:00 |
|
ishaan-jaff
|
d628e6b86b
|
(fix) print statements
|
2023-12-06 14:11:23 -08:00 |
|
ishaan-jaff
|
60317f1476
|
(fix) proxy /model/new writing to config
|
2023-12-06 14:11:23 -08:00 |
|
Krrish Dholakia
|
0dff2ebf55
|
fix(proxy_server.py): allow worker config to just be the config filepath
|
2023-12-06 14:03:25 -08:00 |
|
ishaan-jaff
|
6b8d1a20f5
|
(feat) proxy: use async_callback function
|
2023-12-06 13:51:24 -08:00 |
|
Krrish Dholakia
|
a18bdb3f2e
|
fix(bedrock.py): adding support for cohere embeddings
|
2023-12-06 13:25:18 -08:00 |
|
ishaan-jaff
|
9703afdef1
|
(fix) linting
|
2023-12-06 13:14:26 -08:00 |
|
ishaan-jaff
|
e5d03889fa
|
(feat) /v1/model/info
|
2023-12-06 13:03:29 -08:00 |
|
Krrish Dholakia
|
6d1a5089e3
|
refactor: fix linting errors
|
2023-12-06 11:46:15 -08:00 |
|
ishaan-jaff
|
edb3f82629
|
(fix) proxy - move new health check import
|
2023-12-06 10:13:06 -08:00 |
|
Ishaan Jaff
|
f3d8825290
|
Merge pull request #1023 from PSU3D0/speedup_health_endpoint
(feat) Speedup health endpoint
|
2023-12-06 09:52:13 -08:00 |
|
Krrish Dholakia
|
1546b85bc7
|
feat(proxy_server.py): adding /model/delete endpoint
|
2023-12-05 22:38:38 -08:00 |
|
ishaan-jaff
|
905f366f43
|
(fix) router - allow users to call a specific_model explicit
|
2023-12-05 21:57:00 -08:00 |
|
Frank Colson
|
497602606f
|
Speedup health endpoint
|
2023-12-05 22:09:01 -07:00 |
|