ishaan-jaff
|
e77b7e5a50
|
(docs) proxy: add tags=chat/completions + add response type
|
2023-12-01 10:00:48 -08:00 |
|
ishaan-jaff
|
2977d04f56
|
(fix) proxy: raise exceptions
|
2023-12-01 09:21:06 -08:00 |
|
ishaan-jaff
|
1e9aa69268
|
(fix) proxy: use orjson.loads()
|
2023-11-30 20:24:31 -08:00 |
|
ishaan-jaff
|
06805c7f47
|
(fix) formatting
|
2023-11-30 20:03:56 -08:00 |
|
ishaan-jaff
|
10d5ab8643
|
(feat) proxy: /embedding -> use ORJSON responses
|
2023-11-30 20:00:35 -08:00 |
|
ishaan-jaff
|
2d55cc753e
|
(feat) proxy: use orjson
|
2023-11-30 19:50:47 -08:00 |
|
ishaan-jaff
|
853af29a25
|
(test) load test embedding
|
2023-11-30 19:04:51 -08:00 |
|
Frank Colson
|
7ddfeb75bc
|
Add backwards compatability
|
2023-11-30 16:35:19 -07:00 |
|
Frank Colson
|
5e6913dff2
|
Use poetry extras for proxy
|
2023-11-30 16:23:34 -07:00 |
|
ishaan-jaff
|
bc2299184b
|
(fix) proxy - don't overwrite metadata passed
|
2023-11-30 15:15:47 -08:00 |
|
ishaan-jaff
|
9a1accfe2a
|
(chore) proxy: remove junk load test
|
2023-11-30 13:31:23 -08:00 |
|
ishaan-jaff
|
be8bdb580a
|
(test) proxy + router: add bursty load test
|
2023-11-30 13:17:11 -08:00 |
|
ishaan-jaff
|
a8a6838867
|
(docs) example: azure config.yaml
|
2023-11-30 13:16:41 -08:00 |
|
Krrish Dholakia
|
062ede96e3
|
refactor(proxy_server.py): fix linting issues
|
2023-11-30 09:24:59 -08:00 |
|
Krrish Dholakia
|
b4b7acdb72
|
fix(utils.py): fix azure completion cost calculation
|
2023-11-30 09:19:35 -08:00 |
|
Krrish Dholakia
|
7ee089b5ca
|
fix(proxy_server.py): provide an endpoint that gives model-specific info from proxy
|
2023-11-30 09:08:19 -08:00 |
|
ishaan-jaff
|
a56d4a1e83
|
(fix) proxy: print cwd()
|
2023-11-30 08:52:06 -08:00 |
|
ishaan-jaff
|
ecdc5bdad6
|
(dos) config.yaml
|
2023-11-30 08:34:36 -08:00 |
|
ishaan-jaff
|
e5ce45dc2c
|
(cleanup) proxy/health
|
2023-11-29 20:15:52 -08:00 |
|
Krrish Dholakia
|
50cc4a8595
|
fix(proxy_server.py): have /health and /routes be router endpoints
|
2023-11-29 19:59:56 -08:00 |
|
ishaan-jaff
|
3cc8305ec6
|
(fix) proxy: /health
|
2023-11-29 16:23:37 -08:00 |
|
ishaan-jaff
|
d3672452ce
|
(test) 1k requests
|
2023-11-29 16:22:18 -08:00 |
|
ishaan-jaff
|
66bc0fc343
|
(fix) proxy: /health works with router updates
|
2023-11-29 16:09:31 -08:00 |
|
ishaan-jaff
|
f307e82a41
|
(fix) proxy: making receiving data print_verbose
|
2023-11-29 07:50:52 -08:00 |
|
Krrish Dholakia
|
2b06fea4a8
|
fix(proxy_server.py): ensure /models returns unique model names
|
2023-11-28 17:32:20 -08:00 |
|
ishaan-jaff
|
ee6f5a84db
|
(test) load test completion
|
2023-11-28 15:44:56 -08:00 |
|
ishaan-jaff
|
ae7f0ae0b6
|
(feat) proxy: add logs on router performance
|
2023-11-28 15:44:56 -08:00 |
|
Krrish Dholakia
|
4ea52dd571
|
fix(proxy_server.py): support reading master key from os environment
|
2023-11-28 14:05:17 -08:00 |
|
ishaan-jaff
|
3ca4487e77
|
(feat) proxy set num_retries=3
|
2023-11-27 19:33:59 -08:00 |
|
ishaan-jaff
|
40d9e8ab23
|
(test) load test
|
2023-11-27 18:08:47 -08:00 |
|
ishaan-jaff
|
8560794963
|
(test) load test router
|
2023-11-27 16:37:57 -08:00 |
|
ishaan-jaff
|
ba228a9e0a
|
(fix) proxy set litellm attributes
|
2023-11-27 13:39:18 -08:00 |
|
ishaan-jaff
|
5e2c13fb11
|
(test) load test proxy completion
|
2023-11-27 12:13:21 -08:00 |
|
ishaan-jaff
|
9747cc5aad
|
(feat) --health for checking config models
|
2023-11-27 12:13:21 -08:00 |
|
Krrish Dholakia
|
56bb39e52c
|
fix(acompletion): fix acompletion raise exception issue when custom llm provider is none
|
2023-11-27 11:34:48 -08:00 |
|
Krrish Dholakia
|
aafba24e84
|
fix(proxy_server.py): fix user model returned in /models
|
2023-11-27 08:04:49 -08:00 |
|
Krrish Dholakia
|
e4f302a8e2
|
fix(proxy_server.py): expose a /health endpoint
|
2023-11-25 18:28:47 -08:00 |
|
ishaan-jaff
|
a688df79b1
|
(feat) proxy: make chat/completions async
|
2023-11-25 12:54:03 -08:00 |
|
ishaan-jaff
|
dca0a5ad0f
|
(test) load test embedding: proxy
|
2023-11-24 17:14:44 -08:00 |
|
ishaan-jaff
|
111c7afaca
|
(docs) proxy performance
|
2023-11-24 17:07:46 -08:00 |
|
Krrish Dholakia
|
d62da29cbe
|
fix: fix linting issues
|
2023-11-24 15:46:25 -08:00 |
|
Krrish Dholakia
|
bc84b38154
|
feat(proxy_server.py): new /key/info endpoint to access key information (master key only)
|
2023-11-24 15:24:50 -08:00 |
|
Krrish Dholakia
|
4f22e7de18
|
feat(proxy_server.py): tracking spend per api key
|
2023-11-24 15:14:06 -08:00 |
|
Krrish Dholakia
|
16e1070dbe
|
test: refactor testing order
|
2023-11-24 12:47:28 -08:00 |
|
Krrish Dholakia
|
2a033fd8a2
|
test(test_router_cooldowns.py): adding logging
|
2023-11-24 12:30:08 -08:00 |
|
Krrish Dholakia
|
2e8d582a34
|
fix(proxy_server.py): fix linting issues
|
2023-11-24 11:39:01 -08:00 |
|
ishaan-jaff
|
8edfcd8e5d
|
(fix) prisma using: secrets.compare_digest
|
2023-11-24 10:02:08 -08:00 |
|
David Manouchehri
|
ac08e3616c
|
Fix timing attack on master_key.
|
2023-11-24 12:12:29 -05:00 |
|
David Manouchehri
|
5b6f227170
|
Fix master key check.
|
2023-11-24 12:03:30 -05:00 |
|
David Manouchehri
|
3fa3a767b3
|
Fix OpenAPI auth spec.
|
2023-11-24 11:59:33 -05:00 |
|