Commit graph

517 commits

Author SHA1 Message Date
ishaan-jaff
7c34024411 (dos) config.yaml 2023-11-30 08:34:36 -08:00
ishaan-jaff
370e78647b (cleanup) proxy/health 2023-11-29 20:15:52 -08:00
Krrish Dholakia
d7b70d47de fix(proxy_server.py): have /health and /routes be router endpoints 2023-11-29 19:59:56 -08:00
ishaan-jaff
681f2e6078 (fix) proxy: /health 2023-11-29 16:23:37 -08:00
ishaan-jaff
700fee3eba (test) 1k requests 2023-11-29 16:22:18 -08:00
ishaan-jaff
f29114dadc (fix) proxy: /health works with router updates 2023-11-29 16:09:31 -08:00
ishaan-jaff
229f50394d (fix) proxy: making receiving data print_verbose 2023-11-29 07:50:52 -08:00
Krrish Dholakia
6162d32b5a fix(proxy_server.py): ensure /models returns unique model names 2023-11-28 17:32:20 -08:00
ishaan-jaff
bcc58e16be (test) load test completion 2023-11-28 15:44:56 -08:00
ishaan-jaff
062cf64c43 (feat) proxy: add logs on router performance 2023-11-28 15:44:56 -08:00
Krrish Dholakia
8fff23b944 fix(proxy_server.py): support reading master key from os environment 2023-11-28 14:05:17 -08:00
ishaan-jaff
42df7f9a08 (feat) proxy set num_retries=3 2023-11-27 19:33:59 -08:00
ishaan-jaff
547edd24e6 (test) load test 2023-11-27 18:08:47 -08:00
ishaan-jaff
aef3d2699f (test) load test router 2023-11-27 16:37:57 -08:00
ishaan-jaff
a606d951c5 (fix) proxy set litellm attributes 2023-11-27 13:39:18 -08:00
ishaan-jaff
367468d655 (test) load test proxy completion 2023-11-27 12:13:21 -08:00
ishaan-jaff
3fbd2a853f (feat) --health for checking config models 2023-11-27 12:13:21 -08:00
Krrish Dholakia
fb680ce4a2 fix(acompletion): fix acompletion raise exception issue when custom llm provider is none 2023-11-27 11:34:48 -08:00
Krrish Dholakia
1b723d4694 fix(proxy_server.py): fix user model returned in /models 2023-11-27 08:04:49 -08:00
Krrish Dholakia
8884ceb606 fix(proxy_server.py): expose a /health endpoint 2023-11-25 18:28:47 -08:00
ishaan-jaff
b0552cad35 (feat) proxy: make chat/completions async 2023-11-25 12:54:03 -08:00
ishaan-jaff
c0dfc8d9b3 (test) load test embedding: proxy 2023-11-24 17:14:44 -08:00
ishaan-jaff
32b7c236e6 (docs) proxy performance 2023-11-24 17:07:46 -08:00
Krrish Dholakia
e421642ba8 fix: fix linting issues 2023-11-24 15:46:25 -08:00
Krrish Dholakia
bcf0595bd4 feat(proxy_server.py): new /key/info endpoint to access key information (master key only) 2023-11-24 15:24:50 -08:00
Krrish Dholakia
32cdd0a613 feat(proxy_server.py): tracking spend per api key 2023-11-24 15:14:06 -08:00
Krrish Dholakia
9618718080 test: refactor testing order 2023-11-24 12:47:28 -08:00
Krrish Dholakia
f630901a77 test(test_router_cooldowns.py): adding logging 2023-11-24 12:30:08 -08:00
Krrish Dholakia
6f40fd8ee2 fix(proxy_server.py): fix linting issues 2023-11-24 11:39:01 -08:00
ishaan-jaff
b1db3a38d7 (fix) prisma using: secrets.compare_digest 2023-11-24 10:02:08 -08:00
David Manouchehri
a1a0af824a Fix timing attack on master_key. 2023-11-24 12:12:29 -05:00
David Manouchehri
5165461c51 Fix master key check. 2023-11-24 12:03:30 -05:00
David Manouchehri
75dcb8ba2a Fix OpenAPI auth spec. 2023-11-24 11:59:33 -05:00
ishaan-jaff
865e3959f3 (feat) proxy: cost tracking add track_cost_callback 2023-11-24 05:53:23 -08:00
Kumaran Rajendhiran
9fb31448a9 Fail gracefully if ollama is already being served 2023-11-24 16:52:55 +05:30
Krrish Dholakia
daa45b4031 fix(proxy_server.py): fix linting errors 2023-11-23 21:42:39 -08:00
Krrish Dholakia
8030a9b8d1 feat(proxy_server.py): /key/delete endpoint 2023-11-23 21:37:53 -08:00
ishaan-jaff
7ccfbde6fd (fix) proxy: /embeddings 2023-11-23 21:16:51 -08:00
ishaan-jaff
d8c7417647 (fix) proxy: prisma.schema 2023-11-23 20:11:42 -08:00
Krrish Dholakia
e4deb09eb6 fix(router.py): add support for context window fallbacks on router 2023-11-23 16:43:02 -08:00
ishaan-jaff
c64aad7335 (feat) proxy: cost tracking per completion request 2023-11-23 16:08:59 -08:00
Krrish Dholakia
7610b1f0af feat(proxy_server.py): add in-memory caching for user api keys 2023-11-23 13:21:45 -08:00
ishaan-jaff
4ade4d4e8a (feat) proxy server: add spend column 2023-11-23 11:46:59 -08:00
Krrish Dholakia
0e3064ac8c fix(router.py): fix caching for tracking cooldowns + usage 2023-11-23 11:13:32 -08:00
ishaan-jaff
9570636474 (Feat) update config.yaml example on proxy 2023-11-23 10:54:30 -08:00
ishaan-jaff
b9f0316032 (feat) proxy: caching - show redis settings when initializing 2023-11-23 10:52:50 -08:00
Krish Dholakia
31bb24e9c1 Merge pull request #881 from Manouchehri/lambda-1
Add AWS Lambda Support
2023-11-23 10:38:34 -08:00
ishaan-jaff
9648a8594b (feat) proxy: add curl command test + read cache config 2023-11-23 10:31:04 -08:00
David Manouchehri
ed5b075080 Add mangum. 2023-11-23 00:04:47 -05:00
Krrish Dholakia
2df4791ae9 fix: fix linting errors 2023-11-22 19:59:25 -08:00