Commit graph

1853 commits

Author SHA1 Message Date
ishaan-jaff
3fbd2a853f (feat) --health for checking config models 2023-11-27 12:13:21 -08:00
Krrish Dholakia
fb680ce4a2 fix(acompletion): fix acompletion raise exception issue when custom llm provider is none 2023-11-27 11:34:48 -08:00
Krrish Dholakia
1b723d4694 fix(proxy_server.py): fix user model returned in /models 2023-11-27 08:04:49 -08:00
Krrish Dholakia
8884ceb606 fix(proxy_server.py): expose a /health endpoint 2023-11-25 18:28:47 -08:00
ishaan-jaff
b0552cad35 (feat) proxy: make chat/completions async 2023-11-25 12:54:03 -08:00
Krrish Dholakia
e421642ba8 fix: fix linting issues 2023-11-24 15:46:25 -08:00
Krrish Dholakia
bcf0595bd4 feat(proxy_server.py): new /key/info endpoint to access key information (master key only) 2023-11-24 15:24:50 -08:00
Krrish Dholakia
32cdd0a613 feat(proxy_server.py): tracking spend per api key 2023-11-24 15:14:06 -08:00
Krrish Dholakia
6f40fd8ee2 fix(proxy_server.py): fix linting issues 2023-11-24 11:39:01 -08:00
ishaan-jaff
b1db3a38d7 (fix) prisma using: secrets.compare_digest 2023-11-24 10:02:08 -08:00
David Manouchehri
a1a0af824a Fix timing attack on master_key. 2023-11-24 12:12:29 -05:00
David Manouchehri
5165461c51 Fix master key check. 2023-11-24 12:03:30 -05:00
David Manouchehri
75dcb8ba2a Fix OpenAPI auth spec. 2023-11-24 11:59:33 -05:00
ishaan-jaff
865e3959f3 (feat) proxy: cost tracking add track_cost_callback 2023-11-24 05:53:23 -08:00
Kumaran Rajendhiran
9fb31448a9 Fail gracefully if ollama is already being served 2023-11-24 16:52:55 +05:30
Krrish Dholakia
daa45b4031 fix(proxy_server.py): fix linting errors 2023-11-23 21:42:39 -08:00
Krrish Dholakia
8030a9b8d1 feat(proxy_server.py): /key/delete endpoint 2023-11-23 21:37:53 -08:00
ishaan-jaff
7ccfbde6fd (fix) proxy: /embeddings 2023-11-23 21:16:51 -08:00
Krrish Dholakia
7610b1f0af feat(proxy_server.py): add in-memory caching for user api keys 2023-11-23 13:21:45 -08:00
ishaan-jaff
4ade4d4e8a (feat) proxy server: add spend column 2023-11-23 11:46:59 -08:00
ishaan-jaff
b9f0316032 (feat) proxy: caching - show redis settings when initializing 2023-11-23 10:52:50 -08:00
ishaan-jaff
9648a8594b (feat) proxy: add curl command test + read cache config 2023-11-23 10:31:04 -08:00
Krrish Dholakia
310b24a436 feat(proxy_server): add /v1/embeddings endpoint
n
2023-11-22 14:03:27 -08:00
Krrish Dholakia
bd87e30058 test(test_caching.py): cleaning up tests 2023-11-22 13:43:48 -08:00
Krrish Dholakia
e495a8a9c2 fix(main.py): fix acompletion for anyscale, openrouter, deepinfra, perplexity endpoints 2023-11-22 13:22:58 -08:00
ishaan-jaff
cfd30bb152 (feat) proxy server add /routes to see available routes 2023-11-22 13:20:21 -08:00
Krrish Dholakia
57e894ad5e fix(proxy_server.py): If master key is set, only master key can be used to generate new keys 2023-11-22 10:18:28 -08:00
Krrish Dholakia
2a681e578c fix(proxy_server): fix linting issues 2023-11-22 08:47:59 -08:00
ishaan-jaff
97a8177dfc (fix) caching config 2023-11-21 20:48:56 -08:00
ishaan-jaff
4db51c6eae (fix) cache configs 2023-11-21 20:48:56 -08:00
Krrish Dholakia
e96bb8868d fix(proxy_server.py): fix /models endpoint 2023-11-21 20:15:43 -08:00
ishaan-jaff
fd3462fb4f (fix) proxy server set model list through headers 2023-11-21 19:33:48 -08:00
Krrish Dholakia
550ddb4a6b docs(routing.md): update routing docs 2023-11-21 19:32:50 -08:00
Krrish Dholakia
9205f70b0f docs(routing.md): add queueing to docs 2023-11-21 18:01:02 -08:00
ishaan-jaff
70af870005 (fix) explicitly run prisma generate 2023-11-21 17:42:42 -08:00
ishaan-jaff
3f442d85a4 (fix) prisma 2023-11-21 17:38:34 -08:00
ishaan-jaff
e4b7040500 (fix) prisma always installed on deploys 2023-11-21 17:27:07 -08:00
Krrish Dholakia
7fb3a71b47 refactor(proxy_server.py): using celery workers instead of rq for concurrency 2023-11-21 16:31:56 -08:00
ishaan-jaff
cfafdb3463 (feat)proxy: readon config per request 2023-11-21 16:26:05 -08:00
Krrish Dholakia
68c955409d refactor(proxy_server.py): refactoring background rq worker 2023-11-21 13:47:09 -08:00
Krrish Dholakia
7fb25ef2dd fix(proxy_server.py): defaulting status to queued 2023-11-21 12:40:30 -08:00
Krrish Dholakia
8a77885c1c refactor(proxy_server.py): experimental endpoint - defined job status 2023-11-21 12:37:00 -08:00
Krrish Dholakia
0b0e931e21 refactor(proxy_server.py): refactoring experimental endpoint names 2023-11-21 12:12:15 -08:00
Krrish Dholakia
b8e62f3d0c feat(proxy_server.py): EXPERIMENTAL: adding queuing endpoints to openai proxy server 2023-11-21 12:06:23 -08:00
ishaan-jaff
259f4b977a (fix) proxy server - raise 500 on exceptions 2023-11-21 08:36:34 -08:00
Krrish Dholakia
3e3bb0180d fix(proxy_server.py): run ollama serve when ollama in config.yaml 2023-11-21 08:35:04 -08:00
Krrish Dholakia
2ac804a42f feat(proxy_server.py): enable model aliases 2023-11-20 16:51:04 -08:00
Krrish Dholakia
7472be1529 fix(routing.py): update token usage on streaming 2023-11-20 14:19:25 -08:00
Krrish Dholakia
1812b7ae37 fix(proxy_server.py): fix linting errors 2023-11-18 17:35:16 -08:00
Krrish Dholakia
59b547a65e docs(simple_proxy.md): adding token based auth to docs 2023-11-18 17:34:11 -08:00