Commit graph

3803 commits

Author SHA1 Message Date
ishaan-jaff
da7c07574f (test) cost calc on azure 2023-11-23 13:50:09 -08:00
ishaan-jaff
5408a40cc8 (feat) cost calc: add azure cost tracking 2023-11-23 13:50:09 -08:00
Krrish Dholakia
c074023e14 fix: fix linting issues 2023-11-23 13:47:43 -08:00
Krrish Dholakia
4f183dc6a0 fix(utils.py): support reading api keys dynamically from the os environment 2023-11-23 13:41:56 -08:00
Krrish Dholakia
87aa36a2ec fix(caching.py): fix linting issues 2023-11-23 13:21:45 -08:00
Krrish Dholakia
b10102c285 bump: version 1.5.0 → 1.6.0 2023-11-23 13:21:45 -08:00
Krrish Dholakia
8c4e8d6c62 feat(proxy_server.py): add in-memory caching for user api keys 2023-11-23 13:21:45 -08:00
ishaan-jaff
8291f239a4 (feat) proxy server: add spend column 2023-11-23 11:46:59 -08:00
ishaan-jaff
65d978622d (docs) proxy 2023-11-23 11:46:59 -08:00
Ishaan Jaff
ed17e04bc2
Update README.md 2023-11-23 11:39:39 -08:00
Krrish Dholakia
276041e3bb docs(routing.md): updating routing docs to include cooldown info 2023-11-23 11:27:22 -08:00
ishaan-jaff
2c50ea94c8 (test) embedding: azurea - dd response_ms 2023-11-23 11:15:52 -08:00
Krrish Dholakia
61fc76a8c4 fix(router.py): fix caching for tracking cooldowns + usage 2023-11-23 11:13:32 -08:00
ishaan-jaff
94c1d71b2c (docs) proxy server 2023-11-23 11:13:26 -08:00
ishaan-jaff
28a192df3a (docs) simple proxy 2023-11-23 11:10:50 -08:00
ishaan-jaff
9d3d10da4c (Feat) update config.yaml example on proxy 2023-11-23 10:54:30 -08:00
ishaan-jaff
1dcab433b4 (feat) proxy: caching - show redis settings when initializing 2023-11-23 10:52:50 -08:00
ishaan-jaff
71042ee345 (docs) caching 2023-11-23 10:40:32 -08:00
Krish Dholakia
2fb94b4cc3
Merge pull request #882 from Manouchehri/docker-cleanup-1
Improve Dockerfile
2023-11-23 10:38:45 -08:00
Krish Dholakia
da12c13c5a
Merge pull request #881 from Manouchehri/lambda-1
Add AWS Lambda Support
2023-11-23 10:38:34 -08:00
ishaan-jaff
efe6fb8831 (docs) proxy server caching 2023-11-23 10:35:34 -08:00
ishaan-jaff
ae3eeb16ab (feat) proxy: add curl command test + read cache config 2023-11-23 10:31:04 -08:00
Krish Dholakia
dbd5f2b3cc
Update README.md 2023-11-23 10:03:14 -08:00
ishaan-jaff
fbd2ab4c71 (test) caching ensure we always test add/get caching redis 2023-11-23 08:27:44 -08:00
ishaan-jaff
b15b723567 (docs) proxy server: add caching 2023-11-23 08:08:12 -08:00
ishaan-jaff
44e867499f (docs) proxy 2023-11-23 07:55:12 -08:00
Krish Dholakia
6ba4eeb961
Merge pull request #885 from Codium-ai/bugfix/hf_timeout
Do not timeout when calling HF through acomplete
2023-11-23 07:48:59 -08:00
Krish Dholakia
c51bfc686b
Merge pull request #884 from maqsoodshaik/main
this commit fixes #883
2023-11-23 07:47:28 -08:00
Ori Kotek
e74ac03169
Do not timeout when calling HF through acomplete 2023-11-23 15:56:59 +02:00
maqsoodshaik
0f89c3375a this commit fixes #883 2023-11-23 12:45:38 +01:00
David Manouchehri
c7e3c399c2
Add default expose. 2023-11-23 04:19:54 -05:00
David Manouchehri
f7ad15c392
Reduce Docker size by not copying unneeded files. 2023-11-23 04:18:53 -05:00
David Manouchehri
ecdb7a59e3
Use -function for naming. 2023-11-23 02:09:09 -05:00
David Manouchehri
ab09ad9a74
Make Function URL public. 2023-11-23 01:05:24 -05:00
David Manouchehri
60ca43b6d9
Add mangum. 2023-11-23 00:04:47 -05:00
ishaan-jaff
1af7575c86 (docs) rename reliability -> Fallbacks, num retries 2023-11-22 20:55:53 -08:00
ishaan-jaff
db146bc40a (test) router with fallback deployments 2023-11-22 20:52:56 -08:00
Ishaan Jaff
629415c91a
Merge pull request #880 from Manouchehri/patch-1
(docs) Fix missing `-r` in pip command
2023-11-22 20:45:25 -08:00
David Manouchehri
94b1d09973
(docs) Fix missing -r in pip command 2023-11-22 23:41:16 -05:00
ishaan-jaff
8ebc1b974c (chore) run ci/cd again 2023-11-22 20:34:14 -08:00
David Manouchehri
4d9aa7a3f7
Add symlink and only copy in source dir to stay under 50MB compressed limit for Lambdas. 2023-11-22 23:07:33 -05:00
Krrish Dholakia
2f93c0155a fix: fix linting errors 2023-11-22 19:59:25 -08:00
Krrish Dholakia
5d5ca9f7ef fix(router.py): add support for cooldowns with redis 2023-11-22 19:54:22 -08:00
David Manouchehri
56d28ba69e
Don't commit .sam-aws. 2023-11-22 22:53:55 -05:00
David Manouchehri
65ec095c35
(test) Bump runtime. 2023-11-22 22:52:39 -05:00
David Manouchehri
bf55602e26
(test) lambda / sam build support 2023-11-22 22:39:34 -05:00
ishaan-jaff
cb41b14cc2 (test) proxy test exception mapping 2023-11-22 16:22:05 -08:00
ishaan-jaff
4260e0c1f0 (fix) linting error 2023-11-22 16:22:05 -08:00
Krrish Dholakia
a45be1d16a bump: version 1.4.0 → 1.5.0 2023-11-22 15:59:57 -08:00
Krrish Dholakia
3e76d4b422 feat(router.py): add server cooldown logic 2023-11-22 15:59:48 -08:00