ishaan-jaff
|
525e01824a
|
(test) ensure exception raised when incorrect key passed
|
2023-11-23 16:08:59 -08:00 |
|
Krrish Dholakia
|
8ac03e492f
|
fix(router.py): enable fallbacks for sync completions
|
2023-11-23 16:06:46 -08:00 |
|
ishaan-jaff
|
bb00595429
|
(feat) cost: azure gpt + testing
|
2023-11-23 14:20:48 -08:00 |
|
ishaan-jaff
|
90a9116a17
|
(test) strict test ft:gpt-3.5 cost calc
|
2023-11-23 14:11:01 -08:00 |
|
ishaan-jaff
|
46fc8be4c8
|
(feat) cost tracking ft:gpt-3.5-turbo
|
2023-11-23 13:58:59 -08:00 |
|
ishaan-jaff
|
da7c07574f
|
(test) cost calc on azure
|
2023-11-23 13:50:09 -08:00 |
|
ishaan-jaff
|
5408a40cc8
|
(feat) cost calc: add azure cost tracking
|
2023-11-23 13:50:09 -08:00 |
|
Krrish Dholakia
|
c074023e14
|
fix: fix linting issues
|
2023-11-23 13:47:43 -08:00 |
|
Krrish Dholakia
|
4f183dc6a0
|
fix(utils.py): support reading api keys dynamically from the os environment
|
2023-11-23 13:41:56 -08:00 |
|
Krrish Dholakia
|
87aa36a2ec
|
fix(caching.py): fix linting issues
|
2023-11-23 13:21:45 -08:00 |
|
Krrish Dholakia
|
b10102c285
|
bump: version 1.5.0 → 1.6.0
|
2023-11-23 13:21:45 -08:00 |
|
Krrish Dholakia
|
8c4e8d6c62
|
feat(proxy_server.py): add in-memory caching for user api keys
|
2023-11-23 13:21:45 -08:00 |
|
ishaan-jaff
|
8291f239a4
|
(feat) proxy server: add spend column
|
2023-11-23 11:46:59 -08:00 |
|
ishaan-jaff
|
65d978622d
|
(docs) proxy
|
2023-11-23 11:46:59 -08:00 |
|
Ishaan Jaff
|
ed17e04bc2
|
Update README.md
|
2023-11-23 11:39:39 -08:00 |
|
Krrish Dholakia
|
276041e3bb
|
docs(routing.md): updating routing docs to include cooldown info
|
2023-11-23 11:27:22 -08:00 |
|
ishaan-jaff
|
2c50ea94c8
|
(test) embedding: azurea - dd response_ms
|
2023-11-23 11:15:52 -08:00 |
|
Krrish Dholakia
|
61fc76a8c4
|
fix(router.py): fix caching for tracking cooldowns + usage
|
2023-11-23 11:13:32 -08:00 |
|
ishaan-jaff
|
94c1d71b2c
|
(docs) proxy server
|
2023-11-23 11:13:26 -08:00 |
|
ishaan-jaff
|
28a192df3a
|
(docs) simple proxy
|
2023-11-23 11:10:50 -08:00 |
|
ishaan-jaff
|
9d3d10da4c
|
(Feat) update config.yaml example on proxy
|
2023-11-23 10:54:30 -08:00 |
|
ishaan-jaff
|
1dcab433b4
|
(feat) proxy: caching - show redis settings when initializing
|
2023-11-23 10:52:50 -08:00 |
|
ishaan-jaff
|
71042ee345
|
(docs) caching
|
2023-11-23 10:40:32 -08:00 |
|
Krish Dholakia
|
2fb94b4cc3
|
Merge pull request #882 from Manouchehri/docker-cleanup-1
Improve Dockerfile
|
2023-11-23 10:38:45 -08:00 |
|
Krish Dholakia
|
da12c13c5a
|
Merge pull request #881 from Manouchehri/lambda-1
Add AWS Lambda Support
|
2023-11-23 10:38:34 -08:00 |
|
ishaan-jaff
|
efe6fb8831
|
(docs) proxy server caching
|
2023-11-23 10:35:34 -08:00 |
|
ishaan-jaff
|
ae3eeb16ab
|
(feat) proxy: add curl command test + read cache config
|
2023-11-23 10:31:04 -08:00 |
|
Krish Dholakia
|
dbd5f2b3cc
|
Update README.md
|
2023-11-23 10:03:14 -08:00 |
|
ishaan-jaff
|
fbd2ab4c71
|
(test) caching ensure we always test add/get caching redis
|
2023-11-23 08:27:44 -08:00 |
|
ishaan-jaff
|
b15b723567
|
(docs) proxy server: add caching
|
2023-11-23 08:08:12 -08:00 |
|
ishaan-jaff
|
44e867499f
|
(docs) proxy
|
2023-11-23 07:55:12 -08:00 |
|
Krish Dholakia
|
6ba4eeb961
|
Merge pull request #885 from Codium-ai/bugfix/hf_timeout
Do not timeout when calling HF through acomplete
|
2023-11-23 07:48:59 -08:00 |
|
Krish Dholakia
|
c51bfc686b
|
Merge pull request #884 from maqsoodshaik/main
this commit fixes #883
|
2023-11-23 07:47:28 -08:00 |
|
Ori Kotek
|
e74ac03169
|
Do not timeout when calling HF through acomplete
|
2023-11-23 15:56:59 +02:00 |
|
maqsoodshaik
|
0f89c3375a
|
this commit fixes #883
|
2023-11-23 12:45:38 +01:00 |
|
David Manouchehri
|
c7e3c399c2
|
Add default expose.
|
2023-11-23 04:19:54 -05:00 |
|
David Manouchehri
|
f7ad15c392
|
Reduce Docker size by not copying unneeded files.
|
2023-11-23 04:18:53 -05:00 |
|
David Manouchehri
|
ecdb7a59e3
|
Use -function for naming.
|
2023-11-23 02:09:09 -05:00 |
|
David Manouchehri
|
ab09ad9a74
|
Make Function URL public.
|
2023-11-23 01:05:24 -05:00 |
|
David Manouchehri
|
60ca43b6d9
|
Add mangum.
|
2023-11-23 00:04:47 -05:00 |
|
ishaan-jaff
|
1af7575c86
|
(docs) rename reliability -> Fallbacks, num retries
|
2023-11-22 20:55:53 -08:00 |
|
ishaan-jaff
|
db146bc40a
|
(test) router with fallback deployments
|
2023-11-22 20:52:56 -08:00 |
|
Ishaan Jaff
|
629415c91a
|
Merge pull request #880 from Manouchehri/patch-1
(docs) Fix missing `-r` in pip command
|
2023-11-22 20:45:25 -08:00 |
|
David Manouchehri
|
94b1d09973
|
(docs) Fix missing -r in pip command
|
2023-11-22 23:41:16 -05:00 |
|
ishaan-jaff
|
8ebc1b974c
|
(chore) run ci/cd again
|
2023-11-22 20:34:14 -08:00 |
|
David Manouchehri
|
4d9aa7a3f7
|
Add symlink and only copy in source dir to stay under 50MB compressed limit for Lambdas.
|
2023-11-22 23:07:33 -05:00 |
|
Krrish Dholakia
|
2f93c0155a
|
fix: fix linting errors
|
2023-11-22 19:59:25 -08:00 |
|
Krrish Dholakia
|
5d5ca9f7ef
|
fix(router.py): add support for cooldowns with redis
|
2023-11-22 19:54:22 -08:00 |
|
David Manouchehri
|
56d28ba69e
|
Don't commit .sam-aws.
|
2023-11-22 22:53:55 -05:00 |
|
David Manouchehri
|
65ec095c35
|
(test) Bump runtime.
|
2023-11-22 22:52:39 -05:00 |
|