David Manouchehri
|
c7e3c399c2
|
Add default expose.
|
2023-11-23 04:19:54 -05:00 |
|
David Manouchehri
|
f7ad15c392
|
Reduce Docker size by not copying unneeded files.
|
2023-11-23 04:18:53 -05:00 |
|
David Manouchehri
|
ecdb7a59e3
|
Use -function for naming.
|
2023-11-23 02:09:09 -05:00 |
|
David Manouchehri
|
ab09ad9a74
|
Make Function URL public.
|
2023-11-23 01:05:24 -05:00 |
|
David Manouchehri
|
60ca43b6d9
|
Add mangum.
|
2023-11-23 00:04:47 -05:00 |
|
David Manouchehri
|
4d9aa7a3f7
|
Add symlink and only copy in source dir to stay under 50MB compressed limit for Lambdas.
|
2023-11-22 23:07:33 -05:00 |
|
David Manouchehri
|
56d28ba69e
|
Don't commit .sam-aws.
|
2023-11-22 22:53:55 -05:00 |
|
David Manouchehri
|
65ec095c35
|
(test) Bump runtime.
|
2023-11-22 22:52:39 -05:00 |
|
David Manouchehri
|
bf55602e26
|
(test) lambda / sam build support
|
2023-11-22 22:39:34 -05:00 |
|
ishaan-jaff
|
cb41b14cc2
|
(test) proxy test exception mapping
|
2023-11-22 16:22:05 -08:00 |
|
ishaan-jaff
|
4260e0c1f0
|
(fix) linting error
|
2023-11-22 16:22:05 -08:00 |
|
Krrish Dholakia
|
a45be1d16a
|
bump: version 1.4.0 → 1.5.0
|
2023-11-22 15:59:57 -08:00 |
|
Krrish Dholakia
|
3e76d4b422
|
feat(router.py): add server cooldown logic
|
2023-11-22 15:59:48 -08:00 |
|
ishaan-jaff
|
4ece219ec5
|
(docs) simple proxy
|
2023-11-22 15:01:26 -08:00 |
|
ishaan-jaff
|
d0f11e7a13
|
(docs) input params for litellm.embedding()
|
2023-11-22 14:40:52 -08:00 |
|
ishaan-jaff
|
5abd566b7c
|
(feat) embedding() support for timeouts
|
2023-11-22 14:25:55 -08:00 |
|
ishaan-jaff
|
c38782521c
|
(test)timeout error on openai embedding
|
2023-11-22 14:25:55 -08:00 |
|
ishaan-jaff
|
40e88eec4b
|
(test)timeout errors
|
2023-11-22 14:25:55 -08:00 |
|
ishaan-jaff
|
3059f30672
|
(test) verify azure response have expected keys
|
2023-11-22 14:25:55 -08:00 |
|
ishaan-jaff
|
4247df02c7
|
(fix) Azure - only use ad_token when api_key is None
|
2023-11-22 14:25:55 -08:00 |
|
ishaan-jaff
|
b3bca98561
|
(feat) embedding() remove junk params
|
2023-11-22 14:25:55 -08:00 |
|
Krrish Dholakia
|
0b4e10e068
|
test(test_embedding.py): fix the embedding test
|
2023-11-22 14:09:45 -08:00 |
|
Krrish Dholakia
|
448ec0a571
|
feat(proxy_server): add /v1/embeddings endpoint
n
|
2023-11-22 14:03:27 -08:00 |
|
ishaan-jaff
|
40dd38508f
|
(test) embedding stricter testing
|
2023-11-22 13:50:45 -08:00 |
|
ishaan-jaff
|
e8ff4d5eca
|
(feat) clean out junk params from litellm embedding
|
2023-11-22 13:50:45 -08:00 |
|
Krrish Dholakia
|
b0801f61e6
|
test(test_caching.py): cleaning up tests
|
2023-11-22 13:43:48 -08:00 |
|
Krrish Dholakia
|
78582e158a
|
fix(main.py): fix acompletion for anyscale, openrouter, deepinfra, perplexity endpoints
|
2023-11-22 13:22:58 -08:00 |
|
ishaan-jaff
|
ba73224a3a
|
(feat) proxy server add /routes to see available routes
|
2023-11-22 13:20:21 -08:00 |
|
Krrish Dholakia
|
604ad41eac
|
fix(proxy_server.py): If master key is set, only master key can be used to generate new keys
|
2023-11-22 10:18:28 -08:00 |
|
Krrish Dholakia
|
10fe16c965
|
fix(utils.py): add param mapping for perplexity, anyscale, deepinfra
n
n
|
2023-11-22 10:04:27 -08:00 |
|
Krrish Dholakia
|
e7bb4a0cbd
|
fix(proxy_server): fix linting issues
|
2023-11-22 08:47:59 -08:00 |
|
ishaan-jaff
|
7a4be44805
|
(docs) request q
|
2023-11-22 08:10:30 -08:00 |
|
Krrish Dholakia
|
a4406e1784
|
bump: version 1.3.4 → 1.4.0
|
2023-11-21 21:19:27 -08:00 |
|
Krrish Dholakia
|
76f46902ed
|
feat(router.py): adding latency-based routing strategy
|
2023-11-21 21:19:27 -08:00 |
|
ishaan-jaff
|
2f3e13e43b
|
(test) load test with api.litellm.ai
|
2023-11-21 21:07:27 -08:00 |
|
ishaan-jaff
|
b770ff2404
|
(test) load test proxy
|
2023-11-21 21:04:46 -08:00 |
|
ishaan-jaff
|
d1ad84c26d
|
(test) load test q
|
2023-11-21 20:48:56 -08:00 |
|
ishaan-jaff
|
359f542c10
|
(test) load test q
|
2023-11-21 20:48:56 -08:00 |
|
ishaan-jaff
|
6aa8b41fb3
|
(fix) caching config
|
2023-11-21 20:48:56 -08:00 |
|
ishaan-jaff
|
3c30705b76
|
(fix) cache configs
|
2023-11-21 20:48:56 -08:00 |
|
Krrish Dholakia
|
d7f292c108
|
bump: version 1.3.3 → 1.3.4
|
2023-11-21 20:16:00 -08:00 |
|
Krrish Dholakia
|
904def6119
|
fix(proxy_server.py): fix /models endpoint
|
2023-11-21 20:15:43 -08:00 |
|
Krrish Dholakia
|
e5fa4eb314
|
fix(celery_app.py): add retries to worker
|
2023-11-21 20:07:16 -08:00 |
|
Krrish Dholakia
|
381fdcd37b
|
fix(utils.py): add response ms for async calls
|
2023-11-21 19:59:00 -08:00 |
|
ishaan-jaff
|
898f15af2b
|
(docs) update routing with api.litellm.ai
|
2023-11-21 19:54:29 -08:00 |
|
ishaan-jaff
|
580e6dc166
|
(test) test q
|
2023-11-21 19:45:46 -08:00 |
|
ishaan-jaff
|
363d74cf7d
|
(fix) proxy server set model list through headers
|
2023-11-21 19:33:48 -08:00 |
|
Krrish Dholakia
|
826f56a6a0
|
docs(routing.md): update routing docs
|
2023-11-21 19:32:50 -08:00 |
|
ishaan-jaff
|
fd3895878d
|
(test) test q
|
2023-11-21 18:15:00 -08:00 |
|
ishaan-jaff
|
f91bc03f4c
|
(fix) add psutil req.txt
|
2023-11-21 18:15:00 -08:00 |
|