Commit graph

3566 commits

Author SHA1 Message Date
David Manouchehri
c7e3c399c2
Add default expose. 2023-11-23 04:19:54 -05:00
David Manouchehri
f7ad15c392
Reduce Docker size by not copying unneeded files. 2023-11-23 04:18:53 -05:00
David Manouchehri
ecdb7a59e3
Use -function for naming. 2023-11-23 02:09:09 -05:00
David Manouchehri
ab09ad9a74
Make Function URL public. 2023-11-23 01:05:24 -05:00
David Manouchehri
60ca43b6d9
Add mangum. 2023-11-23 00:04:47 -05:00
David Manouchehri
4d9aa7a3f7
Add symlink and only copy in source dir to stay under 50MB compressed limit for Lambdas. 2023-11-22 23:07:33 -05:00
David Manouchehri
56d28ba69e
Don't commit .sam-aws. 2023-11-22 22:53:55 -05:00
David Manouchehri
65ec095c35
(test) Bump runtime. 2023-11-22 22:52:39 -05:00
David Manouchehri
bf55602e26
(test) lambda / sam build support 2023-11-22 22:39:34 -05:00
ishaan-jaff
cb41b14cc2 (test) proxy test exception mapping 2023-11-22 16:22:05 -08:00
ishaan-jaff
4260e0c1f0 (fix) linting error 2023-11-22 16:22:05 -08:00
Krrish Dholakia
a45be1d16a bump: version 1.4.0 → 1.5.0 2023-11-22 15:59:57 -08:00
Krrish Dholakia
3e76d4b422 feat(router.py): add server cooldown logic 2023-11-22 15:59:48 -08:00
ishaan-jaff
4ece219ec5 (docs) simple proxy 2023-11-22 15:01:26 -08:00
ishaan-jaff
d0f11e7a13 (docs) input params for litellm.embedding() 2023-11-22 14:40:52 -08:00
ishaan-jaff
5abd566b7c (feat) embedding() support for timeouts 2023-11-22 14:25:55 -08:00
ishaan-jaff
c38782521c (test)timeout error on openai embedding 2023-11-22 14:25:55 -08:00
ishaan-jaff
40e88eec4b (test)timeout errors 2023-11-22 14:25:55 -08:00
ishaan-jaff
3059f30672 (test) verify azure response have expected keys 2023-11-22 14:25:55 -08:00
ishaan-jaff
4247df02c7 (fix) Azure - only use ad_token when api_key is None 2023-11-22 14:25:55 -08:00
ishaan-jaff
b3bca98561 (feat) embedding() remove junk params 2023-11-22 14:25:55 -08:00
Krrish Dholakia
0b4e10e068 test(test_embedding.py): fix the embedding test 2023-11-22 14:09:45 -08:00
Krrish Dholakia
448ec0a571 feat(proxy_server): add /v1/embeddings endpoint
n
2023-11-22 14:03:27 -08:00
ishaan-jaff
40dd38508f (test) embedding stricter testing 2023-11-22 13:50:45 -08:00
ishaan-jaff
e8ff4d5eca (feat) clean out junk params from litellm embedding 2023-11-22 13:50:45 -08:00
Krrish Dholakia
b0801f61e6 test(test_caching.py): cleaning up tests 2023-11-22 13:43:48 -08:00
Krrish Dholakia
78582e158a fix(main.py): fix acompletion for anyscale, openrouter, deepinfra, perplexity endpoints 2023-11-22 13:22:58 -08:00
ishaan-jaff
ba73224a3a (feat) proxy server add /routes to see available routes 2023-11-22 13:20:21 -08:00
Krrish Dholakia
604ad41eac fix(proxy_server.py): If master key is set, only master key can be used to generate new keys 2023-11-22 10:18:28 -08:00
Krrish Dholakia
10fe16c965 fix(utils.py): add param mapping for perplexity, anyscale, deepinfra
n

n
2023-11-22 10:04:27 -08:00
Krrish Dholakia
e7bb4a0cbd fix(proxy_server): fix linting issues 2023-11-22 08:47:59 -08:00
ishaan-jaff
7a4be44805 (docs) request q 2023-11-22 08:10:30 -08:00
Krrish Dholakia
a4406e1784 bump: version 1.3.4 → 1.4.0 2023-11-21 21:19:27 -08:00
Krrish Dholakia
76f46902ed feat(router.py): adding latency-based routing strategy 2023-11-21 21:19:27 -08:00
ishaan-jaff
2f3e13e43b (test) load test with api.litellm.ai 2023-11-21 21:07:27 -08:00
ishaan-jaff
b770ff2404 (test) load test proxy 2023-11-21 21:04:46 -08:00
ishaan-jaff
d1ad84c26d (test) load test q 2023-11-21 20:48:56 -08:00
ishaan-jaff
359f542c10 (test) load test q 2023-11-21 20:48:56 -08:00
ishaan-jaff
6aa8b41fb3 (fix) caching config 2023-11-21 20:48:56 -08:00
ishaan-jaff
3c30705b76 (fix) cache configs 2023-11-21 20:48:56 -08:00
Krrish Dholakia
d7f292c108 bump: version 1.3.3 → 1.3.4 2023-11-21 20:16:00 -08:00
Krrish Dholakia
904def6119 fix(proxy_server.py): fix /models endpoint 2023-11-21 20:15:43 -08:00
Krrish Dholakia
e5fa4eb314 fix(celery_app.py): add retries to worker 2023-11-21 20:07:16 -08:00
Krrish Dholakia
381fdcd37b fix(utils.py): add response ms for async calls 2023-11-21 19:59:00 -08:00
ishaan-jaff
898f15af2b (docs) update routing with api.litellm.ai 2023-11-21 19:54:29 -08:00
ishaan-jaff
580e6dc166 (test) test q 2023-11-21 19:45:46 -08:00
ishaan-jaff
363d74cf7d (fix) proxy server set model list through headers 2023-11-21 19:33:48 -08:00
Krrish Dholakia
826f56a6a0 docs(routing.md): update routing docs 2023-11-21 19:32:50 -08:00
ishaan-jaff
fd3895878d (test) test q 2023-11-21 18:15:00 -08:00
ishaan-jaff
f91bc03f4c (fix) add psutil req.txt 2023-11-21 18:15:00 -08:00