Krrish Dholakia
|
5b0968b380
|
fix(__init__.py): fix linting error
|
2023-12-01 20:08:08 -08:00 |
|
Krrish Dholakia
|
fbdcde1a54
|
fix(proxy_server.py): fix linting errors
|
2023-12-01 19:45:09 -08:00 |
|
Krrish Dholakia
|
284fb64f4d
|
feat: support for azure key vault
|
2023-12-01 19:36:06 -08:00 |
|
Krrish Dholakia
|
60d6b6bc37
|
fix(router.py): fix exponential backoff to use retry-after if present in headers
|
2023-11-28 17:25:03 -08:00 |
|
ishaan-jaff
|
00454df83f
|
(fix) add timeout to __init__ litellm
|
2023-11-27 07:49:18 -08:00 |
|
Krrish Dholakia
|
8884ceb606
|
fix(proxy_server.py): expose a /health endpoint
|
2023-11-25 18:28:47 -08:00 |
|
Krrish Dholakia
|
68168cc743
|
fix(router.py): fix retry logic
|
2023-11-24 13:27:44 -08:00 |
|
Krrish Dholakia
|
27fd144950
|
docs(simple_proxy.md): add tutorial for doing fallbacks + retries + timeouts on the proxy
|
2023-11-24 12:20:38 -08:00 |
|
Krrish Dholakia
|
9bb2c7ee0f
|
fix(utils.py): add param mapping for perplexity, anyscale, deepinfra
n
n
|
2023-11-22 10:04:27 -08:00 |
|
Krrish Dholakia
|
7fb3a71b47
|
refactor(proxy_server.py): using celery workers instead of rq for concurrency
|
2023-11-21 16:31:56 -08:00 |
|
Krrish Dholakia
|
68c955409d
|
refactor(proxy_server.py): refactoring background rq worker
|
2023-11-21 13:47:09 -08:00 |
|
Krrish Dholakia
|
b8e62f3d0c
|
feat(proxy_server.py): EXPERIMENTAL: adding queuing endpoints to openai proxy server
|
2023-11-21 12:06:23 -08:00 |
|
Krrish Dholakia
|
c7e2cbd995
|
fix(utils.py): adding support for rules + mythomax/alpaca prompt template
|
2023-11-20 18:58:15 -08:00 |
|
Krrish Dholakia
|
952dd61e0e
|
fix(init.py): exposing apiconnectionerror
|
2023-11-20 08:12:29 -08:00 |
|
ishaan-jaff
|
e9f6741b0b
|
(v1.0+ breaking change) get_max_tokens -> return int
|
2023-11-17 10:38:50 -08:00 |
|
Krrish Dholakia
|
f14bd24b46
|
fix(openai.py): fix linting issues
|
2023-11-16 11:01:28 -08:00 |
|
Krrish Dholakia
|
4bd471644e
|
fix(openai.py): switch back to using requests instead of httpx
|
2023-11-15 18:25:21 -08:00 |
|
Krrish Dholakia
|
ef4e5b9636
|
test: set request timeout at request level
|
2023-11-15 17:42:31 -08:00 |
|
ishaan-jaff
|
c44fb66d7d
|
(chore) removing bloat
|
2023-11-15 17:11:56 -08:00 |
|
Krrish Dholakia
|
e35ce15a89
|
refactor(huggingface_restapi.py): moving async completion + streaming to real async calls
|
2023-11-15 15:14:21 -08:00 |
|
Krrish Dholakia
|
54bdca4ba6
|
docs(routing.md): add contributor s/o
|
2023-11-15 13:20:02 -08:00 |
|
Krrish Dholakia
|
0f6713993d
|
fix(router.py): enabling retrying with expo backoff (without tenacity) for router
|
2023-11-14 20:57:51 -08:00 |
|
Krrish Dholakia
|
9b582b2c85
|
fix(main.py): keep client consistent across calls + exponential backoff retry on ratelimit errors
|
2023-11-14 16:26:05 -08:00 |
|
Krrish Dholakia
|
c0a757a25f
|
refactor(azure.py): working azure completion calls with openai v1 sdk
|
2023-11-11 16:44:39 -08:00 |
|
Krrish Dholakia
|
a5ec85b1f2
|
refactor(openai.py): making it compatible for openai v1
BREAKING CHANGE:
|
2023-11-11 15:33:02 -08:00 |
|
ishaan-jaff
|
b2f7076b9b
|
(fix) add APIError to litellm module
|
2023-11-11 11:33:02 -08:00 |
|
Krrish Dholakia
|
17f5e46080
|
refactor(azure.py): moving azure openai calls to http calls
|
2023-11-08 16:52:18 -08:00 |
|
Krrish Dholakia
|
597dc8f34c
|
bump: version 0.13.1 → 0.13.2.dev1
|
2023-11-05 21:12:13 -08:00 |
|
Krrish Dholakia
|
078c369e35
|
bump: version 0.13.1.dev2 → 0.13.1.dev3
|
2023-11-04 22:31:53 -07:00 |
|
Krrish Dholakia
|
e633566253
|
feat(utils.py): adding additional states for custom logging
|
2023-11-04 17:07:20 -07:00 |
|
Krrish Dholakia
|
2310e0c6eb
|
test(test_completion.py): num retries 3
|
2023-11-04 15:55:56 -07:00 |
|
Krrish Dholakia
|
fa24a61976
|
refactor(proxy_server.py): print statement showing how to add debug for logs
|
2023-11-03 17:41:14 -07:00 |
|
Krrish Dholakia
|
127972a80b
|
build(litellm_server/utils.py): add support for general settings + num retries as a module variable
|
2023-11-02 20:56:41 -07:00 |
|
ishaan-jaff
|
2bc891240b
|
(feat) add embed-english-v3.0
|
2023-11-02 10:05:22 -07:00 |
|
ishaan-jaff
|
f66e9c6bce
|
(feat) detect amazon.titan-embed-text-v1 as bedrock embedding model
|
2023-11-01 14:46:33 -07:00 |
|
Krrish Dholakia
|
147d69f230
|
feat(main.py): add support for maritalk api
|
2023-10-30 17:36:51 -07:00 |
|
ishaan-jaff
|
d5ad0f975f
|
(fix) remove bloat completion_with_split_tests
|
2023-10-27 18:13:30 -07:00 |
|
Krrish Dholakia
|
715ea54544
|
fix(utils.py): adding support for anyscale models
|
2023-10-25 09:08:10 -07:00 |
|
Krrish Dholakia
|
743dda9724
|
fix(init.py): fix linting errors
|
2023-10-24 15:54:21 -07:00 |
|
Krrish Dholakia
|
558d2582b5
|
fix(utils.py): allow url to be passed to model cost map
|
2023-10-24 15:38:05 -07:00 |
|
ishaan-jaff
|
bdb2da75ee
|
(feat) add perplexity llms
|
2023-10-20 14:39:29 -07:00 |
|
ishaan-jaff
|
d4c81814f1
|
(feat) native perplexity support
|
2023-10-20 14:29:07 -07:00 |
|
Krrish Dholakia
|
2fa22dc91e
|
feat(utils.py): adding encode and decode functions
|
2023-10-20 11:59:47 -07:00 |
|
Iskren Chernev
|
2219b12350
|
Load deepinfra models from json
|
2023-10-20 11:34:17 +03:00 |
|
Krrish Dholakia
|
f10a4ce16b
|
feat(utils.py): add register model helper function
|
2023-10-19 18:26:36 -07:00 |
|
Krrish Dholakia
|
154c2a025c
|
test(init.py): improving testing for get_model_cost_map
b69449a811
|
2023-10-18 14:55:36 -07:00 |
|
Krrish Dholakia
|
2df47f1913
|
fix(init.py): adding local cached copy of model mapping for fallbacks
|
2023-10-18 13:59:12 -07:00 |
|
Krrish Dholakia
|
f23431fdc4
|
feat(router.py): add routing capabilities for multiple deployments
|
2023-10-17 22:28:31 -07:00 |
|
canada4663
|
f560530dd1
|
remove duplicate entry for bedrock in provider list
|
2023-10-14 21:44:16 -07:00 |
|
canada4663
|
25a2609e9d
|
Update __init__.py model_list to include bedrock models
|
2023-10-14 16:29:18 -07:00 |
|