Commit graph

219 commits

Author SHA1 Message Date
ishaan-jaff
0bf29a14e8 init vertex_vision_models 2023-12-16 18:37:00 +05:30
ishaan-jaff
9bdd6e73bb (feat) proxy logs: dynamodb - set table name 2023-12-15 21:38:44 +05:30
ishaan-jaff
353433e5ce (feat) add openai.NotFoundError 2023-12-15 10:18:02 +05:30
Krrish Dholakia
1608dd7e0b fix(main.py): support async streaming for text completions endpoint 2023-12-14 13:56:32 -08:00
ishaan-jaff
9ee16bc962 (feat) caching - add supported call types 2023-12-14 22:27:14 +05:30
ishaan-jaff
c0cc78b943 (feat) mistral - add exception mapping 2023-12-14 18:57:39 +05:30
ishaan-jaff
7945664e61 (feat) add mistral api 2023-12-14 18:17:48 +05:30
Krrish Dholakia
8d688b6217 fix(utils.py): support caching for embedding + log cache hits
n

n
2023-12-13 18:37:30 -08:00
Krrish Dholakia
0f29cda8d9 test(test_amazing_vertex_completion.py): fix testing 2023-12-13 16:41:26 -08:00
Krrish Dholakia
ef7a6e3ae1 feat(vertex_ai.py): adds support for gemini-pro on vertex ai 2023-12-13 10:26:30 -08:00
Krrish Dholakia
4bf875d3ed fix(router.py): fix least-busy routing 2023-12-08 20:29:49 -08:00
ishaan-jaff
ee70c4e822 (feat) router - add model_group_alias_map 2023-12-06 20:13:33 -08:00
ishaan-jaff
b3f039627e (feat) litellm - add _async_failure_callback 2023-12-06 14:43:47 -08:00
Krrish Dholakia
d962d5d4c0 fix(bedrock.py): adding support for cohere embeddings 2023-12-06 13:25:18 -08:00
Frank Colson
95e5331090 Use litellm logging convention 2023-12-05 22:28:23 -07:00
Krrish Dholakia
e0ccb281d8 feat(utils.py): add async success callbacks for custom functions 2023-12-04 16:42:40 -08:00
Krrish Dholakia
eae5b3ce50 fix(__init__.py): fix linting error 2023-12-01 20:08:08 -08:00
Krrish Dholakia
328113a28e fix(proxy_server.py): fix linting errors 2023-12-01 19:45:09 -08:00
Krrish Dholakia
368fee224e feat: support for azure key vault 2023-12-01 19:36:06 -08:00
Krrish Dholakia
bb1267eb07 fix(router.py): fix exponential backoff to use retry-after if present in headers 2023-11-28 17:25:03 -08:00
ishaan-jaff
e9947be7b2 (fix) add timeout to __init__ litellm 2023-11-27 07:49:18 -08:00
Krrish Dholakia
e4f302a8e2 fix(proxy_server.py): expose a /health endpoint 2023-11-25 18:28:47 -08:00
Krrish Dholakia
2686894823 fix(router.py): fix retry logic 2023-11-24 13:27:44 -08:00
Krrish Dholakia
12dbdc4c15 docs(simple_proxy.md): add tutorial for doing fallbacks + retries + timeouts on the proxy 2023-11-24 12:20:38 -08:00
Krrish Dholakia
10fe16c965 fix(utils.py): add param mapping for perplexity, anyscale, deepinfra
n

n
2023-11-22 10:04:27 -08:00
Krrish Dholakia
b16646e584 refactor(proxy_server.py): using celery workers instead of rq for concurrency 2023-11-21 16:31:56 -08:00
Krrish Dholakia
a1f6b9b531 refactor(proxy_server.py): refactoring background rq worker 2023-11-21 13:47:09 -08:00
Krrish Dholakia
c6a4744947 feat(proxy_server.py): EXPERIMENTAL: adding queuing endpoints to openai proxy server 2023-11-21 12:06:23 -08:00
Krrish Dholakia
855964ed45 fix(utils.py): adding support for rules + mythomax/alpaca prompt template 2023-11-20 18:58:15 -08:00
Krrish Dholakia
e75c07f726 fix(init.py): exposing apiconnectionerror 2023-11-20 08:12:29 -08:00
ishaan-jaff
bd82559553 (v1.0+ breaking change) get_max_tokens -> return int 2023-11-17 10:38:50 -08:00
Krrish Dholakia
a23c0a2599 fix(openai.py): fix linting issues 2023-11-16 11:01:28 -08:00
Krrish Dholakia
fc8b54eb73 fix(openai.py): switch back to using requests instead of httpx 2023-11-15 18:25:21 -08:00
Krrish Dholakia
9fd7c5b343 test: set request timeout at request level 2023-11-15 17:42:31 -08:00
ishaan-jaff
2a81cc567d (chore) removing bloat 2023-11-15 17:11:56 -08:00
Krrish Dholakia
1a705bfbcb refactor(huggingface_restapi.py): moving async completion + streaming to real async calls 2023-11-15 15:14:21 -08:00
Krrish Dholakia
33d823a18d docs(routing.md): add contributor s/o 2023-11-15 13:20:02 -08:00
Krrish Dholakia
59eaeba92a fix(router.py): enabling retrying with expo backoff (without tenacity) for router 2023-11-14 20:57:51 -08:00
Krrish Dholakia
a7222f257c fix(main.py): keep client consistent across calls + exponential backoff retry on ratelimit errors 2023-11-14 16:26:05 -08:00
Krrish Dholakia
39c2597c33 refactor(azure.py): working azure completion calls with openai v1 sdk 2023-11-11 16:44:39 -08:00
Krrish Dholakia
d3323ba637 refactor(openai.py): making it compatible for openai v1
BREAKING CHANGE:
2023-11-11 15:33:02 -08:00
ishaan-jaff
59c76db34e (fix) add APIError to litellm module 2023-11-11 11:33:02 -08:00
Krrish Dholakia
53abc31c27 refactor(azure.py): moving azure openai calls to http calls 2023-11-08 16:52:18 -08:00
Krrish Dholakia
21ae940992 bump: version 0.13.1 → 0.13.2.dev1 2023-11-05 21:12:13 -08:00
Krrish Dholakia
3a4370ae20 bump: version 0.13.1.dev2 → 0.13.1.dev3 2023-11-04 22:31:53 -07:00
Krrish Dholakia
c3916a7754 feat(utils.py): adding additional states for custom logging 2023-11-04 17:07:20 -07:00
Krrish Dholakia
5bd6d6ff4c test(test_completion.py): num retries 3 2023-11-04 15:55:56 -07:00
Krrish Dholakia
4e1885734a refactor(proxy_server.py): print statement showing how to add debug for logs 2023-11-03 17:41:14 -07:00
Krrish Dholakia
e3a1c58dd9 build(litellm_server/utils.py): add support for general settings + num retries as a module variable 2023-11-02 20:56:41 -07:00
ishaan-jaff
744e69f01f (feat) add embed-english-v3.0 2023-11-02 10:05:22 -07:00