Commit graph

956 commits

Author SHA1 Message Date
Krrish Dholakia
01ed325019 test(test_logging): adding print statements for debugging circle ci 2023-10-07 16:20:17 -07:00
ishaan-jaff
38b8c8cc57 (test) add back tg ai streaming test 2023-10-07 16:15:54 -07:00
Krrish Dholakia
e1576bf97f style(openai.py): using typing for params 2023-10-07 15:51:16 -07:00
Krrish Dholakia
55bd413585 style(main.py): clean up print statement 2023-10-07 15:43:40 -07:00
Krrish Dholakia
cf7e2595b8 fix(utils): adds complete streaming response to success handler 2023-10-07 15:42:00 -07:00
ishaan-jaff
a294438802 (feat) sagemaker auth in completion 2023-10-07 15:27:58 -07:00
ishaan-jaff
cdabb22093 (test) bedrock auth in completion 2023-10-07 15:27:58 -07:00
ishaan-jaff
fc10894a86 (feat) bedrock auth in completion 2023-10-07 15:27:58 -07:00
Krrish Dholakia
c6d36fb59d docs(completion-docs): adds more details on provider-specific params 2023-10-07 13:49:30 -07:00
ishaan-jaff
8997999b92 (test) new azure test for passing base, version 2023-10-07 13:27:07 -07:00
ishaan-jaff
b907e7cb23 test(completion): test bumping a version 2023-10-07 11:11:54 -07:00
Krrish Dholakia
476b10bb96 fix(test_client.py): adds docs for test_client.py 2023-10-07 11:07:35 -07:00
Krrish Dholakia
60a3acf463 docs(docs.md): cleanup litellm api docs 2023-10-07 10:58:14 -07:00
Krrish Dholakia
5f946c201c fix(proxy_cli.py): check if model passed in 2023-10-07 07:52:02 -07:00
ishaan-jaff
12fa08b336 (tests) mark tests as asyncio 2023-10-06 21:04:34 -07:00
ishaan-jaff
a2a5c2b9bd test(rate limit aware acompletion calls): fix test name 2023-10-06 20:52:19 -07:00
ishaan-jaff
065e5fafe3 test(rate limit aware acompletion calls): testing 2023-10-06 20:48:53 -07:00
ishaan-jaff
b188816a89 feat(rate limit aware acompletion calls): 2023-10-06 20:48:53 -07:00
Krrish Dholakia
0d45650972 feat(proxy_cli.py): when user calls ollama model, run ollama serve 2023-10-06 16:46:52 -07:00
ishaan-jaff
e0eedbc183 chore(stash rate limit manager changes ): 2023-10-06 16:22:02 -07:00
Krrish Dholakia
37d7837b63 feat(ollama.py): exposing ollama config 2023-10-06 15:52:58 -07:00
ishaan-jaff
e089aa6a9a fix(test fix): 2023-10-06 15:48:09 -07:00
ishaan-jaff
f4eb94b7aa test commitizen bump 2023-10-06 15:41:38 -07:00
ishaan-jaff
0dbe507a0e bump: version 0.2.5 → 0.3.0 2023-10-06 15:40:01 -07:00
ishaan-jaff
7411f7643b test(batch_completion): add tests for batch_completion and rate limit handler 2023-10-06 15:37:09 -07:00
Krrish Dholakia
5ab3a4b8d7 fix(add-custom-success-callback-for-streaming): add custom success callback for streaming 2023-10-06 15:02:02 -07:00
Krrish Dholakia
666623fcf3 fix(proxy_server.py): make completion call handle "v1" in endpoint url 2023-10-06 09:17:02 -07:00
Krrish Dholakia
366d48929d style(main.py): adding spacing 2023-10-06 06:16:17 -07:00
Krrish Dholakia
37f9626422 fix(bedrock.py): adding provider-specific configs 2023-10-05 23:49:20 -07:00
Krrish Dholakia
6506862927 fix(test_completion.py): fix testing for claude 2023-10-05 23:07:26 -07:00
Krrish Dholakia
793de65405 fix(test_completion.py): fix azure test 2023-10-05 22:55:17 -07:00
Krrish Dholakia
6a572cdf1c fix: azure flag check 2023-10-05 22:44:41 -07:00
Krrish Dholakia
2326ad9d38 fix: fixing mypy linting errors and being backwards compatible for azure=true flag 2023-10-05 22:36:32 -07:00
Krrish Dholakia
69cdf5347a style(test_completion.py): fix merge conflict 2023-10-05 22:09:38 -07:00
ishaan-jaff
0da3150aa3 fix(n param in completion()): fix error thrown when passing n for cohere 2023-10-05 19:54:13 -07:00
ishaan-jaff
605369bc2a fix(llmonitor callback): correctly set user_id 2023-10-05 19:36:39 -07:00
ishaan-jaff
2e1b02e189 fix(completion()): add request_timeout as a param, fix claude error when request_timeout set 2023-10-05 19:05:28 -07:00
ishaan-jaff
7460506084 fix ollama tests comments 2023-10-05 11:20:34 -07:00
ishaan-jaff
f0d6d713e0 ollama default api_base to http://localhost:11434 2023-10-05 11:04:51 -07:00
Krrish Dholakia
1492916a37 adding custom prompt templates to ollama 2023-10-05 10:48:16 -07:00
ishaan-jaff
8f0edd5a78 remove results.json from tests 2023-10-04 16:10:12 -07:00
ishaan-jaff
c76eab4e74 add test batch completions 2023-10-04 16:03:58 -07:00
ishaan-jaff
1dc264d7e4 add ratelimitmanager 2023-10-04 16:03:58 -07:00
ishaan-jaff
b8f6de1289 fix linting 2023-10-04 16:03:58 -07:00
ishaan-jaff
4b30a394ce example output 2023-10-04 16:03:58 -07:00
ishaan-jaff
c8810b3f00 make rate limit hadler a class 2 2023-10-04 16:03:58 -07:00
ishaan-jaff
e82ba26651 make RateLimitHandler a class 2023-10-04 16:03:58 -07:00
Krish Dholakia
5827b30042 Merge pull request #530 from vedant-z/patch-1
Update utils.py
2023-10-04 15:42:59 -07:00
ishaan-jaff
b17e3b6160 add test batch com rate limits 2023-10-04 14:46:11 -07:00
ishaan-jaff
743c97c8da add batch_completion_rate_limits 2023-10-04 14:46:11 -07:00