Commit graph

631 commits

Author SHA1 Message Date
ishaan-jaff
c170917719 (fix) completion: OpenAI/Azure filter out None params 2023-11-24 14:01:21 -08:00
Krrish Dholakia
5a9a3aa89c fix(main.py): fix streaming_chunk_builder to return usage 2023-11-24 11:27:04 -08:00
ishaan-jaff
b520ace9dd (feat) cost tracking for azure llms 2023-11-23 21:41:38 -08:00
Krrish Dholakia
1b26a0931e fix(utils.py): make failure logging sync 2023-11-23 20:19:27 -08:00
ishaan-jaff
4f536885e9 (fix) cost calculator for FT: gpt-3.5 2023-11-23 18:28:21 -08:00
Krrish Dholakia
c89c41b3dc fix(utils.py): vertex ai api error exception mapping 2023-11-23 17:50:50 -08:00
Krrish Dholakia
90385ef6a5 fix(utils.py): remove eos token for zephyr models 2023-11-23 17:47:39 -08:00
Krrish Dholakia
78d13ea6eb fix(vertex_ai.py): fix exception mapping for vertex ai 2023-11-23 17:35:33 -08:00
ishaan-jaff
cde8996798 (fix) debugging: viewing raw POST request 2023-11-23 16:28:19 -08:00
ishaan-jaff
193ef781ae (fix) debugging: POST request 2023-11-23 16:08:59 -08:00
ishaan-jaff
288e3e962a (feat) cost: azure gpt + testing 2023-11-23 14:20:48 -08:00
ishaan-jaff
78e3d886a4 (feat) cost tracking ft:gpt-3.5-turbo 2023-11-23 13:58:59 -08:00
ishaan-jaff
3b9359ea7c (test) cost calc on azure 2023-11-23 13:50:09 -08:00
Krrish Dholakia
e4f40f4535 fix(utils.py): support reading api keys dynamically from the os environment 2023-11-23 13:41:56 -08:00
Krrish Dholakia
a2207d462e feat(router.py): add server cooldown logic 2023-11-22 15:59:48 -08:00
ishaan-jaff
c1ab1120bd (feat) clean out junk params from litellm embedding 2023-11-22 13:50:45 -08:00
Krrish Dholakia
bd87e30058 test(test_caching.py): cleaning up tests 2023-11-22 13:43:48 -08:00
Krrish Dholakia
9bb2c7ee0f fix(utils.py): add param mapping for perplexity, anyscale, deepinfra
n

n
2023-11-22 10:04:27 -08:00
Krrish Dholakia
efc2bfe295 fix(utils.py): add response ms for async calls 2023-11-21 19:59:00 -08:00
Krrish Dholakia
c0e3b2ece9 fix(utils.py): fix pre call rules 2023-11-21 07:10:04 -08:00
Krrish Dholakia
8e1dcc540f fix(main.py): revert model alias map change 2023-11-20 21:07:52 -08:00
Krrish Dholakia
bdd9a933ad fix(utils.py): fix rules calling 2023-11-20 21:06:36 -08:00
Krrish Dholakia
d83c2b9ee8 fix(main.py): fix model alias map logic 2023-11-20 20:49:10 -08:00
Krrish Dholakia
35e5a757b0 fix(openai.py-+-azure.py): fix linting issues 2023-11-20 19:29:23 -08:00
Krrish Dholakia
c7e2cbd995 fix(utils.py): adding support for rules + mythomax/alpaca prompt template 2023-11-20 18:58:15 -08:00
ishaan-jaff
2f1180418b (fix) linting error 2023-11-20 18:32:43 -08:00
ishaan-jaff
372b4654c3 (fix) pydantic errors openai usage 2023-11-20 18:28:19 -08:00
ishaan-jaff
bb4ee4be0a (fix) completion - always map finish_reason 2023-11-20 17:24:16 -08:00
ishaan-jaff
756f356897 (fix) completion: max_retries using OpenAI client 2023-11-20 16:57:37 -08:00
Krrish Dholakia
7472be1529 fix(routing.py): update token usage on streaming 2023-11-20 14:19:25 -08:00
Krrish Dholakia
854b749535 fix(utils.py): expanding exception mapping coverage for vertex ai 2023-11-18 20:05:40 -08:00
ishaan-jaff
0fabd4caf8 (fix) streaming completion azure 2023-11-18 19:04:41 -08:00
ishaan-jaff
70dc8441f6 (fix) streaming ensure response obj is initialized 2023-11-18 17:31:58 -08:00
ishaan-jaff
e527a45ffc (feat) print_verbose Raw openai chunk 2023-11-18 17:12:49 -08:00
ishaan-jaff
8f402e04c9 (fix) streaming openai + function calling 2023-11-18 17:01:46 -08:00
ishaan-jaff
edf98cabae (fix) streaming + function / tool calling 2023-11-18 16:23:29 -08:00
Krrish Dholakia
cf0a9f591c fix(router.py): introducing usage-based-routing 2023-11-17 17:56:26 -08:00
ishaan-jaff
0ba90475c9 (feat) support parallel function calling 2023-11-17 15:51:27 -08:00
ishaan-jaff
698f47c226 (feat) improve logging - show model_call_details 2023-11-17 15:51:27 -08:00
Krrish Dholakia
02ed97d0b2 fix(acompletion): support client side timeouts + raise exceptions correctly for async calls 2023-11-17 15:39:47 -08:00
ishaan-jaff
ef8d82a54c (feat) completion: add response_format, seed, tools, tool_choice 2023-11-17 13:59:57 -08:00
ishaan-jaff
e9f6741b0b (v1.0+ breaking change) get_max_tokens -> return int 2023-11-17 10:38:50 -08:00
Krrish Dholakia
1e0560e4d2 fix(utils.py): improve exception mapping for vertex ai 2023-11-16 22:02:26 -08:00
Krrish Dholakia
d9123ea2e8 docs(routing.md): update tutorial on deploying router 2023-11-16 21:46:43 -08:00
Krrish Dholakia
48a508bab6 feat: global client for sync + async calls (openai + Azure only) 2023-11-16 14:44:13 -08:00
Krrish Dholakia
a6e9f147d3 fix(openai.py): handling extra headers 2023-11-16 12:48:21 -08:00
ishaan-jaff
56838ee815 (fix) bedrock meta llama optional params 2023-11-16 12:38:27 -08:00
ishaan-jaff
55a054f3f6 (fix) only decode chunk when it's not a str 2023-11-16 12:24:31 -08:00
Krrish Dholakia
e54056f0ed fix(azure.py): use openai client sdk for handling sync+async calling 2023-11-16 12:08:12 -08:00
ishaan-jaff
aa84ca04d8 (fix) HF api + streaming 2023-11-16 11:59:56 -08:00