Commit graph

462 commits

Author SHA1 Message Date
Krrish Dholakia
86ef2a02f7 fix(azure.py): adding support for aiohttp calls on azure + openai 2023-11-09 10:40:33 -08:00
Krrish Dholakia
7c5fd5824b refactor(azure.py): fix linting errors 2023-11-08 19:24:53 -08:00
Krrish Dholakia
6f4707bbb3 refactor(azure.py): moving embeddings to http call 2023-11-08 19:07:21 -08:00
Krrish Dholakia
70311502c8 refactor(openai.py): moving embedding calls to http 2023-11-08 19:01:17 -08:00
Krrish Dholakia
c2cbdb23fd refactor(openai.py): moving openai text completion calls to http 2023-11-08 18:40:03 -08:00
Krrish Dholakia
c57ed0a9d7 refactor(openai.py): moving openai chat completion calls to http 2023-11-08 17:40:41 -08:00
Krrish Dholakia
53abc31c27 refactor(azure.py): moving azure openai calls to http calls 2023-11-08 16:52:18 -08:00
ishaan-jaff
a404b0fc3b (fix) remove errant print from hf 2023-11-08 11:49:15 -08:00
Krrish Dholakia
ce27e08e7d (fix): llama-2 non-chat models prompt template 2023-11-07 21:33:54 -08:00
ishaan-jaff
3c67de7f04 (fix) hf don't fail when logprob is None 2023-11-06 14:22:09 -08:00
Krrish Dholakia
65c01eae23 fix(huggingface_restapi.py): output parsing chat template models 2023-11-06 11:43:12 -08:00
Krrish Dholakia
7c46e85ed6 refactor(bedrock.py): better exception mapping for bedrock + huggingface 2023-11-04 16:12:12 -07:00
Krrish Dholakia
ab54262d37 fix(timeout.py): import errors 2023-11-04 16:05:14 -07:00
Krrish Dholakia
5b3978eff4 fix(main.py): fixing print_verbose 2023-11-04 14:41:34 -07:00
Krrish Dholakia
6b40546e59 refactor(all-files): removing all print statements; adding pre-commit + flake8 to prevent future regressions 2023-11-04 12:50:15 -07:00
Krrish Dholakia
1c4dd0671b fix(bedrock.py): add exception mapping coverage for authentication scenarios 2023-11-03 18:25:34 -07:00
Krrish Dholakia
142750adff fix(bedrock.py): fix bedrock exception mapping 2023-11-03 18:14:12 -07:00
ishaan-jaff
49650af444 (fix) bedrock: remove check for - if "error" in outputText 2023-11-03 18:06:23 -07:00
ishaan-jaff
df57e9247a (fix) hf calculating usage non blocking 2023-11-03 18:03:19 -07:00
Krrish Dholakia
4e1885734a refactor(proxy_server.py): print statement showing how to add debug for logs 2023-11-03 17:41:14 -07:00
ishaan-jaff
6fc0c74878 (fix) remove errant print statements 2023-11-03 13:02:52 -07:00
ishaan-jaff
89e32db321 (fix) remove errant tg ai print statements 2023-11-03 12:59:23 -07:00
ishaan-jaff
6c82abf5bf (fix) vertex ai streaming 2023-11-03 12:54:36 -07:00
Krrish Dholakia
e3a1c58dd9 build(litellm_server/utils.py): add support for general settings + num retries as a module variable 2023-11-02 20:56:41 -07:00
Krrish Dholakia
512a1637eb feat(completion()): enable setting prompt templates via completion() 2023-11-02 16:24:01 -07:00
ishaan-jaff
03860984eb (feat) add setting input_type for cohere 2023-11-02 10:16:35 -07:00
ishaan-jaff
724e169f32 (fix) improve cohere error handling 2023-11-02 10:07:11 -07:00
Krrish Dholakia
b305492a0b fix(huggingface_restapi.py): fix linting issue 2023-11-01 16:43:35 -07:00
Krrish Dholakia
2c4cb76ce5 fix(huggingface_restapi.py): fix embeddings for sentence-transformer models 2023-11-01 16:36:46 -07:00
ishaan-jaff
2ad81bdd7b (feat) embedding() add bedrock/amazon.titan-embed-text-v1 2023-11-01 13:55:28 -07:00
ishaan-jaff
fa7e063198 (feat) add bedrock.cohere streaming 2023-10-31 22:26:43 -07:00
ishaan-jaff
9223f7cc7a (feat) textcompletion - transform hf log probs to openai text completion 2023-10-31 17:15:35 -07:00
Krish Dholakia
523c08a646
Merge pull request #717 from canada4663/main
support for custom bedrock runtime endpoint
2023-10-31 16:47:33 -07:00
Krrish Dholakia
0ed3917b09 feat(main.py): add support for maritalk api 2023-10-30 17:36:51 -07:00
canada4663
bfa297e82f add ability to use custom AWS Bedrock Runtime endpoint via environment or .env variable AWS_BEDROCK_RUNTIME_ENDPOINT or as a parameter to init_bedrock_client 2023-10-28 22:25:45 -07:00
canada4663
1add0d604e add in ability to read AWS boto3 default AWS Region name if litellm .env config AWS_REGION_NAME not available 2023-10-28 21:43:48 -07:00
ishaan-jaff
6515eef382 (fix) bedrock region name defaults to None 2023-10-27 10:17:45 -07:00
ishaan-jaff
63928fa166 (feat) use usage class for model responses for cohere, hf, tg ai, cohere 2023-10-27 09:58:47 -07:00
ishaan-jaff
194f85e92f (feat) add model_response.usage.completion_tokens for bedrock, palm, petals, sagemaker 2023-10-27 09:51:50 -07:00
ishaan-jaff
3691dc9b08 (feat) use usage class for vertex ai 2023-10-27 09:38:37 -07:00
ishaan-jaff
0a2de59932 (feat) use usage class for anthropic 2023-10-27 09:32:25 -07:00
Krrish Dholakia
f12dc5df21 fix(vertex_ai.py): fix output parsing 2023-10-24 12:08:22 -07:00
Krrish Dholakia
b0842e328c fix(vertex_ai.py): parse out text response from response object 2023-10-24 08:53:22 -07:00
Krrish Dholakia
4b48af7c3c fix(anthropic.py-+-bedrock.py): anthropic prompt format 2023-10-20 10:56:15 -07:00
Krrish Dholakia
00993f3575 fix: allow api base to be set for all providers
enables proxy use cases
2023-10-19 19:07:42 -07:00
Krrish Dholakia
76bf8c4be3 fix(anthropic.py): enable api base to be customized 2023-10-19 18:45:29 -07:00
Krrish Dholakia
dcb866b353 docs(proxy_server.md): update proxy server docs to include multi-agent autogen tutorial 2023-10-17 09:22:34 -07:00
Krrish Dholakia
7572086231 style: fix linting errors 2023-10-16 17:35:08 -07:00
Krrish Dholakia
7358d2e4ea bump: version 0.8.4 → 0.8.5 2023-10-14 16:43:06 -07:00
Krrish Dholakia
d77eee34f0 fix(openai.py): fix linting errors 2023-10-13 22:24:58 -07:00