Commit graph

805 commits

Author SHA1 Message Date
ishaan-jaff
00ac18e8b7 (feat) improve bedrock, sagemaker exception mapping 2024-01-15 21:22:22 -08:00
ishaan-jaff
fcc1e23a05 (fix) post_call rules 2024-01-15 20:56:25 -08:00
ishaan-jaff
e864c78d15 (feat) post call rules - fail with error message 2024-01-15 17:13:13 -08:00
ishaan-jaff
79ad63009e (feat) support extra body for Azure, OpenAI 2024-01-13 14:32:11 -08:00
ishaan-jaff
6bae534968 (fix) check if custom_llm_provider is not None 2024-01-13 12:54:03 -08:00
ishaan-jaff
53fd62b0cd (feat) use custom_llm_provider in completion_cost 2024-01-13 12:29:51 -08:00
ishaan-jaff
6b2a4714a6 (feat) return custom_llm_provider in streaming response 2024-01-12 17:14:43 -08:00
David Leen
a674de8f36 improve bedrock exception granularity 2024-01-12 16:38:55 +01:00
Ishaan Jaff
d181bd22a7
Merge pull request #1422 from dleen/httpx
(fix) create httpx.Request instead of httpx.request
2024-01-11 22:31:55 +05:30
David Leen
6b87c13b9d (fix) create httpx.Request instead of httpx.request
fixes #1420
2024-01-11 16:22:26 +01:00
ishaan-jaff
1fb3547e48 (feat) improve litellm verbose logs 2024-01-11 18:13:08 +05:30
ishaan-jaff
f297a4d174 (feat) show args passed to litellm.completion, acompletion on call 2024-01-11 17:56:27 +05:30
Ishaan Jaff
2433d6c613
Merge pull request #1200 from MateoCamara/explicit-args-acomplete
feat: added explicit args to acomplete
2024-01-11 10:39:05 +05:30
ishaan-jaff
f61d8596e1 (fix) working s3 logging 2024-01-11 08:57:32 +05:30
Krrish Dholakia
3080f27b54 fix(utils.py): raise correct error for azure content blocked error 2024-01-10 23:31:51 +05:30
Mateo Cámara
203089e6c7
Merge branch 'main' into explicit-args-acomplete 2024-01-09 13:07:37 +01:00
Ishaan Jaff
4cfa010dbd
Merge pull request #1381 from BerriAI/litellm_content_policy_violation_exception
[Feat] Add litellm.ContentPolicyViolationError
2024-01-09 17:18:29 +05:30
ishaan-jaff
248e5f3d92 (chore) remove deprecated completion_with_config() tests 2024-01-09 17:13:06 +05:30
ishaan-jaff
186fc4614d (feat) add ContentPolicyViolationError for azure 2024-01-09 16:58:09 +05:30
ishaan-jaff
9da61bdf31 (fix) ContentPolicyViolationError 2024-01-09 16:53:15 +05:30
Mateo Cámara
bb06c51ede Added test to check if acompletion is using the same parameters as CompletionRequest attributes. Added functools to client decorator to expose acompletion parameters from outside. 2024-01-09 12:06:49 +01:00
ishaan-jaff
09874cc83f (v0) add ContentPolicyViolationError 2024-01-09 16:33:03 +05:30
ishaan-jaff
5f2cbfc711 (feat) litellm.completion - support ollama timeout 2024-01-09 10:34:41 +05:30
Krrish Dholakia
dd78782133 fix(utils.py): error handling for litellm --model mistral edge case 2024-01-08 15:09:01 +05:30
Krrish Dholakia
6333fbfe56 fix(main.py): support cost calculation for text completion streaming object 2024-01-08 12:41:43 +05:30
Krrish Dholakia
9b46412279 fix(utils.py): fix logging for text completion streaming 2024-01-08 12:05:28 +05:30
Krrish Dholakia
c04fa54d19 fix(utils.py): fix exception raised 2024-01-08 07:42:17 +05:30
Krrish Dholakia
3469b5b911 fix(utils.py): map optional params for gemini 2024-01-08 07:38:55 +05:30
Krrish Dholakia
75177c2a15 bump: version 1.16.16 → 1.16.17 2024-01-08 07:16:37 +05:30
Krish Dholakia
439ee3bafc
Merge pull request #1344 from BerriAI/litellm_speed_improvements
Litellm speed improvements
2024-01-06 22:38:10 +05:30
Krrish Dholakia
5fd2f945f3 fix(factory.py): support gemini-pro-vision on google ai studio
https://github.com/BerriAI/litellm/issues/1329
2024-01-06 22:36:22 +05:30
Krrish Dholakia
712f89b4f1 fix(utils.py): handle original_response being a json 2024-01-06 17:02:50 +05:30
ishaan-jaff
4679c7b99a (fix) caching use same "created" in response_object 2024-01-05 16:03:56 +05:30
ishaan-jaff
00b001b96b (feat) completion_cost: improve model=None error 2024-01-05 15:26:04 +05:30
ishaan-jaff
f681f0f2b2 (feat) completion_cost - embeddings + raise Exception 2024-01-05 13:11:23 +05:30
Krrish Dholakia
aa72d65c90 fix(utils.py): fix check for if cached response should be returned 2024-01-04 21:49:19 +05:30
Krrish Dholakia
773a0a147a fix(utils.py): raise a bad request error if litellm client raises a model /provider not found error 2024-01-04 15:50:43 +05:30
Krrish Dholakia
99d9a825de feat(proxy_server.py): abstract config update/writing and support persisting config in db
allows user to opt into writing to db (SAVE_CONFIG_TO_DB) and removes any api keys before sending to db

 https://github.com/BerriAI/litellm/issues/1322
2024-01-04 14:44:59 +05:30
Krrish Dholakia
8cee267a5b fix(caching.py): support ttl, s-max-age, and no-cache cache controls
https://github.com/BerriAI/litellm/issues/1306
2024-01-03 12:42:43 +05:30
ishaan-jaff
96cb6f3b10 (fix) azure+stream: count completion tokens 2024-01-03 12:06:39 +05:30
ishaan-jaff
f3b8d9c3ef (fix) counting response tokens+streaming 2024-01-03 12:06:39 +05:30
Krrish Dholakia
a778f8a00e bump: version 1.16.10 → 1.16.11 2024-01-02 22:26:47 +05:30
ishaan-jaff
790dcff5e0 (feat) add xinference as an embedding provider 2024-01-02 15:32:26 +05:30
Krrish Dholakia
0fffcc1579 fix(utils.py): support token counting for gpt-4-vision models 2024-01-02 14:41:42 +05:30
Krrish Dholakia
a83e2e07cf fix(router.py): correctly raise no model available error
https://github.com/BerriAI/litellm/issues/1289
2024-01-01 21:22:42 +05:30
ishaan-jaff
535a547b66 (fix) use cloudflare optional params 2023-12-30 12:22:31 +05:30
Krrish Dholakia
77be3e3114 fix(main.py): don't set timeout as an optional api param 2023-12-30 11:47:07 +05:30
Krrish Dholakia
38f55249e1 fix(router.py): support retry and fallbacks for atext_completion 2023-12-30 11:19:32 +05:30
ishaan-jaff
5d6954895f (fix) timeout optional param 2023-12-30 11:07:52 +05:30
ishaan-jaff
2f4cd3b569 (feat) proxy - support dynamic timeout per request 2023-12-30 10:55:42 +05:30