Commit graph

3243 commits

Author SHA1 Message Date
Krrish Dholakia
3080f27b54 fix(utils.py): raise correct error for azure content blocked error 2024-01-10 23:31:51 +05:30
Krrish Dholakia
6a8d518e44 test(test_lowest_latency_routing.py): use the correct cache key 2024-01-10 22:15:01 +05:30
Krrish Dholakia
5bc44353e0 feat(proxy_cli.py): move print statements to show actually deployed port 2024-01-10 22:09:58 +05:30
ishaan-jaff
1ff9785c6b (fix) test - moved to circe ci dockerfile 2024-01-10 21:54:13 +05:30
Krrish Dholakia
954d1b071c test: remove invalid arg 2024-01-10 21:53:29 +05:30
Ishaan Jaff
58d0366447
Merge pull request #1399 from BerriAI/litellm_default_use_gunicorn
LiteLLM Proxy - Use Gunicorn with Uvicorn workers
2024-01-10 21:46:04 +05:30
Krrish Dholakia
9a829ff956 refactor: cleanup duplicates 2024-01-10 21:42:20 +05:30
Krrish Dholakia
31917176ff fix(lowest_latency.py): fix merge issue 2024-01-10 21:37:46 +05:30
Krish Dholakia
298e937586
Merge branch 'main' into litellm_latency_routing_updates 2024-01-10 21:33:54 +05:30
Krrish Dholakia
7f269e92c5 test(test_completion_with_retries.py): remove duplicate test 2024-01-10 21:17:30 +05:30
Krrish Dholakia
14a65eb730 test(test_proxy_server_keys.py): removing as this is now tested via the docker build job 2024-01-10 21:05:12 +05:30
Krrish Dholakia
162f6f1ed3 refactor: refactor key tests 2024-01-10 20:58:29 +05:30
Krrish Dholakia
0d86c4ce5b refactor: move proxy server key testing 2024-01-10 20:56:52 +05:30
Krrish Dholakia
990c32a5d6 test(test_router.py): handle image gen timeouts 2024-01-10 20:56:52 +05:30
Krrish Dholakia
fe632c08a4 fix(router.py): allow user to control the latency routing time window 2024-01-10 20:56:52 +05:30
Krrish Dholakia
bb04a340a5 fix(lowest_latency.py): add back tpm/rpm checks, configurable time window 2024-01-10 20:52:01 +05:30
ishaan-jaff
4d8d58f0c9 (test) temp - comment out deployed proxy keygen test 2024-01-10 18:09:59 +05:30
Krrish Dholakia
b06d7f0cb6 build(config.yml): reintroduce mounting config.yaml 2024-01-10 18:03:57 +05:30
ishaan-jaff
2b9174c3d7 (feat) add comments on starting with gunicorn 2024-01-10 17:50:51 +05:30
ishaan-jaff
67dc9adc71 (fix) import gunicorn 2024-01-10 17:47:34 +05:30
ishaan-jaff
873965df22 (chore) remove old uvicorn logic 2024-01-10 17:39:05 +05:30
ishaan-jaff
5136d5980f (fix) use gunicorn to start proxt 2024-01-10 17:09:03 +05:30
ishaan-jaff
c7fe33202d v0 2024-01-10 16:29:38 +05:30
ishaan-jaff
f6012124b7 (fix) test for captured caplog 2024-01-10 13:17:34 +05:30
ishaan-jaff
a064786f3d (ci/cd) retry deployed proxy test 3 times 2024-01-10 13:10:16 +05:30
Krish Dholakia
de7d4e64a4
Merge pull request #1390 from Manouchehri/fix-json-caching-1
(caching) Fix incorrect usage of str, which created invalid JSON.
2024-01-10 12:21:25 +05:30
Krrish Dholakia
ebe752fb61 build(pyproject.toml): drop certifi dependency (unused) 2024-01-10 08:09:03 +05:30
David Manouchehri
8a07476524
(caching) Fix incorrect usage of str, which created invalid JSON. 2024-01-09 14:21:41 -05:00
Krrish Dholakia
befbcbb7ac fix: fix linting issues 2024-01-09 22:55:18 +05:30
Krrish Dholakia
ba7026639d fix: fix linting errors 2024-01-09 22:16:16 +05:30
Krrish Dholakia
35f9666dc2 feat(proxy_server.py): adds working dynamo db support for key gen 2024-01-09 18:23:34 +05:30
Mateo Cámara
203089e6c7
Merge branch 'main' into explicit-args-acomplete 2024-01-09 13:07:37 +01:00
Mateo Cámara
9aedd4e794 Moved test to a new file 2024-01-09 13:02:12 +01:00
Mateo Cámara
0ec976b3d1 Reverted changes made by the IDE automatically 2024-01-09 12:55:12 +01:00
Ishaan Jaff
4cfa010dbd
Merge pull request #1381 from BerriAI/litellm_content_policy_violation_exception
[Feat] Add litellm.ContentPolicyViolationError
2024-01-09 17:18:29 +05:30
ishaan-jaff
248e5f3d92 (chore) remove deprecated completion_with_config() tests 2024-01-09 17:13:06 +05:30
ishaan-jaff
c0b56b6575 (test) catch litellm.ContentPolicyViolationError 2024-01-09 17:04:04 +05:30
ishaan-jaff
186fc4614d (feat) add ContentPolicyViolationError for azure 2024-01-09 16:58:09 +05:30
ishaan-jaff
170ae74118 (feat) add exception mapping for litellm.image_generation 2024-01-09 16:54:47 +05:30
ishaan-jaff
f0c10377cf (test) ContentPolicyViolationError 2024-01-09 16:53:57 +05:30
ishaan-jaff
9da61bdf31 (fix) ContentPolicyViolationError 2024-01-09 16:53:15 +05:30
Mateo Cámara
bb06c51ede Added test to check if acompletion is using the same parameters as CompletionRequest attributes. Added functools to client decorator to expose acompletion parameters from outside. 2024-01-09 12:06:49 +01:00
Mateo Cámara
48b2f69c93 Added the new acompletion parameters based on CompletionRequest attributes 2024-01-09 12:05:31 +01:00
ishaan-jaff
cf98343eb5 (test) content policy violation error 2024-01-09 16:34:20 +05:30
Mateo Cámara
178a57492b Merge remote-tracking branch 'origin/main' into explicit-args-acomplete 2024-01-09 12:04:04 +01:00
ishaan-jaff
09874cc83f (v0) add ContentPolicyViolationError 2024-01-09 16:33:03 +05:30
ishaan-jaff
47f56f0d19 (test) deployed proxy key/gen 2024-01-09 16:07:33 +05:30
Ishaan Jaff
cdeb864e28
Merge pull request #1376 from BerriAI/litellm_deployed_proxy_prisma
[Test+Fix] Use deployed proxy with Prisma
2024-01-09 16:02:27 +05:30
ishaan-jaff
b5f9f05491 (test) fix - skip HF is currently loading exception 2024-01-09 15:53:19 +05:30
ishaan-jaff
bf65306ec3 (chore) undo extra space 2024-01-09 15:38:14 +05:30