Commit graph

153 commits

Author SHA1 Message Date
Ishaan Jaff
215a407ebf [Feat Proxy] Allow using hypercorn for http v2 (#5950)
* use run_hypercorn

* add docs on using hypercorn
2024-09-28 15:03:50 -07:00
Krish Dholakia
0a03f2f11e LiteLLM Minor Fixes & Improvements (09/25/2024) (#5893)
* fix(langfuse.py): support new langfuse prompt_chat class init params

* fix(langfuse.py): handle new init values on prompt chat + prompt text templates

fixes error caused during langfuse logging

* docs(openai_compatible.md): clarify `openai/` handles correct routing for `/v1/completions` route

Fixes https://github.com/BerriAI/litellm/issues/5876

* fix(utils.py): handle unmapped gemini model optional param translation

Fixes https://github.com/BerriAI/litellm/issues/5888

* fix(o1_transformation.py): fix o-1 validation, to not raise error if temperature=1

Fixes https://github.com/BerriAI/litellm/issues/5884

* fix(prisma_client.py): refresh iam token

Fixes https://github.com/BerriAI/litellm/issues/5896

* fix: pass drop params where required

* fix(utils.py): pass drop_params correctly

* fix(types/vertex_ai.py): fix generation config

* test(test_max_completion_tokens.py): fix test

* fix(vertex_and_google_ai_studio_gemini.py): fix map openai params
2024-09-26 16:41:44 -07:00
steffen-sbt
357dd3cad5 Add the option to specify a schema in the postgres DB, also modify docs (#5640) 2024-09-11 14:53:52 -07:00
Ishaan Jaff
b5d1d93c14 refactor secret managers 2024-09-03 10:58:02 -07:00
Ishaan Jaff
09519b74db refactor get_secret 2024-09-03 10:42:12 -07:00
Krrish Dholakia
f9ab33cbc2 feat(proxy_server.py): support azure batch api endpoints 2024-08-22 15:21:43 -07:00
Krrish Dholakia
727035913b fix(proxy_cli.py): support database_host, database_username, database_password, database_name 2024-08-19 16:17:45 -07:00
Krish Dholakia
350e4b3830 Merge pull request #5057 from BerriAI/litellm_rds_iam_auth
feat(proxy_cli.py): support iam-based auth to rds
2024-08-06 10:44:33 -07:00
Krrish Dholakia
24e20718e4 feat(proxy_cli.py): support iam-based auth to rds
Initial pr for iam-based auth support for rds
2024-08-05 17:35:48 -07:00
Krrish Dholakia
b5e22bde06 fix: bump default allowed_fails + reduce default db pool limit
Fixes issues with running proxy server in production
2024-08-05 15:07:46 -07:00
Krrish Dholakia
aecbc98b9b fix(proxy_cli.py): bump default azure api version 2024-07-08 16:28:22 -07:00
Krrish Dholakia
4c5ee771e1 fix(proxy_cli.py): run aws kms decrypt before starting proxy server 2024-06-28 16:03:56 -07:00
Ishaan Jaff
6cc427790e make sure linting runs proxy_cli.py 2024-06-20 20:20:08 -07:00
Chris Van Pelt
ae0e2ba9e5 Update proxy_cli.py
Fixed indentation to so we don't get an `UnboundLocalError`.  Fixes #4324
2024-06-20 17:48:16 -07:00
Krrish Dholakia
4ba5c71350 fix(proxy_cli.py): fix double counting json logs 2024-06-20 15:15:23 -07:00
Krrish Dholakia
bee79f0b70 fix(proxy_cli.py): support passing the database url as an encrypted kms key 2024-06-10 15:48:27 -07:00
Krrish Dholakia
2d2e411c11 fix(proxy/_logging.py): fix default logging level 2024-06-05 17:42:49 -07:00
Krrish Dholakia
741bfb9cef fix(proxy_cli.py): enable json logging via litellm_settings param on config
allows user to enable json logs without needing to figure out env variables
2024-05-29 21:41:20 -07:00
Krrish Dholakia
00d47f3d3b feat(proxy_cli.py): support json logs on proxy
allow user to enable 'json logs' for proxy server
2024-05-20 09:18:12 -07:00
Krrish Dholakia
943432c758 docs(prod.md): add 'disable load_dotenv' tutorial to docs 2024-05-14 19:13:22 -07:00
Krrish Dholakia
c2fa620088 fix: disable 'load_dotenv' for prod environments 2024-05-14 19:09:36 -07:00
Krrish Dholakia
5f93cae3ff feat(proxy_server.py): return litellm version in response headers 2024-05-08 16:00:08 -07:00
Krrish Dholakia
06a0ca1e80 fix(proxy_cli.py): don't double load the router config
was causing callbacks to be instantiated twice - double couting usage in cache
2024-04-10 13:23:56 -07:00
Krrish Dholakia
2d0c450079 fix(proxy_cli.py): revert db timeout change - user-controllable param
db timeout is a user controllable param, not necessary to change defaults
2024-04-03 09:37:57 -07:00
Krrish Dholakia
895073e68a fix(proxy_server.py): bump default db timeouts 2024-04-03 09:35:08 -07:00
Krrish Dholakia
46937935d1 feat(proxy/utils.py): enable updating db in a separate server 2024-03-27 16:02:36 -07:00
Krrish Dholakia
b6d705d8d7 fix(proxy_cli.py): fix circular import issue 2024-03-26 21:16:41 -07:00
Krrish Dholakia
f3a56c5af2 fix(llm_guard.py): working llm-guard 'key-specific' mode 2024-03-26 17:47:20 -07:00
Krish Dholakia
a8cdb82ef0 Merge pull request #2697 from antoniomdk/fix-database-credentials-leakage
(fix) Remove print statements from append_query_params
2024-03-26 16:07:33 -07:00
Ishaan Jaff
62c83d36a5 (fix) remove litellm.telemetry 2024-03-26 11:21:09 -07:00
Antonio Molner Domenech
22629898c9 Update print statements to use verbose logger and DEBUG level 2024-03-26 22:41:28 +07:00
Ishaan Jaff
539fc6d7e1 (feat) start proxy with default num_workers=1 2024-03-20 10:46:32 -07:00
ishaan-jaff
9e1d089770 (docs) use port 4000 2024-03-08 21:59:00 -08:00
ishaan-jaff
fc3b61fe89 (fix) default num workers 2024-03-08 15:24:08 -08:00
Krrish Dholakia
8dc9b51ef2 test(test_proxy_server.py): add back bedrock embedding tests 2024-03-04 18:28:05 -08:00
Krrish Dholakia
7263ae63c6 fix(proxy_cli.py): allow user to control db connection pool + timeouts from config 2024-02-17 19:03:43 -08:00
ishaan-jaff
8d503caa1e (feat) inc default gunicorn timeout 2024-02-12 21:49:32 -08:00
ishaan-jaff
3b57d285c2 (feat) litellm set ssl_certificates 2024-02-10 17:52:47 -08:00
Krrish Dholakia
4c6bb6a9f6 build(schema.prisma): support direct url on prisma schema 2024-02-08 17:37:37 -08:00
Krrish Dholakia
aec4572285 fix(proxy_cli.py): fix max connection limit issue on db 2024-02-07 22:57:44 -08:00
Krrish Dholakia
c429dfa746 fix(proxy_cli.py-&&-proxy_server.py): bump reset budget intervals and fix pool limits for prisma connections 2024-02-06 19:39:49 -08:00
Krish Dholakia
45cbb3cf3d Merge branch 'main' into litellm_embedding_caching_updates 2024-02-03 18:08:47 -08:00
Krrish Dholakia
9aa55ab872 build(proxy_cli.py): make running gunicorn an optional cli arg
when running proxy locally, running with uvicorn is much better for debugging
2024-01-29 15:32:34 -08:00
Krrish Dholakia
e937662e8a build(ui/litellm_dashboard_v_2): allow app owner to create keys and view their keys 2024-01-27 13:48:35 -08:00
Krrish Dholakia
9c0179d132 fix(proxy_server.py): support setting tpm/rpm limits per user / per key 2024-01-18 17:03:18 -08:00
Krrish Dholakia
3bdfb2bc6d fix(proxy_cli.py): ensure proxy always retries if db push fails to connect to db 2024-01-17 17:37:59 -08:00
ishaan-jaff
b01321cdc7 (feat) use env variables to set host, port, num_workers 2024-01-16 08:36:29 -08:00
ishaan-jaff
1ff697af3e (feat) print num_workers when starting prox 2024-01-16 08:21:37 -08:00
ishaan-jaff
b3de8e8fa2 v0 num_workers 2024-01-16 08:15:12 -08:00
ishaan-jaff
4a7f42cb73 (fix) gunicorn - show request logs like uvicorn 2024-01-13 11:22:16 -08:00