Commit graph

15087 commits

Author SHA1 Message Date
Ishaan Jaff
dfc674622b litellm router - use free / paid tier 2024-07-18 20:35:14 -07:00
Ishaan Jaff
f3e0a89597 check if using tag based routing 2024-07-18 20:10:45 -07:00
Ishaan Jaff
08adda7091 control using enable_tag_filtering 2024-07-18 19:39:04 -07:00
Krrish Dholakia
96471c145e fix(bedrock_httpx.py): support jamba streaming 2024-07-18 19:36:50 -07:00
Ishaan Jaff
071091fd8c fix use tags as a litellm param 2024-07-18 19:34:45 -07:00
Ishaan Jaff
b6e60d481e fix remove previous code on free/paid tier 2024-07-18 19:24:13 -07:00
Ishaan Jaff
4d0fbfea83 router - refactor to tag based routing 2024-07-18 19:22:09 -07:00
Krrish Dholakia
cece76c4ee feat(bedrock_httpx.py): add ai21 jamba instruct as converse model
initial commit for adding ai21 jamba instruct support through bedrock converse
2024-07-18 18:24:06 -07:00
Ishaan Jaff
51525254e8 fix ui make ui session last 24 hours 2024-07-18 18:22:40 -07:00
Ishaan Jaff
81c77f33b8 fix test amazing vertex medlm 2024-07-18 18:16:00 -07:00
Ishaan Jaff
4b96cd46b2
Merge pull request #4786 from BerriAI/litellm_use_model_tier_keys
[Feat-Enterprise] Use free/paid tiers for Virtual Keys
2024-07-18 18:07:09 -07:00
Krrish Dholakia
b23a633cf1 fix(utils.py): fix status code in exception mapping 2024-07-18 18:04:59 -07:00
Ishaan Jaff
22ef951e0b docs free/paid tier 2024-07-18 18:01:30 -07:00
Ishaan Jaff
d266234cd6 docs using free, paid tier 2024-07-18 18:01:21 -07:00
Krrish Dholakia
e3ca2789df fix(bedrock_httpx.py): cache response stream shape
Fixes https://github.com/BerriAI/litellm/issues/4774
2024-07-18 17:41:31 -07:00
Krrish Dholakia
30d6c1d171 fix(factory.py): handle escape characters in image url
prevents mimetype errors in vertex ai calls
2024-07-18 17:36:58 -07:00
Ishaan Jaff
f6efed06de test adding free / paid tier to metadata 2024-07-18 17:31:02 -07:00
Krrish Dholakia
6d741a5424 docs(json_mode.md): add json mode to docs 2024-07-18 17:20:19 -07:00
Ishaan Jaff
fda5578263 feat - enterprise 2024-07-18 17:15:47 -07:00
Ishaan Jaff
64e38562d9 router - use free paid tier routing 2024-07-18 17:09:42 -07:00
Ishaan Jaff
88cd641089 helper to get_deployments_for_tier 2024-07-18 17:06:06 -07:00
Krrish Dholakia
4d963ab789 feat(vertex_ai_anthropic.py): support response_schema for vertex ai anthropic calls
allows passing response_schema for anthropic calls. supports schema validation.
2024-07-18 16:57:38 -07:00
Ishaan Jaff
007b959460 litellm router - use free / paid tier 2024-07-18 16:55:50 -07:00
reibs
f36c2061eb removed model from replicate registry 2024-07-18 16:47:19 -07:00
Ishaan Jaff
b9d4d67357
Merge pull request #4779 from BerriAI/docs_add_salt_key_docker_compose
Docs -  add `LITELLM_SALT_KEY` to docker compose
2024-07-18 15:28:26 -07:00
Ishaan Jaff
031ea1cbe4
Merge pull request #4782 from BerriAI/litellm_add_gemini-gemma-2-27b-it
[FEAT] -  add Google AI Studio: gemini-gemma-2-27b-it,  gemini-gemma-2-9b-it
2024-07-18 15:19:40 -07:00
Ishaan Jaff
99ffc5d4c3 feat - add gemini-gemma-2-27b-it 2024-07-18 15:18:56 -07:00
Marc Abramowitz
7b1c72e3f4 Alias /health/liveliness as /health/liveness
The latter is the more common term in Kubernetes, so it's nice to support that.
2024-07-18 14:29:32 -07:00
Ishaan Jaff
976cc3f5ea ui - fix typo on referencing base url 2024-07-18 13:36:05 -07:00
Ishaan Jaff
75ca53fab5 fix linting errors on main 2024-07-18 13:32:48 -07:00
Ishaan Jaff
b38f470135 docs on litellm docker compose 2024-07-18 13:08:06 -07:00
Ishaan Jaff
972eec2b3f docs - add instructions on LITELLM_SALT_KEY 2024-07-18 13:07:02 -07:00
Ishaan Jaff
df4aab8be9
Merge pull request #4764 from BerriAI/litellm_run_moderation_check_on_embedding
[Feat] run guardrail moderation check on embedding
2024-07-18 12:44:37 -07:00
Ishaan Jaff
eedacf5193
Merge branch 'main' into litellm_run_moderation_check_on_embedding 2024-07-18 12:44:30 -07:00
Ishaan Jaff
51b3ef87d3 docs litellm telemetry 2024-07-18 12:36:16 -07:00
Ishaan Jaff
e719986c81
Merge pull request #4776 from BerriAI/litellm_add_gpt-4o
[Feat] Add OpenAI GPT-4o mini
2024-07-18 10:44:20 -07:00
Ishaan Jaff
b2623ed8a3 add gpt-4o-mini-2024-07-18 to docs 2024-07-18 10:43:33 -07:00
Ishaan Jaff
c453519aa1 gpt-4o-mini-2024-07-18 2024-07-18 10:42:37 -07:00
Ishaan Jaff
c521736bb8 add gpt-4o 2024-07-18 10:41:08 -07:00
Krish Dholakia
d8ccd7ef79
Merge pull request #4772 from fgreinacher/feat/minimal-health
feat(proxy): support hiding health check details
2024-07-18 09:58:04 -07:00
Titusz
fcef2c4580
Add missing num_gpu ollama configuration parameter 2024-07-18 17:51:56 +02:00
Florian Greinacher
f8bec3a86c
feat(proxy): support hiding health check details 2024-07-18 17:21:12 +02:00
Krish Dholakia
57f6923ab6
Merge pull request #4729 from vingiarrusso/vgiarrusso/guardrails
Add enabled_roles to Guardrails configuration, Update Lakera guardrail moderation hook
2024-07-17 22:24:35 -07:00
Krish Dholakia
e2f2c3f02f
Merge pull request #4762 from BerriAI/litellm_aporio_integration
feat(aporio_ai.py): support aporio ai prompt injection for chat completion requests
2024-07-17 22:14:34 -07:00
Krish Dholakia
77656d9f11
Merge branch 'main' into litellm_aporio_integration 2024-07-17 22:14:29 -07:00
Krish Dholakia
f359335b5b
Merge pull request #4766 from skucherlapati/add_medlm_models_to_cost_map
Add medlm models to cost map
2024-07-17 22:07:17 -07:00
skucherlapati
2ad342e7bf add medlm models to cost map 2024-07-17 21:17:58 -07:00
skucherlapati
f1747adac6 adding medlm models 2024-07-17 21:13:35 -07:00
Ishaan Jaff
9440754e48 ci/cd run again 2024-07-17 20:37:10 -07:00
Ishaan Jaff
c16583464a ci/cd run again 2024-07-17 20:25:43 -07:00