Commit graph

3338 commits

Author SHA1 Message Date
Ishaan Jaff
19914aad11 docs new deadlock fixing architecture 2025-04-02 13:24:53 -07:00
Krish Dholakia
0519c0c507 Add Google AI Studio /v1/files upload API support (#9645)
* test: fix import for test

* fix: fix bad error string

* docs: cleanup files docs

* fix(files/main.py): cleanup error string

* style: initial commit with a provider/config pattern for files api

google ai studio files api onboarding

* fix: test

* feat(gemini/files/transformation.py): support gemini files api response transformation

* fix(gemini/files/transformation.py): return file id as gemini uri

allows id to be passed in to chat completion request, just like openai

* feat(llm_http_handler.py): support async route for files api on llm_http_handler

* fix: fix linting errors

* fix: fix model info check

* fix: fix ruff errors

* fix: fix linting errors

* Revert "fix: fix linting errors"

This reverts commit 926a5a527f.

* fix: fix linting errors

* test: fix test

* test: fix tests
2025-04-02 08:56:58 -07:00
Krrish Dholakia
19451e74dc docs: update docs 2025-04-02 07:58:45 -07:00
Tomer Bin
e848caaa4d Virtual key based policies in Aim Guardrails (#9499)
* report key alias to aim

* send litellm version to aim

* Update docs

* blacken

* add docs

* Add info part about virtual keys specific guards

* sort guardrails alphabetically

* fix ruff
2025-04-01 21:57:23 -07:00
Krrish Dholakia
f914dbd0e9 build(enterprise.md): add why enterprise to docs 2025-04-01 11:27:03 -07:00
Krrish Dholakia
351b432693 docs(anthropic.md): update docs with file message usage 2025-03-31 22:58:51 -07:00
Ishaan Jaff
cba4a4abcb Merge pull request #9567 from BerriAI/litellm_anthropic_messages_improvements
[Refactor] - Expose litellm.messages.acreate() and  litellm.messages.create() to make LLM API calls in Anthropic API spec
2025-03-31 20:50:30 -07:00
Ishaan Jaff
a52591d234 Merge pull request #9562 from KPCOFGS/main
Update all_caches.md
2025-03-31 16:06:44 -07:00
Ishaan Jaff
7681f24d37 Merge pull request #9581 from GabrielLoiseau/main
docs(gemini): fix typo
2025-03-31 16:06:10 -07:00
Ishaan Jaff
6317aebb79 Merge pull request #9286 from colesmcintosh/xai-vision-model-docs-update
fix(docs): update xAI Grok vision model reference
2025-03-31 15:49:06 -07:00
Ishaan Jaff
b01a06be0a docs anthropic messages endpoint 2025-03-31 15:28:40 -07:00
Shixian Sheng
837e02910e Merge branch 'BerriAI:main' into main 2025-03-30 06:53:09 -04:00
Krrish Dholakia
c0cfaa34d4 docs: cleanup docs 2025-03-30 00:40:23 -07:00
Krrish Dholakia
e9950581ed docs(vertex.md): update docs to show 'file' message usage 2025-03-30 00:28:45 -07:00
Ishaan Jaff
c34ce5449c docs release notes 2025-03-29 23:08:20 -07:00
Ishaan Jaff
5cf23edb72 docs litellm mcp 2025-03-29 22:28:03 -07:00
Ishaan Jaff
518723bfc7 docs mcp litellm 2025-03-29 21:59:58 -07:00
Ishaan Jaff
456033b969 docs mcp 2025-03-29 21:46:18 -07:00
Ishaan Jaff
44ff6991b5 docs mcp tools 2025-03-29 20:42:14 -07:00
Ishaan Jaff
b35fc446a9 doc fix mcp 2025-03-29 20:35:53 -07:00
Ishaan Jaff
43af63d824 docs 1.65.0-stable 2025-03-29 20:03:05 -07:00
Ishaan Jaff
d9b62ec888 docs update stable release 2025-03-29 20:01:00 -07:00
Ishaan Jaff
0f18e5fe19 add litellm model name on SLP 2025-03-29 19:55:41 -07:00
Ishaan Jaff
5cba87f9c7 docs updates release notes 2025-03-29 19:51:38 -07:00
Ishaan Jaff
d06b93c5c7 docs release notes 2025-03-29 19:50:13 -07:00
Ishaan Jaff
6f590fb8d6 docs release notes 2025-03-29 19:08:23 -07:00
Ishaan Jaff
76f4095e5a docs release notes 2025-03-29 19:03:55 -07:00
Ishaan Jaff
805c9792c3 docs add mcp graphic to stable release notes 2025-03-29 19:00:26 -07:00
Ishaan Jaff
b70b885237 docs stable release notes 2025-03-29 18:21:42 -07:00
Ishaan Jaff
df8c506037 add Complete Git Diff 2025-03-29 18:09:54 -07:00
Ishaan Jaff
149e8d3b01 docs fix release notes 2025-03-29 18:06:36 -07:00
Tan Yong Sheng
af885be743 update docs for openwebui (#9636) 2025-03-29 17:40:27 -07:00
Krish Dholakia
797a4bb1d5 install prisma migration files - connects litellm proxy to litellm's prisma migration files (#9637)
* build(README.md): initial commit adding a separate folder for additional proxy files. Meant to reduce size of core package

* build(litellm-proxy-extras/): new pip package for storing migration files

allows litellm proxy to use migration files, without adding them to core repo

* build(litellm-proxy-extras/): cleanup pyproject.toml

* build: move prisma migration files inside new proxy extras package

* build(run_migration.py): update script to write to correct folder

* build(proxy_cli.py): load in migration files from litellm-proxy-extras

Closes https://github.com/BerriAI/litellm/issues/9558

* build: add MIT license to litellm-proxy-extras

* test: update test

* fix: fix schema

* bump: version 0.1.0 → 0.1.1

* build(publish-proxy-extras.sh): add script for publishing new proxy-extras version

* build(liccheck.ini): add litellm-proxy-extras to authorized packages

* fix(litellm-proxy-extras/utils.py): move prisma migrate logic inside extra proxy pkg

easier since migrations folder already there

* build(pre-commit-config.yaml): add litellm_proxy_extras to ci tests

* docs(config_settings.md): document new env var

* build(pyproject.toml): bump relevant files when litellm-proxy-extras version changed

* build(pre-commit-config.yaml): run poetry check on litellm-proxy-extras as well
2025-03-29 15:27:09 -07:00
Krrish Dholakia
596d98471f docs(bedrock.md): clarify version on docs 2025-03-29 00:31:35 -07:00
Krrish Dholakia
d6931455fa docs(bedrock.md): add latency optimized inference to docs 2025-03-29 00:31:06 -07:00
Krish Dholakia
308a2fb195 Add bedrock latency optimized inference support (#9623)
* fix(converse_transformation.py): add performanceConfig param support on bedrock

Closes https://github.com/BerriAI/litellm/issues/7606

* fix(converse_transformation.py): refactor to use more flexible single getter for params which are separate config blocks

* test(test_main.py): add e2e mock test for bedrock performance config

* build(model_prices_and_context_window.json): add versioned multimodal embedding

* refactor(multimodal_embeddings/): migrate to config pattern

* feat(vertex_ai/multimodalembeddings): calculate usage for multimodal embedding calls

Enables cost calculation for multimodal embeddings

* feat(vertex_ai/multimodalembeddings): get usage object for embedding calls

ensures accurate cost tracking for vertexai multimodal embedding calls

* fix(embedding_handler.py): remove unused imports

* fix: fix linting errors

* fix: handle response api usage calculation

* test(test_vertex_ai_multimodal_embedding_transformation.py): update tests

* test: mark flaky test

* feat(vertex_ai/multimodal_embeddings/transformation.py): support text+image+video input

* docs(vertex.md): document sending text + image to vertex multimodal embeddings

* test: remove incorrect file

* fix(multimodal_embeddings/transformation.py): fix linting error

* style: remove unused import
2025-03-29 00:23:09 -07:00
Shixian Sheng
79e98ef012 Merge branch 'BerriAI:main' into main 2025-03-28 06:44:51 -04:00
Krish Dholakia
3543b2a808 Support discovering gemini, anthropic, xai models by calling their /v1/model endpoint (#9530)
* fix: initial commit for adding provider model discovery to gemini

* feat(gemini/): add model discovery for gemini/ route

* docs(set_keys.md): update docs to show you can check available gemini models as well

* feat(anthropic/): add model discovery for anthropic api key

* feat(xai/): add model discovery for XAI

enables checking what models an xai key can call

* ci: bump ci config yml

* fix(topaz/common_utils.py): fix linting error

* fix: fix linting error for python38
2025-03-27 22:50:48 -07:00
Shixian Sheng
4ec2175e21 Merge branch 'BerriAI:main' into main 2025-03-27 20:52:06 -04:00
Krrish Dholakia
bc1f2fe3b2 docs(index.md): document new team model flow 2025-03-27 12:14:29 -07:00
Gabriel Loiseau
43cfc7ec60 Fix typo gemini.md 2025-03-27 14:53:02 +01:00
Krrish Dholakia
a5fbe50f04 docs(openai.md): add gpt-4o-transcribe to docs 2025-03-26 23:23:35 -07:00
Ishaan Jaff
6f5b467bff docs prod.md 2025-03-26 21:04:36 -07:00
Ishaan Jaff
cb44f8d64d Merge pull request #9533 from BerriAI/litellm_stability_fixes
[Reliability Fixes] - Gracefully handle exceptions when DB is having an outage
2025-03-26 18:57:38 -07:00
Ishaan Jaff
78387eba53 allow_requests_on_db_unavailable 2025-03-26 18:48:18 -07:00
Ishaan Jaff
5b4cf0c167 docs fix 2025-03-26 18:32:58 -07:00
Ishaan Jaff
147548f151 Merge pull request #9542 from BerriAI/litellm_fix_vertex_ai_ft_models
[Feature]: Support for Fine-Tuned Vertex AI LLMs
2025-03-26 16:19:41 -07:00
Ishaan Jaff
4bc3780ac9 doc fix Fine-tuned Models 2025-03-26 14:09:27 -07:00
Ishaan Jaff
502044c179 docs vertex ft model 2025-03-26 14:08:19 -07:00
Ishaan Jaff
d10e8a57d9 docs verte ft models 2025-03-26 14:03:42 -07:00