Commit graph

18335 commits

Author SHA1 Message Date
Ishaan Jaff
1276c10338 migrate presidio to new guardrails 2024-09-04 10:12:23 -07:00
David Manouchehri
0268877f28
Add azure/gpt-4o-2024-08-06 pricing. (#5510) 2024-09-04 09:40:18 -07:00
Ishaan Jaff
4599a0ffd3 fix get llm provider logic 2024-09-04 09:28:23 -07:00
Ishaan Jaff
9ed4dad5ac test get llm provider 2024-09-04 09:22:19 -07:00
Krrish Dholakia
e7855329e8 style: ci/cd run again 2024-09-04 08:32:35 -07:00
Krrish Dholakia
25e49a59b2 test: skip flaky test 2024-09-04 08:20:57 -07:00
Ishaan Jaff
ca5a117544 dual cache use always read redis as True by default 2024-09-04 08:01:55 -07:00
Ishaan Jaff
db0ad40e98 test for pl obj 2024-09-04 08:00:49 -07:00
Ishaan Jaff
657b7eef82 add always read redis test 2024-09-04 07:57:23 -07:00
Ishaan Jaff
f5122e0325 ci/cd run again 2024-09-04 07:41:08 -07:00
Krish Dholakia
be3c7b401e
LiteLLM Minor fixes + improvements (08/03/2024) (#5488)
* fix(internal_user_endpoints.py): set budget_reset_at for /user/update

* fix(vertex_and_google_ai_studio_gemini.py): handle accumulated json

Fixes https://github.com/BerriAI/litellm/issues/5479

* fix(vertex_ai_and_gemini.py): fix assistant message function call when content is not None

Fixes https://github.com/BerriAI/litellm/issues/5490

* fix(proxy_server.py): generic state uuid for okta sso

* fix(lago.py): improve debug logs

Debugging for https://github.com/BerriAI/litellm/issues/5477

* docs(bedrock.md): add bedrock cross-region inferencing to docs

* fix(azure.py): return azure response headers on aembedding call

* feat(azure.py): return azure response headers for `/audio/transcription`

* fix(types/utils.py): standardize deepseek / anthropic prompt caching usage information

Closes https://github.com/BerriAI/litellm/issues/5285

* docs(usage.md): add docs on litellm usage object

* test(test_completion.py): mark flaky test
2024-09-03 21:21:34 -07:00
Ishaan Jaff
59042511c9 fix router debug logs 2024-09-03 20:18:10 -07:00
Ishaan Jaff
48c9d10e37 fix route debug logs 2024-09-03 20:14:06 -07:00
Ishaan Jaff
ea07622174 bump: version 1.44.15 → 1.44.16 2024-09-03 18:20:46 -07:00
Ishaan Jaff
5f73c1cb69
Merge pull request #5497 from BerriAI/litellm_bump_langfuse_v
[Feat-Proxy] bump langfuse sdk version on docker
2024-09-03 18:18:19 -07:00
Ishaan Jaff
256ab6d8e8 fix req.txt 2024-09-03 18:13:44 -07:00
Ishaan Jaff
ed951e5e35 bump langfuse sdk version on docker 2024-09-03 18:09:00 -07:00
Ishaan Jaff
e9701ffa36
Merge pull request #5494 from BerriAI/litellm_allow_setting_route_controls
[Feat-Proxy] Enterprise - allow controlling allowed private, public, admin only routes
2024-09-03 17:00:44 -07:00
Ishaan Jaff
115de195f1
Merge branch 'main' into litellm_allow_setting_route_controls 2024-09-03 17:00:22 -07:00
Ishaan Jaff
814bc67c84 docs control routes on proxy 2024-09-03 16:58:11 -07:00
Ishaan Jaff
dbe1879707 reset general settings post test 2024-09-03 15:33:22 -07:00
Ishaan Jaff
45e89f977a add test for admin only routes 2024-09-03 15:26:42 -07:00
Ishaan Jaff
0b63625673 add check for admin only routes 2024-09-03 15:03:32 -07:00
Ishaan Jaff
19dbfff620
Merge pull request #5489 from BerriAI/litellm_Add_secret_managers
[Feat] Add Google Secret Manager Support
2024-09-03 14:51:32 -07:00
Ishaan Jaff
0a42e14401
Merge pull request #5491 from BerriAI/dependabot/npm_and_yarn/docs/my-website/pagefind-1.1.1
Bump pagefind from 1.1.0 to 1.1.1 in /docs/my-website
2024-09-03 14:51:07 -07:00
Ishaan Jaff
aae2ba208d add test for allowed routes 2024-09-03 14:17:53 -07:00
Ishaan Jaff
fd6366d041 fix test google secret manager 2024-09-03 14:01:03 -07:00
Ishaan Jaff
253ef5f995 allow setting allowed routes on proxy 2024-09-03 13:59:31 -07:00
dependabot[bot]
ae37d60ba1
Bump pagefind from 1.1.0 to 1.1.1 in /docs/my-website
Bumps [pagefind](https://github.com/CloudCannon/pagefind) from 1.1.0 to 1.1.1.
- [Release notes](https://github.com/CloudCannon/pagefind/releases)
- [Changelog](https://github.com/CloudCannon/pagefind/blob/main/CHANGELOG.md)
- [Commits](https://github.com/CloudCannon/pagefind/compare/v1.1.0...v1.1.1)

---
updated-dependencies:
- dependency-name: pagefind
  dependency-type: indirect
...

Signed-off-by: dependabot[bot] <support@github.com>
2024-09-03 19:35:54 +00:00
Ishaan Jaff
e2372f7999
Merge pull request #5459 from yujonglee/update-canary
update canary
2024-09-03 11:58:43 -07:00
Ishaan Jaff
e33d634e5d mark test as flaky 2024-09-03 11:56:03 -07:00
Ishaan Jaff
d9843a59a7 docs secret manager link 2024-09-03 11:41:43 -07:00
Ishaan Jaff
04456ab19e test secret manager 2024-09-03 11:22:06 -07:00
Ishaan Jaff
3c898e23ea refactor secret managers 2024-09-03 10:58:02 -07:00
Ishaan Jaff
150f3c2cfa read from .env for secret manager 2024-09-03 10:53:52 -07:00
Ishaan Jaff
b0178a85cf refactor get_secret 2024-09-03 10:42:12 -07:00
Ishaan Jaff
1546a82f18 add sync_construct_request_headers 2024-09-03 10:36:10 -07:00
Krrish Dholakia
27090c97dd fix(proxy/_types.py): add lago 'charge_by' env var to proxy ui 2024-09-03 08:19:40 -07:00
Krrish Dholakia
9aa006d353 docs(bedrock.md): add multimodal embedding support to docs 2024-09-03 08:14:10 -07:00
Krrish Dholakia
cdfea7e5ae docs(batches.md): add loadbalancing multiple azure deployments on batches api to docs 2024-09-03 08:01:14 -07:00
Krrish Dholakia
ab6ddd1a49 docs(azure.md): add docs on azure token refresh 2024-09-03 07:47:15 -07:00
Krrish Dholakia
f5905e1000 docs(routing.md): add proxy loadbalancing tutorial 2024-09-03 07:38:19 -07:00
Krrish Dholakia
36ce43ed95 refactor: ci/cd run again 2024-09-03 07:17:02 -07:00
Krrish Dholakia
eadcc09f44 docs(enterprise.md): clarify how enterprise deployments work 2024-09-03 06:55:14 -07:00
Krrish Dholakia
2d47acad6c test: fix test 2024-09-03 06:40:52 -07:00
Krrish Dholakia
9df0588c2c docs(json_mode.md): update docs 2024-09-02 22:41:17 -07:00
Krish Dholakia
6ccff1b13e
fix(router.py): fix inherited type (#5485) 2024-09-02 22:03:21 -07:00
Krish Dholakia
9f3fa29624
feat(router.py): Support Loadbalancing batch azure api endpoints (#5469)
* feat(router.py): initial commit for loadbalancing azure batch api endpoints

Closes https://github.com/BerriAI/litellm/issues/5396

* fix(router.py): working `router.acreate_file()`

* feat(router.py): working router.acreate_batch endpoint

* feat(router.py): expose router.aretrieve_batch function

Make it easy for user to retrieve the batch information

* feat(router.py): support 'router.alist_batches' endpoint

Adds support for getting all batches across all endpoints

* feat(router.py): working loadbalancing on `/v1/files`

* feat(proxy_server.py): working loadbalancing on `/v1/batches`

* feat(proxy_server.py): working loadbalancing on Retrieve + List batch
2024-09-02 21:32:55 -07:00
Ishaan Jaff
7a22faaba4 bump: version 1.44.14 → 1.44.15 2024-09-02 21:27:23 -07:00
Ishaan Jaff
38bb8439b0
Merge pull request #5482 from BerriAI/litellm_track_embedding_spendLogs
[Feat] track embedding /predict in spend logs
2024-09-02 21:23:24 -07:00