Commit graph

9496 commits

Author SHA1 Message Date
Krrish Dholakia
0decc36bed fix(factory.py): handle message content being a list instead of string
Fixes https://github.com/BerriAI/litellm/issues/4679
2024-07-12 19:00:39 -07:00
Ishaan Jaff
70b96d12e9
Merge pull request #4685 from BerriAI/litellm_return_type_expired_key
[Fix] Proxy Return type=expire_key on expired Key errors
2024-07-12 18:52:51 -07:00
Krrish Dholakia
667fd2b376 docs(pass_through.md): add doc on creating custom chat endpoints on proxy
Allows developers to call proxy with anthropic sdk/boto3/etc.
2024-07-12 18:48:40 -07:00
Ishaan Jaff
7918f41aca test expired key raises correct exception 2024-07-12 18:45:01 -07:00
Ishaan Jaff
57ced1d25e raise roxyErrorTypes.expired_key on expired key 2024-07-12 18:41:39 -07:00
Ishaan Jaff
34ff0a7e57 raise expired_key error 2024-07-12 18:39:00 -07:00
Ishaan Jaff
92bf98b30f
Merge pull request #4684 from BerriAI/litellm_safe_memory_mode
[Feat] Allow safe memory mode
2024-07-12 18:32:16 -07:00
Ishaan Jaff
eb342bbe2c
Merge pull request #4683 from BerriAI/litellm_dealloc_in_mem_cache
[Fix] Mem Util - De Reference when removing from in-memory cache
2024-07-12 18:31:56 -07:00
Ishaan Jaff
24918c5041
Merge pull request #4682 from BerriAI/litellm_mem_leak_debug
show stack trace of 10 files taking up memory
2024-07-12 18:31:41 -07:00
Ishaan Jaff
cf5f11cc84
Merge pull request #4681 from BerriAI/litellm_mem_usage
[Fix] Reduce Mem Usage - only set ttl for requests to 2 mins
2024-07-12 18:31:19 -07:00
Ishaan Jaff
08efef5316 feat add safe_memory_mode 2024-07-12 18:18:39 -07:00
Ishaan Jaff
0099bf7859 de-ref unused cache items 2024-07-12 16:38:36 -07:00
Krrish Dholakia
fd743aaefd feat(opentelemetry.py): support logging call metadata to otel 2024-07-12 15:41:34 -07:00
Ishaan Jaff
1a8fce8edb show stack trace of 10 files tking up memory 2024-07-12 15:33:03 -07:00
Ishaan Jaff
8c8dcdbdb1 reduce ttil for update_request_status 2024-07-12 15:14:54 -07:00
Krrish Dholakia
f5b3cc6c02 fix(litellm_logging.py): fix condition check
Fixes https://github.com/BerriAI/litellm/issues/4633
2024-07-12 09:22:19 -07:00
Krrish Dholakia
88eb25da5c fix(bedrock_httpx.py): handle user error - malformed system prompt
if user passes in system prompt as a list of content blocks, handle that
2024-07-12 08:28:50 -07:00
Andrea Ponti
496445481d Rollback to metadata deepcopy 2024-07-12 11:25:23 +02:00
Krrish Dholakia
cff66d6151 fix(proxy_server.py): fix linting errors 2024-07-11 22:12:33 -07:00
Krish Dholakia
d72bcdbce3
Merge pull request #4669 from BerriAI/litellm_logging_only_masking
Flag for PII masking on Logging only
2024-07-11 22:03:37 -07:00
Krish Dholakia
f0b8c0e7fb
Merge pull request #4588 from Manouchehri/vertex-seed-2973
feat(vertex_httpx.py): Add seed parameter
2024-07-11 22:02:13 -07:00
Krish Dholakia
5ad341d0ff
Merge pull request #4607 from maamalama/helicone-cohere
Helicone Headers & Cohere support
2024-07-11 22:01:44 -07:00
Krish Dholakia
72f1c9181d
Merge branch 'main' into litellm_call_id_in_response 2024-07-11 21:54:49 -07:00
Krish Dholakia
79d6b69d1c
Merge pull request #4651 from msabramo/docs-logging-cleanup
Docs: Miscellaneous cleanup of `docs/my-website/docs/proxy/logging.md`
2024-07-11 21:52:20 -07:00
Krrish Dholakia
8d4e7f9967 test(test_assistants.py): handle openai api instability 2024-07-11 21:32:43 -07:00
Ishaan Jaff
4b8d33e6a8 ci/cd run again 2024-07-11 21:16:23 -07:00
Ishaan Jaff
8dbf0a634a fix supports vision test 2024-07-11 21:14:25 -07:00
Krrish Dholakia
5f5c925efd fix(guardrails.py): fix guardrail item typing 2024-07-11 21:01:56 -07:00
Krrish Dholakia
b2e46086dd fix(utils.py): fix recreating model response object when stream usage is true 2024-07-11 21:01:12 -07:00
Krrish Dholakia
1300223f51 test: fix test 2024-07-11 20:09:24 -07:00
Ishaan Jaff
e112379d2f ci/cd run again 2024-07-11 19:26:20 -07:00
Ishaan Jaff
aec468c0e9 ui new build 2024-07-11 19:13:08 -07:00
Krrish Dholakia
9d918d2ac7 fix(presidio_pii_masking.py): support logging_only pii masking 2024-07-11 18:04:12 -07:00
Krrish Dholakia
9deb9b4e3f feat(guardrails): Flag for PII Masking on Logging
Fixes https://github.com/BerriAI/litellm/issues/4580
2024-07-11 16:09:34 -07:00
Ishaan Jaff
28cfca87c1
Merge pull request #4647 from msabramo/msabramo/remove-unnecessary-imports
Remove unnecessary imports
2024-07-11 15:07:30 -07:00
Ishaan Jaff
17822db657
Merge pull request #4662 from msabramo/slack-alert-add-missing-space
Add missing space in "Failed Tracking Cost" Slack alert msg
2024-07-11 15:03:52 -07:00
Ishaan Jaff
8bf50ac5db
Merge pull request #4661 from BerriAI/litellm_fix_mh
[Fix] Model Hub - Show supports vision correctly
2024-07-11 15:03:37 -07:00
Krrish Dholakia
52b293e831 fix(types/utils.py): message role is always 'assistant' 2024-07-11 14:14:38 -07:00
Krrish Dholakia
77068575d7 fix: fix linting errors 2024-07-11 13:44:29 -07:00
Krrish Dholakia
389a51e05d fix: fix linting errors 2024-07-11 13:36:55 -07:00
Marc Abramowitz
fe27182e8e Add missing space in "Failed Tracking Cost" msg
Fixes stuff like this:

```
Message: Failed Tracking Cost forerror in tracking cost callback - Traceback (most recent call last):
                              ^^^^^^^^
```
2024-07-11 13:26:59 -07:00
Krrish Dholakia
6e9f048618 fix: move to using pydantic obj for setting values 2024-07-11 13:18:36 -07:00
Ishaan Jaff
46493303ed test get mode info for gemini/gemini-1.5-flash 2024-07-11 13:04:18 -07:00
Ishaan Jaff
341f88d191 fix supports vision 2024-07-11 12:59:42 -07:00
Krrish Dholakia
dd1048cb35 fix(main.py): fix linting errors 2024-07-11 12:11:50 -07:00
Krrish Dholakia
1ba3fcc3fb feat(utils.py): accept 'api_key' as param for validate_environment
Closes https://github.com/BerriAI/litellm/issues/4375
2024-07-11 12:02:23 -07:00
Krrish Dholakia
2163434ff3 fix(llm_cost_calc/google.py): fix google embedding cost calculation
Fixes https://github.com/BerriAI/litellm/issues/4630
2024-07-11 11:55:48 -07:00
Ishaan Jaff
e3470d8e91
Merge pull request #4658 from BerriAI/litellm_check_otel_spans
[Test-Proxy] Otel Traces
2024-07-11 10:41:51 -07:00
Ishaan Jaff
8d7db56deb fix Local only test. WIP 2024-07-11 10:30:40 -07:00
Ishaan Jaff
b4f8c7304f
Merge pull request #4652 from msabramo/shorter-success_callbacks-in-health-readiness-response
Shorter success callbacks from `/health/readiness`
2024-07-11 09:57:52 -07:00