Ishaan Jaff
|
11d1f4e430
|
doc - control guradrail per api key
|
2024-07-02 17:50:09 -07:00 |
|
Ishaan Jaff
|
f61eb8dfa1
|
docs control lakera ai per request
|
2024-07-02 17:38:07 -07:00 |
|
Ishaan Jaff
|
64e86c3305
|
docs control lakera ai per call
|
2024-07-02 17:34:48 -07:00 |
|
Ishaan Jaff
|
b6af67344c
|
feat - control lakera per call
|
2024-07-02 17:34:05 -07:00 |
|
Krish Dholakia
|
f87535140c
|
Merge pull request #4523 from BerriAI/litellm_azure_img_gen_refactor
refactor(azure.py): move azure dall-e calls to httpx client
|
2024-07-02 17:15:41 -07:00 |
|
Krrish Dholakia
|
cf5334fe8a
|
refactor(azure.py): refactor sync azure calls to httpx
|
2024-07-02 17:06:48 -07:00 |
|
Ishaan Jaff
|
9f8572e427
|
check if key does not want secret detection to run
|
2024-07-02 17:05:53 -07:00 |
|
Ishaan Jaff
|
174b2b69df
|
Merge pull request #4518 from BerriAI/litellm_fix_background_health_checks
[Fix-Proxy] Background health checks use deep copy of model list for _run_background_health_check
|
2024-07-02 16:42:34 -07:00 |
|
Ishaan Jaff
|
90a0db5618
|
Merge pull request #4519 from BerriAI/litellm_re_use_openai_azure_clients_whisper
[Fix+Test] /audio/transcriptions - use initialized OpenAI / Azure OpenAI clients
|
2024-07-02 16:42:22 -07:00 |
|
andrewmjc
|
e07b110b47
|
matching openai tool result spec
|
2024-07-02 16:57:13 -06:00 |
|
Peter Muller
|
47c97e1fa2
|
Fix test name typo in comment
|
2024-07-02 15:38:15 -07:00 |
|
Krrish Dholakia
|
589c1c6280
|
refactor(azure.py): replaces the custom transport logic for just using our httpx client
Done to fix all the http/https proxy issues people are facing with proxy.
|
2024-07-02 15:32:53 -07:00 |
|
Peter Muller
|
d9e9a8645b
|
Add tests for SageMaker region selection
|
2024-07-02 15:30:39 -07:00 |
|
Krish Dholakia
|
612af8f5be
|
Merge pull request #4492 from Manouchehri/gemini-context-caching-1
feat(vertex_httpx.py): Support cachedContent.
|
2024-07-02 14:09:25 -07:00 |
|
Krish Dholakia
|
c4e11e03d7
|
Merge pull request #4520 from BerriAI/litellm_fix_request_hanging_alert
fix(slack_alerting.py): use in-memory cache for checking request status
|
2024-07-02 13:48:19 -07:00 |
|
Krrish Dholakia
|
66c6992f8a
|
fix(slack_alerting.py): use in-memory cache for checking request status
|
2024-07-02 13:01:59 -07:00 |
|
Ishaan Jaff
|
ce7fade15e
|
test whisper re-using openai/azure clients
|
2024-07-02 12:35:15 -07:00 |
|
Ishaan Jaff
|
2b5f3c6105
|
fix use router level client for OpenAI / Azure transcription calls
|
2024-07-02 12:33:31 -07:00 |
|
Krrish Dholakia
|
3d3f725ef5
|
docs(user_keys.md): add langchain js example to docs
|
2024-07-02 12:06:48 -07:00 |
|
Ishaan Jaff
|
cd6b121642
|
use deep copy of router for _run_background_health_check
|
2024-07-02 11:29:24 -07:00 |
|
Tiger Yu
|
26630cd263
|
Merge branch 'main' into litellm-fix-vertexaibeta
|
2024-07-02 09:49:44 -07:00 |
|
Krrish Dholakia
|
79670ab82e
|
fix(main.py): get the region name from boto3 client if dynamic var not set
|
2024-07-02 09:24:07 -07:00 |
|
Krrish Dholakia
|
5aae2313f3
|
fix(aws_secret_manager.py): fix string replace
|
2024-07-02 00:42:12 -07:00 |
|
Krrish Dholakia
|
196b94455e
|
fix(dynamic_rate_limiter.py): add rpm allocation, priority + quota reservation to docs
|
2024-07-01 23:35:42 -07:00 |
|
Krish Dholakia
|
4ab83f0f46
|
Merge pull request #4497 from BerriAI/litellm_disable_cooldowns
fix(router.py): disable cooldowns
|
2024-07-01 23:10:18 -07:00 |
|
Krish Dholakia
|
011e14eb08
|
Merge branch 'main' into litellm_disable_cooldowns
|
2024-07-01 23:10:10 -07:00 |
|
Krrish Dholakia
|
6b529d4e0e
|
fix(dynamic_rate_limiter.py): support setting priority + reserving tpm/rpm
|
2024-07-01 23:08:54 -07:00 |
|
Ishaan Jaff
|
38770da2f6
|
docs prometheus tracking x-remaining tokens
|
2024-07-01 22:45:33 -07:00 |
|
Ishaan Jaff
|
5f04ef14a6
|
doc prometheus tracking
|
2024-07-01 22:41:52 -07:00 |
|
Ishaan Jaff
|
e2a2c2bde1
|
ci/cd run again
|
2024-07-01 21:36:30 -07:00 |
|
Ishaan Jaff
|
4cb098661a
|
bump: version 1.41.2 → 1.41.3
|
2024-07-01 21:35:52 -07:00 |
|
Ishaan Jaff
|
4bb418acf3
|
Merge pull request #4504 from BerriAI/litellm_fix_exception_provider_not_known
fix exception provider not known
|
2024-07-01 21:22:20 -07:00 |
|
Ishaan Jaff
|
665d8fb250
|
test - test_azure_embedding_exceptions
|
2024-07-01 21:19:47 -07:00 |
|
Ishaan Jaff
|
402799c8db
|
Merge pull request #4503 from BerriAI/litellm_log_remaining_rate_limit_prometheus
[Feat-Enterprise] log `"x-ratelimit-remaining-tokens"` and `"x-ratelimit-remaining-requests"` on prometheus
|
2024-07-01 21:11:42 -07:00 |
|
Ishaan Jaff
|
1c194f0275
|
Merge pull request #4501 from BerriAI/litellm_return_Response_headers
[Feat] Return Response headers for OpenAI / Azure OpenAI when `litellm.return_response_headers=True`
|
2024-07-01 21:11:10 -07:00 |
|
Ishaan Jaff
|
fcf65d5215
|
fix exception provider not known
|
2024-07-01 21:05:37 -07:00 |
|
Ishaan Jaff
|
4033302656
|
feat - return headers for openai audio transcriptions
|
2024-07-01 20:27:27 -07:00 |
|
Krrish Dholakia
|
460c33f70f
|
test(test_dynamic_rate_limit_handler.py): add unit tests for dynamic rpm limits
|
2024-07-01 20:20:24 -07:00 |
|
Krrish Dholakia
|
0781014706
|
test(test_dynamic_rate_limit_handler.py): refactor tests for rpm suppprt
|
2024-07-01 20:16:10 -07:00 |
|
Ishaan Jaff
|
568245b5c0
|
feat - set response headers in azure requests
|
2024-07-01 20:12:39 -07:00 |
|
Ishaan Jaff
|
c37d45d556
|
feat - prometheus log remaining headers
|
2024-07-01 20:00:47 -07:00 |
|
Krrish Dholakia
|
f23b17091d
|
fix(dynamic_rate_limiter.py): support dynamic rate limiting on rpm
|
2024-07-01 17:45:10 -07:00 |
|
Ishaan Jaff
|
04a975d486
|
feat - add response_headers in litellm_logging_obj
|
2024-07-01 17:25:15 -07:00 |
|
Krrish Dholakia
|
b761ce9dd1
|
fix(s3.py): fix s3 path logging on proxy
|
2024-07-01 17:22:31 -07:00 |
|
Ishaan Jaff
|
140f7fe254
|
return azure response headers
|
2024-07-01 17:09:06 -07:00 |
|
Ishaan Jaff
|
48946f7528
|
fix config
|
2024-07-01 17:02:15 -07:00 |
|
Ishaan Jaff
|
4b7feb3261
|
feat - return response headers for async openai requests
|
2024-07-01 17:01:42 -07:00 |
|
Peter Muller
|
c6be8326db
|
Allow calling SageMaker endpoints from different regions
|
2024-07-01 16:00:42 -07:00 |
|
Krrish Dholakia
|
a5439a621b
|
docs(debugging.md): add docs on debugging common errors
|
2024-07-01 15:13:19 -07:00 |
|
Krrish Dholakia
|
051dfee421
|
docs(routing.md): add docs on how to disable cooldowns
|
2024-07-01 15:05:38 -07:00 |
|