Commit graph

3353 commits

Author SHA1 Message Date
Krrish Dholakia
33deeda300 feat(ollama.py): support ollama /api/embed endpoint
Closes https://github.com/BerriAI/litellm/issues/5291
2024-08-20 09:10:08 -07:00
Krrish Dholakia
6f8840daa1 fix(proxy_server.py): fix invalid login message to not show passed in pwd
Closes https://github.com/BerriAI/litellm/issues/5290
2024-08-20 08:56:57 -07:00
Ishaan Jaff
9ef6ae2f7c
Merge pull request #4868 from msabramo/allow-not-displaying-feedback-box
Allow not displaying feedback box
2024-08-20 08:53:45 -07:00
Ishaan Jaff
aceab2669f test guardrails with API Key 2024-08-20 08:40:00 -07:00
Ishaan Jaff
30e985d79c feat control guardrails per API Key 2024-08-20 08:16:28 -07:00
Krrish Dholakia
e747127e3b fix(pass_through_endpoints.py): fix query param pass through 2024-08-19 21:38:30 -07:00
Ishaan Jaff
cae51fc7fc fix _get_request_ip_address 2024-08-19 21:08:27 -07:00
Ishaan Jaff
b919a1a42e fix import error guardrails 2024-08-19 20:59:39 -07:00
Ishaan Jaff
c7b3978655
Merge pull request #5288 from BerriAI/litellm_aporia_refactor
[Feat] V2 aporia guardrails litellm
2024-08-19 20:41:45 -07:00
Ishaan Jaff
6f12f6dcf8 fix importing _ENTERPRISE_Aporia 2024-08-19 20:39:24 -07:00
Ishaan Jaff
0431600f7b add testing for aporia guardrails 2024-08-19 18:50:14 -07:00
Ishaan Jaff
ca9c81470b fix aporia typo 2024-08-19 18:34:45 -07:00
Ishaan Jaff
8cd1963c11 feat - guardrails v2 2024-08-19 18:24:20 -07:00
Krrish Dholakia
f9640d8a58 feat(langfuse_endpoints.py): support team based logging for langfuse pass-through endpoints 2024-08-19 17:58:39 -07:00
Krrish Dholakia
1701c48ad5 feat(langfuse_endpoints.py): support langfuse pass through endpoints by default 2024-08-19 17:28:34 -07:00
Krrish Dholakia
8ce8680a9a fix(proxy_cli.py): support database_host, database_username, database_password, database_name 2024-08-19 16:17:45 -07:00
Ishaan Jaff
b7d4031f89 doc aporia_w_litellm 2024-08-19 14:36:55 -07:00
Ishaan Jaff
a0361b0e76 run during_call_hook 2024-08-19 12:07:46 -07:00
Ishaan Jaff
613bd1babd feat - return applied guardrails in response headers 2024-08-19 11:56:20 -07:00
Ishaan Jaff
4685b9909a feat - allow accessing data post success call 2024-08-19 11:35:33 -07:00
Ishaan Jaff
8cb62213e1 feat run aporia as post call success hook 2024-08-19 11:25:31 -07:00
Krrish Dholakia
417547b6f9 fix(user_api_key_auth.py): log requester ip address to logs on request rejection
Closes https://github.com/BerriAI/litellm/issues/5220
2024-08-19 11:03:58 -07:00
Krrish Dholakia
3cafebbc65 test(test_caching.py): re-introduce testing for s3 cache w/ streaming
Closes https://github.com/BerriAI/litellm/issues/3268
2024-08-19 10:56:48 -07:00
Krrish Dholakia
cc42f96d6a fix(ollama_chat.py): fix sync tool calling
Fixes https://github.com/BerriAI/litellm/issues/5245
2024-08-19 08:31:46 -07:00
Ishaan Jaff
398295116f inly write model tpm/rpm tracking when user set it 2024-08-18 09:58:09 -07:00
Krish Dholakia
f42ac2c9d8
Merge pull request #5264 from BerriAI/litellm_bedrock_pass_through
feat: Bedrock pass-through endpoint support (All endpoints)
2024-08-18 09:55:22 -07:00
Krrish Dholakia
663a0c1b83 feat(Support-pass-through-for-bedrock-endpoints): Allows pass-through support for bedrock endpoints 2024-08-17 17:57:43 -07:00
Ishaan Jaff
83515e88ce
Merge pull request #5263 from BerriAI/litellm_support_access_groups
[Feat-Proxy] Use model access groups for teams
2024-08-17 17:11:11 -07:00
Krrish Dholakia
f7a2e04426 feat(pass_through_endpoints.py): add pass-through support for all cohere endpoints 2024-08-17 16:57:55 -07:00
Ishaan Jaff
08db691dec use model access groups for teams 2024-08-17 16:45:53 -07:00
Ishaan Jaff
eff874bf05 fix proxy all models test 2024-08-17 15:54:51 -07:00
Ishaan Jaff
b83fa87880 update tpm / rpm limit per model 2024-08-17 15:26:12 -07:00
Krrish Dholakia
db54b66457 style(vertex_httpx.py): make vertex error string more helpful 2024-08-17 15:09:55 -07:00
Ishaan Jaff
a60fc3ad70
Merge pull request #5261 from BerriAI/litellm_set_model_rpm_tpm_limit
[Feat-Proxy] set rpm/tpm limits per api key per model
2024-08-17 14:30:54 -07:00
Krish Dholakia
ff6ff133ee
Merge pull request #5260 from BerriAI/google_ai_studio_pass_through
Pass-through endpoints for Gemini - Google AI Studio
2024-08-17 13:51:51 -07:00
Ishaan Jaff
68b54bed85 add tpm limits per api key per model 2024-08-17 13:20:55 -07:00
Krrish Dholakia
fd44cf8d26 feat(pass_through_endpoints.py): support streaming requests 2024-08-17 12:46:57 -07:00
Ishaan Jaff
fa96610bbc fix async_pre_call_hook in parallel request limiter 2024-08-17 12:42:28 -07:00
Ishaan Jaff
feb8c3c5b4
Merge pull request #5259 from BerriAI/litellm_return_remaining_tokens_in_header
[Feat] return `x-litellm-key-remaining-requests-{model}`: 1, `x-litellm-key-remaining-tokens-{model}: None` in response headers
2024-08-17 12:41:16 -07:00
Ishaan Jaff
ee0f772b5c feat return rmng tokens for model for api key 2024-08-17 12:35:10 -07:00
Krrish Dholakia
bc0023a409 feat(google_ai_studio_endpoints.py): support pass-through endpoint for all google ai studio requests
New Feature
2024-08-17 10:46:59 -07:00
Ishaan Jaff
5985c7e933 feat - use commong helper for getting model group 2024-08-17 10:46:04 -07:00
Ishaan Jaff
412d30d362 add litellm-key-remaining-tokens on prometheus 2024-08-17 10:02:20 -07:00
Ishaan Jaff
785482f023 feat add settings for rpm/tpm limits for a model 2024-08-17 09:16:01 -07:00
Krrish Dholakia
b56ecd7e02 fix(pass_through_endpoints.py): fix returned response headers for pass-through endpoitns 2024-08-17 09:00:00 -07:00
Krrish Dholakia
08411f37b4 docs(vertex_ai.md): cleanup docs 2024-08-17 08:38:01 -07:00
Krish Dholakia
f3e17cd692
Merge branch 'main' into litellm_log_model_price_information 2024-08-16 19:34:16 -07:00
Krish Dholakia
a8dd2b6910
Merge pull request #5244 from BerriAI/litellm_better_error_logging_sentry
refactor: replace .error() with .exception() logging for better debugging on sentry
2024-08-16 19:16:20 -07:00
Ishaan Jaff
1ee33478c9 track rpm/tpm usage per key+model 2024-08-16 18:28:58 -07:00
Ishaan Jaff
a6a4b944ad user api key auth rpm_limit_per_model 2024-08-16 18:22:35 -07:00