Ishaan Jaff
|
e3becc6514
|
refactor vtx image gen
|
2024-09-02 17:35:51 -07:00 |
|
Ishaan Jaff
|
638e6291f0
|
Merge pull request #5457 from BerriAI/litellm_track_spend_logs_for_vertex_pass_through_endpoints
[Feat-Proxy] track spend logs for vertex pass through endpoints
|
2024-08-31 16:30:15 -07:00 |
|
Ishaan Jaff
|
2474400796
|
fix cost tracking for vertex ai native
|
2024-08-31 08:22:27 -07:00 |
|
Ishaan Jaff
|
c60125d7be
|
add gcs bucket base
|
2024-08-30 10:41:39 -07:00 |
|
Ishaan Jaff
|
378182cba2
|
show all error types on swagger
|
2024-08-29 18:50:41 -07:00 |
|
Ishaan Jaff
|
359a003ac8
|
v0 add rerank on litellm proxy
|
2024-08-27 17:28:39 -07:00 |
|
Ishaan Jaff
|
8a72d1b9c1
|
fix set Caching Default Off
|
2024-08-24 09:43:39 -07:00 |
|
Krrish Dholakia
|
e7d914e96b
|
fix(proxy_server.py): fix post /v1/batches endpoint
Fixes https://github.com/BerriAI/litellm/issues/5279#issuecomment-2307919820
|
2024-08-23 20:38:00 -07:00 |
|
Krrish Dholakia
|
26bd9a409a
|
fix(proxy_server.py): support env vars for controlling global max parallel request retry/timeouts
fixes issue where litellm module level settings weren't working for global retries, due to time of init
|
2024-08-23 16:06:08 -07:00 |
|
Ishaan Jaff
|
dcd39dac00
|
init custom guardrail class
|
2024-08-23 10:54:42 -07:00 |
|
Krish Dholakia
|
11cbf60e4f
|
Merge branch 'main' into litellm_azure_batch_apis
|
2024-08-22 19:07:54 -07:00 |
|
Krrish Dholakia
|
b0706a6f8f
|
fix(proxy_server.py): expose flag to disable retries when max parallel request limit is hit
|
2024-08-22 16:49:52 -07:00 |
|
Krrish Dholakia
|
5959cf0b50
|
fix: fix linting errors
|
2024-08-22 15:51:59 -07:00 |
|
Krrish Dholakia
|
f9ab33cbc2
|
feat(proxy_server.py): support azure batch api endpoints
|
2024-08-22 15:21:43 -07:00 |
|
Krish Dholakia
|
f87f3987bd
|
Merge branch 'main' into litellm_redis_cluster
|
2024-08-22 11:06:14 -07:00 |
|
Ishaan Jaff
|
d2dd40e1d2
|
fix allow setting LiteLLM license as .env
|
2024-08-22 10:05:00 -07:00 |
|
Ishaan Jaff
|
468bf7c615
|
fix allow setting license in config.yaml
|
2024-08-22 09:45:15 -07:00 |
|
Ishaan Jaff
|
a067f0f926
|
add docstring for /embeddings and /completions
|
2024-08-22 09:30:47 -07:00 |
|
Ishaan Jaff
|
2caee891e0
|
add doc string for /chat/completions swagger
|
2024-08-22 09:27:40 -07:00 |
|
Ishaan Jaff
|
56057f278a
|
Merge branch 'main' into litellm_pass_through_vtx_multi_modal
|
2024-08-21 17:23:22 -07:00 |
|
Ishaan Jaff
|
152ebe2e7f
|
proxy - print embedding request when recieved
|
2024-08-21 17:00:18 -07:00 |
|
Krish Dholakia
|
3971880af4
|
Merge branch 'main' into litellm_disable_storing_master_key_hash_in_db
|
2024-08-21 15:37:25 -07:00 |
|
Krrish Dholakia
|
33c9c16388
|
feat(caching.py): redis cluster support
Closes https://github.com/BerriAI/litellm/issues/4358
|
2024-08-21 15:01:52 -07:00 |
|
Ishaan Jaff
|
b522ade709
|
docs semantic caching qdrant
|
2024-08-21 13:03:41 -07:00 |
|
Krrish Dholakia
|
9f83724697
|
feat(proxy_server.py): support disabling storing master key hash in db, for spend tracking
|
2024-08-21 12:35:37 -07:00 |
|
Krrish Dholakia
|
d163025fab
|
fix(proxy_server.py): fix invalid login message to not show passed in pwd
Closes https://github.com/BerriAI/litellm/issues/5290
|
2024-08-20 08:56:57 -07:00 |
|
Ishaan Jaff
|
338b3a6bae
|
Merge pull request #4868 from msabramo/allow-not-displaying-feedback-box
Allow not displaying feedback box
|
2024-08-20 08:53:45 -07:00 |
|
Ishaan Jaff
|
c82714757a
|
Merge pull request #5288 from BerriAI/litellm_aporia_refactor
[Feat] V2 aporia guardrails litellm
|
2024-08-19 20:41:45 -07:00 |
|
Ishaan Jaff
|
319690ab5e
|
feat - guardrails v2
|
2024-08-19 18:24:20 -07:00 |
|
Krrish Dholakia
|
4b15f5bc83
|
feat(langfuse_endpoints.py): support langfuse pass through endpoints by default
|
2024-08-19 17:28:34 -07:00 |
|
Ishaan Jaff
|
f16e0472c2
|
feat - return applied guardrails in response headers
|
2024-08-19 11:56:20 -07:00 |
|
Ishaan Jaff
|
b4bca8db82
|
feat - allow accessing data post success call
|
2024-08-19 11:35:33 -07:00 |
|
Krish Dholakia
|
5e6700f985
|
Merge pull request #5260 from BerriAI/google_ai_studio_pass_through
Pass-through endpoints for Gemini - Google AI Studio
|
2024-08-17 13:51:51 -07:00 |
|
Ishaan Jaff
|
db8f789318
|
Merge pull request #5259 from BerriAI/litellm_return_remaining_tokens_in_header
[Feat] return `x-litellm-key-remaining-requests-{model}`: 1, `x-litellm-key-remaining-tokens-{model}: None` in response headers
|
2024-08-17 12:41:16 -07:00 |
|
Ishaan Jaff
|
9f6630912d
|
feat return rmng tokens for model for api key
|
2024-08-17 12:35:10 -07:00 |
|
Krrish Dholakia
|
29bedae79f
|
feat(google_ai_studio_endpoints.py): support pass-through endpoint for all google ai studio requests
New Feature
|
2024-08-17 10:46:59 -07:00 |
|
Ishaan Jaff
|
a62277a6aa
|
feat - use commong helper for getting model group
|
2024-08-17 10:46:04 -07:00 |
|
Krrish Dholakia
|
2874b94fb1
|
refactor: replace .error() with .exception() logging for better debugging on sentry
|
2024-08-16 09:22:47 -07:00 |
|
Krish Dholakia
|
0f1b48162c
|
Merge branch 'main' into litellm_pass_through_endpoints_api
|
2024-08-15 22:39:19 -07:00 |
|
Krrish Dholakia
|
1c2decbfdd
|
feat(ui): for adding pass-through endpoints
|
2024-08-15 21:58:11 -07:00 |
|
Krrish Dholakia
|
362605ba65
|
feat(pass_through_endpoints.py): initial working CRUD endpoints for /pass_through_endoints
|
2024-08-15 21:23:26 -07:00 |
|
Ishaan Jaff
|
654df03678
|
refactor use 1 util for llm routing
|
2024-08-15 08:42:20 -07:00 |
|
Ishaan Jaff
|
4cbde8af39
|
use route_request for making llm call
|
2024-08-15 08:16:44 -07:00 |
|
Krrish Dholakia
|
7a67551927
|
feat(pass_through_endpoints.py): initial commit of crud endpoints for pass through endpoints
|
2024-08-14 21:36:07 -07:00 |
|
Krrish Dholakia
|
3ea4a3b8bd
|
feat(proxy_server.py): support returning available fields for pass_through_endpoints via `/config/field/list
|
2024-08-14 19:07:10 -07:00 |
|
Ishaan Jaff
|
2e3e06844c
|
feat read config from s3
|
2024-08-13 20:18:59 -07:00 |
|
Krrish Dholakia
|
3fa00408f1
|
fix(proxy_server.py): add info log when spend logs is skipped because disable_spend_logs=True .
|
2024-08-12 11:20:30 -07:00 |
|
Ishaan Jaff
|
dcf50ff219
|
feat add cron job for sending stats from prometheus
|
2024-08-10 16:56:03 -07:00 |
|
Krrish Dholakia
|
c919c15c4a
|
fix(proxy_server.py): ensure internal_user params only apply to internal_user role
|
2024-08-08 17:50:29 -07:00 |
|
Krrish Dholakia
|
488a78e5f4
|
fix(proxy_server.py): respect internal_user_budget_duration for sso user
|
2024-08-08 17:28:28 -07:00 |
|