litellm-mirror

mirror of https://github.com/BerriAI/litellm.git synced 2025-04-25 10:44:24 +00:00

Author	SHA1	Message	Date
Ishaan Jaff	6d4a6a8402	add spend tracking config.yaml	2025-03-31 19:42:00 -07:00
Ishaan Jaff	22dfc4dea9	fix user_api_key_auth example config	2025-03-26 08:36:11 -07:00
Ishaan Jaff	be25b298e6	fix async_moderation_hook	2025-03-12 18:45:54 -07:00
Ishaan Jaff	ed68ad7775	fix linting	2025-03-12 18:44:51 -07:00
Ishaan Jaff	f47987e673	(Refactor) `/v1/messages` to follow simpler logic for Anthropic API spec (#9013 ) * anthropic_messages_handler v0 * fix /messages * working messages with router methods * test_anthropic_messages_handler_litellm_router_non_streaming * test_anthropic_messages_litellm_router_non_streaming_with_logging * AnthropicMessagesConfig * _handle_anthropic_messages_response_logging * working with /v1/messages endpoint * working /v1/messages endpoint * refactor to use router factory function * use aanthropic_messages * use BaseConfig for Anthropic /v1/messages * track api key, team on /v1/messages endpoint * fix get_logging_payload * BaseAnthropicMessagesTest * align test config * test_anthropic_messages_with_thinking * test_anthropic_streaming_with_thinking * fix - display anthropic url for debugging * test_bad_request_error_handling * test_anthropic_messages_router_streaming_with_bad_request * fix ProxyException * test_bad_request_error_handling_streaming * use provider_specific_header * test_anthropic_messages_with_extra_headers * test_anthropic_messages_to_wildcard_model * fix gcs pub sub test * standard_logging_payload * fix unit testing for anthopic /v1/messages support * fix pass through anthropic messages api * delete dead code * fix anthropic pass through response * revert change to spend tracking utils * fix get_litellm_metadata_from_kwargs * fix spend logs payload json * proxy_pass_through_endpoint_tests * TestAnthropicPassthroughBasic * fix pass through tests * test_async_vertex_proxy_route_api_key_auth * _handle_anthropic_messages_response_logging * vertex_credentials * test_set_default_vertex_config * test_anthropic_messages_litellm_router_non_streaming_with_logging * test_ageneric_api_call_with_fallbacks_basic * test__aadapter_completion	2025-03-06 00:43:08 -08:00
Ishaan Jaff	55b938dd6e	(Infra/DB) - Allow running older litellm version when out of sync with current state of DB (#8695 ) * fix check migration * clean up should_update_prisma_schema * update test * db_migration_disable_update_check * Check container logs for expected message * db_migration_disable_update_check * test_check_migration_out_of_sync * test_should_update_prisma_schema * db_migration_disable_update_check * pip install aiohttp	2025-02-20 18:30:23 -08:00
Ishaan Jaff	b8977f5e10	Revert "test fix use mock endpoints for e2e files and ft tests" This reverts commit `c921d8dd81`.	2025-02-15 15:28:18 -08:00
Ishaan Jaff	c921d8dd81	test fix use mock endpoints for e2e files and ft tests	2025-02-15 15:08:46 -08:00
Ishaan Jaff	64a4229606	(e2e testing) - add tests for using litellm `/team/` updates in multi-instance deployments with Redis (#8440 ) * add team block/unblock test * test_team_blocking_behavior_multi_instance * proxy_multi_instance_tests * test - Run Docker container 2	2025-02-10 19:33:27 -08:00
Ishaan Jaff	7e1b79d446	(Bug fix) - Langfuse / Callback settings stored in DB (#8251 ) * fix _decrypt_and_set_db_env_variables * fix proxy config * test callbacks in DB * test langfuse callbacks in db * test_e2e_langfuse_callbacks_in_db * proxy_store_model_in_db_tests * fix proxy_store_model_in_db_tests * proxy_store_model_in_db_tests * fix store_model_db_config.yaml * fix check_langfuse_request * fix test langfuse base url * ci/cd run again	2025-02-04 21:09:37 -08:00
Ishaan Jaff	d19614b8c0	(QA / testing) - Add e2e tests for key model access auth checks (#8000 ) * fix _model_matches_any_wildcard_pattern_in_list * test key model access checks * add key_model_access_denied to ProxyErrorTypes * update auth checks * test_model_access_update * test_team_model_access_patterns * fix _team_model_access_check * fix config used for otel testing * test fix test_call_with_invalid_model * fix model acces check tests * test_team_access_groups * test _model_matches_any_wildcard_pattern_in_list	2025-01-25 17:15:11 -08:00
Krish Dholakia	21e8f212d7	Litellm dev 12 25 2024 p3 (#7421 ) * refactor(prometheus.py): refactor to use a factory method for setting label values allows for enforcing end user id disabling on prometheus e2e * fix: fix linting error * fix(prometheus.py): ensure label factory drops end-user value if disabled by user * fix(prometheus.py): specify service_type in end user tracking get * test: fix test * test: add unit test for prometheus factory * test: improve test (cover flag not set scenario) * test(test_prometheus.py): e2e test covering if 'end_user_id' shows up in testing if disabled scrapes the `/metrics` endpoint and scans text to check if id appears in emitted metrics * fix(prometheus.py): stringify status code before logging it	2024-12-25 18:54:24 -08:00
Ishaan Jaff	47e12802df	(feat) `/batches` Add support for using `/batches` endpoints in OAI format (#7402 ) * run azure testing on ci/cd * update docs on azure batches endpoints * add input azure.jsonl * refactor - use separate file for batches endpoints * fixes for passing custom llm provider to /batch endpoints * pass custom llm provider to files endpoints * update azure batches doc * add info for azure batches api * update batches endpoints * use simple helper for raising proxy exception * update config.yml * fix imports * update tests * use existing settings * update env var used * update configs * update config.yml * update ft testing	2024-12-24 16:58:05 -08:00
Ishaan Jaff	b90b98b88f	(fix) LiteLLM Proxy fix GET `/files/{file_id:path}/content"` endpoint (#7342 ) * fix order of get_file_content * update e2 files tests * add e2 batches endpoint testing * update config.yml * write content to file * use correct oai_misc_config * fixes for openai batches endpoint testing * remove extra out file * fix input.jsonl	2024-12-20 21:27:45 -08:00
Krish Dholakia	516c2a6a70	Litellm remove circular imports (#7232 ) * fix(utils.py): initial commit to remove circular imports - moves llmproviders to utils.py * fix(router.py): fix 'litellm.EmbeddingResponse' import from router.py ' * refactor: fix litellm.ModelResponse import on pass through endpoints * refactor(litellm_logging.py): fix circular import for custom callbacks literal * fix(factory.py): fix circular imports inside prompt factory * fix(cost_calculator.py): fix circular import for 'litellm.Usage' * fix(proxy_server.py): fix potential circular import with `litellm.Router' * fix(proxy/utils.py): fix potential circular import in `litellm.Router` * fix: remove circular imports in 'auth_checks' and 'guardrails/' * fix(prompt_injection_detection.py): fix router impor t * fix(vertex_passthrough_logging_handler.py): fix potential circular imports in vertex pass through * fix(anthropic_pass_through_logging_handler.py): fix potential circular imports * fix(slack_alerting.py-+-ollama_chat.py): fix modelresponse import * fix(base.py): fix potential circular import * fix(handler.py): fix potential circular ref in codestral + cohere handler's * fix(azure.py): fix potential circular imports * fix(gpt_transformation.py): fix modelresponse import * fix(litellm_logging.py): add logging base class - simplify typing makes it easy for other files to type check the logging obj without introducing circular imports * fix(azure_ai/embed): fix potential circular import on handler.py * fix(databricks/): fix potential circular imports in databricks/ * fix(vertex_ai/): fix potential circular imports on vertex ai embeddings * fix(vertex_ai/image_gen): fix import * fix(watsonx-+-bedrock): cleanup imports * refactor(anthropic-pass-through-+-petals): cleanup imports * refactor(huggingface/): cleanup imports * fix(ollama-+-clarifai): cleanup circular imports * fix(openai_like/): fix impor t * fix(openai_like/): fix embedding handler cleanup imports * refactor(openai.py): cleanup imports * fix(sagemaker/transformation.py): fix import * ci(config.yml): add circular import test to ci/cd	2024-12-14 16:28:34 -08:00
Ishaan Jaff	ddfe687b13	(fix) don't block proxy startup if license check fails & using prometheus (#6839 ) * fix - don't block proxy startup if not a premium user * test_litellm_proxy_server_config_with_prometheus * add test for proxy startup * fix remove unused test * fix startup test * add comment on bad-license	2024-11-20 17:55:39 -08:00
Krish Dholakia	56e9047818	Litellm router max depth (#6501 ) * feat(router.py): add check for max fallback depth Prevent infinite loop for fallbacks Closes https://github.com/BerriAI/litellm/issues/6498 * test: update test * (fix) Prometheus - Log Postgres DB latency, status on prometheus (#6484) * fix logging DB fails on prometheus * unit testing log to otel wrapper * unit testing for service logger + prometheus * use LATENCY buckets for service logging * fix service logging * docs clarify vertex vs gemini * (router_strategy/) ensure all async functions use async cache methods (#6489) * fix router strat * use async set / get cache in router_strategy * add coverage for router strategy * fix imports * fix batch_get_cache * use async methods for least busy * fix least busy use async methods * fix test_dual_cache_increment * test async_get_available_deployment when routing_strategy="least-busy" * (fix) proxy - fix when `STORE_MODEL_IN_DB` should be set (#6492) * set store_model_in_db at the top * correctly use store_model_in_db global * (fix) `PrometheusServicesLogger` `_get_metric` should return metric in Registry (#6486) * fix logging DB fails on prometheus * unit testing log to otel wrapper * unit testing for service logger + prometheus * use LATENCY buckets for service logging * fix service logging * fix _get_metric in prom services logger * add clear doc string * unit testing for prom service logger * bump: version 1.51.0 → 1.51.1 * Add `azure/gpt-4o-mini-2024-07-18` to model_prices_and_context_window.json (#6477) * Update utils.py (#6468) Fixed missing keys * (perf) Litellm redis router fix - ~100ms improvement (#6483) * docs(exception_mapping.md): add missing exception types Fixes https://github.com/Aider-AI/aider/issues/2120#issuecomment-2438971183 * fix(main.py): register custom model pricing with specific key Ensure custom model pricing is registered to the specific model+provider key combination * test: make testing more robust for custom pricing * fix(redis_cache.py): instrument otel logging for sync redis calls ensures complete coverage for all redis cache calls * refactor: pass parent_otel_span for redis caching calls in router allows for more observability into what calls are causing latency issues * test: update tests with new params * refactor: ensure e2e otel tracing for router * refactor(router.py): add more otel tracing acrosss router catch all latency issues for router requests * fix: fix linting error * fix(router.py): fix linting error * fix: fix test * test: fix tests * fix(dual_cache.py): pass ttl to redis cache * fix: fix param * perf(cooldown_cache.py): improve cooldown cache, to store cache results in memory for 5s, prevents redis call from being made on each request reduces 100ms latency per call with caching enabled on router * fix: fix test * fix(cooldown_cache.py): handle if a result is None * fix(cooldown_cache.py): add debug statements * refactor(dual_cache.py): move to using an in-memory check for batch get cache, to prevent redis from being hit for every call * fix(cooldown_cache.py): fix linting erropr * build: merge main --------- Co-authored-by: Ishaan Jaff <ishaanjaffer0324@gmail.com> Co-authored-by: Xingyao Wang <xingyao@all-hands.dev> Co-authored-by: vibhanshu-ob <115142120+vibhanshu-ob@users.noreply.github.com>	2024-10-29 22:05:41 -07:00
Ishaan Jaff	4d1b4beb3d	(refactor) caching use LLMCachingHandler for async_get_cache and set_cache (#6208 ) * use folder for caching * fix importing caching * fix clickhouse pyright * fix linting * fix correctly pass kwargs and args * fix test case for embedding * fix linting * fix embedding caching logic * fix refactor handle utils.py * fix test_embedding_caching_azure_individual_items_reordered	2024-10-14 16:34:01 +05:30
Krish Dholakia	04e5963b65	Litellm expose disable schema update flag (#6085 ) * fix: enable new 'disable_prisma_schema_update' flag * build(config.yml): remove setup remote docker step * ci(config.yml): give container time to start up * ci(config.yml): update test * build(config.yml): actually start docker * build(config.yml): simplify grep check * fix(prisma_client.py): support reading disable_schema_update via env vars * ci(config.yml): add test to check if all general settings are documented * build(test_General_settings.py): check available dir * ci: check ../ repo path * build: check ./ * build: fix test	2024-10-05 21:26:51 -04:00
Krish Dholakia	d57be47b0f	Litellm ruff linting enforcement (#5992 ) * ci(config.yml): add a 'check_code_quality' step Addresses https://github.com/BerriAI/litellm/issues/5991 * ci(config.yml): check why circle ci doesn't pick up this test * ci(config.yml): fix to run 'check_code_quality' tests * fix(__init__.py): fix unprotected import * fix(__init__.py): don't remove unused imports * build(ruff.toml): update ruff.toml to ignore unused imports * fix: fix: ruff + pyright - fix linting + type-checking errors * fix: fix linting errors * fix(lago.py): fix module init error * fix: fix linting errors * ci(config.yml): cd into correct dir for checks * fix(proxy_server.py): fix linting error * fix(utils.py): fix bare except causes ruff linting errors * fix: ruff - fix remaining linting errors * fix(clickhouse.py): use standard logging object * fix(__init__.py): fix unprotected import * fix: ruff - fix linting errors * fix: fix linting errors * ci(config.yml): cleanup code qa step (formatting handled in local_testing) * fix(_health_endpoints.py): fix ruff linting errors * ci(config.yml): just use ruff in check_code_quality pipeline for now * build(custom_guardrail.py): include missing file * style(embedding_handler.py): fix ruff check	2024-10-01 19:44:20 -04:00
Krrish Dholakia	3fc4ae0d65	build(custom_guardrail.py): include missing file	2024-10-01 17:18:52 -04:00
Ishaan Jaff	711932294c	[Feat] Add testing for prometheus failure metrics (#5823 ) * prom - show status code and class type on prom * log exception_class name on prometheus metrics * prometheus track error code and status * add bad model * add prometheus failure metric test * remove outdated file * fix litellm_proxy_total_requests_metric * add prometheus metrics testing	2024-09-21 11:36:29 -07:00
Ishaan Jaff	1973ae8fb8	[Feat] Allow setting `supports_vision` for Custom OpenAI endpoints + Added testing (#5821 ) * add test for using images with custom openai endpoints * run all otel tests * update name of test * add custom openai model to test config * add test for setting supports_vision=True for model * fix test guardrails aporia * docs supports vison * fix yaml * fix yaml * docs supports vision * fix bedrock guardrail test * fix cohere rerank test * update model_group doc string * add better prints on test	2024-09-21 11:35:55 -07:00
Ishaan Jaff	b6ae2204a8	[Feat-Proxy] Slack Alerting - allow using os.environ/ vars for alert to webhook url (#5726 ) * allow using os.environ for slack urls * use env vars for webhook urls * fix types for get_secret * fix linting * fix linting * fix linting * linting fixes * linting fix * docs alerting slack * fix get data	2024-09-16 18:03:37 -07:00
Ishaan Jaff	414d2dcb52	call spend logs endpoint	2024-08-30 16:35:07 -07:00
Ishaan Jaff	f3f85f6141	add test for vertex basic pass throgh	2024-08-30 16:26:00 -07:00
Ishaan Jaff	8ed0ffea54	fix use existing custom_auth.py	2024-08-30 16:22:28 -07:00
Ishaan Jaff	e1e1e2e566	add example custom	2024-08-30 15:46:45 -07:00
Ishaan Jaff	a4b88c16dc	fix indentation	2024-08-29 17:01:23 -07:00
Ishaan Jaff	da2cefc45a	fix team based tag routing	2024-08-29 14:37:44 -07:00
Ishaan Jaff	f592aeaa38	add test_chat_completion_with_no_tags	2024-08-29 13:54:11 -07:00
Ishaan Jaff	c27640e6e4	add /rerank test	2024-08-27 17:50:37 -07:00
Ishaan Jaff	fb5be57bb8	v0 add rerank on litellm proxy	2024-08-27 17:28:39 -07:00
Ishaan Jaff	7d30188f84	custom_callbacks	2024-08-23 09:52:52 -07:00
Ishaan Jaff	1f0cc72531	test bedrock guardrails	2024-08-22 17:24:42 -07:00
Ishaan Jaff	0431600f7b	add testing for aporia guardrails	2024-08-19 18:50:14 -07:00
Ishaan Jaff	02ab3cb73d	test- otel span recording	2024-07-11 08:47:16 -07:00
Krrish Dholakia	4a3b084961	feat(bedrock_httpx.py): moves to using httpx client for bedrock cohere calls	2024-05-11 13:43:08 -07:00
ishaan-jaff	13eb40e7bd	v0 using custom_key_generate	2024-01-20 08:39:52 -08:00
Krrish Dholakia	2070a785a4	feat(utils.py): support google kms for secret management https://github.com/BerriAI/litellm/issues/1235	2023-12-26 15:39:40 +05:30
ishaan-jaff	ac486a3c4a	(docs) add example config.yaml	2023-12-04 18:08:57 -08:00
ishaan-jaff	07a2035651	(chore) rm old config examples	2023-12-04 13:26:55 -08:00
ishaan-jaff	50284771b7	(test) test_reading proxy	2023-12-04 13:24:41 -08:00
ishaan-jaff	de4a7b719d	(test) proxy: reading config.yaml	2023-12-04 13:16:19 -08:00
ishaan-jaff	89cd54094c	(docs) proxy: add example OTEL config yaml	2023-12-02 11:22:40 -08:00
ishaan-jaff	a8a6838867	(docs) example: azure config.yaml	2023-11-30 13:16:41 -08:00
ishaan-jaff	213b345a43	(docs) example hosted litellm yaml	2023-11-21 16:59:33 -08:00
ishaan-jaff	5835e6ed04	(docs) proxy queue config yaml	2023-11-21 16:22:00 -08:00
ishaan-jaff	d1af0af7bf	(docs) load balancer	2023-11-17 17:25:46 -08:00
ishaan-jaff	42432bedaa	(docs) add example load balancer	2023-11-17 17:25:12 -08:00

1 2

52 commits