litellm

Author	SHA1	Message	Date
Ishaan Jaff	cc83463707	fix test_async_log_proxy_authentication_errors_get_request	2024-11-27 11:58:09 -08:00
Ishaan Jaff	68e59824a3	(feat) Allow using include to include external YAML files in a config.yaml (#6922 ) * add helper to process inlcudes directive on yaml * add doc on config management * unit tests for `include` on config.yaml	2024-11-26 20:27:12 -08:00
Ishaan Jaff	4bc06392db	(feat) log proxy auth errors on datadog (#6931 ) * add new dd type for auth errors * add async_log_proxy_authentication_errors * fix comment * use async_log_proxy_authentication_errors * test_datadog_post_call_failure_hook * test_async_log_proxy_authentication_errors	2024-11-26 20:26:57 -08:00
Ishaan Jaff	aea68cbeb6	(feat) DataDog Logger - Add Failure logging + use Standard Logging payload (#6929 ) * add async_log_failure_event for dd * use standard logging payload for DD logging * use standard logging payload for DD * fix use SLP status * allow opting into _create_v0_logging_payload * add unit tests for DD logging payload * fix dd logging tests	2024-11-26 19:27:06 -08:00
Ishaan Jaff	8fd3bf34d8	(feat) pass through llm endpoints - add `PATCH` support (vertex context caching requires for update ops) (#6924 ) * add PATCH for pass through endpoints * test_pass_through_routes_support_all_methods	2024-11-26 14:39:13 -08:00
Krish Dholakia	8673f2541e	fix(key_management_endpoints.py): fix user-membership check when creating team key (#6890 ) * fix(key_management_endpoints.py): fix user-membership check when creating team key * docs: add deprecation notice on original `/v1/messages` endpoint + add better swagger tags on pass-through endpoints * fix(gemini/): fix image_url handling for gemini Fixes https://github.com/BerriAI/litellm/issues/6897 * fix(teams.tsx): fix member add when role is 'user' * fix(team_endpoints.py): /team/member_add fix adding several new members to team * test(test_vertex.py): remove redundant test * test(test_proxy_server.py): fix team member add tests	2024-11-26 14:19:24 +05:30
Krrish Dholakia	0b15662c6e	test: temporarily comment out doc test - fix ci/cd issue in separate pr	2024-11-26 13:52:40 +05:30
Krrish Dholakia	fd288c5081	test: fix test	2024-11-26 13:48:08 +05:30
Krrish Dholakia	195112565d	test: fix documentation tests	2024-11-26 13:45:00 +05:30
Ishaan Jaff	c285132ad6	(docs) Simplify `/vertex_ai/` pass through docs (#6910 ) * simplify vertex pass through docs * allow using known path for setting up pass throughs * add unit testing for vtx pass through auth	2024-11-25 23:57:50 -08:00
Ishaan Jaff	5c854650c2	(redis fix) - fix `AbstractConnection.__init__() got an unexpected keyword argument 'ssl'` (#6908 ) * add better debugging for get_redis_connection_pool + allow passing ssl=None * test_redis_with_ssl * test_redis_with_ssl * test_redis_with_ssl	2024-11-25 22:52:44 -08:00
Ishaan Jaff	552c0dd7a4	(fix) pass through endpoints - run logging async + use thread pool executor for sync logging callbacks (#6907 ) * run pass through logging async * fix use thread_pool_executor for pass through logging * test_pass_through_request_logging_failure_with_stream * fix anthropic pt logging test * test_pass_through_request_logging_failure	2024-11-25 22:52:05 -08:00
Ishaan Jaff	c60261c3bc	(feat) Add support for using @google/generative-ai JS with LiteLLM Proxy (#6899 ) * feat - allow using gemini js SDK with LiteLLM * add auth for gemini_proxy_route * basic local test for js * test cost tagging gemini js requests * add js sdk test for gemini with litellm * add docs on gemini JS SDK * run node.js tests * fix google ai studio tests * fix vertex js spend test	2024-11-25 13:13:03 -08:00
Ishaan Jaff	f77bf49772	feat - allow sending `tags` on vertex pass through requests (#6876 ) * feat - allow tagging vertex JS SDK request * add unit testing for passing headers for pass through endpoints * fix allow using vertex_ai as the primary way for pass through vertex endpoints * docs on vertex js pass tags * add e2e test for vertex pass through with spend tags * add e2e tests for streaming vertex JS with tags * fix vertex ai testing	2024-11-25 12:12:09 -08:00
Ishaan Jaff	c73ce95c01	(feat) - provider budget improvements - ensure provider budgets work with multiple proxy instances + improve latency to ~90ms (#6886 ) * use 1 file for duration_in_seconds * add to readme.md * re use duration_in_seconds * fix importing _extract_from_regex, get_last_day_of_month * fix import * update provider budget routing * fix - remove dup test * add support for using in multi instance environments * test_in_memory_redis_sync_e2e * test_in_memory_redis_sync_e2e * fix test_in_memory_redis_sync_e2e * fix code quality check * fix test provider budgets * working provider budget tests * add fixture for provider budget routing * fix router testing for provider budgets * add comments on provider budget routing * use RedisPipelineIncrementOperation * add redis async_increment_pipeline * use redis async_increment_pipeline * use lower value for testing * use redis async_increment_pipeline * use consistent key name for increment op * add handling for budget windows * fix typing async_increment_pipeline * fix set attr * add clear doc strings * unit testing for provider budgets * test_redis_increment_pipeline	2024-11-24 16:36:19 -08:00
Ishaan Jaff	34bfebe470	(QOL improvement) Provider budget routing - allow using 1s, 1d, 1mo, 2mo etc (#6885 ) * use 1 file for duration_in_seconds * add to readme.md * re use duration_in_seconds * fix importing _extract_from_regex, get_last_day_of_month * fix import * update provider budget routing * fix - remove dup test	2024-11-23 16:59:46 -08:00
Krish Dholakia	424b8b0231	Litellm dev 11 23 2024 (#6881 ) * build(ui/create_key_button.tsx): support adding tags for cost tracking/routing when making key * LiteLLM Minor Fixes & Improvements (11/23/2024) (#6870) * feat(pass_through_endpoints/): support logging anthropic/gemini pass through calls to langfuse/s3/etc. * fix(utils.py): allow disabling end user cost tracking with new param Allows proxy admin to disable cost tracking for end user - keeps prometheus metrics small * docs(configs.md): add disable_end_user_cost_tracking reference to docs * feat(key_management_endpoints.py): add support for restricting access to `/key/generate` by team/proxy level role Enables admin to restrict key creation, and assign team admins to handle distributing keys * test(test_key_management.py): add unit testing for personal / team key restriction checks * docs: add docs on restricting key creation * docs(finetuned_models.md): add new guide on calling finetuned models * docs(input.md): cleanup anthropic supported params Closes https://github.com/BerriAI/litellm/issues/6856 * test(test_embedding.py): add test for passing extra headers via embedding * feat(cohere/embed): pass client to async embedding * feat(rerank.py): add `/v1/rerank` if missing for cohere base url Closes https://github.com/BerriAI/litellm/issues/6844 * fix(main.py): pass extra_headers param to openai Fixes https://github.com/BerriAI/litellm/issues/6836 * fix(litellm_logging.py): don't disable global callbacks when dynamic callbacks are set Fixes issue where global callbacks - e.g. prometheus were overriden when langfuse was set dynamically * fix(handler.py): fix linting error * fix: fix typing * build: add conftest to proxy_admin_ui_tests/ * test: fix test * fix: fix linting errors * test: fix test * fix: fix pass through testing * feat(key_management_endpoints.py): allow proxy_admin to enforce params on key creation allows admin to force team keys to have tags * build(ui/): show teams in leftnav + allow team admin to add new members * build(ui/): show created tags in dropdown makes it easier for admin to add tags to keys * test(test_key_management.py): fix test * test: fix test * fix playwright e2e ui test * fix e2e ui testing deps * fix: fix linting errors * fix e2e ui testing * fix e2e ui testing, only run e2e ui testing in playwright --------- Co-authored-by: Ishaan Jaff <ishaanjaffer0324@gmail.com>	2024-11-23 22:37:16 +05:30
Ishaan Jaff	f3ffa67553	fix e2e ui testing	2024-11-23 08:45:14 -08:00
Ishaan Jaff	a8b4e1cc03	fix playwright e2e ui test	2024-11-23 08:34:55 -08:00
Krish Dholakia	7e9d8b58f6	LiteLLM Minor Fixes & Improvements (11/23/2024) (#6870 ) * feat(pass_through_endpoints/): support logging anthropic/gemini pass through calls to langfuse/s3/etc. * fix(utils.py): allow disabling end user cost tracking with new param Allows proxy admin to disable cost tracking for end user - keeps prometheus metrics small * docs(configs.md): add disable_end_user_cost_tracking reference to docs * feat(key_management_endpoints.py): add support for restricting access to `/key/generate` by team/proxy level role Enables admin to restrict key creation, and assign team admins to handle distributing keys * test(test_key_management.py): add unit testing for personal / team key restriction checks * docs: add docs on restricting key creation * docs(finetuned_models.md): add new guide on calling finetuned models * docs(input.md): cleanup anthropic supported params Closes https://github.com/BerriAI/litellm/issues/6856 * test(test_embedding.py): add test for passing extra headers via embedding * feat(cohere/embed): pass client to async embedding * feat(rerank.py): add `/v1/rerank` if missing for cohere base url Closes https://github.com/BerriAI/litellm/issues/6844 * fix(main.py): pass extra_headers param to openai Fixes https://github.com/BerriAI/litellm/issues/6836 * fix(litellm_logging.py): don't disable global callbacks when dynamic callbacks are set Fixes issue where global callbacks - e.g. prometheus were overriden when langfuse was set dynamically * fix(handler.py): fix linting error * fix: fix typing * build: add conftest to proxy_admin_ui_tests/ * test: fix test * fix: fix linting errors * test: fix test * fix: fix pass through testing	2024-11-23 15:17:40 +05:30
Ishaan Jaff	d81ae45827	(Perf / latency improvement) improve pass through endpoint latency to ~50ms (before PR was 400ms) (#6874 ) * use correct location for types * fix types location * perf improvement for pass through endpoints * update lint check * fix import * fix ensure async clients test * fix azure.py health check * fix ollama	2024-11-22 18:47:26 -08:00
Ishaan Jaff	b2b3e40d13	(feat) use `@google-cloud/vertexai` js sdk with litellm (#6873 ) * stash gemini JS test * add vertex js sdj example * handle vertex pass through separately * tes vertex JS sdk * fix vertex_proxy_route * use PassThroughStreamingHandler * fix PassThroughStreamingHandler * use common _create_vertex_response_logging_payload_for_generate_content * test vertex js * add working vertex jest tests * move basic bass through test * use good name for test * test vertex * test_chunk_processor_yields_raw_bytes * unit tests for streaming * test_convert_raw_bytes_to_str_lines * run unit tests 1st * simplify local * docs add usage example for js * use get_litellm_virtual_key * add unit tests for vertex pass through	2024-11-22 16:50:10 -08:00
Krrish Dholakia	d8e5134935	test: skip flaky test	2024-11-22 19:23:36 +05:30
Ishaan Jaff	a6220f7a40	test - also try diff host for langfuse	2024-11-21 23:51:58 -08:00
Ishaan Jaff	701c154e35	fix test_aaateam_logging	2024-11-21 23:47:38 -08:00
Ishaan Jaff	b903134cc9	ci/cd run again	2024-11-21 23:12:54 -08:00
Ishaan Jaff	952dbb9eb7	test_langfuse_masked_input_output	2024-11-21 22:59:36 -08:00
Ishaan Jaff	366a6895e2	test_langfuse_masked_input_output	2024-11-21 22:54:18 -08:00
Ishaan Jaff	be0f0dd345	test_langfuse_masked_input_output	2024-11-21 22:51:19 -08:00
Ishaan Jaff	027967d260	test_langfuse_logging_audio_transcriptions	2024-11-21 22:46:23 -08:00
Ishaan Jaff	f398c9b172	fix test_aaateam_logging	2024-11-21 22:36:44 -08:00
Ishaan Jaff	5a2e5b43c4	fix test_aaapass_through_endpoint_pass_through_keys_langfuse	2024-11-21 22:05:00 -08:00
Ishaan Jaff	e0921da38c	test_team_logging	2024-11-21 22:01:12 -08:00
Ishaan Jaff	f77bd9a99c	test_aaalangfuse_logging_metadata	2024-11-21 21:56:36 -08:00
Ishaan Jaff	6717929206	(Feat) Allow passing `litellm_metadata` to pass through endpoints + Add e2e tests for /anthropic/ usage tracking (#6864 ) * allow passing _litellm_metadata in pass through endpoints * fix _create_anthropic_response_logging_payload * include litellm_call_id in logging * add e2e testing for anthropic spend logs * add testing for spend logs payload * add example with anthropic python SDK	2024-11-21 21:41:05 -08:00
Ishaan Jaff	b8af46e1a2	(feat) Add usage tracking for streaming `/anthropic` passthrough routes (#6842 ) * use 1 file for AnthropicPassthroughLoggingHandler * add support for anthropic streaming usage tracking * ci/cd run again * fix - add real streaming for anthropic pass through * remove unused function stream_response * working anthropic streaming logging * fix code quality * fix use 1 file for vertex success handler * use helper for _handle_logging_vertex_collected_chunks * enforce vertex streaming to use sse for streaming * test test_basic_vertex_ai_pass_through_streaming_with_spendlog * fix type hints * add comment * fix linting * add pass through logging unit testing	2024-11-21 19:36:03 -08:00
Ishaan Jaff	920f4c9f82	(fix) add linting check to ban creating `AsyncHTTPHandler` during LLM calling (#6855 ) * fix triton * fix TEXT_COMPLETION_CODESTRAL * fix REPLICATE * fix CLARIFAI * fix HUGGINGFACE * add test_no_async_http_handler_usage * fix PREDIBASE * fix anthropic use get_async_httpx_client * fix vertex fine tuning * fix dbricks get_async_httpx_client * fix get_async_httpx_client vertex * fix get_async_httpx_client * fix get_async_httpx_client * fix make_async_azure_httpx_request * fix check_for_async_http_handler * test: cleanup mistral model * add check for AsyncClient * fix check_for_async_http_handler * fix get_async_httpx_client * fix tests using in_memory_llm_clients_cache * fix langfuse import * fix import --------- Co-authored-by: Krrish Dholakia <krrishdholakia@gmail.com>	2024-11-21 19:03:02 -08:00
Krish Dholakia	7e5085dc7b	Litellm dev 11 21 2024 (#6837 ) * Fix Vertex AI function calling invoke: use JSON format instead of protobuf text format. (#6702) * test: test tool_call conversion when arguments is empty dict Fixes https://github.com/BerriAI/litellm/issues/6833 * fix(openai_like/handler.py): return more descriptive error message Fixes https://github.com/BerriAI/litellm/issues/6812 * test: skip overloaded model * docs(anthropic.md): update anthropic docs to show how to route to any new model * feat(groq/): fake stream when 'response_format' param is passed Groq doesn't support streaming when response_format is set * feat(groq/): add response_format support for groq Closes https://github.com/BerriAI/litellm/issues/6845 * fix(o1_handler.py): remove fake streaming for o1 Closes https://github.com/BerriAI/litellm/issues/6801 * build(model_prices_and_context_window.json): add groq llama3.2b model pricing Closes https://github.com/BerriAI/litellm/issues/6807 * fix(utils.py): fix handling ollama response format param Fixes https://github.com/BerriAI/litellm/issues/6848#issuecomment-2491215485 * docs(sidebars.js): refactor chat endpoint placement * fix: fix linting errors * test: fix test * test: fix test * fix(openai_like/handler): handle max retries * fix(streaming_handler.py): fix streaming check for openai-compatible providers * test: update test * test: correctly handle model is overloaded error * test: update test * test: fix test * test: mark flaky test --------- Co-authored-by: Guowang Li <Guowang@users.noreply.github.com>	2024-11-22 01:53:52 +05:30
Ishaan Jaff	a7d5536872	(fix) passthrough - allow internal users to access /anthropic (#6843 ) * fix /anthropic/ * test llm_passthrough_router * fix test_gemini_pass_through_endpoint	2024-11-21 11:46:50 -08:00
Krrish Dholakia	50d2510b60	test: cleanup mistral model	2024-11-21 23:44:50 +05:30
Ishaan Jaff	ddfe687b13	(fix) don't block proxy startup if license check fails & using prometheus (#6839 ) * fix - don't block proxy startup if not a premium user * test_litellm_proxy_server_config_with_prometheus * add test for proxy startup * fix remove unused test * fix startup test * add comment on bad-license	2024-11-20 17:55:39 -08:00
Ishaan Jaff	cc1f8ff0ba	(testing) - add e2e tests for anthropic pass through endpoints (#6840 ) * tests - add e2e tests for anthropic pass through * fix swagger * fix pass through tests	2024-11-20 17:55:13 -08:00
Ishaan Jaff	434b1d3d86	(refactor) anthropic - move _process_response in transformation.py (#6834 ) * move _process_response in transformation * fix AnthropicConfig test	2024-11-20 17:24:19 -08:00
Krish Dholakia	b11bc0374e	Litellm dev 11 20 2024 (#6838 ) * feat(customer_endpoints.py): support passing budget duration via `/customer/new` endpoint Closes https://github.com/BerriAI/litellm/issues/5651 * docs: add missing params to swagger + api documentation test * docs: add documentation for all key endpoints documents all params on swagger * docs(internal_user_endpoints.py): document all /user/new params Ensures all params are documented * docs(team_endpoints.py): add missing documentation for team endpoints Ensures 100% param documentation on swagger * docs(organization_endpoints.py): document all org params Adds documentation for all params in org endpoint * docs(customer_endpoints.py): add coverage for all params on /customer endpoints ensures all /customer/* params are documented * ci(config.yml): add endpoint doc testing to ci/cd * fix: fix internal_user_endpoints.py * fix(internal_user_endpoints.py): support 'duration' param * fix(partner_models/main.py): fix anthropic re-raise exception on vertex * fix: fix pydantic obj * build(model_prices_and_context_window.json): add new vertex claude model names vertex claude changed model names - causes cost tracking errors	2024-11-21 05:20:37 +05:30
Krish Dholakia	689cd677c6	Litellm dev 11 20 2024 (#6831 ) * feat(customer_endpoints.py): support passing budget duration via `/customer/new` endpoint Closes https://github.com/BerriAI/litellm/issues/5651 * docs: add missing params to swagger + api documentation test * docs: add documentation for all key endpoints documents all params on swagger * docs(internal_user_endpoints.py): document all /user/new params Ensures all params are documented * docs(team_endpoints.py): add missing documentation for team endpoints Ensures 100% param documentation on swagger * docs(organization_endpoints.py): document all org params Adds documentation for all params in org endpoint * docs(customer_endpoints.py): add coverage for all params on /customer endpoints ensures all /customer/* params are documented * ci(config.yml): add endpoint doc testing to ci/cd * fix: fix internal_user_endpoints.py * fix(internal_user_endpoints.py): support 'duration' param * fix(partner_models/main.py): fix anthropic re-raise exception on vertex * fix: fix pydantic obj	2024-11-21 04:06:06 +05:30
Krish Dholakia	b0be5bf3a1	LiteLLM Minor Fixes & Improvements (11/19/2024) (#6820 ) * fix(anthropic/chat/transformation.py): add json schema as values: json_schema fixes passing pydantic obj to anthropic Fixes https://github.com/BerriAI/litellm/issues/6766 * (feat): Add timestamp_granularities parameter to transcription API (#6457) * Add timestamp_granularities parameter to transcription API * add param to the local test * fix(databricks/chat.py): handle max_retries optional param handling for openai-like calls Fixes issue with calling finetuned vertex ai models via databricks route * build(ui/): add team admins via proxy ui * fix: fix linting error * test: fix test * docs(vertex.md): refactor docs * test: handle overloaded anthropic model error * test: remove duplicate test * test: fix test * test: update test to handle model overloaded error --------- Co-authored-by: Show <35062952+BrunooShow@users.noreply.github.com>	2024-11-21 00:57:58 +05:30
Krrish Dholakia	6a816bceee	test: fix test	2024-11-20 14:13:14 +05:30
Ishaan Jaff	132569dafc	ci/cd run again	2024-11-19 22:38:45 -08:00
Ishaan Jaff	8631f3bb60	use correct name for test file	2024-11-19 22:11:52 -08:00
Ishaan Jaff	8b92e4f77a	fix test_prometheus_metric_tracking	2024-11-19 22:11:30 -08:00

1 2 3 4 5 ...

522 commits