litellm-mirror

mirror of https://github.com/BerriAI/litellm.git synced 2025-04-26 19:24:27 +00:00

Author	SHA1	Message	Date
Krish Dholakia	dec53961f7	LiteLLM Minor Fixes and Improvements (11/09/2024) (#5634 ) * fix(caching.py): set ttl for async_increment cache fixes issue where ttl for redis client was not being set on increment_cache Fixes https://github.com/BerriAI/litellm/issues/5609 * fix(caching.py): fix increment cache w/ ttl for sync increment cache on redis Fixes https://github.com/BerriAI/litellm/issues/5609 * fix(router.py): support adding retry policy + allowed fails policy via config.yaml * fix(router.py): don't cooldown single deployments No point, as there's no other deployment to loadbalance with. * fix(user_api_key_auth.py): support setting allowed email domains on jwt tokens Closes https://github.com/BerriAI/litellm/issues/5605 * docs(token_auth.md): add user upsert + allowed email domain to jwt auth docs * fix(litellm_pre_call_utils.py): fix dynamic key logging when team id is set Fixes issue where key logging would not be set if team metadata was not none * fix(secret_managers/main.py): load environment variables correctly Fixes issue where os.environ/ was not being loaded correctly * test(test_router.py): fix test * feat(spend_tracking_utils.py): support logging additional usage params - e.g. prompt caching values for deepseek * test: fix tests * test: fix test * test: fix test * test: fix test * test: fix test	2024-09-11 22:36:06 -07:00
Krish Dholakia	7f47c48b35	LiteLLM Minor Fixes and Improvements (09/10/2024) (#5618 ) * fix(cost_calculator.py): move to debug for noisy warning message on cost calculation error Fixes https://github.com/BerriAI/litellm/issues/5610 * fix(databricks/cost_calculator.py): Handles model name issues for databricks models * fix(main.py): fix stream chunk builder for multiple tool calls Fixes https://github.com/BerriAI/litellm/issues/5591 * fix: correctly set user_alias when passed in Fixes https://github.com/BerriAI/litellm/issues/5612 * fix(types/utils.py): allow passing role for message object https://github.com/BerriAI/litellm/issues/5621 * fix(litellm_logging.py): Fix langfuse logging across multiple projects Fixes issue where langfuse logger was re-using the old logging object * feat(proxy/_types.py): support adding key-based tags for tag-based routing Enable tag based routing at key-level * fix(proxy/_types.py): fix inheritance * test(test_key_generate_prisma.py): fix test * test: fix test * fix(litellm_logging.py): return used callback object	2024-09-11 11:30:29 -07:00
Ishaan Jaff	02325f33d7	Merge branch 'main' into litellm_allow_turning_off_message_logging_for_callbacks	2024-09-09 21:59:36 -07:00
Krish Dholakia	52849e6422	LiteLLM Minor Fixes and Improvements (09/07/2024) (#5580 ) * fix(litellm_logging.py): set completion_start_time_float to end_time_float if none Fixes https://github.com/BerriAI/litellm/issues/5500 * feat(_init_.py): add new 'openai_text_completion_compatible_providers' list Fixes https://github.com/BerriAI/litellm/issues/5558 Handles correctly routing fireworks ai calls when done via text completions * fix: fix linting errors * fix: fix linting errors * fix(openai.py): fix exception raised * fix(openai.py): fix error handling * fix(_redis.py): allow all supported arguments for redis cluster (#5554) * Revert "fix(_redis.py): allow all supported arguments for redis cluster (#5554)" (#5583) This reverts commit `f2191ef4cb`. * fix(router.py): return model alias w/ underlying deployment on router.get_model_list() Fixes https://github.com/BerriAI/litellm/issues/5524#issuecomment-2336410666 * test: handle flaky tests --------- Co-authored-by: Jonas Dittrich <58814480+Kakadus@users.noreply.github.com>	2024-09-09 18:54:17 -07:00
Ishaan Jaff	09f1631f60	use redact_message_input_output_from_custom_logger	2024-09-09 16:02:24 -07:00
Ishaan Jaff	e87ab72aa4	refactor redact_message_input_output_from_custom_logger	2024-09-09 16:00:47 -07:00
Ishaan Jaff	07623cd234	Merge pull request #5573 from BerriAI/litellm_add_rerank_spend_tracking [Feat] Add cost tracking for cohere rerank	2024-09-06 18:11:30 -07:00
Krish Dholakia	2cab33b061	LiteLLM Minor Fixes and Improvements (08/06/2024) (#5567 ) * fix(utils.py): return citations for perplexity streaming Fixes https://github.com/BerriAI/litellm/issues/5535 * fix(anthropic/chat.py): support fallbacks for anthropic streaming (#5542) * fix(anthropic/chat.py): support fallbacks for anthropic streaming Fixes https://github.com/BerriAI/litellm/issues/5512 * fix(anthropic/chat.py): use module level http client if none given (prevents early client closure) * fix: fix linting errors * fix(http_handler.py): fix raise_for_status error handling * test: retry flaky test * fix otel type * fix(bedrock/embed): fix error raising * test(test_openai_batches_and_files.py): skip azure batches test (for now) quota exceeded * fix(test_router.py): skip azure batch route test (for now) - hit batch quota limits --------- Co-authored-by: Ishaan Jaff <ishaanjaffer0324@gmail.com> * All `model_group_alias` should show up in `/models`, `/model/info` , `/model_group/info` (#5539) * fix(router.py): support returning model_alias model names in `/v1/models` * fix(proxy_server.py): support returning model alias'es on `/model/info` * feat(router.py): support returning model group alias for `/model_group/info` * fix(proxy_server.py): fix linting errors * fix(proxy_server.py): fix linting errors * build(model_prices_and_context_window.json): add amazon titan text premier pricing information Closes https://github.com/BerriAI/litellm/issues/5560 * feat(litellm_logging.py): log standard logging response object for pass through endpoints. Allows bedrock /invoke agent calls to be correctly logged to langfuse + s3 * fix(success_handler.py): fix linting error * fix(success_handler.py): fix linting errors * fix(team_endpoints.py): Allows admin to update team member budgets --------- Co-authored-by: Ishaan Jaff <ishaanjaffer0324@gmail.com>	2024-09-06 17:16:24 -07:00
Ishaan Jaff	8bd57b6167	add cost tracking for rerank+ test	2024-09-06 16:06:19 -07:00
Ishaan Jaff	6f68e860e0	fix import error	2024-09-05 10:09:44 -07:00
Ishaan Jaff	7370a994f5	use correct type hints for audio transcriptions	2024-09-05 09:12:27 -07:00
Ishaan Jaff	b468ccbb77	Merge pull request #5514 from BerriAI/litellm_add_presidio [Fix-Refactor] support presidio on new guardrails config	2024-09-04 16:09:54 -07:00
Stefan Bokarev	7558e49d78	stream response (#5516 )	2024-09-04 16:02:29 -07:00
Ishaan Jaff	a5f892bb6d	fix init presidio guardrail	2024-09-04 15:22:37 -07:00
Ishaan Jaff	5e67a32d07	handle logging_only logic for guardrails	2024-09-04 13:57:04 -07:00
Ishaan Jaff	fdf9cbb846	fix get llm provider logic	2024-09-04 09:28:23 -07:00
Ishaan Jaff	44a1677c7c	test get llm provider	2024-09-04 09:22:19 -07:00
Krish Dholakia	ca4e746545	LiteLLM minor fixes + improvements (31/08/2024) (#5464 ) * fix(vertex_endpoints.py): fix vertex ai pass through endpoints * test(test_streaming.py): skip model due to end of life * feat(custom_logger.py): add special callback for model hitting tpm/rpm limits Closes https://github.com/BerriAI/litellm/issues/4096	2024-09-01 13:31:42 -07:00
Krish Dholakia	e474c3665a	Bedrock Embeddings refactor + model support (#5462 ) * refactor(bedrock): initial commit to refactor bedrock to a folder Improve code readability + maintainability * refactor: more refactor work * fix: fix imports * feat(bedrock/embeddings.py): support translating embedding into amazon embedding formats * fix: fix linting errors * test: skip test on end of life model * fix(cohere/embed.py): fix linting error * fix(cohere/embed.py): fix typing * fix(cohere/embed.py): fix post-call logging for cohere embedding call * test(test_embeddings.py): fix error message assertion in test	2024-09-01 13:29:58 -07:00
Krish Dholakia	321b0961b5	fix: Minor LiteLLM Fixes + Improvements (29/08/2024) (#5436 ) * fix(model_checks.py): support returning wildcard models on `/v1/models` Fixes https://github.com/BerriAI/litellm/issues/4903 * fix(bedrock_httpx.py): support calling bedrock via api_base Closes https://github.com/BerriAI/litellm/pull/4587 * fix(litellm_logging.py): only leave last 4 char of gemini key unmasked Fixes https://github.com/BerriAI/litellm/issues/5433 * feat(router.py): support setting 'weight' param for models on router Closes https://github.com/BerriAI/litellm/issues/5410 * test(test_bedrock_completion.py): add unit test for custom api base * fix(model_checks.py): handle no "/" in model	2024-08-29 22:40:25 -07:00
Ishaan Jaff	da43356242	Merge pull request #5431 from BerriAI/litellm_Add_fireworks_ai_health_check [Fix-Proxy] /health check for provider wildcard models (fireworks/*)	2024-08-29 14:25:05 -07:00
Ishaan Jaff	d2e286e45d	add util to pick_cheapest_model_from_llm_provider	2024-08-29 09:27:20 -07:00
Krish Dholakia	3e8f5009f4	fix(utils.py): correctly log streaming cache hits (#5417 ) (#5426 ) Fixes https://github.com/BerriAI/litellm/issues/5401	2024-08-28 22:50:33 -07:00
Ishaan Jaff	bae54ca642	use cost per token for jamba	2024-08-27 14:18:04 -07:00
Krrish Dholakia	145e7794b4	build(config.yml): bump anyio version	2024-08-27 07:37:06 -07:00
Krrish Dholakia	d02cfcde97	fix(asyncify.py): fix linting errors	2024-08-27 07:37:06 -07:00
Krrish Dholakia	d12ec470f7	fix(asyncify.py): fix linting errors	2024-08-27 07:37:06 -07:00
Krrish Dholakia	07dd3c640b	perf(sagemaker.py): asyncify hf prompt template check leads to 189% improvement in RPS @ 100 users	2024-08-27 07:37:06 -07:00
Krrish Dholakia	21bad0aa73	fix(streaming_utils.py): fix generic_chunk_has_all_required_fields	2024-08-26 21:13:02 -07:00
Krrish Dholakia	b989762bb0	fix(sagemaker.py): support streaming for messages api Fixes https://github.com/BerriAI/litellm/issues/5372	2024-08-26 15:08:08 -07:00
Ishaan Jaff	8162208a5c	track api_call_start_time	2024-08-22 13:52:03 -07:00
Krrish Dholakia	d6bc37374e	feat(litellm_logging.py): add 'saved_cache_cost' to standard logging payload (s3)	2024-08-21 16:58:07 -07:00
Krrish Dholakia	ac5c6c8751	fix(litellm_pre_call_utils.py): handle dynamic keys via api correctly	2024-08-21 13:37:21 -07:00
Krrish Dholakia	77a6f597e0	fix(litellm_logging.py): add stricter check for special param being non none	2024-08-20 21:35:02 -07:00
Krrish Dholakia	0091f64ff1	fix(utils.py): ensure consistent cost calc b/w returned header and logged object	2024-08-20 19:01:20 -07:00
Krish Dholakia	e49e454929	Merge pull request #5287 from BerriAI/litellm_fix_response_cost_cal fix(cost_calculator.py): only override base model if custom pricing is set	2024-08-20 11:42:48 -07:00
Ishaan Jaff	c82714757a	Merge pull request #5288 from BerriAI/litellm_aporia_refactor [Feat] V2 aporia guardrails litellm	2024-08-19 20:41:45 -07:00
Krrish Dholakia	cf1a1605a6	feat(cost_calculator.py): only override base model if custom pricing is set	2024-08-19 16:05:49 -07:00
Ishaan Jaff	6af497e383	feat run aporia as post call success hook	2024-08-19 11:25:31 -07:00
Krrish Dholakia	0d82089136	test(test_caching.py): re-introduce testing for s3 cache w/ streaming Closes https://github.com/BerriAI/litellm/issues/3268	2024-08-19 10:56:48 -07:00
Krrish Dholakia	1856ac585d	feat(pass_through_endpoints.py): add pass-through support for all cohere endpoints	2024-08-17 16:57:55 -07:00
Krrish Dholakia	29bedae79f	feat(google_ai_studio_endpoints.py): support pass-through endpoint for all google ai studio requests New Feature	2024-08-17 10:46:59 -07:00
Krish Dholakia	88fccb2427	Merge branch 'main' into litellm_log_model_price_information	2024-08-16 19:34:16 -07:00
Krish Dholakia	0916197c9d	Merge pull request #5244 from BerriAI/litellm_better_error_logging_sentry refactor: replace .error() with .exception() logging for better debugging on sentry	2024-08-16 19:16:20 -07:00
Ishaan Jaff	937471223a	fix databricks streaming test	2024-08-16 16:56:08 -07:00
Krrish Dholakia	9609505d0c	fix(litellm_logging.py): fix price information logging to s3	2024-08-16 16:42:38 -07:00
Krrish Dholakia	ef51f8600d	feat(litellm_logging.py): support logging model price information to s3 logs	2024-08-16 16:21:34 -07:00
Krrish Dholakia	2874b94fb1	refactor: replace .error() with .exception() logging for better debugging on sentry	2024-08-16 09:22:47 -07:00
Krrish Dholakia	b381dec0a8	fix(litellm_logging.py): wrap function to safely fail	2024-08-15 18:05:06 -07:00
Krrish Dholakia	c0448b9641	feat(litellm_logging.py): cleanup payload + add response cost to logged payload	2024-08-15 17:53:25 -07:00

1 2 3

145 commits