Amazon Bedrock Guardrails Now Supports Image-Aided Multimodal Toxicity Detection (preview)

Today, we’re announcing a preview of multimodal image-enhanced toxicity detection in Amazon Bedrock Guardrails. This new capability detects and filters unwanted image content in addition to text, helping you improve the user experience and manage model outputs in your generative AI applications.

Amazon Bedrock Guardrails helps you implement security for generative AI applications by filtering unwanted content, redacting personally identifiable information (PII), and enhancing content security and privacy. You can configure prohibited topic policies, content filters, word filters, PII redaction, contextual grounding checks, and automatic justification checks (preview) to tailor security to your specific use cases and responsible AI policies.

With this launch, you can now use existing content filtering policies in Amazon Bedrock Guardrails to detect and block harmful visual content across categories such as hate, insults, sexual, and violence. You can configure the thresholds from low to high to suit the needs of your application.

This new image support works with all base models (FM) in Amazon Bedrock that support image data, as well as any custom fine-tuned models you bring. It provides a consistent layer of protection across text and image modalities, making it easier to build responsive AI applications.

Tero Hottinen, Vice President, Head of Strategic Partnerships at KONE, envisions the following use case:

In its ongoing assessment, KONE recognizes the potential of Amazon Bedrock Guardrails as key components in protecting gen AI applications, particularly for relevance and contextual grounding checks, as well as multi-modal security. The company envisions integrating design diagrams and product manuals into its applications, with Amazon Bedrock Guardrails playing a key role in enabling more accurate diagnosis and analysis of multimodal content.

Here’s how it works.

Multimodal toxicity detection in action
To get started, create a rail in the AWS Management Console and configure content filters for text, image data, or both. You can also use the AWS SDKs to integrate this functionality into your applications.

Create a handrail
On the console, go to Amazonian subsoil and select Railing. From there, you can create a new railing and use existing content filters to detect and block image data in addition to text data. Category for Rush, Insults, Sexualand Violence under Configuring content filters can be configured for either text or image content or both. Tea Misconduct and Prompt attacks categories can only be configured for text content.