Image Redaction with cloak.business OCR

The Problem: 80% of Enterprise Data is Unstructured — and Invisible to Text-Only Tools

Most PII anonymization tools stop at text. But enterprise workflows routinely pass images to AI models:

Scanned contracts, invoices, and NDAs with client names and signatures
Identity documents — passports, driver's licenses, employee IDs
Medical forms and lab reports with patient data
Screenshots of CRM dashboards, email threads, and financial tables
Whiteboard photos from client meetings

When you ask Claude to summarize a scanned contract, the entire image — including names, addresses, account numbers, and signatures — travels to Anthropic's servers. The AI never strips it. The user never notices.

How cloak.business Handles Images

cloak.business provides two dedicated MCP tools for visual data:

1. cloak_analyze_image — OCR + PII Detection

Pass any image (JPEG, PNG, PDF page, TIFF) to cloak_analyze_image. The tool:

Runs OCR across 38 languages (Latin, Cyrillic, Arabic, CJK, Devanagari, Hebrew, and more)
Detects 320+ PII entity types across 70+ countries in the extracted text
Returns detected entities with their bounding box coordinates in the image
Supports both scanned documents (print OCR) and digital screenshots (screen OCR)

{
  "tool": "cloak_analyze_image",
  "image_url": "https://example.com/contract-scan.jpg",
  "language": "auto"
}

// Response:
{
  "entities": [
    {
      "type": "PERSON",
      "value": "Maria Hoffmann",
      "bbox": { "x": 142, "y": 88, "w": 210, "h": 24 }
    },
    {
      "type": "IBAN",
      "value": "DE89370400440532013000",
      "bbox": { "x": 80, "y": 312, "w": 380, "h": 22 }
    }
  ]
}

2. cloak_redact_image — Visual Redaction

cloak_redact_image takes an image and returns a new version with black redaction boxes drawn over every detected PII region. No coordinate math required — you pass the image, you get back a clean version.

Draws filled rectangles over detected entity bounding boxes
Returns the redacted image as base64 or a download URL
Preserves all non-PII regions of the original image
Suitable for compliance archives, legal discovery, and GDPR data access requests

{
  "tool": "cloak_redact_image",
  "image_url": "https://example.com/id-scan.png",
  "entities": ["PERSON", "BIRTH_DATE", "ID_NUMBER"]
}

// Response:
{
  "redacted_image_url": "https://cdn.cloak.business/redacted/abc123.png",
  "entities_redacted": 4,
  "ttl_seconds": 300
}

Real-World Use Cases

Legal: Scanned Document Review

Law firms use Claude to review and summarize large document sets during discovery. With cloak_redact_image, scanned exhibits are automatically redacted before the AI sees them — satisfying attorney-client privilege and court confidentiality orders without slowing down the workflow.

HR: ID Verification and Onboarding

HR teams increasingly use AI to extract data from ID documents for onboarding. Passing passport or driver's license scans to an AI without redaction violates GDPR Article 9 (special categories). cloak.business intercepts the image before it reaches any AI model, strips identity fields, and stores only non-PII metadata.

Finance: Invoice Processing Pipelines

Automated invoice processing pipelines often route scanned PDFs directly to AI models for data extraction. Each invoice contains VAT numbers, IBAN details, company names, and addresses. cloak.business intercepts and redacts before extraction — the AI gets the layout, not the PII.

Healthcare: Lab Report Summarization

Medical teams ask AI assistants to summarize lab results and clinical notes. When those documents are scans or photographs, standard text-layer anonymization fails entirely. cloak.business's OCR-first approach detects patient names, dates of birth, and national health numbers in the image layer before the AI touches the document.

MCP Setup for Image Workflows

Add cloak.business to Claude Desktop or Cursor in one step:

{
  "mcpServers": {
    "cloak": {
      "command": "npx",
      "args": ["-y", "cloak-business-mcp-server"],
      "env": {
        "CLOAK_API_KEY": "your-api-key"
      }
    }
  }
}

Image tools are available on all paid plans. The free tier includes 50 image API credits per month to test the workflow end-to-end before upgrading.

cloak.business vs anonym.legal — Image Capability

Feature	cloak.business	anonym.legal
Image OCR	38 languages	Not available
Visual redaction	Yes — bounding box redact	Not available
Entity types	320+ (70+ countries)	285+ (48 languages)
Batch processing	1–100 texts per call	Single text per call
Chrome Extension	Yes	Not available
Best for	Enterprise, visual workflows	Developers, text pipelines

Under GDPR Art. 4, a photograph of a person's face is personal data. Under Art. 9, it may be special-category data. Sending unredacted ID document scans to any third-party AI provider without explicit consent or a lawful basis is a data protection violation — regardless of whether the AI model is told "not to remember" it.

cloak.business's redact-before-send architecture ensures that only the redacted image leaves your infrastructure. The original never touches external servers. This is not a policy control — it is a technical control, which is what GDPR Art. 25 (Privacy by Design) and ISO 27001:2022 Annex A 8.11 require.

Infrastructure: All cloak.business image processing runs on Hetzner servers in Germany (Frankfurt). Images are processed transiently and not stored after redaction. The system is ISO 27001:2022 certified. See cloak.business privacy policy for data handling details.

Image PII Redaction with cloak.business — OCR + Visual Redact via MCP

The Problem: 80% of Enterprise Data is Unstructured — and Invisible to Text-Only Tools

How cloak.business Handles Images

1. cloak_analyze_image — OCR + PII Detection

2. cloak_redact_image — Visual Redaction

Real-World Use Cases

Legal: Scanned Document Review

HR: ID Verification and Onboarding

Finance: Invoice Processing Pipelines

Healthcare: Lab Report Summarization

MCP Setup for Image Workflows

cloak.business vs anonym.legal — Image Capability

Protect images before they reach AI models

Related Articles

Image PII Redaction with cloak.business — OCR + Visual Redact via MCP

The Problem: 80% of Enterprise Data is Unstructured — and Invisible to Text-Only Tools

How cloak.business Handles Images

1. cloak_analyze_image — OCR + PII Detection

2. cloak_redact_image — Visual Redaction

Real-World Use Cases

Legal: Scanned Document Review

HR: ID Verification and Onboarding

Finance: Invoice Processing Pipelines

Healthcare: Lab Report Summarization

MCP Setup for Image Workflows

cloak.business vs anonym.legal — Image Capability

GDPR and the Image Problem

Protect images before they reach AI models

Related Articles