Appearance
PII/PHI Detection
Automatically detect and handle sensitive information in documents.
Supported Entity Types
| Category | Entities |
|---|---|
| Personal | PERSON, EMAIL, PHONE, ADDRESS |
| Financial | CREDIT_CARD, BANK_ACCOUNT, SSN |
| Medical | MEDICAL_RECORD, DIAGNOSIS, MEDICATION |
| Legal | CASE_NUMBER, ATTORNEY |
| Custom | User-defined patterns |
Detection Engine
GateFlow uses Microsoft Presidio with custom enhancements:
- 50+ built-in recognizers
- ML-based entity classification
- Context-aware detection
- Multilingual support
Configuration
Enable Detection
bash
curl -X POST https://api.gateflow.ai/v1/data/documents \
-H "Authorization: Bearer gw_prod_..." \
-F "file=@document.pdf" \
-F 'pii={"detect": true, "action": "redact"}'Actions
| Action | Description |
|---|---|
detect | Identify only, don't modify |
redact | Replace with [REDACTED] |
mask | Replace with asterisks |
encrypt | Encrypt in place |
tokenize | Replace with reversible token |
Custom Patterns
json
{
"pii": {
"custom_patterns": [
{
"name": "EMPLOYEE_ID",
"pattern": "EMP-\\d{6}",
"action": "mask"
}
]
}
}Detection Results
json
{
"pii_detected": true,
"entities": [
{
"type": "SSN",
"text": "[REDACTED]",
"position": {"start": 145, "end": 156},
"confidence": 0.98,
"action_taken": "redacted"
}
]
}Next Steps
- Data Classification - Classification levels
- GDPR Article 17 - Right to erasure