Combining the extraction engine with a data pipeline, every uploaded document is read for text and scanned for PII patterns in real time. Operator dashboard, false-positive review, batch analysis — all in one place.
PII DetectionRealtimePrivacy by Design
PII Detection & Masking
We catch personal information the moment it lands.
Posts, attachments, public-disclosure documents, internal docs — personal information hides everywhere, and finding it after the fact is too late.
We combine our extraction engine (Docpler) with data engineering pipelines so the moment a file uploads, the text is extracted and PII patterns surface immediately. Both real-time and batch modes are supported.
Proven in deployments at public-sector institutions like the Korea Tourism Organization and the Korea Copyright Commission.
Real-time detection through to operations tooling
Real-time detection
Text is extracted and PII patterns are scanned the moment a file uploads — without breaking the user flow.
Batch analysis
Documents already accumulated in the system can be analyzed in batch — useful for periodic and ad-hoc audits.
Format coverage
HWP, PDF, Office, images — the same detection runs across every business document format.
Operator dashboard
Detection trends visualized over time, with detected items exposed for false-positive review.
SDK · API delivery
Plugs into board, CMS, or upload modules via SDK or REST API. Existing user experience stays intact.
Policy-aware
Detection patterns and masking rules adapt to your privacy-protection policy. Compliance requirements come through directly.
Real-time PII detection on user-uploaded documents
Delivered as an SDK so detection runs at the moment of upload, applied to the existing board features without breaking the user experience while still meeting privacy-protection policy. The service operator gets a dashboard to track detection trends, with detected items exposed for false-positive review.
Privacy FilteringRealtimeSDK
PII detection across documents in the public-disclosure system
For systems that need to analyze documents in real time or in batch to detect PII, the text extraction tool — distributed as an SDK — proved a clean fit.