- 咨询服务
RedactXpert 8-Wk Proof of Value
RedactXpert is an application to assist with the process of redacting documents, using AI.
The main aim of the tooling being to improve efficiency in the process and provide time-saving benefits.
RedactXpert is an Azure Hosted web application that allows users to log in with their Microsoft Entra ID accounts, and upload PDF documents to be redacted.
Documents uploaded through RedactXpert are stored in Azure Blob Storage whilst being redacted in the app, they are then removed after X days (this is configurable based on your requirements) via a lifecycle management policy on the blob storage account. Data for the application is also stored in an Azure SQL Database and again, any data will be deleted after X days via a background process.
When a document is selected in the tool, Azure Cognitive Services are used to extract personally identifiable information (PII) from the text within that document - this happens in two steps:
Optical Character Recognition is performed on the document by a call to the Form Recognizer Service in Azure, to identify the text within the document - this call includes a reference to the document in Blob Storage, which is then retrieved by Form Recognizer.
The text retrieved in the OCR process is then sent to the Language Service in Azure Cognitive Services to extract the PII from that text.
Once the PII has been extracted, this text is then highlighted on the PDF that is displayed to the user. The user can then select the highlighted text to redact it, or manually highlight other text and draw their own boxes to redact different elements of the document.
As the user saves their progress, this is stored in the Azure SQL Database, which is used to track the redactions that have been made so far on each document.
Once a user has completed the redaction, they can download the document with the redactions included.