Development and evaluation of an open source software tool for deidentification of pathology reports

NewsGuard 100/100 Score

Researchers have developed new reliable open-source software to remove identifiers from patient reports.

In a study published today in the open access journal BMC Medical Informatics and Decision Making, researchers report on a new open-source computer programme able to remove 98.3% of all identifiers from 1254 pathology reports processed. This programme provides a basis for others to develop customized tools specific to report types or institutional styles.

Bruce Beckwith from Beth Israel Deaconess Medical Center, Boston, USA and colleagues from other institutions in the USA designed a programme, or 'scrubber', to remove all identifiers from pathology reports. Nineteen different types of identifiers, including name, address and social security number may be found in pathology reports. The authors implemented the programme in three hospitals and processed a total of 1800 individual pathology reports, 1254 of which had unique identifiers, in XML format. The files were then checked manually.

Beckwith et al.'s results show that 98.3% of the 3499 unique identifiers in the pathology reports scanned were correctly removed. Only 19 of them were missed. The performance varied from 94.7% to 99% between the three hospitals and the authors conclude that hospital-specific customization might be needed to obtain the best performance.

Comments

The opinions expressed here are the views of the writer and do not necessarily reflect the views and opinions of News Medical.
Post a new comment
Post

While we only use edited and approved content for Azthena answers, it may on occasions provide incorrect responses. Please confirm any data provided with the related suppliers or authors. We do not provide medical advice, if you search for medical information you must always consult a medical professional before acting on any information provided.

Your questions, but not your email details will be shared with OpenAI and retained for 30 days in accordance with their privacy principles.

Please do not ask questions that use sensitive or confidential information.

Read the full Terms & Conditions.

You might also like...
Study identifies APOE4 homozygotes as high-risk group for Alzheimer's disease