Virtual Transcriptionist Dictation Assistant

Purpose

Provide a more intelligent speech recognition through AI-assisted virtual transcriptionist/dictation assistant that can address common errors that occur within radiology reports, including age, sex/gender, laterality, speech-recognition errors, and affirmative/negative correlation between findings and impression

Tag(s)

Non-Interpretative

Panel

Reading Room

Define-AI ID

19120002

Originator

Woojin Kim
Lead Woojin Kim

Panel Chair

Ben Wandtke
Non-Interpretative Panel Chairs: Alexander J Towbin, Adam Prater  

Panel Reviewers

Reading Room Subpanel

License

Creative Commons 4.0 
Status Public Commenting 
                               

Clinical Implementation


Value Proposition

Radiologists have been using speech recognition (SR) to generate their radiology reports for decades, one of the early adopters of the SR technology within healthcare. While there have been advancements in the SR and reporting technology over the years, AI has the potential to provide a more intelligent virtual transcriptionist/dictation assistant function to the radiologists to reduce errors within their reports.

Narrative(s)

A 46-year-old patient calls the radiology department angrily to discuss her radiology report that contains multiple errors as well as a confusing impression. The report described her age as being 64 years old with a description of a prostate (instead of a uterus). The technique section says, “contrast menstruation.” While the findings section of the report says “no pulmonary nodule is seen,” the impression says, “1. Pulmonary nodule.”

Workflow Description

The AI algorithms can monitor the radiology report text in real-time to look for various errors, similar to how Grammarly (https://app.grammarly.com/) works to improve the grammar of the user in real-time (similar UI also recommended for consideration). The metadata, including the patient demographics, should be matched against the radiologist’s report text to detect whenever there is an error in the patient’s age, sex/gender (which includes not only male vs. female descriptions but also anatomy - e.g., prostate in a female as in the narrative example; in addition, the algorithm needs to be able to distinguish between the patient and someone described in addition to the patient - e.g., the history section may say, “mother describes cough and fever,” but the exam is that for her infant son; related, the algorithm also needs to be able to detect and keep track when more than one person has been imaged, such as the sex of the patient and the sex of the fetus/fetuses - e.g., MRI of a woman with a male fetus and a female fetus), and laterality.

In the above example, the radiologist dictated, “contrast administered,” but the SR transcribed it as “contrast menstruation.” Such SR errors are well-known to occur. AI algorithms can provide additional intelligence to know whenever there is a term that is out of place in context. This can supplement the basic spelling and grammar checks. Additional intelligence can be used to know whenever a particular body part or finding was mentioned that would not be seen with a given exam type (e.g., description of a thumb in a foot x-ray report).

Finally, the descriptions within the Findings section should be matched with those in the Impression section to ensure concordance. In the last example in the narrative, the radiologist either failed to say “no” or the SR failed to pick up “no,” resulting in what appears to be conflicting Findings vs. Impression.

Considerations for Dataset Development


  • Need a robust heterogeneous training dataset as prosaic and templated styles to vary widely across institutions (not to mention countries and languages)

Technical Specifications


Inputs

 

Radiologist Report


Definition

The Radiologist Report

 

Radiologist Voice Recording

 

 

Definition

 

 

The voice recording/file of the radiologist for a given radiology report

 

 

Primary Outputs


Wrong Text Detection in Report Text

Definition

Detect the wrong word(s) within the report text.

Data Type

Text

Value Set

N/A

Units

N/A


Wrong Text Detection in Voice Recording

Definition

Detect the wrong word(s) within voice recording.

Data Type

Text

Value Set

N/A

Units

N/A


Correct Word Suggestion

Definition

Suggestion of the correct word(s).

Data Type

Text

Value Set

N/A

Units

N/A




Secondary Output 

Reason Word(s) is wrong

Definition

Provide user reason on why detected word(s) is wrong.

Data Type

Categorical

Value Set

· No issues

· Misused term

· Not aligned with the exam

· Not aligned with the patient

· Grammatical error

· Conflict within the report

· Conflict between the report and EHR

· Missing information

Units

N/A

Future Development Ideas


Not all errors come from the radiologists. Sometimes the metadata contains errors. Therefore, more advanced AI algorithms should incorporate other information. Using computer vision, for example, the laterality described in the report can be compared also with the actual images. A similar AI technique can be used to determine errors within views or technique descriptions (e.g., 3 views instead of 2, enhanced vs. unenhanced, etc.).