TY - JOUR
T1 - Automatic identification and classification of surgical margin status from pathology reports following prostate cancer surgery.
AU - D'Avolio, Leonard W.
AU - Litwin, Mark S.
AU - Rogers, Selwyn O.
AU - Bui, Alex A.T.
PY - 2007
Y1 - 2007
N2 - Prostate cancer removal surgeries result in tumor found at the surgical margin, otherwise known as a positive surgical margin, have a significantly higher chance of biochemical recurrence and clinical progression. To support clinical outcomes assessment a system was designed to automatically identify, extract, and classify key phrases from pathology reports describing this outcome. Heuristics and boundary detection were used to extract phrases. Phrases were then classified using support vector machines into one of three classes: 'positive (involved) margins,' 'negative (uninvolved) margins,' and 'not-applicable or definitive.' A total of 851 key phrases were extracted from a sample of 782 reports produced between 1996 and 2006 from two major hospitals. Despite differences in reporting style, at least 1 sentence containing a diagnosis was extracted from 780 of the 782 reports (99.74%). Of the 851 sentences extracted, 97.3% contained diagnoses. Overall accuracy of automated classification of extracted sentences into the three categories was 97.18%.
AB - Prostate cancer removal surgeries result in tumor found at the surgical margin, otherwise known as a positive surgical margin, have a significantly higher chance of biochemical recurrence and clinical progression. To support clinical outcomes assessment a system was designed to automatically identify, extract, and classify key phrases from pathology reports describing this outcome. Heuristics and boundary detection were used to extract phrases. Phrases were then classified using support vector machines into one of three classes: 'positive (involved) margins,' 'negative (uninvolved) margins,' and 'not-applicable or definitive.' A total of 851 key phrases were extracted from a sample of 782 reports produced between 1996 and 2006 from two major hospitals. Despite differences in reporting style, at least 1 sentence containing a diagnosis was extracted from 780 of the 782 reports (99.74%). Of the 851 sentences extracted, 97.3% contained diagnoses. Overall accuracy of automated classification of extracted sentences into the three categories was 97.18%.
UR - http://www.scopus.com/inward/record.url?scp=56149117540&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=56149117540&partnerID=8YFLogxK
M3 - Article
C2 - 18693818
AN - SCOPUS:56149117540
SN - 1559-4076
SP - 160
EP - 164
JO - AMIA ... Annual Symposium proceedings / AMIA Symposium. AMIA Symposium
JF - AMIA ... Annual Symposium proceedings / AMIA Symposium. AMIA Symposium
ER -