An exploratory study using the predicate-argument structure to develop methodology for measuring semantic similarity of radiology sentences

dc.contributor.advisorJones, Josette F.
dc.contributor.authorNewsom, Eric Tyner
dc.contributor.otherGamache, Roland E.
dc.contributor.otherMahoui, Malika
dc.date.accessioned2013-11-12T18:50:55Z
dc.date.available2013-11-12T18:50:55Z
dc.date.issued2013-11-12
dc.degree.date2013en_US
dc.degree.disciplineSchool of Informaticsen
dc.degree.grantorIndiana Universityen_US
dc.degree.levelM.S.en_US
dc.descriptionIndiana University-Purdue University Indianapolis (IUPUI)en_US
dc.description.abstractThe amount of information produced in the form of electronic free text in healthcare is increasing to levels incapable of being processed by humans for advancement of his/her professional practice. Information extraction (IE) is a sub-field of natural language processing with the goal of data reduction of unstructured free text. Pertinent to IE is an annotated corpus that frames how IE methods should create a logical expression necessary for processing meaning of text. Most annotation approaches seek to maximize meaning and knowledge by chunking sentences into phrases and mapping these phrases to a knowledge source to create a logical expression. However, these studies consistently have problems addressing semantics and none have addressed the issue of semantic similarity (or synonymy) to achieve data reduction. To achieve data reduction, a successful methodology for data reduction is dependent on a framework that can represent currently popular phrasal methods of IE but also fully represent the sentence. This study explores and reports on the benefits, problems, and requirements to using the predicate-argument statement (PAS) as the framework. A convenient sample from a prior study with ten synsets of 100 unique sentences from radiology reports deemed by domain experts to mean the same thing will be the text from which PAS structures are formed.en_US
dc.identifier.urihttps://hdl.handle.net/1805/3666
dc.identifier.urihttp://dx.doi.org/10.7912/C2/894
dc.language.isoen_USen_US
dc.subjectNatural Language Processingen_US
dc.subjectInformation Extractionen_US
dc.subjectPredicate-Argument Structureen_US
dc.subjectSemantic Similarityen_US
dc.subject.lcshComputational linguistics -- Analysisen_US
dc.subject.lcshNatural language processing (Computer science)en_US
dc.subject.lcshSemantic computing -- Researchen_US
dc.subject.lcshSemantics -- Data processingen_US
dc.subject.lcshDescription logicsen_US
dc.subject.lcshElectronic information resource searching -- Researchen_US
dc.subject.lcshData miningen_US
dc.subject.lcshSemantic Weben_US
dc.subject.lcshText processing (Computer science)en_US
dc.subject.lcshInformation storage and retrieval systems -- Researchen_US
dc.subject.lcshPredicate (Logic)en_US
dc.subject.lcshMedical informatics -- Data processingen_US
dc.titleAn exploratory study using the predicate-argument structure to develop methodology for measuring semantic similarity of radiology sentencesen_US
dc.typeThesisen
Files
Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Thesis_Eric_Newsom.pdf
Size:
3.51 MB
Format:
Adobe Portable Document Format
Description:
Main thesis document
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.88 KB
Format:
Item-specific license agreed upon to submission
Description: