BlogSum: A Query-based Summarization Approach to Make Sense of Social Media

dc.contributor.authorMithun, Shamima
dc.date.accessioned2016-04-18T20:10:26Z
dc.date.available2016-04-18T20:10:26Z
dc.date.issued2016-04-08
dc.description.abstractWith the rapid growth of the Social Web, a large amount of informal opinionated texts are available on numerous topics. However, people can be overwhelmed with this vast amount of information and they need help to find the information of their interests. Natural language tools for automatically analyzing these opinions become necessary to help individuals, organizations, and governments in making timely decisions. To address this need, I proposed a summarization approach for opinionated texts. To validate my approach, BlogSum is developed and evaluated experimentally using current benchmarks. Users can ask BlogSum any question (e.g. Why do people like Chrome better than Firefox?). To answer user's question, BlogSum first retrieves relevant blogs, reviews from the web then generates a concise summary that represents people opinions expressed towards the topic. Since blog summarization is a more recent endeavor, an error analysis was conducted by manually analyzing blog summaries to find there is any information processing difference needed for blogs compared to factual data. This analysis shows that question irrelevance and discourse incoherence, which decrease the overall quality of a summary and reduces the summary coherence, are two major issues for blog summaries. To address question irrelevance and discourse incoherence, in this work a domain-independent schema-based summarization approach is developed that utilizes discourse structures. This approach is based on the automatic identification of discourse relations within candidate sentences in order to instantiate the most appropriate discourse schema and filter and order candidate sentences in the most effective way. BlogSum also needs to deal with opinions, emotions effectively to be successful. BlogSum's overall performance as well as performance for question relevance and coherence was evaluated using various dataset. These results show that the proposed approach can effectively reduce question irrelevance and discourse incoherence and satisfy user's information need.en_US
dc.identifier.citationMithun, Shamima. 2016 April 8. BlogSum: A Query-based Summarization Approach to Make Sense of Social Media. Poster presented at IUPUI Research Day 2016. Indianapolis, Indiana.en_US
dc.identifier.urihttps://hdl.handle.net/1805/9320
dc.language.isoen_USen_US
dc.publisherOffice of the Vice Chancellor for Research, IUPUIen_US
dc.subjectsocial mediaen_US
dc.subjectnatural languageen_US
dc.subjectbloggingen_US
dc.subjectquestion irrelevanceen_US
dc.subjectdiscourse incoherenceen_US
dc.subjecttext miningen_US
dc.titleBlogSum: A Query-based Summarization Approach to Make Sense of Social Mediaen_US
dc.typePosteren_US
Files
Original bundle
Now showing 1 - 2 of 2
Loading...
Thumbnail Image
Name:
BlogSumPoster_SM.pdf
Size:
214.12 KB
Format:
Adobe Portable Document Format
Description:
Poster
Loading...
Thumbnail Image
Name:
Mithun.pdf
Size:
7.83 KB
Format:
Adobe Portable Document Format
Description:
Poster abstract
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.88 KB
Format:
Item-specific license agreed upon to submission
Description: