Danh mục

Báo cáo khoa học: WISDOM: A Web Information Credibility Analysis System

Số trang: 4      Loại file: pdf      Dung lượng: 548.35 KB      Lượt xem: 10      Lượt tải: 0    
Thư viện của tui

Xem trước 2 trang đầu tiên của tài liệu này:

Thông tin tài liệu:

We demonstrate an information credibility analysis system called WISDOM. The purpose of WISDOM is to evaluate the credibility of information available on the Web from multiple viewpoints. WISDOM considers the following to be the source of information credibility: information contents, information senders, and information appearances. We aim at analyzing and organizing these measures on the basis of semantics-oriented natural language processing (NLP) techniques.
Nội dung trích xuất từ tài liệu:
Báo cáo khoa học: "WISDOM: A Web Information Credibility Analysis System" WISDOM: A Web Information Credibility Analysis System Susumu Akamine† Daisuke Kawahara† Yoshikiyo Kato† Tetsuji Nakagawa† Kentaro Inui† Sadao Kurohashi†‡ Yutaka Kidawara† † National Institute of Information and Communications Technology ‡ Graduate School of Informatics, Kyoto University {akamine, dk, ykato, tnaka, inui, kidawara}@nict.go.jp, kuro@i.kyoto-u.ac.jp Abstract distribution for a given topic. For this purpose, syntactic and discourse structures must be ana- We demonstrate an information credibility lyzed, their types and relations must be extracted, analysis system called WISDOM. The purpose and synonymous and ambiguous expressions of WISDOM is to evaluate the credibility of in- should be handled properly. formation available on the Web from multiple Furthermore, it is important to determine the viewpoints. WISDOM considers the following identity of the information sender and his/her to be the source of information credibility: in- specialty as criteria for credibility, which require formation contents, information senders, and named entity recognition and total analysis of information appearances. We aim at analyzing documents. and organizing these measures on the basis of In this paper, we describe an information cre- semantics-oriented natural language processing dibility analysis system called WISDOM, which (NLP) techniques. automatically analyzes and organizes the above aspects on the basis of semantically oriented1. Introduction NLP techniques. WISDOM currently operates As computers and computer networks become over 100 million Japanese Web pages.increasingly sophisticated, a vast amount of in- 2. Overview of WISDOMformation and knowledge has been accumulated We consider the following three criteria for theand circulated on the Web. They provide people judgment of information credibility.with options regarding their daily lives and arestarting to have a strong influence on govern- (1) Credibility of information contents,mental policies and business management. How- (2) Credibility of the information sender, andever, a crucial problem is that the information (3) Credibility estimated from the documentavailable on the Web is not necessarily credible. style and superficial characteristics.It is actually very difficult for human beings to In order to help people judge the credibility ofjudge the credibility of the information and even information from these viewpoints, we have beenmore difficult for computers. However, comput- developing an information analysis system calleders can be used to develop a system that collects, WISDOM. Figure 1 shows the analysis result oforganizes, and relativises information and helps WISDOM on the analysis topic “Is bio-ethanolhuman beings view information from several good for the environment?” Figure 2 shows theviewpoints and judge the credibility of the in- system architecture of WISDOM.formation. Given an analysis topic (query), WISDOM Information organization is a promising en- sends the query to the search engine TSUBAKIdeavor in the area of next-generation Web search. (Shinzato et al., 2008), and TSUBAKI returns aThe search engine Clusty provides a search result list of the top N relevant Web pages (N is usuallyclustering 1 , and Cuil classifies a search result on set to 1000).the basis of query-related terms2. The persuasive Then, those pages are automatically analyzed,technology research project at Stanford Universi- and major and contradictory expressions and eva-ty discussed how websites can be designed to luative expressions are extracted. Furthermore,influence people’s perceptions (B. J. Fogg, 2003). the information senders of the Web pages, whichHowever, as per our knowledge, no research has were analyzed beforehand, are collected and thebeen carried out for supporting the human judg- distribution is calculated.ment on information credibility and information The WISDOM analysis results can be viewedorganization systems for this purpose. from several viewpoints by changing the tabs In order to support the judgment of informa- using a Web browser. The leftmost tab, “Sum-tion credibility, it is necessary to extract the mary,” shows the summary of the analysis, withbackground, facts, and various opinions and their major phrases and major/contradictory state- ments first.1 http://clusty.com/, http://clusty.jp/ 1 Proceedings of the ACL-IJCNLP 2009 Software Demons ...

Tài liệu được xem nhiều:

Tài liệu cùng danh mục:

Tài liệu mới: