Báo cáo khoa học: WISDOM: A Web Information Credibility Analysis System
Thông tin tài liệu:
Nội dung trích xuất từ tài liệu:
Báo cáo khoa học: "WISDOM: A Web Information Credibility Analysis System" WISDOM: A Web Information Credibility Analysis System Susumu Akamine† Daisuke Kawahara† Yoshikiyo Kato† Tetsuji Nakagawa† Kentaro Inui† Sadao Kurohashi†‡ Yutaka Kidawara† † National Institute of Information and Communications Technology ‡ Graduate School of Informatics, Kyoto University {akamine, dk, ykato, tnaka, inui, kidawara}@nict.go.jp, kuro@i.kyoto-u.ac.jp Abstract distribution for a given topic. For this purpose, syntactic and discourse structures must be ana- We demonstrate an information credibility lyzed, their types and relations must be extracted, analysis system called WISDOM. The purpose and synonymous and ambiguous expressions of WISDOM is to evaluate the credibility of in- should be handled properly. formation available on the Web from multiple Furthermore, it is important to determine the viewpoints. WISDOM considers the following identity of the information sender and his/her to be the source of information credibility: in- specialty as criteria for credibility, which require formation contents, information senders, and named entity recognition and total analysis of information appearances. We aim at analyzing documents. and organizing these measures on the basis of In this paper, we describe an information cre- semantics-oriented natural language processing dibility analysis system called WISDOM, which (NLP) techniques. automatically analyzes and organizes the above aspects on the basis of semantically oriented1. Introduction NLP techniques. WISDOM currently operates As computers and computer networks become over 100 million Japanese Web pages.increasingly sophisticated, a vast amount of in- 2. Overview of WISDOMformation and knowledge has been accumulated We consider the following three criteria for theand circulated on the Web. They provide people judgment of information credibility.with options regarding their daily lives and arestarting to have a strong influence on govern- (1) Credibility of information contents,mental policies and business management. How- (2) Credibility of the information sender, andever, a crucial problem is that the information (3) Credibility estimated from the documentavailable on the Web is not necessarily credible. style and superficial characteristics.It is actually very difficult for human beings to In order to help people judge the credibility ofjudge the credibility of the information and even information from these viewpoints, we have beenmore difficult for computers. However, comput- developing an information analysis system calleders can be used to develop a system that collects, WISDOM. Figure 1 shows the analysis result oforganizes, and relativises information and helps WISDOM on the analysis topic “Is bio-ethanolhuman beings view information from several good for the environment?” Figure 2 shows theviewpoints and judge the credibility of the in- system architecture of WISDOM.formation. Given an analysis topic (query), WISDOM Information organization is a promising en- sends the query to the search engine TSUBAKIdeavor in the area of next-generation Web search. (Shinzato et al., 2008), and TSUBAKI returns aThe search engine Clusty provides a search result list of the top N relevant Web pages (N is usuallyclustering 1 , and Cuil classifies a search result on set to 1000).the basis of query-related terms2. The persuasive Then, those pages are automatically analyzed,technology research project at Stanford Universi- and major and contradictory expressions and eva-ty discussed how websites can be designed to luative expressions are extracted. Furthermore,influence people’s perceptions (B. J. Fogg, 2003). the information senders of the Web pages, whichHowever, as per our knowledge, no research has were analyzed beforehand, are collected and thebeen carried out for supporting the human judg- distribution is calculated.ment on information credibility and information The WISDOM analysis results can be viewedorganization systems for this purpose. from several viewpoints by changing the tabs In order to support the judgment of informa- using a Web browser. The leftmost tab, “Sum-tion credibility, it is necessary to extract the mary,” shows the summary of the analysis, withbackground, facts, and various opinions and their major phrases and major/contradictory state- ments first.1 http://clusty.com/, http://clusty.jp/ 1 Proceedings of the ACL-IJCNLP 2009 Software Demons ...
Tìm kiếm theo từ khóa liên quan:
Web Information Credibility Analysis System Susumu Akamine Long Papers báo cáo khoa học báo cáo ngôn ngữ xử lý ngôn ngữ tự nhiênTài liệu cùng danh mục:
-
Đề tài nghiên cứu khoa học: Kỹ năng quản lý thời gian của sinh viên trường Đại học Nội vụ Hà Nội
80 trang 1526 4 0 -
Tiểu luận: Phương pháp Nghiên cứu Khoa học trong kinh doanh
27 trang 471 0 0 -
57 trang 333 0 0
-
44 trang 297 0 0
-
19 trang 289 0 0
-
63 trang 286 0 0
-
báo cáo chuyên đề GIÁO DỤC BẢO VỆ MÔI TRƯỜNG
78 trang 284 0 0 -
13 trang 261 0 0
-
95 trang 258 1 0
-
80 trang 254 0 0
Tài liệu mới:
-
Khảo sát tình trạng dinh dưỡng trước mổ ở người bệnh ung thư đại trực tràng
9 trang 21 0 0 -
94 trang 19 0 0
-
Tham vấn Thanh thiếu niên - ĐH Mở Bán công TP Hồ Chí Minh
276 trang 20 0 0 -
Kết hợp luân phiên sóng T và biến thiên nhịp tim trong tiên lượng bệnh nhân suy tim
10 trang 19 0 0 -
Đề thi giữa học kì 1 môn Ngữ văn lớp 9 năm 2024-2025 có đáp án - Trường THCS Nguyễn Trãi, Thanh Khê
14 trang 21 0 0 -
Đánh giá hiệu quả giải pháp phát triển thể chất cho sinh viên Trường Đại học Kiến trúc Hà Nội
8 trang 20 0 0 -
Tỉ lệ và các yếu tố liên quan đoạn chi dưới ở bệnh nhân đái tháo đường có loét chân
11 trang 20 0 0 -
39 trang 19 0 0
-
Đề thi học kì 1 môn Tiếng Anh lớp 6 năm 2024-2025 có đáp án - Trường TH&THCS Quang Trung, Hội An
6 trang 19 1 0 -
Tôm ram lá chanh vừa nhanh vừa dễRất dễ làm, nhanh gọn mà lại ngon. Nhà mình
7 trang 19 0 0