Luận án Tiến sĩ Khoa học máy tính: Advanced deep learning methods and applications in open domain question answering
Số trang: 67
Loại file: pdf
Dung lượng: 1.17 MB
Lượt xem: 27
Lượt tải: 0
Xem trước 7 trang đầu tiên của tài liệu này:
Thông tin tài liệu:
The resulting model is a Document Retriever, called QASA, which is then integrated with a machine reader to form a complete open-domain QA system. Our system is thoroughly evaluated using QUASAR-T dataset and shows surpassing results compared to other state-of-the-art methods.
Nội dung trích xuất từ tài liệu:
Luận án Tiến sĩ Khoa học máy tính: Advanced deep learning methods and applications in open domain question answering VIETNAM NATIONAL UNIVERSITY, HANOI UNIVERSITY OF ENGINEERING AND TECHNOLOGY Nguyen Minh TrangADVANCED DEEP LEARNING METHODS AND APPLICATIONS INOPEN-DOMAIN QUESTION ANSWERING MASTER THESIS Major: Computer Science HA NOI - 2019 VIETNAM NATIONAL UNIVERSITY, HANOI UNIVERSITY OF ENGINEERING AND TECHNOLOGY Nguyen Minh Trang ADVANCED DEEP LEARNING METHODS AND APPLICATIONS IN OPEN-DOMAIN QUESTION ANSWERING MASTER THESIS Major: Computer ScienceSupervisor: Assoc.Prof. Ha Quang Thuy Ph.D. Nguyen Ba Dat HA NOI - 2019 AbstractEver since the Internet has become ubiquitous, the amount of data accessible byinformation retrieval systems has increased exponentially. As for information con-sumers, being able to obtain a short and accurate answer for any query is one ofthe most desirable features. This motivation, along with the rise of deep learning,has led to a boom in open-domain Question Answering (QA) research. An open-domain QA system usually consists of two modules: retriever and reader. Eachis developed to solve a particular task. While the problem of document compre-hension has received multiple success with the help of large training corpora andthe emergence of attention mechanism, the development of document retrieval inopen-domain QA has not gain much progress. In this thesis, we propose a novelencoding method for learning question-aware self-attentive document represen-tations. Then, these representations are utilized by applying pair-wise rankingapproach to them. The resulting model is a Document Retriever, called QASA,which is then integrated with a machine reader to form a complete open-domainQA system. Our system is thoroughly evaluated using QUASAR-T dataset andshows surpassing results compared to other state-of-the-art methods.Keywords: Open-domain Question Answering, Document Retrieval, Learning toRank, Self-attention mechanism. iii AcknowledgementsForemost, I would like to express my sincere gratitude to my supervisor Assoc.Prof. Ha Quang Thuy for the continuous support of my Master study and research,for his patience, motivation, enthusiasm, and immense knowledge. His guidancehelped me in all the time of research and writing of this thesis. I would also like to thank my co-supervisor Ph.D. Nguyen Ba Dat who hasnot only provided me with valuable guidance but also generously funded my re-search. My sincere thanks also goes to Assoc. Prof. Chng Eng-Siong and M.Sc. VuThi Ly for offering me the summer internship opportunities in NTU, Singaporeand leading me working on diverse exciting projects. I thank my fellow labmates in KTLab: M.Sc. Le Hoang Quynh, B.Sc. CanDuy Cat, B.Sc. Tran Van Lien for the stimulating discussions, and for all the funwe have had in the last two years. Last but not the least, I would like to thank my parents for giving birth to meat the first place and supporting me spiritually throughout my life. iv DeclarationI declare that the thesis has been composed by myself and that the work has notbe submitted for any other degree or professional qualification. I confirm that thework submitted is my own, except where work which has formed part of jointly-authored publications has been included. My contribution and those of the other authors to this work have been ex-plicitly indicated below. I confirm that appropriate credit has been given withinthis thesis where reference has been made to the work of others. The work pre-sented in Chapter 3 was previously published in Proceedings of the 3rd ICMLSCas “QASA: Advanced Document Retriever for Open Domain Question Answeringby Learning to Rank Question-Aware Self-Attentive Document Representations”by Trang M. Nguyen (myself), Van-Lien Tran, Duy-Cat Can, Quang-Thuy Ha(my supervisor), Ly T. Vu, Eng-Siong Chng. This study was conceived by all ofthe authors. My contributions include: proposing the method, carrying out theexperiments, and writing the paper. Master student Nguyen Minh Trang vTable of ContentsAbstract . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . iiiAcknowledgements . . . . . . . . . . . . . . . . . . . . . . . . . . . . . ivDeclaration . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . ...
Nội dung trích xuất từ tài liệu:
Luận án Tiến sĩ Khoa học máy tính: Advanced deep learning methods and applications in open domain question answering VIETNAM NATIONAL UNIVERSITY, HANOI UNIVERSITY OF ENGINEERING AND TECHNOLOGY Nguyen Minh TrangADVANCED DEEP LEARNING METHODS AND APPLICATIONS INOPEN-DOMAIN QUESTION ANSWERING MASTER THESIS Major: Computer Science HA NOI - 2019 VIETNAM NATIONAL UNIVERSITY, HANOI UNIVERSITY OF ENGINEERING AND TECHNOLOGY Nguyen Minh Trang ADVANCED DEEP LEARNING METHODS AND APPLICATIONS IN OPEN-DOMAIN QUESTION ANSWERING MASTER THESIS Major: Computer ScienceSupervisor: Assoc.Prof. Ha Quang Thuy Ph.D. Nguyen Ba Dat HA NOI - 2019 AbstractEver since the Internet has become ubiquitous, the amount of data accessible byinformation retrieval systems has increased exponentially. As for information con-sumers, being able to obtain a short and accurate answer for any query is one ofthe most desirable features. This motivation, along with the rise of deep learning,has led to a boom in open-domain Question Answering (QA) research. An open-domain QA system usually consists of two modules: retriever and reader. Eachis developed to solve a particular task. While the problem of document compre-hension has received multiple success with the help of large training corpora andthe emergence of attention mechanism, the development of document retrieval inopen-domain QA has not gain much progress. In this thesis, we propose a novelencoding method for learning question-aware self-attentive document represen-tations. Then, these representations are utilized by applying pair-wise rankingapproach to them. The resulting model is a Document Retriever, called QASA,which is then integrated with a machine reader to form a complete open-domainQA system. Our system is thoroughly evaluated using QUASAR-T dataset andshows surpassing results compared to other state-of-the-art methods.Keywords: Open-domain Question Answering, Document Retrieval, Learning toRank, Self-attention mechanism. iii AcknowledgementsForemost, I would like to express my sincere gratitude to my supervisor Assoc.Prof. Ha Quang Thuy for the continuous support of my Master study and research,for his patience, motivation, enthusiasm, and immense knowledge. His guidancehelped me in all the time of research and writing of this thesis. I would also like to thank my co-supervisor Ph.D. Nguyen Ba Dat who hasnot only provided me with valuable guidance but also generously funded my re-search. My sincere thanks also goes to Assoc. Prof. Chng Eng-Siong and M.Sc. VuThi Ly for offering me the summer internship opportunities in NTU, Singaporeand leading me working on diverse exciting projects. I thank my fellow labmates in KTLab: M.Sc. Le Hoang Quynh, B.Sc. CanDuy Cat, B.Sc. Tran Van Lien for the stimulating discussions, and for all the funwe have had in the last two years. Last but not the least, I would like to thank my parents for giving birth to meat the first place and supporting me spiritually throughout my life. iv DeclarationI declare that the thesis has been composed by myself and that the work has notbe submitted for any other degree or professional qualification. I confirm that thework submitted is my own, except where work which has formed part of jointly-authored publications has been included. My contribution and those of the other authors to this work have been ex-plicitly indicated below. I confirm that appropriate credit has been given withinthis thesis where reference has been made to the work of others. The work pre-sented in Chapter 3 was previously published in Proceedings of the 3rd ICMLSCas “QASA: Advanced Document Retriever for Open Domain Question Answeringby Learning to Rank Question-Aware Self-Attentive Document Representations”by Trang M. Nguyen (myself), Van-Lien Tran, Duy-Cat Can, Quang-Thuy Ha(my supervisor), Ly T. Vu, Eng-Siong Chng. This study was conceived by all ofthe authors. My contributions include: proposing the method, carrying out theexperiments, and writing the paper. Master student Nguyen Minh Trang vTable of ContentsAbstract . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . iiiAcknowledgements . . . . . . . . . . . . . . . . . . . . . . . . . . . . . ivDeclaration . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . ...
Tìm kiếm theo từ khóa liên quan:
Luận án Tiến sĩ Công nghệ thông tin Advanced deep learning methods Khoa học máy tính Computer science Open-domain Question answering Document retrievalGợi ý tài liệu liên quan:
-
Tóm tắt Đồ án tốt nghiệp Khoa học máy tính: Xây dựng ứng dụng quản lý quán cà phê
15 trang 460 1 0 -
205 trang 413 0 0
-
52 trang 410 1 0
-
Luận án Tiến sĩ Tài chính - Ngân hàng: Phát triển tín dụng xanh tại ngân hàng thương mại Việt Nam
267 trang 376 1 0 -
Đề thi kết thúc học phần học kì 2 môn Cơ sở dữ liệu năm 2019-2020 có đáp án - Trường ĐH Đồng Tháp
5 trang 371 6 0 -
206 trang 299 2 0
-
174 trang 297 0 0
-
Top 10 mẹo 'đơn giản nhưng hữu ích' trong nhiếp ảnh
11 trang 291 0 0 -
Báo cáo thực tập thực tế: Nghiên cứu và xây dựng website bằng Wordpress
24 trang 286 0 0 -
74 trang 275 0 0