Báo cáo khoa học: A Web-Based Interactive Computer Aided Translation Tool
Số trang: 4
Loại file: pdf
Dung lượng: 129.91 KB
Lượt xem: 12
Lượt tải: 0
Xem trước 2 trang đầu tiên của tài liệu này:
Thông tin tài liệu:
We developed caitra, a novel tool that aids human translators by (a) making suggestions for sentence completion in an interactive machine translation setting, (b) providing alternative word and phrase translations, and (c) allowing them to postedit machine translation output. The tool uses the Moses decoder, is implemented in Ruby on Rails and C++ and delivered over the web. We are at the beginning of a research program to explore the benefits of these different types of aid to human translators, analyze user interaction behavior, and develop novel types of assistance. ...
Nội dung trích xuất từ tài liệu:
Báo cáo khoa học: "A Web-Based Interactive Computer Aided Translation Tool" A Web-Based Interactive Computer Aided Translation Tool Philipp Koehn School of Informatics University of Edinburgh pkoehn@inf.ed.ac.uk Abstract We are at the beginning of a research program to explore the benefits of these different types of We developed caitra, a novel tool that aids aid to human translators, analyze user interaction human translators by (a) making sugges- behavior, and develop novel types of assistance. tions for sentence completion in an inter- To have a testbed for this research, we developed active machine translation setting, (b) pro- an online, web-based tool for translators. viding alternative word and phrase trans- lations, and (c) allowing them to post- 2 Overview edit machine translation output. The tool uses the Moses decoder, is implemented in Caitra is implemented in Ruby on Rails (Thomas Ruby on Rails and C++ and delivered over and Hansson, 2008) as a web-based client-server the web. architecture, using Ajax-style Web 2.0 technolo-1 Introduction gies (Raymond, 2007) connected to a MySQL database-driven back-end. The machine trans-Today’s machine translation systems are mostly lation back-end is powered by the open sourceused for inbound translation (also called assim- Moses decoder (Koehn et al., 2007). The inter-ilation), where the reader accepts lower quality active machine translation prediction code is im-translation for instant access to foreign language plemented in C++ for speed. The tool is deliveredtext. The standards are much higher for outbound over the web to allow for easier user studies withtranslation (also called dissemination), where the remote users, but also to expose the tool to a widerreader is typically an unsuspecting customer or cit- community to gather additional feedback. You canizen who is seeking information about products or find caitra online at http://www.caitra.org/services, and human translators are required for Caitra allows the uploading of documents us-high-quality publication-ready translation. ing a simple text box. This text is then processed While machine translation has made tremen- by a back-end job to pre-compute all the neces-dous progress over the last years, this progress has sary data (machine translation output, translationmade little inroads into tools for human transla- options, search graphs). This process takes a fewtors. Although it has become common practice in minutes.the industry to provide human translators with ma- Finally, the user is presented with an interfacechine translation output that they have to post-edit, that includes all the different types of assistance.typically no deeper integration of machine transla- Each may be turned off, if the user finds it distract-tion and human translation is found in translation ing. The user translates one sentence at a time,agencies. while the context (both input and user transla- An interesting approach was pioneered by the tion, including the proceeding and following para-TransType project (Langlais et al., 2000). The ma- graph) is displayed for reference.chine translation system makes sentence comple- In the next three sections, we will describe eachtion predictions in an interactive machine trans- type of assistance in detail.lation setting. The users may accept them oroverride them by typing in their own translations, 3 Interactive Machine Translationwhich triggers new suggestions by the tool (Bar-rachina et al., 2009). The idea of interactive machine translation has But also other information that is generated dur- been greatly advanced by work carried out in theing the machine translation process may be useful TransType project (Langlais et ...
Nội dung trích xuất từ tài liệu:
Báo cáo khoa học: "A Web-Based Interactive Computer Aided Translation Tool" A Web-Based Interactive Computer Aided Translation Tool Philipp Koehn School of Informatics University of Edinburgh pkoehn@inf.ed.ac.uk Abstract We are at the beginning of a research program to explore the benefits of these different types of We developed caitra, a novel tool that aids aid to human translators, analyze user interaction human translators by (a) making sugges- behavior, and develop novel types of assistance. tions for sentence completion in an inter- To have a testbed for this research, we developed active machine translation setting, (b) pro- an online, web-based tool for translators. viding alternative word and phrase trans- lations, and (c) allowing them to post- 2 Overview edit machine translation output. The tool uses the Moses decoder, is implemented in Caitra is implemented in Ruby on Rails (Thomas Ruby on Rails and C++ and delivered over and Hansson, 2008) as a web-based client-server the web. architecture, using Ajax-style Web 2.0 technolo-1 Introduction gies (Raymond, 2007) connected to a MySQL database-driven back-end. The machine trans-Today’s machine translation systems are mostly lation back-end is powered by the open sourceused for inbound translation (also called assim- Moses decoder (Koehn et al., 2007). The inter-ilation), where the reader accepts lower quality active machine translation prediction code is im-translation for instant access to foreign language plemented in C++ for speed. The tool is deliveredtext. The standards are much higher for outbound over the web to allow for easier user studies withtranslation (also called dissemination), where the remote users, but also to expose the tool to a widerreader is typically an unsuspecting customer or cit- community to gather additional feedback. You canizen who is seeking information about products or find caitra online at http://www.caitra.org/services, and human translators are required for Caitra allows the uploading of documents us-high-quality publication-ready translation. ing a simple text box. This text is then processed While machine translation has made tremen- by a back-end job to pre-compute all the neces-dous progress over the last years, this progress has sary data (machine translation output, translationmade little inroads into tools for human transla- options, search graphs). This process takes a fewtors. Although it has become common practice in minutes.the industry to provide human translators with ma- Finally, the user is presented with an interfacechine translation output that they have to post-edit, that includes all the different types of assistance.typically no deeper integration of machine transla- Each may be turned off, if the user finds it distract-tion and human translation is found in translation ing. The user translates one sentence at a time,agencies. while the context (both input and user transla- An interesting approach was pioneered by the tion, including the proceeding and following para-TransType project (Langlais et al., 2000). The ma- graph) is displayed for reference.chine translation system makes sentence comple- In the next three sections, we will describe eachtion predictions in an interactive machine trans- type of assistance in detail.lation setting. The users may accept them oroverride them by typing in their own translations, 3 Interactive Machine Translationwhich triggers new suggestions by the tool (Bar-rachina et al., 2009). The idea of interactive machine translation has But also other information that is generated dur- been greatly advanced by work carried out in theing the machine translation process may be useful TransType project (Langlais et ...
Tìm kiếm theo từ khóa liên quan:
Web-Based Interactive Computer Aided Translation Tool Philipp Koehn long papers báo cáo khoa học báo cáo ngôn ngữ xử lý ngôn ngữ tự nhiênTài liệu liên quan:
-
63 trang 331 0 0
-
12 trang 319 0 0
-
Phương pháp tạo ra văn bản tiếng Việt có đề tài xác định
7 trang 276 0 0 -
13 trang 268 0 0
-
Báo cáo khoa học Bước đầu tìm hiểu văn hóa ẩm thực Trà Vinh
61 trang 255 0 0 -
Tóm tắt luận án tiến sỹ Một số vấn đề tối ưu hóa và nâng cao hiệu quả trong xử lý thông tin hình ảnh
28 trang 225 0 0 -
Đề tài nghiên cứu khoa học và công nghệ cấp trường: Hệ thống giám sát báo trộm cho xe máy
63 trang 214 0 0 -
NGHIÊN CỨU CHỌN TẠO CÁC GIỐNG LÚA CHẤT LƯỢNG CAO CHO VÙNG ĐỒNG BẰNG SÔNG CỬU LONG
9 trang 214 0 0 -
Giáo trình Lập trình logic trong prolog: Phần 1
114 trang 205 0 0 -
Đề tài nghiên cứu khoa học: Tội ác và hình phạt của Dostoevsky qua góc nhìn tâm lý học tội phạm
70 trang 193 0 0