Báo cáo khoa học: A High-Speed Large-Capacity Dictionary System
Số trang: 32
Loại file: pdf
Dung lượng: 639.39 KB
Lượt xem: 6
Lượt tải: 0
Xem trước 4 trang đầu tiên của tài liệu này:
Thông tin tài liệu:
A system of dictionary organization is described which makes it possible for a computer with 32,000 words of core storage to accommodate a vocabulary of hundreds of thousands of words, with a look-up speed of over a hundred words per second. The central part of the look-up process involves using the first few letters of each word as addresses, one after another.
Nội dung trích xuất từ tài liệu:
Báo cáo khoa học: "A High-Speed Large-Capacity Dictionary System" [Mechanical Translation, Vol.6, November 1961] A High-Speed Large-Capacity Dictionary System by Sydney M. Lamb and William H. Jacobsen, Jr.,* University of California, Berkeley A system of dictionary organization is described which makes it possible for a computer with 32,000 words of core storage to accommodate a vocabu- lary of hundreds of thousands of words, with a look-up speed of over a hundred words per second. The central part of the look-up process involves using the first few letters of each word as addresses, one after another.Introductory A dictionary entry may be thought of as consisting of two parts, the heading and the exposition.1 TheThis paper describes a method of adapting dictionaries heading is an instance (or coded representation) of thefor use by a computer in such a way that comprehen- lex itself, and serves to identify the entry. The remaindersiveness of vocabulary coverage can be maximized of the entry, the exposition, is the information which iswhile look-up time is minimized. Although the pro- provided concerning that lex. If the dictionary is partgramming of the system has not yet been completed, of an automatic translation system, the exposition mightit is estimated at the time of writing that it will allow contain the following three parts (not necessarily sep-for a dictionary of 20,000 entries or more, with a total arated): (1) the syntactic-semantic code, signifyinglook-up time of about 8 milliseconds (.008 seconds) per distributional and semantic properties about whichword, when used on an IBM 704 computer with 32,000 information may be needed in dealing with other lex-words of core storage. With a proper system of segmen- emes occurring in the environment of the one in ques-tation, a dictionary of 20,000 entries can handle several tion; (2) (highly compressed) instructions for selectinghundred thousand different words, thus providing ample the appropriate target representation for any given en-coverage for a single fairly broad field of science. Al- vironment; and (3) the target representations. In anthough the system has been designed specifically for efficient automatic dictionary system the target repre-purposes of machine translation of Russian, it is appli- sentations might be kept together on tape, to becable to other areas of linguistic data processing in brought into core storage as a body when needed, afterwhich dictionaries are needed. the look-up and translation proper have been completed.Preliminary Definitions In this case, the expositions would be split up, the target representations being separated from the rest;An entity for which there is (or should be) a dictionary in their place would be put the addresses where theentry is a lexical item or lex. A text is made up of a representations would be located after the “target-sequence of lexes, for each of which we hope to find language tape” has been run into core storage. Then wea di ...
Nội dung trích xuất từ tài liệu:
Báo cáo khoa học: "A High-Speed Large-Capacity Dictionary System" [Mechanical Translation, Vol.6, November 1961] A High-Speed Large-Capacity Dictionary System by Sydney M. Lamb and William H. Jacobsen, Jr.,* University of California, Berkeley A system of dictionary organization is described which makes it possible for a computer with 32,000 words of core storage to accommodate a vocabu- lary of hundreds of thousands of words, with a look-up speed of over a hundred words per second. The central part of the look-up process involves using the first few letters of each word as addresses, one after another.Introductory A dictionary entry may be thought of as consisting of two parts, the heading and the exposition.1 TheThis paper describes a method of adapting dictionaries heading is an instance (or coded representation) of thefor use by a computer in such a way that comprehen- lex itself, and serves to identify the entry. The remaindersiveness of vocabulary coverage can be maximized of the entry, the exposition, is the information which iswhile look-up time is minimized. Although the pro- provided concerning that lex. If the dictionary is partgramming of the system has not yet been completed, of an automatic translation system, the exposition mightit is estimated at the time of writing that it will allow contain the following three parts (not necessarily sep-for a dictionary of 20,000 entries or more, with a total arated): (1) the syntactic-semantic code, signifyinglook-up time of about 8 milliseconds (.008 seconds) per distributional and semantic properties about whichword, when used on an IBM 704 computer with 32,000 information may be needed in dealing with other lex-words of core storage. With a proper system of segmen- emes occurring in the environment of the one in ques-tation, a dictionary of 20,000 entries can handle several tion; (2) (highly compressed) instructions for selectinghundred thousand different words, thus providing ample the appropriate target representation for any given en-coverage for a single fairly broad field of science. Al- vironment; and (3) the target representations. In anthough the system has been designed specifically for efficient automatic dictionary system the target repre-purposes of machine translation of Russian, it is appli- sentations might be kept together on tape, to becable to other areas of linguistic data processing in brought into core storage as a body when needed, afterwhich dictionaries are needed. the look-up and translation proper have been completed.Preliminary Definitions In this case, the expositions would be split up, the target representations being separated from the rest;An entity for which there is (or should be) a dictionary in their place would be put the addresses where theentry is a lexical item or lex. A text is made up of a representations would be located after the “target-sequence of lexes, for each of which we hope to find language tape” has been run into core storage. Then wea di ...
Tìm kiếm theo từ khóa liên quan:
A High-Speed Large-Capacity Dictionary System Sydney M. Lamb William H. Jacobsen báo cáo khoa học báo cáo ngôn ngữ ngôn ngữ tự nhiênGợi ý tài liệu liên quan:
-
63 trang 314 0 0
-
13 trang 264 0 0
-
Báo cáo khoa học Bước đầu tìm hiểu văn hóa ẩm thực Trà Vinh
61 trang 253 0 0 -
Tóm tắt luận án tiến sỹ Một số vấn đề tối ưu hóa và nâng cao hiệu quả trong xử lý thông tin hình ảnh
28 trang 222 0 0 -
Đề tài nghiên cứu khoa học và công nghệ cấp trường: Hệ thống giám sát báo trộm cho xe máy
63 trang 200 0 0 -
NGHIÊN CỨU CHỌN TẠO CÁC GIỐNG LÚA CHẤT LƯỢNG CAO CHO VÙNG ĐỒNG BẰNG SÔNG CỬU LONG
9 trang 199 0 0 -
Đề tài nghiên cứu khoa học: Tội ác và hình phạt của Dostoevsky qua góc nhìn tâm lý học tội phạm
70 trang 190 0 0 -
98 trang 171 0 0
-
96 trang 168 0 0
-
SỨC MẠNH CHÍNH TRỊ CỦA LIÊN MINH CHÂU ÂU TRÊN TRƯỜNG QUỐC TẾ
4 trang 168 0 0