Danh mục

Training for a 10k Event By Ben Wisbey

Số trang: 6      Loại file: pdf      Dung lượng: 45.56 KB      Lượt xem: 23      Lượt tải: 0    
tailieu_vip

Xem trước 1 trang đầu tiên của tài liệu này:

Thông tin tài liệu:

Hadoop [7] is an open-source implementation of Google’s Map/Reduce [5] framework that enables dis- tributed, data-intensive, parallel applications by decom- posing a massive job into smaller tasks and a massive data-set into smaller partitions, such that each task pro- cesses a different partition in parallel. The main abstrac- tions are (i) Map tasks that process the partitions of the data-set using key/value pairs to generate a set of inter- mediate results, and (ii) Reduce tasks that merge all in- termediate values associated with the same intermediate key. Hadoop uses the Hadoop Distributed File System (HDFS), an implementation of the Google Filesystem [16], to share data amongst the distributed tasks in the system. HDFS...
Nội dung trích xuất từ tài liệu:
Training for a 10k Event By Ben Wisbey

Tài liệu được xem nhiều: