Zhang Xiaohong (张霄宏),Feng Shengzhong,Fan Jianping,Huang Zhexue.[J].高技术通讯(英文),2012,18(4):343~349 |
|
Method for improving MapReduce performance by prefetching before scheduling |
|
DOI: |
中文关键词: |
英文关键词: cloud computing, distributed computing, prefetching, MapReduce, scheduling |
基金项目: |
Author Name | Affiliation | Zhang Xiaohong (张霄宏) | | Feng Shengzhong | | Fan Jianping | | Huang Zhexue | |
|
Hits: 830 |
Download times: 0 |
中文摘要: |
|
英文摘要: |
In this paper, a prefetching technique is proposed to solve the performance problem caused by remote data access delay. In the technique, the map tasks which will cause the delay are predicted first and then the input data of these tasks will be preloaded before the tasks are scheduled. During the execution, the input data can be read from local nodes. Therefore, the delay can be hidden. The technique has been implemented in Hadoop-0.20.1. The experiment results have shown that the technique reduces map tasks causing delay, and improves the performance of Hadoop MapReduce by 20%. |
View Full Text
View/Add Comment Download reader |
Close |
|
|
|