276 / 2017-01-31 16:43:14
Genome Sequencing using MapReduce and Hadoop – A Technical Review
5881,7212,12688,12689
全文录用
Divya Patel / Yeshwantrao Chavan College of Engineering
Kavita Singh / Yeshwantrao College of Engineering
The technique that allows for researchers to read and convert the genetic information found in the DNA of any organisms is called Genome Sequencing. Genome Sequencing involves determining the order of the nucleotide sub units found in DNA, which consists of a small number of bases called short reads. The human genome is approximately 3 billion bases in length, which would take months or years to be processed on a single machine. So large numbers of short reads are available in such sequencing. In these cases, the first step in the data analysis pipeline is the short read mapping problem. Speed is becoming significantly important and challenging due to the huge volume of data. In this paper, we have proposed an architecture that will take a dataset of DNA sequencing as an input and split them across the cluster machine by applying Map Reduce implementation of hadoop to make the search efficient for large scale genome sequencing applications.
重要日期
  • 会议日期

    03月22日

    2017

    03月24日

    2017

  • 02月15日 2017

    初稿截稿日期

  • 02月20日 2017

    初稿录用通知日期

  • 02月22日 2017

    终稿截稿日期

  • 03月24日 2017

    注册截止日期

移动端
在手机上打开
小程序
打开微信小程序
客服
扫码或点此咨询