Online Program Home
My Program

Abstract Details

Activity Number: 490 - Advances in Methods for the Accurate Measurement of High-Throughput Sequencing Data
Type: Contributed
Date/Time: Wednesday, August 1, 2018 : 8:30 AM to 10:20 AM
Sponsor: Section on Statistics in Genomics and Genetics
Abstract #330093 Presentation
Title: Vi-HMM: a Novel HMM-Based Method for Sequence Variant Identification in Short Read Data
Author(s): Man Tang* and Mohammad Shabbir Hasan and Liqing Zhang and Hongxiao Zhu and Xiaowei Wu
Companies: Virginia Tech and Virginia Tech and Virginia Tech and Virginia Tech University and Virginia Tech
Keywords: Variant calling; HMM; Viterbi algorithm; SNP; Indel
Abstract:

Accurate and reliable identification of sequence variants, including Single Nucleotide Polymorphisms(SNPs) and Insertion-Deletion polymorphisms(INDELs), plays a fundamental role in next-generation sequencing (NGS) applications. Existing methods for calling these variants often make simplified assumptions of positional independence and fail to leverage spatial dependence of genotypes at nearby loci caused by Linkage Disequilibrium. We propose vi-HMM, a hidden Markov model(HMM) based method for calling SNPs and INDELs in aligned short read data. This method allows transitions between hidden states(defined as SNP, insertion, deletion, and match) on adjacent genomic bases, and determines an optimal hidden state path using the Viterbi algorithm. The inferred hidden state path provides a direct solution to the identification of SNPs and INDELs. Simulation experiments show that, under various sequencing depths, vi-HMM outperforms existing commonly-used variant calling methods in terms of sensitivity and F1 score. When applied to the human whole genome sequencing(WGS) data, vi-HMM achieves comparable results to the gold standard GATK callers and performs better than FreeBayes and Platypus.


Authors who are presenting talks have a * after their name.

Back to the full JSM 2018 program