An Evaluation of Variant Annotation Tools – Alamut Batch, ENSEMBL Variant Effect Predictor (VEP), and ANNOVAR - for Clinical Next Generation Sequencing (NGS) based Genetic Testing

Session Number

Project ID: MEDH 01

Advisor(s)

Dr. Kai Lee Yap, PhD, FACMG, Director of Molecular Diagnostics, Ann and Robert H. Lurie Children’s Hospital of Chicago, Assistant Professor of Pathology, Feinberg School of Medicine, Northwestern University

Discipline

Medical and Health Sciences

Start Date

20-4-2022 11:55 AM

End Date

20-4-2022 12:20 PM

Abstract

Dramatically expanding our ability for clinical genetic testing for inherited conditions and complex diseases such as cancer, next generation sequencing (NGS) technologies are allowing for rapid interrogation of thousands of genes and identification of millions of variants. Variant annotation, the process of assigning functional information to DNA variants based on the standardized Human Genome Variation Society (HGVS) nomenclature, is a fundamental challenge in the analysis of NGS data that has led to the development of many empirically based tools. In this study, we evaluated the performance of three variant annotation tools: Alamut Batch, ENSEMBL Variant Effect Predictor (VEP) and ANNOVAR, benchmarked by a manually curated ground truth set of 298 variants from the medical exome database at the Molecular Diagnostics Laboratory at Lurie Children’s Hospital. Of the three tools, VEP produces most accurate variant annotations (HGVS nomenclature for 297 of the 298 variants) due to usage of updated gene transcript versions within the algorithm. Alamut Batch called 296 of the 298 variants correctly; strikingly, ANNOVAR exhibited the greatest number of discrepancies (20 of the 298 variants, 93.3% concordance with ground truth set). Adoption of validated methods of variant annotation is critical in post analytical phases of clinical testing.

Share

COinS
 
Apr 20th, 11:55 AM Apr 20th, 12:20 PM

An Evaluation of Variant Annotation Tools – Alamut Batch, ENSEMBL Variant Effect Predictor (VEP), and ANNOVAR - for Clinical Next Generation Sequencing (NGS) based Genetic Testing

Dramatically expanding our ability for clinical genetic testing for inherited conditions and complex diseases such as cancer, next generation sequencing (NGS) technologies are allowing for rapid interrogation of thousands of genes and identification of millions of variants. Variant annotation, the process of assigning functional information to DNA variants based on the standardized Human Genome Variation Society (HGVS) nomenclature, is a fundamental challenge in the analysis of NGS data that has led to the development of many empirically based tools. In this study, we evaluated the performance of three variant annotation tools: Alamut Batch, ENSEMBL Variant Effect Predictor (VEP) and ANNOVAR, benchmarked by a manually curated ground truth set of 298 variants from the medical exome database at the Molecular Diagnostics Laboratory at Lurie Children’s Hospital. Of the three tools, VEP produces most accurate variant annotations (HGVS nomenclature for 297 of the 298 variants) due to usage of updated gene transcript versions within the algorithm. Alamut Batch called 296 of the 298 variants correctly; strikingly, ANNOVAR exhibited the greatest number of discrepancies (20 of the 298 variants, 93.3% concordance with ground truth set). Adoption of validated methods of variant annotation is critical in post analytical phases of clinical testing.