IMR Press / FBL / Volume 12 / Issue 5 / DOI: 10.2741/2173

Frontiers in Bioscience-Landmark (FBL) is published by IMR Press from Volume 26 Issue 5 (2021). Previous articles were published by another publisher on a subscription basis, and they are hosted by IMR Press on as a courtesy and upon agreement with Frontiers in Bioscience.

Predicting single nucleotide polymorphisms (SNP) from DNA sequence by support vector machine
Show Less
1 Bioinformatics Group, Nanyang Polytechnic,180 Ang Mo Kio Ave 8, S(569830), Singapore
Front. Biosci. (Landmark Ed) 2007, 12(5), 1610–1614;
Published: 1 January 2007

Recently, SNP has gained substantial attention as genetic markers and is recognized as a key element in the development of personalized medicine. Computational prediction of SNP can be used as a guide for SNP discovery to reduce the cost and time needed for the development of personalized medicine. We have developed a method for SNP prediction based on support vector machines (SVMs) using different features extracted from the SNP data. Prediction rates of 60.9% was achieved by sequence feature, 59.1% by free-energy feature, 58.1% by GC content feature, 58.0% by melting temperature feature, 56.2% by enthalpy feature, 55.1% by entropy feature and 54.3% by the gene, exon and intron feature. We introduced a new feature, the SNP distribution score that achieved a prediction rate of 77.3%. Thus, the proposed SNP prediction algorithm can be used to in SNP discovery.

Back to top