Threshold Selection Study on Fisher Discriminant Analysis Used in Exon Prediction for Unbalanced Data Sets

Ma, Yutao and Fang, Yanbing and Liu, Ping and Teng, Jianfu (2013) Threshold Selection Study on Fisher Discriminant Analysis Used in Exon Prediction for Unbalanced Data Sets. Communications and Network, 05 (03). pp. 601-605. ISSN 1949-2421

[thumbnail of CN_2013100914375665.pdf] Text
CN_2013100914375665.pdf - Published Version

Download (400kB)

Abstract

In gene prediction, the Fisher discriminant analysis (FDA) is used to separate protein coding region (exon) from non-coding regions (intron). Usually, the positive data set and the negative data set are of the same size if the number of the data is big enough. But for some situations the data are not sufficient or not equal, the threshold used in FDA may have important influence on prediction results. This paper presents a study on the selection of the threshold. The eigen value of each exon/intron sequence is computed using the Z-curve method with 69 variables. The experiments results suggest that the size and the standard deviation of the data sets and the threshold are the three key elements to be taken into consideration to improve the prediction results.

Item Type: Article
Subjects: Euro Archives > Computer Science
Depositing User: Managing Editor
Date Deposited: 29 Mar 2023 04:00
Last Modified: 30 May 2024 05:40
URI: http://publish7promo.com/id/eprint/840

Actions (login required)

View Item
View Item