Robert R. McCormick School of Engineering and Applied Science Electrical Engineering and Computer Science Department Center for Ultra-scale Computing and Information Security at Northwestern University

At CUCIS, we are focused on developing sophisticated solutions to problems relating to scalable processing and I/O; computer security and information assurance; and high performance data mining.

To learn more about these projects, please see our publications page.





Northwestern University - EECS Dept.

Evaluation of pairwise statistical significance

Error per Query (EPQ) versus Coverage plots are often used to evaluate the accuracy of different approaches of statistical significance. To create these plots, the list of pairwise comparisons are sorted based on statistical significance, and subsequently, the lists are examined, from best score to worst. Going down the list, the coverage count is increased by one if the two members of the pair are homologs, and the error count is increased by one if they are not. At a given point in the list, EPQ is the total number of errors incurred so far, divided by the number of queries. Coverage at that point is the fraction of homolog pairs detected at this significance level. For each of the 86 queries, 2771 comparisons are done, and EPQ vs. Coverage curves are plotted.

The EPQ is defined as

EPQ = F_num/Q_num, ---------(1)

where F_numis the total number of non-homologous sequences detected as homologs (i.e., false positives) and Q_num is the total number of queries.
The Coverage can be given by

Coverage = H_d/H_t, ----------(2)

where H_d and H_t are the number of homologous pairs detected and the total number of homologous pairs presented in the sequence database, respectively.

 

See Ref.1 for more details about the conception of Error per Query (EPQ) versus Coverage

 

Reference :

1. Agrawal, A., V. Brendel, et al. (2008). "Pairwise statistical significance versus database statistical significance for local alignment of protein sequences." Bioinformatics Research and Applications: 50-61.

 

Northwestern University EECS Home | McCormick Home | Northwestern Home | Calendar: Plan-It Purple
© 2011 Robert R. McCormick School of Engineering and Applied Science, Northwestern University
"Tech": 2145 Sheridan Rd, Tech L359, Evanston IL 60208-3118  |  Phone: (847) 491-5410  |  Fax: (847) 491-4455
"Ford": 2133 Sheridan Rd, Ford Building, Rm 3-320, Evanston, IL 60208  |  Fax: (847) 491-5258
Email Director

Last Updated: $LastChangedDate: 2014-11-23 19:04:45 -0600 (Sun, 23 Nov 2014) $