Similarity values obtained by G-SESAME
| | CDC19 | ENO1 | ENO2 | FBA1 | GPM1 | PFK1 | PFK2 | PGI1 | PGK1 | PYK2 | TDH1 | TDH2 | TDH3 | TPI1 |
| CDC19 | | 0.180 | 0.180 | 0.180 | 0.180 | 0.641 | 0.641 | 0.180 | 0.599 | 1 | 0.157 | 0.157 | 0.157 | 0.180 |
| ENO1 | | | 1 | 0.338 | 0.199 | 0.150 | 0.150 | 0.199 | 0.180 | 0.180 | 0.173 | 0.173 | 0.173 | 0.199 |
| ENO2 | | | | 0.338 | 0.199 | 0.150 | 0.150 | 0.199 | 0.180 | 0.180 | 0.173 | 0.173 | 0.173 | 0.199 |
| FBA1 | | | | | 0.199 | 0.150 | 0.150 | 0.199 | 0.180 | 0.180 | 0.173 | 0.173 | 0.173 | 0.199 |
| GPM1 | | | | | | 0.150 | 0.150 | 0.338 | 0.180 | 0.180 | 0.173 | 0.173 | 0.173 | 0.338 |
| PFK1 | | | | | | | 1 | 0.150 | 0.491 | 0.641 | 0.130 | 0.130 | 0.130 | 0.150 |
| PFK2 | | | | | | | | 0.150 | 0.491 | 0.641 | 0.130 | 0.130 | 0.130 | 0.150 |
| PGI1 | | | | | | | | | 0.180 | 0.180 | 0.173 | 0.173 | 0.173 | 0.728 |
| PGK1 | | | | | | | | | | 0.599 | 0.157 | 0.157 | 0.157 | 0.180 |
| PYK2 | | | | | | | | | | | 0.157 | 0.157 | 0.157 | 0.180 |
| TDH1 | | | | | | | | | | | | 1 | 1 | 0.173 |
| TDH2 | | | | | | | | | | | | | 1 | 0.173 |
| TDH3 | | | | | | | | | | | | | | 0.173 |
| TPI1 | | | | | | | | | | | | | | |
Clustering results obtained by G-SESAME
| Threshold | Initial | 1.000 | 0.720 | 0.640 | 0.590 | 0.330 | 0.190 | 0.180 | 0.170 |
Clustering Result |
|
|
|
|
PFK2 PFK1 PYK2 CDC19 PGK1
|
|
PFK2 PFK1 PYK2 CDC19 PGK1
|
|
TPI1 PGI1 GPM1 ENO2 ENO1 FBA1
|
PFK2 PFK1 PYK2 CDC19 PGK1
|
|
TPI1 PGI1 GPM1 ENO2 ENO1 FBA1 PFK2 PFK1 PYK2 CDC19 PGK1
|
|
TPI1 PGI1 GPM1 ENO2 ENO1 FBA1 PFK2 PFK1 PYK2 CDC19 PGK1 TDH3 TDH2 TDH1
|
|
Similarity values obtained by Resnik's method
| |
CDC19 |
ENO1 |
ENO2 |
FBA1 |
GPM1 |
PFK1 |
PFK2 |
PGI1 |
PGK1 |
PYK2 |
TDH1 |
TDH2 |
TDH3 |
TPI1 |
| CDC19 |
|
0.281 |
0.281 |
0.281 |
0.281 |
3.394 |
3.394 |
0.281 |
3.143 |
3.394 |
0.281 |
0.281 |
0.281 |
0.281 |
| ENO1 |
|
|
5.529 |
1.378 |
0.281 |
1.378 |
1.378 |
0.281 |
0.281 |
0.281 |
0.281 |
0.281 |
0.281 |
0.281 |
| ENO2 |
|
|
|
1.378 |
0.281 |
1.378 |
1.378 |
0.281 |
0.281 |
0.281 |
0.281 |
0.281 |
0.281 |
0.281 |
| FBA1 |
|
|
|
|
0.281 |
1.378 |
1.378 |
0.281 |
0.281 |
0.281 |
0.281 |
0.281 |
0.281 |
0.281 |
| GPM1 |
|
|
|
|
|
1.378 |
1.378 |
1.378 |
0.281 |
0.281 |
0.281 |
0.281 |
0.281 |
1.378 |
| PFK1 |
|
|
|
|
|
|
7.826 |
1.378 |
3.143 |
3.394 |
1.378 |
1.378 |
1.378 |
1.378 |
| PFK2 |
|
|
|
|
|
|
|
1.378 |
3.143 |
3.394 |
1.378 |
1.378 |
1.378 |
1.378 |
| PGI1 |
|
|
|
|
|
|
|
|
0.281 |
0.281 |
0.281 |
0.281 |
0.281 |
6.235 |
| PGK1 |
|
|
|
|
|
|
|
|
|
3.143 |
0.281 |
0.281 |
0.281 |
0.281 |
| PYK2 |
|
|
|
|
|
|
|
|
|
|
0.281 |
0.281 |
0.281 |
0.281 |
| TDH1 |
|
|
|
|
|
|
|
|
|
|
|
7.935 |
7.935 |
0.281 |
| TDH2 |
|
|
|
|
|
|
|
|
|
|
|
|
7.935 |
0.281 |
| TDH3 |
|
|
|
|
|
|
|
|
|
|
|
|
|
0.281 |
| TPI1 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Clustering results obtained by Resnik's method
| Threshold |
Initial |
7.935 |
7.735 |
6.135 |
5.435 |
3.335 |
3.135 |
1.335 |
Clustering
Result |
|
|
|
|
|
|
PFK2
PFK1
PYK2
CDC19
PGK1
|
|
PFK2
PFK1
PYK2
CDC19
PGK1
ENO2
ENO1
TPI1
PGI1
TDH3
TDH2
TDH1
GPM1
FBA1
|
|
Analysis:
Genes TDH1, TDH2, and TDH3 are annotated by the same molecular function term GO:0004365: glyceraldehyde-3-phosphate dehydrogenase (phosphorylating) activity.
Genes ENO1 and ENO2 are annotated by the same molecular function term GO:0004634: phosphopyruvate hydratase activity.
Genes PFK1 and PFK2 are annotated by the same molecular function term GO:0003872: 6-phosphofructokinase activity.
Genes PYK2 and CDC19 are annotated by the same molecular function term GO:0004743: pyruvate kinase activity.
Thus, TDH1, TDH2, and TDH3 have the same molecular function. ENO1 and ENO2 have the same molecular function.
PFK1 and PFK2 have the same molecular function. PYK2 and CDC19 have the same molecular function.
The similarity values and clustering results obtained by G-SESAME are consistent with these observations.
However, based on Resnik's method, the similarity
among genes TDH1, TDH2, and TDH3 is greater than the similarity between PFK1 and PFK2.
The similarity between PFK1 and PFK2 is greater than the similarity
between ENO1 and ENO2. The similarity between ENO1 and ENO2 is greater than the similarity between PYK2 and CDC19.
These results are inconsistent with the human perception.
Furthermore, there are other results obtained by Resnik's method are not consistent with human perspectives. For instance,
Genes TPI1 and PGI1 are annotated by two different terms, GO:0004807: triose-phosphate isomerase activity and GO:0004347: glucose-6-phosphate isomerase activity
respectively. Therefore, the difference between TPI1 and PGI1 should be greater than the difference between PYK2 and CDC19.
However, based on Resnik's method, the similarity value between TPI1 and PGI1 is greater than the similarity between
PYK2 and CDC19, two genes annotated by the same molecular function term.
Conclusions:
G-SESAME is better.