|
Most informative features for positive class, from
a set of 3-gram and 4-gram features. TF is the number of transcription
factor binding motifs containing the feature.
|
|
Feature |
Weight |
TF |
Feature |
Weight |
TF |
Feature |
Weight |
TF |
|
H3 |
TGGC |
2.35 |
13 |
GCGA |
2.18 |
15 |
CCTG |
2.16 |
13 |
|
|
CCT |
1.99 |
51 |
GGGA |
1.87 |
13 |
TGCG |
1.79 |
14 |
|
|
CAG |
1.78 |
35 |
TGTG |
1.74 |
16 |
CGTT |
1.68 |
20 |
|
H4 |
CAAA |
2.78 |
20 |
TATC |
2.41 |
15 |
ATC |
2.21 |
55 |
|
|
TTTG |
2.01 |
22 |
CCA |
1.87 |
57 |
GATA |
1.8 |
19 |
|
|
GGA |
1.57 |
52 |
TGG |
1.56 |
41 |
ACAG |
1.55 |
9 |
|
H3K9ac |
CCGG |
3.12 |
22 |
TATA |
2.56 |
21 |
GGTC |
2.2 |
8 |
|
|
CTTG |
2.13 |
7 |
GGCC |
1.93 |
12 |
GGGC |
1.73 |
7 |
|
|
CGGT |
1.7 |
12 |
GCCC |
1.63 |
11 |
TCGC |
1.59 |
7 |
|
H3K14ac |
TATA |
2.32 |
21 |
CCGG |
2.19 |
22 |
TCGC |
2.04 |
7 |
|
|
TAAG |
2.01 |
13 |
GTCC |
1.92 |
6 |
GGCC |
1.87 |
12 |
|
|
TAGT |
1.73 |
10 |
GGTC |
1.71 |
8 |
TTTT |
1.69 |
33 |
|
H4ac |
TCGC |
3.12 |
7 |
GCGA |
3 |
15 |
CCGG |
2.45 |
22 |
|
|
GGCC |
2.13 |
12 |
CCCG |
2.11 |
14 |
GTCC |
2.03 |
6 |
|
|
GGTC |
2.01 |
8 |
CGGT |
1.92 |
12 |
TATA |
1.75 |
21 |
|
H3K4me1 |
TATC |
2.26 |
15 |
GACG |
1.97 |
18 |
CCGC |
1.67 |
27 |
|
|
CAAA |
1.57 |
20 |
CGTC |
1.56 |
16 |
TTTG |
1.49 |
22 |
|
|
CCA |
1.47 |
57 |
CAT |
1.45 |
67 |
GATA |
1.42 |
19 |
|
H3K4me2 |
TAAG |
1.69 |
13 |
CGGT |
1.62 |
12 |
CGA |
1.32 |
60 |
|
|
GTCC |
1.28 |
6 |
CCCG |
1.23 |
14 |
TCGC |
1.23 |
7 |
|
|
TGAG |
1.2 |
9 |
CACT |
1.18 |
17 |
CCGG |
1.12 |
22 |
|
H3K4me3 |
CCGG |
3.09 |
22 |
TCGC |
3.04 |
7 |
GCGA |
2.79 |
15 |
|
|
CCCG |
2.78 |
14 |
TATA |
2.69 |
21 |
GTCC |
2.26 |
6 |
|
|
GGCC |
2.11 |
12 |
TAAG |
2.03 |
13 |
ACCC |
1.96 |
23 |
|
H3K36me3 |
CGTC |
2.24 |
16 |
CACC |
1.84 |
23 |
GACG |
1.78 |
18 |
|
|
ACGA |
1.77 |
23 |
ACC |
1.72 |
59 |
TGG |
1.59 |
41 |
|
|
CTTC |
1.52 |
15 |
CAAA |
1.51 |
20 |
GATA |
1.49 |
19 |
|
H3K79me3 |
CAAA |
2.44 |
20 |
ATC |
2.27 |
55 |
GATA |
2.24 |
19 |
|
|
TATC |
2.24 |
15 |
GGTA |
1.93 |
13 |
TTTG |
1.85 |
22 |
|
|
TACC |
1.66 |
15 |
ATCC |
1.56 |
13 |
TGGA |
1.55 |
11 |
|
Most informative features for negative class, from
a set of 3-gram and 4-gram features. TF is the number of transcription
factor binding motifs containing the feature.
|
|
Feature |
Weight |
TF |
Feature |
Weight |
TF |
Feature |
Weight |
TF |
|
H3 |
CGCG |
-5.33 |
18 |
GCGC |
-3.32 |
11 |
CGC |
-2.46 |
51 |
|
|
TTTT |
-2.4 |
33 |
GCG |
-2.11 |
53 |
CGTG |
-1.98 |
22 |
|
|
CGG |
-1.88 |
60 |
GCGG |
-1.87 |
18 |
CCG |
-1.86 |
80 |
|
H4 |
CGCG |
-3.57 |
18 |
TTTT |
-3.14 |
33 |
TATA |
-3.07 |
21 |
|
|
AAAA |
-2.96 |
32 |
GCG |
-2.84 |
53 |
CGC |
-2.49 |
51 |
|
|
CGTG |
-2.47 |
22 |
GCGC |
-2.15 |
11 |
GGCC |
-1.78 |
12 |
|
H3K9ac |
TTTG |
-2.43 |
22 |
CAAA |
-2.43 |
20 |
GGGG |
-2.14 |
18 |
|
|
CCGC |
-2.1 |
27 |
TACC |
-2.05 |
15 |
AAT |
-2.04 |
86 |
|
|
GCGG |
-1.81 |
18 |
CCCC |
-1.81 |
15 |
CGTC |
-1.77 |
16 |
|
H3K14ac |
CGGC |
-2.64 |
12 |
CCCC |
-2.35 |
15 |
TATC |
-2.28 |
15 |
|
|
CAAA |
-2.27 |
20 |
CGTC |
-2.18 |
16 |
TTTG |
-2.17 |
22 |
|
|
CCGC |
-1.98 |
27 |
GGGG |
-1.8 |
18 |
CGCC |
-1.51 |
20 |
|
H4ac |
CGGC |
-2.94 |
12 |
GCGG |
-2.62 |
18 |
CCCC |
-2.46 |
15 |
|
|
CCGC |
-2.43 |
27 |
GCCG |
-2.34 |
26 |
CGTC |
-2.17 |
16 |
|
|
GGGG |
-2.02 |
18 |
CGCC |
-1.96 |
20 |
AAT |
-1.86 |
86 |
|
H3K4me1 |
CGCG |
-2.68 |
18 |
TATA |
-2.29 |
21 |
CACG |
-2.19 |
20 |
|
|
CCGG |
-1.81 |
22 |
CGTG |
-1.58 |
22 |
TCGC |
-1.55 |
7 |
|
|
TTTT |
-1.52 |
33 |
CTTA |
-1.39 |
15 |
CCCG |
-1.34 |
14 |
|
H3K4me2 |
CGGC |
-1.68 |
12 |
CCGC |
-1.6 |
27 |
ATAT |
-1.59 |
28 |
|
|
CCCC |
-1.47 |
15 |
ATT |
-1.39 |
102 |
TCAA |
-1.28 |
15 |
|
|
ACAT |
-1.14 |
23 |
TTAA |
-1.05 |
22 |
ATTA |
-1.04 |
39 |
|
H3K4me3 |
CGGC |
-3.44 |
12 |
CCCC |
-3.26 |
15 |
CCGC |
-3.18 |
27 |
|
|
GGGG |
-2.69 |
18 |
GCGG |
-2.28 |
18 |
TCGG |
-2.06 |
12 |
|
|
GCCG |
-2.06 |
26 |
CGTC |
-1.94 |
16 |
CAAA |
-1.68 |
20 |
|
H3K36me3 |
CTTA |
-2.08 |
15 |
TCGC |
-2.03 |
7 |
TAAG |
-1.96 |
13 |
|
|
TCTC |
-1.8 |
16 |
GCCA |
-1.69 |
11 |
TATA |
-1.66 |
21 |
|
|
GATC |
-1.63 |
7 |
CCCT |
-1.6 |
11 |
GCG |
-1.54 |
53 |
|
H3K79me3 |
TATA |
-3.9 |
21 |
CGC |
-2.73 |
51 |
GCG |
-2.57 |
53 |
|
|
CGCG |
-2.01 |
18 |
AAAA |
-1.89 |
32 |
TTTT |
-1.8 |
33 |
|
|
ATGT |
-1.76 |
24 |
CCC |
-1.65 |
50 |
CATG |
-1.57 |
11 |
|
|
Most informative features for positive class, from
a set of 4-gram and 5-gram features. TF is the number of transcription
factor binding motifs containing the feature.
|
|
Feature |
Weight |
TF |
Feature |
Weight |
TF |
Feature |
Weight |
TF |
|
H3 |
CCTG |
1.98 |
13 |
GCATT |
1.7 |
4 |
AGTGC |
1.6 |
3 |
|
|
ACCTG |
1.58 |
5 |
TCCTG |
1.56 |
1 |
CTCTT |
1.55 |
5 |
|
|
TTCAC |
1.54 |
4 |
GGGTT |
1.54 |
8 |
ACAGC |
1.52 |
4 |
|
H4 |
CCAGT |
1.85 |
4 |
TCAGG |
1.73 |
1 |
TGGA |
1.65 |
11 |
|
|
TTGGG |
1.55 |
2 |
CGTTA |
1.51 |
3 |
TATC |
1.49 |
15 |
|
|
GGAT |
1.49 |
14 |
GGATC |
1.48 |
1 |
TATTG |
1.47 |
5 |
|
H3K9ac |
CCGG |
2.04 |
22 |
ACCCG |
1.97 |
6 |
GCCCA |
1.87 |
3 |
|
|
GGTGG |
1.78 |
2 |
CAGCG |
1.77 |
3 |
GCGAG |
1.76 |
7 |
|
|
CCGGG |
1.71 |
8 |
GCCGG |
1.71 |
7 |
GGCCG |
1.66 |
5 |
|
H3K14ac |
GCGTG |
2.21 |
1 |
ACCCG |
2.06 |
6 |
TAGTC |
2.02 |
2 |
|
|
GGTGG |
1.85 |
2 |
ATGAG |
1.85 |
1 |
GGGTA |
1.85 |
5 |
|
|
CTCGA |
1.84 |
1 |
TAGGA |
1.72 |
4 |
AAGGC |
1.7 |
1 |
|
H4ac |
GCCGG |
2.42 |
7 |
CTCAT |
2.35 |
4 |
ACCCG |
2.35 |
6 |
|
|
GGGTA |
1.96 |
5 |
ATGAG |
1.9 |
1 |
GGTGG |
1.9 |
2 |
|
|
ACCGC |
1.89 |
5 |
GCGA |
1.87 |
15 |
TCGC |
1.86 |
7 |
|
H3K4me1 |
GGGTC |
2.31 |
2 |
CGGGA |
1.96 |
2 |
AATTC |
1.93 |
4 |
|
|
CGGAG |
1.87 |
3 |
AGTTT |
1.75 |
2 |
TCGTA |
1.71 |
1 |
|
|
AACCC |
1.68 |
4 |
ACCCA |
1.63 |
13 |
ATCGA |
1.62 |
0 |
|
H3K4me2 |
ACCAC |
2.23 |
1 |
GTTGC |
2.18 |
2 |
GCGAG |
2.03 |
7 |
|
|
CCAGC |
1.97 |
4 |
CACTT |
1.94 |
1 |
ACCGC |
1.9 |
5 |
|
|
ATTCC |
1.73 |
5 |
GAGGG |
1.68 |
3 |
GTCCA |
1.64 |
1 |
|
H3K4me3 |
ACCCG |
3.39 |
6 |
CGGGT |
2.18 |
6 |
ATGAG |
2.13 |
1 |
|
|
TGCGA |
2.06 |
3 |
GGTGG |
1.92 |
2 |
CTCGC |
1.91 |
4 |
|
|
CTCAT |
1.9 |
4 |
GTTGC |
1.84 |
2 |
GCGA |
1.82 |
15 |
|
H3K36me3 |
GGCGA |
1.82 |
3 |
CGTCC |
1.72 |
3 |
ACCCA |
1.68 |
13 |
|
|
GGAAC |
1.59 |
1 |
AATTC |
1.58 |
4 |
GATTT |
1.58 |
5 |
|
|
CAACG |
1.51 |
7 |
TGGAT |
1.48 |
3 |
CGCAG |
1.48 |
0 |
|
H3K79me3 |
GATTT |
2.09 |
5 |
TAATG |
1.85 |
3 |
CATTA |
1.81 |
6 |
|
|
GCGTA |
1.66 |
1 |
CTCTA |
1.63 |
5 |
ACAGC |
1.55 |
4 |
|
|
CAGCA |
1.54 |
1 |
CAATA |
1.47 |
7 |
TAGGG |
1.47 |
5 |
|
Most informative features for positive class, from
a set of 4-gram and 5-gram features. TF is the number of transcription
factor binding motifs containing the feature.
|
|
Feature |
Weight |
TF |
Feature |
Weight |
TF |
Feature |
Weight |
TF |
|
H3 |
CGCG |
-4.57 |
18 |
GCGC |
-2.94 |
11 |
GCGCG |
-2.41 |
2 |
|
|
CGGGC |
-2.21 |
3 |
GGCCG |
-2.11 |
5 |
GCGG |
-2.05 |
18 |
|
|
CGCGC |
-2.01 |
3 |
GCGGC |
-1.87 |
4 |
GCAGC |
-1.67 |
1 |
|
H4 |
CGCG |
-3.46 |
18 |
GCGC |
-2.42 |
11 |
TTTTT |
-2.31 |
8 |
|
|
AAAAA |
-1.96 |
13 |
TATA |
-1.93 |
21 |
CGTG |
-1.88 |
22 |
|
|
CCCGG |
-1.82 |
6 |
GCGCG |
-1.7 |
2 |
TAATT |
-1.64 |
18 |
|
H3K9ac |
GCCGC |
-3.45 |
16 |
CCATA |
-2.03 |
5 |
ACCCC |
-2 |
2 |
|
|
AAACC |
-1.97 |
5 |
GCGGC |
-1.86 |
4 |
CAATA |
-1.84 |
7 |
|
|
GGCGG |
-1.76 |
5 |
GATTT |
-1.74 |
5 |
TGTAA |
-1.67 |
5 |
|
H3K14ac |
GCCGC |
-3.3 |
16 |
ACCCC |
-2.13 |
2 |
TACCA |
-1.88 |
1 |
|
|
GACGT |
-1.87 |
6 |
TGGTA |
-1.85 |
2 |
AATTC |
-1.8 |
4 |
|
|
TCTAA |
-1.78 |
2 |
GATCC |
-1.76 |
2 |
CTCGG |
-1.66 |
2 |
|
H4ac |
GCCGC |
-4.95 |
16 |
GCGGC |
-2.61 |
4 |
ACCCC |
-2.35 |
2 |
|
|
GTTGT |
-1.92 |
1 |
GCGTC |
-1.92 |
5 |
GTGGG |
-1.88 |
5 |
|
|
AATTC |
-1.78 |
4 |
TACGA |
-1.78 |
3 |
TACCA |
-1.77 |
1 |
|
H3K4me1 |
GGGTA |
-2.2 |
5 |
TCCTA |
-2.05 |
6 |
TACAC |
-2.01 |
5 |
|
|
TGCGA |
-1.94 |
3 |
TGTCC |
-1.87 |
0 |
ACCCG |
-1.85 |
6 |
|
|
CTCGA |
-1.83 |
1 |
TACCC |
-1.82 |
6 |
ATCGC |
-1.8 |
0 |
|
H3K4me2 |
GCCGC |
-2.73 |
16 |
CCGCC |
-2.32 |
13 |
GGTGC |
-2.13 |
3 |
|
|
GACCG |
-2.1 |
2 |
GGGAT |
-1.93 |
2 |
CGCGG |
-1.82 |
4 |
|
|
GTTCC |
-1.82 |
3 |
ACCGG |
-1.64 |
4 |
CCGAG |
-1.63 |
4 |
|
H3K4me3 |
GCCGC |
-4.27 |
16 |
ACCCC |
-2.74 |
2 |
CGCGG |
-2.62 |
4 |
|
|
CACGC |
-2.15 |
3 |
GCGTC |
-2.13 |
5 |
GTGGG |
-2.09 |
5 |
|
|
GCGGC |
-2.05 |
4 |
CCGC |
-2.03 |
27 |
AGCCG |
-1.98 |
12 |
|
H3K36me3 |
TAGGA |
-2.58 |
4 |
ACCCG |
-2.39 |
6 |
CTCGA |
-2.32 |
1 |
|
|
AACCG |
-2.13 |
5 |
CCCTT |
-1.82 |
3 |
CTCAT |
-1.68 |
4 |
|
|
CATCG |
-1.6 |
3 |
ATGCG |
-1.59 |
6 |
GTATA |
-1.53 |
2 |
|
H3K36me3 |
CGCG |
-2.36 |
18 |
GCGC |
-2.22 |
11 |
TATA |
-2.22 |
21 |
|
|
ACATA |
-2.2 |
2 |
ACGTA |
-2.19 |
| |