Most informative features for positive class, from a set of 3-gram and 4-gram features. TF is the number of transcription factor binding motifs containing the feature.

  Feature Weight TF    Feature Weight TF    Feature Weight TF   
H3 TGGC 2.35 13 GCGA 2.18 15 CCTG 2.16 13
  CCT 1.99 51 GGGA 1.87 13 TGCG 1.79 14
  CAG 1.78 35 TGTG 1.74 16 CGTT 1.68 20
H4 CAAA 2.78 20 TATC 2.41 15 ATC 2.21 55
  TTTG 2.01 22 CCA 1.87 57 GATA 1.8 19
  GGA 1.57 52 TGG 1.56 41 ACAG 1.55 9
H3K9ac CCGG 3.12 22 TATA 2.56 21 GGTC 2.2 8
  CTTG 2.13 7 GGCC 1.93 12 GGGC 1.73 7
  CGGT 1.7 12 GCCC 1.63 11 TCGC 1.59 7
H3K14ac TATA 2.32 21 CCGG 2.19 22 TCGC 2.04 7
  TAAG 2.01 13 GTCC 1.92 6 GGCC 1.87 12
  TAGT 1.73 10 GGTC 1.71 8 TTTT 1.69 33
H4ac TCGC 3.12 7 GCGA 3 15 CCGG 2.45 22
  GGCC 2.13 12 CCCG 2.11 14 GTCC 2.03 6
  GGTC 2.01 8 CGGT 1.92 12 TATA 1.75 21
H3K4me1 TATC 2.26 15 GACG 1.97 18 CCGC 1.67 27
  CAAA 1.57 20 CGTC 1.56 16 TTTG 1.49 22
  CCA 1.47 57 CAT 1.45 67 GATA 1.42 19
H3K4me2 TAAG 1.69 13 CGGT 1.62 12 CGA 1.32 60
  GTCC 1.28 6 CCCG 1.23 14 TCGC 1.23 7
  TGAG 1.2 9 CACT 1.18 17 CCGG 1.12 22
H3K4me3 CCGG 3.09 22 TCGC 3.04 7 GCGA 2.79 15
  CCCG 2.78 14 TATA 2.69 21 GTCC 2.26 6
  GGCC 2.11 12 TAAG 2.03 13 ACCC 1.96 23
H3K36me3 CGTC 2.24 16 CACC 1.84 23 GACG 1.78 18
  ACGA 1.77 23 ACC 1.72 59 TGG 1.59 41
  CTTC 1.52 15 CAAA 1.51 20 GATA 1.49 19
H3K79me3 CAAA 2.44 20 ATC 2.27 55 GATA 2.24 19
  TATC 2.24 15 GGTA 1.93 13 TTTG 1.85 22
  TACC 1.66 15 ATCC 1.56 13 TGGA 1.55 11

 

Most informative features for negative class, from a set of 3-gram and 4-gram features. TF is the number of transcription factor binding motifs containing the feature.

  Feature Weight TF    Feature Weight TF   Feature Weight TF   
H3 CGCG -5.33 18 GCGC -3.32 11 CGC -2.46 51
  TTTT -2.4 33 GCG -2.11 53 CGTG -1.98 22
  CGG -1.88 60 GCGG -1.87 18 CCG -1.86 80
H4 CGCG -3.57 18 TTTT -3.14 33 TATA -3.07 21
  AAAA -2.96 32 GCG -2.84 53 CGC -2.49 51
  CGTG -2.47 22 GCGC -2.15 11 GGCC -1.78 12
H3K9ac TTTG -2.43 22 CAAA -2.43 20 GGGG -2.14 18
  CCGC -2.1 27 TACC -2.05 15 AAT -2.04 86
  GCGG -1.81 18 CCCC -1.81 15 CGTC -1.77 16
H3K14ac CGGC -2.64 12 CCCC -2.35 15 TATC -2.28 15
  CAAA -2.27 20 CGTC -2.18 16 TTTG -2.17 22
  CCGC -1.98 27 GGGG -1.8 18 CGCC -1.51 20
H4ac CGGC -2.94 12 GCGG -2.62 18 CCCC -2.46 15
  CCGC -2.43 27 GCCG -2.34 26 CGTC -2.17 16
  GGGG -2.02 18 CGCC -1.96 20 AAT -1.86 86
H3K4me1 CGCG -2.68 18 TATA -2.29 21 CACG -2.19 20
  CCGG -1.81 22 CGTG -1.58 22 TCGC -1.55 7
  TTTT -1.52 33 CTTA -1.39 15 CCCG -1.34 14
H3K4me2 CGGC -1.68 12 CCGC -1.6 27 ATAT -1.59 28
  CCCC -1.47 15 ATT -1.39 102 TCAA -1.28 15
  ACAT -1.14 23 TTAA -1.05 22 ATTA -1.04 39
H3K4me3 CGGC -3.44 12 CCCC -3.26 15 CCGC -3.18 27
  GGGG -2.69 18 GCGG -2.28 18 TCGG -2.06 12
  GCCG -2.06 26 CGTC -1.94 16 CAAA -1.68 20
H3K36me3 CTTA -2.08 15 TCGC -2.03 7 TAAG -1.96 13
  TCTC -1.8 16 GCCA -1.69 11 TATA -1.66 21
  GATC -1.63 7 CCCT -1.6 11 GCG -1.54 53
H3K79me3 TATA -3.9 21 CGC -2.73 51 GCG -2.57 53
  CGCG -2.01 18 AAAA -1.89 32 TTTT -1.8 33
  ATGT -1.76 24 CCC -1.65 50 CATG -1.57 11

 

Most informative features for positive class, from a set of 4-gram and 5-gram features. TF is the number of transcription factor binding motifs containing the feature.

  Feature Weight TF   Feature Weight TF   Feature Weight TF  
H3 CCTG 1.98 13 GCATT 1.7 4 AGTGC 1.6 3
  ACCTG 1.58 5 TCCTG 1.56 1 CTCTT 1.55 5
  TTCAC 1.54 4 GGGTT 1.54 8 ACAGC 1.52 4
H4 CCAGT 1.85 4 TCAGG 1.73 1 TGGA 1.65 11
  TTGGG 1.55 2 CGTTA 1.51 3 TATC 1.49 15
  GGAT 1.49 14 GGATC 1.48 1 TATTG 1.47 5
H3K9ac CCGG 2.04 22 ACCCG 1.97 6 GCCCA 1.87 3
  GGTGG 1.78 2 CAGCG 1.77 3 GCGAG 1.76 7
  CCGGG 1.71 8 GCCGG 1.71 7 GGCCG 1.66 5
H3K14ac GCGTG 2.21 1 ACCCG 2.06 6 TAGTC 2.02 2
  GGTGG 1.85 2 ATGAG 1.85 1 GGGTA 1.85 5
  CTCGA 1.84 1 TAGGA 1.72 4 AAGGC 1.7 1
H4ac GCCGG 2.42 7 CTCAT 2.35 4 ACCCG 2.35 6
  GGGTA 1.96 5 ATGAG 1.9 1 GGTGG 1.9 2
  ACCGC 1.89 5 GCGA 1.87 15 TCGC 1.86 7
H3K4me1 GGGTC 2.31 2 CGGGA 1.96 2 AATTC 1.93 4
  CGGAG 1.87 3 AGTTT 1.75 2 TCGTA 1.71 1
  AACCC 1.68 4 ACCCA 1.63 13 ATCGA 1.62 0
H3K4me2 ACCAC 2.23 1 GTTGC 2.18 2 GCGAG 2.03 7
  CCAGC 1.97 4 CACTT 1.94 1 ACCGC 1.9 5
  ATTCC 1.73 5 GAGGG 1.68 3 GTCCA 1.64 1
H3K4me3 ACCCG 3.39 6 CGGGT 2.18 6 ATGAG 2.13 1
  TGCGA 2.06 3 GGTGG 1.92 2 CTCGC 1.91 4
  CTCAT 1.9 4 GTTGC 1.84 2 GCGA 1.82 15
H3K36me3 GGCGA 1.82 3 CGTCC 1.72 3 ACCCA 1.68 13
  GGAAC 1.59 1 AATTC 1.58 4 GATTT 1.58 5
  CAACG 1.51 7 TGGAT 1.48 3 CGCAG 1.48 0
H3K79me3 GATTT 2.09 5 TAATG 1.85 3 CATTA 1.81 6
  GCGTA 1.66 1 CTCTA 1.63 5 ACAGC 1.55 4
  CAGCA 1.54 1 CAATA 1.47 7 TAGGG 1.47 5

 

Most informative features for positive class, from a set of 4-gram and 5-gram features. TF is the number of transcription factor binding motifs containing the feature.

  Feature Weight TF   Feature Weight TF   Feature Weight TF  
H3 CGCG -4.57 18 GCGC -2.94 11 GCGCG -2.41 2
  CGGGC -2.21 3 GGCCG -2.11 5 GCGG -2.05 18
  CGCGC -2.01 3 GCGGC -1.87 4 GCAGC -1.67 1
H4 CGCG -3.46 18 GCGC -2.42 11 TTTTT -2.31 8
  AAAAA -1.96 13 TATA -1.93 21 CGTG -1.88 22
  CCCGG -1.82 6 GCGCG -1.7 2 TAATT -1.64 18
H3K9ac GCCGC -3.45 16 CCATA -2.03 5 ACCCC -2 2
  AAACC -1.97 5 GCGGC -1.86 4 CAATA -1.84 7
  GGCGG -1.76 5 GATTT -1.74 5 TGTAA -1.67 5
H3K14ac GCCGC -3.3 16 ACCCC -2.13 2 TACCA -1.88 1
  GACGT -1.87 6 TGGTA -1.85 2 AATTC -1.8 4
  TCTAA -1.78 2 GATCC -1.76 2 CTCGG -1.66 2
H4ac GCCGC -4.95 16 GCGGC -2.61 4 ACCCC -2.35 2
  GTTGT -1.92 1 GCGTC -1.92 5 GTGGG -1.88 5
  AATTC -1.78 4 TACGA -1.78 3 TACCA -1.77 1
H3K4me1 GGGTA -2.2 5 TCCTA -2.05 6 TACAC -2.01 5
  TGCGA -1.94 3 TGTCC -1.87 0 ACCCG -1.85 6
  CTCGA -1.83 1 TACCC -1.82 6 ATCGC -1.8 0
H3K4me2 GCCGC -2.73 16 CCGCC -2.32 13 GGTGC -2.13 3
  GACCG -2.1 2 GGGAT -1.93 2 CGCGG -1.82 4
  GTTCC -1.82 3 ACCGG -1.64 4 CCGAG -1.63 4
H3K4me3 GCCGC -4.27 16 ACCCC -2.74 2 CGCGG -2.62 4
  CACGC -2.15 3 GCGTC -2.13 5 GTGGG -2.09 5
  GCGGC -2.05 4 CCGC -2.03 27 AGCCG -1.98 12
H3K36me3 TAGGA -2.58 4 ACCCG -2.39 6 CTCGA -2.32 1
  AACCG -2.13 5 CCCTT -1.82 3 CTCAT -1.68 4
  CATCG -1.6 3 ATGCG -1.59 6 GTATA -1.53 2
H3K36me3 CGCG -2.36 18 GCGC -2.22 11 TATA -2.22 21
  ACATA -2.2 2 ACGTA -2.19