Microexon ID At_2:18556312-18556319:-
Species Arabidopsis thaliana
Coordinates 2:18556312..18556319
Microexon Cluster ID MEP16
Size 8
Phase 1
Pfam Domain Motif SNF2_N
Structure of Microexon-tag (flanking exon, microexon, flanking exon sizes) 49,8,51
Microexon location in the Microexon-tag 2
Microexon-tag DNA Seq BTWWMHTGGCTYATACGAHKRTATSAGMWYGGYRTMAAYGKMATTCTYGSWGATGARATGGGWCTKGGRAARACWYTKCAARCYATHTCHTTSYTGRGYTAYYTRMAT
Logo of Microexon-tag DNA Seq NT60 Logo
Alignment of exons MSA
Microexon DNA seq GAGACGAG
Microexon Amino Acid seq GDE
Microexon-tag DNA Seq GTCTCTTGGCTTATACAGAAATATCTACTCGGCGTCAATGTCGTTCTCGGAGACGAGATGGGATTGGGAAAGACTCTCCAAGCAATCTCTTTCCTGAGTTATTTGAAG
Microexon-tag Amino Acid Seq VSWLIQKYLLGVNVVLGDEMGLGKTLQAISFLSYLK
Microexon-tag spanning region18556182-18556495
Microexon-tag prediction score0.9406
Overlapped with the annotated transcript (%) 100
New Transcript ID AT2G44980.3x
Reference Transcript ID AT2G44980.3
Gene ID AT2G44980
Gene Name CHR10
Transcript ID AT2G44980.3
Protein ID AT2G44980.3
Gene ID AT2G44980
Gene Name CHR10
Pfam domain motif SNF2_N
Motif E-value 3.5e-49
Motif start 71
Motif end 356
Protein seq >AT2G44980.3
MSKESSPPKVPSTTMEYERRLEAAAEIILEKEAKFSNTPPDCSEFGVTATLKPHQVEGVSWLIQKYLLGVNVVLGDEMGL
GKTLQAISFLSYLKFRQGLPGPFLVLCPLSVTDGWVSEINRFTPNLEVLRYVGDKYCRLDMRKSMYDHVKKSSKGHFLPF
DVLLTTYDIALVDQDFLSQIPWQYAIIDEAQRLKNPNSVLYNVLLEQFLIPRRLLITGTPIQNNLTELWALMHFCMPLVF
GTLDQFLSAFKETGDGLSGLDVSNDKETYKSLKFILGAFMLRRTKSLLIESGNLVLPPLTELTVMVPLVSLQKKIYTSIL
RKELPGLLELSSGGSNHTSLQNIVIQLRKACSHPYLFPGIEPEPFEEGEHLVQASGKLLVLDQLLKRLHDSGHRVLLFSQ
MTSTLDILQDFMELRRYSYERLDGSVRAEERFAAIKNFSAKTERGLDSEVDGSNAFVFMISTRAGGVGLNLVAADTVIFY
EQDWNPQVDKQALQRAHRIGQISHVLSINLVTEHSVEEVILRRAERKLQLSHNVVGDNMEEKEEDGGDLRSLVFGLQRFD
PEEIHNEESDNLKMVEISSLAEKVVAIRQNVEPDKEERRFEINSSDTLLGNTSSASLDSELDEASYLSWVEKLKEAARSS
KDEKIIELGNRKNLSEERNLRIEAARKKAEEKKLATWGAHGYQSLSVEEPILPDDVDSSSDAGSVNFVFGDCTNPSTVSH
EPAIIFSCVDDSGNWGRGGMFDALSKLSNTVPTAYHRASEFKDLHLGDLHLIKIDDNDDQQNTQASKPLWVAVAVTQSYN
SRRKVPRSSISIPDLESCLAKASFSASQKSASLHMPRIGYQDGSDRSQWYTVERLLRKYSSIFTVKIFVYYYRRSP*
CDS seq >AT2G44980.3
ATGTCAAAAGAGAGTTCACCGCCAAAAGTCCCTTCCACGACGATGGAGTATGAACGGCGACTTGAAGCGGCGGCGGAAAT
TATACTCGAGAAGGAGGCGAAATTTAGCAACACACCTCCGGATTGTAGTGAATTTGGGGTCACGGCGACTCTAAAGCCTC
ACCAAGTCGAAGGTGTCTCTTGGCTTATACAGAAATATCTACTCGGCGTCAATGTCGTTCTCGGAGACGAGATGGGATTG
GGAAAGACTCTCCAAGCAATCTCTTTCCTGAGTTATTTGAAGTTTCGTCAAGGCTTGCCTGGGCCATTTCTGGTGCTATG
TCCCTTAAGTGTAACGGATGGTTGGGTATCAGAGATAAATAGGTTCACTCCAAATCTTGAAGTTCTTAGATATGTTGGCG
ATAAGTATTGCCGTCTGGATATGCGTAAGTCAATGTATGATCATGTCAAGAAGAGTTCTAAGGGACATTTCTTACCATTT
GATGTATTGCTGACAACATATGACATAGCATTGGTGGATCAAGACTTTCTTTCTCAAATTCCTTGGCAATATGCTATAAT
TGATGAAGCTCAAAGACTCAAGAATCCTAACAGTGTTCTGTATAATGTCTTACTTGAGCAATTCCTCATTCCACGACGTC
TCTTGATTACTGGCACACCTATCCAGAACAATCTTACCGAACTTTGGGCTCTGATGCATTTTTGCATGCCACTAGTTTTT
GGGACACTGGACCAGTTCCTTTCCGCATTCAAAGAGACTGGGGACGGTCTATCAGGTCTTGATGTATCTAATGACAAGGA
AACATATAAGAGTCTGAAGTTCATCTTAGGTGCCTTTATGCTGAGGCGTACAAAATCTTTGCTCATTGAGTCTGGAAATC
TGGTGCTGCCACCTCTCACGGAGCTAACTGTAATGGTTCCGCTCGTCAGTCTTCAAAAGAAGATATATACTTCCATTTTG
AGGAAAGAGCTTCCAGGGCTTCTTGAACTGTCTTCTGGAGGATCAAATCATACGTCTTTACAAAATATTGTGATACAGCT
AAGAAAAGCATGTAGCCACCCTTACTTATTTCCAGGTATCGAACCAGAGCCTTTTGAAGAGGGTGAACACCTTGTTCAGG
CAAGTGGTAAGCTTTTGGTTTTGGATCAATTGCTCAAAAGGCTCCATGATAGTGGCCATCGTGTCCTCCTCTTTTCCCAA
ATGACCTCAACACTTGACATTTTGCAGGATTTTATGGAGCTTCGTAGGTATTCGTATGAGCGCCTTGATGGATCAGTACG
GGCTGAAGAACGTTTTGCTGCGATAAAGAATTTCAGCGCGAAAACAGAAAGAGGATTGGATTCTGAAGTTGATGGAAGCA
ATGCCTTTGTTTTCATGATCTCTACAAGAGCAGGGGGAGTCGGTTTGAATCTTGTTGCTGCTGATACTGTTATATTTTAT
GAACAAGACTGGAACCCACAGGTGGATAAACAAGCTTTGCAACGAGCTCATCGAATTGGACAAATCAGTCATGTGCTGTC
TATAAACCTCGTTACTGAACATTCTGTGGAAGAGGTTATTTTGAGGAGGGCAGAGAGAAAGTTGCAGCTTAGTCATAATG
TTGTGGGAGATAACATGGAAGAGAAAGAAGAAGATGGTGGTGATTTGCGGTCTCTAGTGTTTGGTTTACAAAGGTTTGAT
CCCGAGGAGATTCACAATGAAGAGTCTGATAACCTGAAAATGGTAGAAATAAGTTCTCTTGCTGAGAAAGTTGTTGCAAT
ACGTCAAAACGTTGAACCAGACAAGGAAGAAAGAAGGTTTGAAATTAATTCGAGCGACACTTTGCTAGGAAATACATCAT
CTGCAAGTCTGGATTCCGAGCTTGATGAAGCTTCTTACCTTTCATGGGTTGAGAAATTGAAGGAAGCTGCACGATCAAGC
AAGGATGAAAAAATTATTGAGTTGGGAAATAGAAAGAACTTATCTGAGGAAAGAAATCTGAGAATTGAGGCTGCCAGGAA
AAAGGCAGAAGAGAAGAAGTTAGCCACGTGGGGAGCACACGGTTATCAGTCTTTGTCCGTGGAAGAACCGATTTTGCCTG
ATGATGTTGATTCCAGCTCAGATGCAGGATCTGTTAACTTTGTCTTTGGAGACTGTACAAATCCATCAACTGTCTCTCAT
GAGCCAGCAATCATATTCAGCTGTGTTGATGACTCTGGAAACTGGGGACGTGGTGGAATGTTTGATGCTCTGTCAAAACT
GTCTAATACTGTACCCACTGCTTATCACCGAGCATCCGAGTTTAAAGACCTTCATCTTGGAGACCTGCATCTCATAAAAA
TTGATGACAATGATGACCAACAAAACACACAAGCAAGCAAGCCTCTTTGGGTGGCAGTTGCAGTTACACAATCTTACAAT
TCGAGGCGTAAAGTCCCACGTAGTAGCATTTCAATTCCAGACCTTGAAAGTTGCTTAGCCAAAGCTTCATTCTCTGCTTC
TCAGAAATCAGCCTCACTCCACATGCCACGTATTGGTTACCAAGACGGATCAGACCGATCTCAATGGTACACTGTTGAGC
GTCTTCTTCGAAAATACTCCTCCATCTTCACCGTCAAAATATTCGTGTACTATTATCGTCGATCTCCCTGA
Microexon DNA seq GAGACGAG
Microexon Amino Acid seq GDE
Microexon-tag DNA Seq GTCTCTTGGCTTATACAGAAATATCTACTCGGCGTCAATGTCGTTCTCGGAGACGAGATGGGATTGGGAAAGACTCTCCAAGCAATCTCTTTCCTGAGTTATTTGAAG
Microexon-tag Amino Acid seq VSWLIQKYLLGVNVVLGDEMGLGKTLQAISFLSYLK
Transcript ID AT2G44980.3
Gene ID At.11525
Gene Name CHR10
Pfam domain motif SNF2_N
Motif E-value 3.5e-49
Motif start 71
Motif end 356
Protein seq >AT2G44980.3
MSKESSPPKVPSTTMEYERRLEAAAEIILEKEAKFSNTPPDCSEFGVTATLKPHQVEGVSWLIQKYLLGVNVVLGDEMGL
GKTLQAISFLSYLKFRQGLPGPFLVLCPLSVTDGWVSEINRFTPNLEVLRYVGDKYCRLDMRKSMYDHVKKSSKGHFLPF
DVLLTTYDIALVDQDFLSQIPWQYAIIDEAQRLKNPNSVLYNVLLEQFLIPRRLLITGTPIQNNLTELWALMHFCMPLVF
GTLDQFLSAFKETGDGLSGLDVSNDKETYKSLKFILGAFMLRRTKSLLIESGNLVLPPLTELTVMVPLVSLQKKIYTSIL
RKELPGLLELSSGGSNHTSLQNIVIQLRKACSHPYLFPGIEPEPFEEGEHLVQASGKLLVLDQLLKRLHDSGHRVLLFSQ
MTSTLDILQDFMELRRYSYERLDGSVRAEERFAAIKNFSAKTERGLDSEVDGSNAFVFMISTRAGGVGLNLVAADTVIFY
EQDWNPQVDKQALQRAHRIGQISHVLSINLVTEHSVEEVILRRAERKLQLSHNVVGDNMEEKEEDGGDLRSLVFGLQRFD
PEEIHNEESDNLKMVEISSLAEKVVAIRQNVEPDKEERRFEINSSDTLLGNTSSASLDSELDEASYLSWVEKLKEAARSS
KDEKIIELGNRKNLSEERNLRIEAARKKAEEKKLATWGAHGYQSLSVEEPILPDDVDSSSDAGSVNFVFGDCTNPSTVSH
EPAIIFSCVDDSGNWGRGGMFDALSKLSNTVPTAYHRASEFKDLHLGDLHLIKIDDNDDQQNTQASKPLWVAVAVTQSYN
SRRKVPRSSISIPDLESCLAKASFSASQKSASLHMPRIGYQDGSDRSQWYTVERLLRKYSSIFTVKIFVYYYRRSP*
CDS seq >AT2G44980.3
ATGTCAAAAGAGAGTTCACCGCCAAAAGTCCCTTCCACGACGATGGAGTATGAACGGCGACTTGAAGCGGCGGCGGAAAT
TATACTCGAGAAGGAGGCGAAATTTAGCAACACACCTCCGGATTGTAGTGAATTTGGGGTCACGGCGACTCTAAAGCCTC
ACCAAGTCGAAGGTGTCTCTTGGCTTATACAGAAATATCTACTCGGCGTCAATGTCGTTCTCGGAGACGAGATGGGATTG
GGAAAGACTCTCCAAGCAATCTCTTTCCTGAGTTATTTGAAGTTTCGTCAAGGCTTGCCTGGGCCATTTCTGGTGCTATG
TCCCTTAAGTGTAACGGATGGTTGGGTATCAGAGATAAATAGGTTCACTCCAAATCTTGAAGTTCTTAGATATGTTGGCG
ATAAGTATTGCCGTCTGGATATGCGTAAGTCAATGTATGATCATGTCAAGAAGAGTTCTAAGGGACATTTCTTACCATTT
GATGTATTGCTGACAACATATGACATAGCATTGGTGGATCAAGACTTTCTTTCTCAAATTCCTTGGCAATATGCTATAAT
TGATGAAGCTCAAAGACTCAAGAATCCTAACAGTGTTCTGTATAATGTCTTACTTGAGCAATTCCTCATTCCACGACGTC
TCTTGATTACTGGCACACCTATCCAGAACAATCTTACCGAACTTTGGGCTCTGATGCATTTTTGCATGCCACTAGTTTTT
GGGACACTGGACCAGTTCCTTTCCGCATTCAAAGAGACTGGGGACGGTCTATCAGGTCTTGATGTATCTAATGACAAGGA
AACATATAAGAGTCTGAAGTTCATCTTAGGTGCCTTTATGCTGAGGCGTACAAAATCTTTGCTCATTGAGTCTGGAAATC
TGGTGCTGCCACCTCTCACGGAGCTAACTGTAATGGTTCCGCTCGTCAGTCTTCAAAAGAAGATATATACTTCCATTTTG
AGGAAAGAGCTTCCAGGGCTTCTTGAACTGTCTTCTGGAGGATCAAATCATACGTCTTTACAAAATATTGTGATACAGCT
AAGAAAAGCATGTAGCCACCCTTACTTATTTCCAGGTATCGAACCAGAGCCTTTTGAAGAGGGTGAACACCTTGTTCAGG
CAAGTGGTAAGCTTTTGGTTTTGGATCAATTGCTCAAAAGGCTCCATGATAGTGGCCATCGTGTCCTCCTCTTTTCCCAA
ATGACCTCAACACTTGACATTTTGCAGGATTTTATGGAGCTTCGTAGGTATTCGTATGAGCGCCTTGATGGATCAGTACG
GGCTGAAGAACGTTTTGCTGCGATAAAGAATTTCAGCGCGAAAACAGAAAGAGGATTGGATTCTGAAGTTGATGGAAGCA
ATGCCTTTGTTTTCATGATCTCTACAAGAGCAGGGGGAGTCGGTTTGAATCTTGTTGCTGCTGATACTGTTATATTTTAT
GAACAAGACTGGAACCCACAGGTGGATAAACAAGCTTTGCAACGAGCTCATCGAATTGGACAAATCAGTCATGTGCTGTC
TATAAACCTCGTTACTGAACATTCTGTGGAAGAGGTTATTTTGAGGAGGGCAGAGAGAAAGTTGCAGCTTAGTCATAATG
TTGTGGGAGATAACATGGAAGAGAAAGAAGAAGATGGTGGTGATTTGCGGTCTCTAGTGTTTGGTTTACAAAGGTTTGAT
CCCGAGGAGATTCACAATGAAGAGTCTGATAACCTGAAAATGGTAGAAATAAGTTCTCTTGCTGAGAAAGTTGTTGCAAT
ACGTCAAAACGTTGAACCAGACAAGGAAGAAAGAAGGTTTGAAATTAATTCGAGCGACACTTTGCTAGGAAATACATCAT
CTGCAAGTCTGGATTCCGAGCTTGATGAAGCTTCTTACCTTTCATGGGTTGAGAAATTGAAGGAAGCTGCACGATCAAGC
AAGGATGAAAAAATTATTGAGTTGGGAAATAGAAAGAACTTATCTGAGGAAAGAAATCTGAGAATTGAGGCTGCCAGGAA
AAAGGCAGAAGAGAAGAAGTTAGCCACGTGGGGAGCACACGGTTATCAGTCTTTGTCCGTGGAAGAACCGATTTTGCCTG
ATGATGTTGATTCCAGCTCAGATGCAGGATCTGTTAACTTTGTCTTTGGAGACTGTACAAATCCATCAACTGTCTCTCAT
GAGCCAGCAATCATATTCAGCTGTGTTGATGACTCTGGAAACTGGGGACGTGGTGGAATGTTTGATGCTCTGTCAAAACT
GTCTAATACTGTACCCACTGCTTATCACCGAGCATCCGAGTTTAAAGACCTTCATCTTGGAGACCTGCATCTCATAAAAA
TTGATGACAATGATGACCAACAAAACACACAAGCAAGCAAGCCTCTTTGGGTGGCAGTTGCAGTTACACAATCTTACAAT
TCGAGGCGTAAAGTCCCACGTAGTAGCATTTCAATTCCAGACCTTGAAAGTTGCTTAGCCAAAGCTTCATTCTCTGCTTC
TCAGAAATCAGCCTCACTCCACATGCCACGTATTGGTTACCAAGACGGATCAGACCGATCTCAATGGTACACTGTTGAGC
GTCTTCTTCGAAAATACTCCTCCATCTTCACCGTCAAAATATTCGTGTACTATTATCGTCGATCTCCCTGA