Microexon ID At_2:19626814-19626828:+
Species Arabidopsis thaliana
Coordinates 2:19626814..19626828
Microexon Cluster ID MEP41
Size 15
Phase 0
Pfam Domain Motif DUF974
Structure of Microexon-tag (flanking exon, microexon, flanking exon sizes) 48,15,45
Microexon location in the Microexon-tag 2
Microexon-tag DNA Seq CARTTYTTCAAGTTYATTGTTKCWAAYCCACTTTCWGTTAGRACAAAGGTYCGYRYTRTCAAGGAAACTACMTWTYTRGARGCTTGYATWGARAAYCATACAAAATCA
Logo of Microexon-tag DNA Seq NT60 Logo
Alignment of exons MSA
Microexon DNA seq GTTCGAGTTGTAAAG
Microexon Amino Acid seq VRVVK
Microexon-tag DNA Seq CAGTTTTTCAAGTTTGTCGTTGCAAACCCACTCTCAGTGAGGACCAAGGTTCGAGTTGTAAAGGAGACGACATTTTTAGAGGCTTGCATTGAGAACCATACAAAAGCA
Microexon-tag Amino Acid Seq QFFKFVVANPLSVRTKVRVVKETTFLEACIENHTKA
Microexon-tag spanning region19626667-19627278
Microexon-tag prediction score0.9638
Overlapped with the annotated transcript (%) 100
New Transcript ID AT2G47960.1x
Reference Transcript ID AT2G47960.1
Gene ID AT2G47960
Gene Name NA
Transcript ID AT2G47960.1
Protein ID AT2G47960.1
Gene ID AT2G47960
Gene Name NA
Pfam domain motif DUF974
Motif E-value 2.6e-66
Motif start 90
Motif end 321
Protein seq >AT2G47960.1
MSATQTQTHGPHSLAFRVMRLCKPSFHVDPPLRIDPFDLLAGEDFSDDPSSASLFRRHVSSADAVDSDLSYRNRFLLNHP
TDPIGLSGLLLLPQSFGAIYLGETFCSYISVNNSSTSEVRDVTIKAEIQTERQRILLLDTSKSPVESIRTGGRYDFIVEH
DVKELGAHTLVCSALYNDADGERKYLPQFFKFVVANPLSVRTKVRVVKETTFLEACIENHTKANLFMDQVDFEPAKQWSA
VRLQNEDSTEDPPTSGLSGLIPKPPVIIRSGGGIHNYLYKLNPSADVSGQTKFQGSNILGKFQITWRTNLGEPGRLQTQQ
ILGAPVSRKEINMRVVEVPAVIHLNRPFRAYLNLTNQTDRQLGPFEVSLSQDETQLEKPVGINGLQTLMLPRIEAFGSND
FQLNLIASKLGVQKIAGITALDTREKKTYELVPDMEIFVETD*
CDS seq >AT2G47960.1
ATGAGCGCGACGCAGACGCAGACGCATGGACCGCACTCGCTGGCCTTCAGGGTGATGCGACTCTGCAAACCGTCTTTCCA
CGTCGATCCTCCCCTCCGCATTGACCCTTTCGATCTTCTCGCCGGCGAGGATTTCTCCGACGATCCCTCTTCCGCCTCCT
TGTTCCGCCGCCACGTATCTTCTGCCGACGCTGTCGACTCCGATCTCAGCTACCGAAATAGATTCCTCCTCAACCATCCT
ACCGATCCAATCGGTCTCTCTGGTCTTCTGCTTCTTCCTCAGTCCTTCGGAGCGATCTATCTCGGAGAAACATTCTGTAG
CTATATCAGCGTCAACAATAGTTCCACTTCAGAAGTTCGGGATGTTACCATTAAGGCAGAAATTCAGACAGAGAGGCAGA
GGATTCTACTTCTGGATACATCCAAGTCACCTGTTGAATCGATACGGACAGGTGGACGCTATGATTTCATAGTTGAACAC
GATGTGAAAGAGCTTGGAGCTCACACGTTGGTTTGCTCTGCATTGTACAATGACGCTGATGGTGAGCGTAAATATCTTCC
CCAGTTTTTCAAGTTTGTCGTTGCAAACCCACTCTCAGTGAGGACCAAGGTTCGAGTTGTAAAGGAGACGACATTTTTAG
AGGCTTGCATTGAGAACCATACAAAAGCAAACCTCTTTATGGACCAAGTTGATTTTGAACCTGCTAAGCAATGGAGTGCT
GTAAGATTACAGAATGAAGACTCAACAGAAGATCCTCCAACCAGTGGTCTTAGTGGACTGATACCTAAACCGCCAGTTAT
AATCCGATCTGGCGGGGGTATCCACAACTATCTCTACAAGTTAAACCCATCTGCAGATGTTTCCGGGCAAACAAAATTCC
AGGGAAGTAATATCCTGGGTAAATTTCAAATAACATGGCGCACAAATTTGGGTGAACCTGGTCGCTTACAAACACAACAA
ATTCTCGGCGCTCCAGTAAGCCGTAAAGAGATTAATATGCGAGTTGTAGAGGTTCCAGCTGTTATACACTTAAATAGACC
CTTTCGGGCATACTTGAATCTCACAAACCAAACTGATAGGCAACTGGGACCCTTCGAGGTCTCACTGTCACAAGATGAAA
CACAATTGGAGAAGCCGGTTGGTATTAATGGTCTCCAGACTCTGATGTTACCCAGGATTGAAGCATTTGGCTCCAATGAT
TTCCAATTGAATCTTATTGCCTCCAAGCTGGGAGTCCAGAAAATCGCTGGGATCACGGCCTTGGACACAAGAGAGAAAAA
AACCTACGAACTTGTTCCAGATATGGAGATATTTGTAGAGACAGACTAA
Microexon DNA seq GTTCGAGTTGTAAAG
Microexon Amino Acid seq VRVVK
Microexon-tag DNA Seq CAGTTTTTCAAGTTTGTCGTTGCAAACCCACTCTCAGTGAGGACCAAGGTTCGAGTTGTAAAGGAGACGACATTTTTAGAGGCTTGCATTGAGAACCATACAAAAGCA
Microexon-tag Amino Acid seq QFFKFVVANPLSVRTKVRVVKETTFLEACIENHTKA
Transcript ID AT2G47960.1
Gene ID At.11855
Gene Name NA
Pfam domain motif DUF974
Motif E-value 2.6e-66
Motif start 90
Motif end 321
Protein seq >AT2G47960.1
MSATQTQTHGPHSLAFRVMRLCKPSFHVDPPLRIDPFDLLAGEDFSDDPSSASLFRRHVSSADAVDSDLSYRNRFLLNHP
TDPIGLSGLLLLPQSFGAIYLGETFCSYISVNNSSTSEVRDVTIKAEIQTERQRILLLDTSKSPVESIRTGGRYDFIVEH
DVKELGAHTLVCSALYNDADGERKYLPQFFKFVVANPLSVRTKVRVVKETTFLEACIENHTKANLFMDQVDFEPAKQWSA
VRLQNEDSTEDPPTSGLSGLIPKPPVIIRSGGGIHNYLYKLNPSADVSGQTKFQGSNILGKFQITWRTNLGEPGRLQTQQ
ILGAPVSRKEINMRVVEVPAVIHLNRPFRAYLNLTNQTDRQLGPFEVSLSQDETQLEKPVGINGLQTLMLPRIEAFGSND
FQLNLIASKLGVQKIAGITALDTREKKTYELVPDMEIFVETD*
CDS seq >AT2G47960.1
ATGAGCGCGACGCAGACGCAGACGCATGGACCGCACTCGCTGGCCTTCAGGGTGATGCGACTCTGCAAACCGTCTTTCCA
CGTCGATCCTCCCCTCCGCATTGACCCTTTCGATCTTCTCGCCGGCGAGGATTTCTCCGACGATCCCTCTTCCGCCTCCT
TGTTCCGCCGCCACGTATCTTCTGCCGACGCTGTCGACTCCGATCTCAGCTACCGAAATAGATTCCTCCTCAACCATCCT
ACCGATCCAATCGGTCTCTCTGGTCTTCTGCTTCTTCCTCAGTCCTTCGGAGCGATCTATCTCGGAGAAACATTCTGTAG
CTATATCAGCGTCAACAATAGTTCCACTTCAGAAGTTCGGGATGTTACCATTAAGGCAGAAATTCAGACAGAGAGGCAGA
GGATTCTACTTCTGGATACATCCAAGTCACCTGTTGAATCGATACGGACAGGTGGACGCTATGATTTCATAGTTGAACAC
GATGTGAAAGAGCTTGGAGCTCACACGTTGGTTTGCTCTGCATTGTACAATGACGCTGATGGTGAGCGTAAATATCTTCC
CCAGTTTTTCAAGTTTGTCGTTGCAAACCCACTCTCAGTGAGGACCAAGGTTCGAGTTGTAAAGGAGACGACATTTTTAG
AGGCTTGCATTGAGAACCATACAAAAGCAAACCTCTTTATGGACCAAGTTGATTTTGAACCTGCTAAGCAATGGAGTGCT
GTAAGATTACAGAATGAAGACTCAACAGAAGATCCTCCAACCAGTGGTCTTAGTGGACTGATACCTAAACCGCCAGTTAT
AATCCGATCTGGCGGGGGTATCCACAACTATCTCTACAAGTTAAACCCATCTGCAGATGTTTCCGGGCAAACAAAATTCC
AGGGAAGTAATATCCTGGGTAAATTTCAAATAACATGGCGCACAAATTTGGGTGAACCTGGTCGCTTACAAACACAACAA
ATTCTCGGCGCTCCAGTAAGCCGTAAAGAGATTAATATGCGAGTTGTAGAGGTTCCAGCTGTTATACACTTAAATAGACC
CTTTCGGGCATACTTGAATCTCACAAACCAAACTGATAGGCAACTGGGACCCTTCGAGGTCTCACTGTCACAAGATGAAA
CACAATTGGAGAAGCCGGTTGGTATTAATGGTCTCCAGACTCTGATGTTACCCAGGATTGAAGCATTTGGCTCCAATGAT
TTCCAATTGAATCTTATTGCCTCCAAGCTGGGAGTCCAGAAAATCGCTGGGATCACGGCCTTGGACACAAGAGAGAAAAA
AACCTACGAACTTGTTCCAGATATGGAGATATTTGTAGAGACAGACTAA