Microexon ID At_1:23660722-23660733:-
Species Arabidopsis thaliana
Coordinates 1:23660722..23660733
Microexon Cluster ID MEP28
Size 12
Phase 0
Pfam Domain Motif Peptidase_M1
Structure of Microexon-tag (flanking exon, microexon, flanking exon sizes) 24,19,5,12,48
Microexon location in the Microexon-tag 4
Microexon-tag DNA Seq GTTMGRCCWCAYTCTTAYATYAAGATGGACAACTTCTAYACAGTRACGGTKTATGARAAGGGWGCTGAAGTTGTCMGRATGTACAARACMTTRYTKGGRAGTYMAGGR
Logo of Microexon-tag DNA Seq NT60 Logo
Alignment of exons MSA
Microexon DNA seq GTTTATGAAAAG
Microexon Amino Acid seq VYEK
Microexon-tag DNA Seq GTTCGCCCACATTCATACATCAAGATGGATAACTTCTACACAGTGACGGTTTATGAAAAGGGAGCTGAGGTCGTCAGGATGTACAAAACTCTACTAGGAACTCAGGGG
Microexon-tag Amino Acid Seq VRPHSYIKMDNFYTVTVYEKGAEVVRMYKTLLGTQG
Microexon-tag spanning region23660597-23661112
Microexon-tag prediction score0.975
Overlapped with the annotated transcript (%) 100
New Transcript ID AT1G63770.5x
Reference Transcript ID AT1G63770.5
Gene ID AT1G63770
Gene Name NA
Transcript ID AT1G63770.5
Protein ID AT1G63770.5
Gene ID AT1G63770
Gene Name NA
Pfam domain motif Peptidase_M1
Motif E-value 1.6e-50
Motif start 323
Motif end 534
Protein seq >AT1G63770.5
MARLIIPCRSSSLARVNLLGLLSRAPVPVRSSCLRSSANRLTQHRPFLTSEAICLRKNRFLPHSVDTHKQNSRRLICSVA
TESVPDKAEDSKMDAPKEIFLKNYTKPDYYFETVDLSFSLGEEKTIVSSKIKVSPRVKGSSAALVLDGHDLKLLSVKVEG
KLLKEGDYQLDSRHLTLPSLPAEESFVLEIDTEIYPHKNTSLEGLYKSSGNFCTQCEAEGFRKITFYQDRPDIMAKYTCR
VEGDKTLYPVLLSNGNLISQGDIEGGRHYALWEDPFKKPCYLFALVAGQLVSRDDTFTTRSGRQVSLKIWTPAEDLPKTA
HAMYSLKAAMKWDEDVFGLEYDLDLFNIVAVPDFNMGAMENKSLNIFNSKLVLASPETATDADYAAILGVIGHEYFHNWT
GNRVTCRDWFQLSLKEGLTVFRDQEFSSDMGSRTVKRIADVSKLRIYQFPQDAGPMAHPVRPHSYIKMDNFYTVTVYEKG
AEVVRMYKTLLGTQGFRKGIDLYFERHDEQAVTCEDFFAAMRDANNADFANFLQWYSQAGTPVVKVVSSYNADARTFSLK
FSQEIPPTPGQPTKEPTFIPVVVGLLDSSGKDITLSSVHHDGTVQTISGSSTILRVTKKEEEFVFSDIPERPVPSLFRGF
SAPVRVETDLSNDDLFFLLAHDSDEFNRWEAGQVLARKLMLNLVSDFQQNKPLALNPKFVQGLGSVLSDSSLDKEFIAKA
ITLPGEGEIMDMMAVADPDAVHAVRKFVRKQLASELKEELLKIVENNRSTEAYVFDHSNMARRALKNTALAYLASLEDPA
YMELALNEYKMATNLTDQFAALAALSQNPGKTRDDILADFYNKWQDDYLVVNKWFLLQSTSDIPGNVENVKKLLDHPAFD
LRNPNKVYSLIGGFCGSPVNFHAKDGSGYKFLGDIVVQLDKLNPQVASRMVSAFSRWKRYDETRQGLAKAQLEMIMSANG
LSENVFEIASKSLAA*
CDS seq >AT1G63770.5
ATGGCTCGGTTGATAATTCCTTGTCGAAGTTCGTCTCTGGCGAGAGTCAATCTTCTGGGTTTGCTCTCTCGTGCTCCTGT
TCCTGTTAGAAGCAGTTGTCTTCGTAGTTCGGCAAACAGACTTACTCAACACAGACCGTTTCTTACTTCCGAGGCCATTT
GTTTGAGGAAGAATCGGTTTCTACCCCATTCTGTTGATACACATAAGCAAAACAGCAGGAGGCTCATTTGTTCTGTTGCC
ACAGAATCAGTTCCGGATAAAGCTGAAGATTCCAAAATGGATGCACCTAAAGAAATATTTCTCAAGAACTACACAAAGCC
TGATTACTACTTTGAAACTGTGGATCTGAGCTTCTCTCTAGGTGAAGAGAAAACAATTGTTAGCTCTAAAATCAAGGTTT
CTCCTCGAGTTAAAGGATCTTCTGCTGCATTGGTCTTGGATGGACATGACTTGAAGCTACTTTCTGTCAAGGTTGAGGGG
AAGCTTCTTAAGGAAGGGGATTACCAGTTGGACTCTCGCCATCTAACTCTTCCTTCGCTTCCAGCTGAGGAGTCCTTTGT
TTTGGAGATTGATACTGAGATATACCCCCACAAGAATACTTCACTTGAGGGGCTTTACAAGTCTTCTGGGAATTTCTGCA
CACAGTGTGAAGCAGAAGGTTTCCGCAAAATTACATTTTACCAGGATCGTCCTGATATAATGGCTAAGTACACGTGTCGT
GTGGAAGGTGACAAGACACTTTATCCTGTGTTATTGTCCAATGGAAACCTCATTTCTCAAGGAGATATAGAGGGTGGTCG
ACACTATGCCTTATGGGAGGATCCATTCAAGAAACCGTGCTATCTATTTGCTTTGGTGGCTGGACAGCTAGTGAGCAGAG
ATGATACATTTACCACACGCTCTGGTAGGCAGGTTTCCCTGAAAATCTGGACTCCTGCAGAAGATCTACCAAAGACTGCC
CATGCCATGTATTCCCTAAAAGCGGCCATGAAGTGGGATGAGGATGTGTTCGGGCTTGAGTATGACCTGGATCTCTTCAA
CATTGTTGCTGTTCCAGATTTTAACATGGGAGCCATGGAAAACAAGAGTTTGAATATTTTTAATTCCAAGCTTGTCCTAG
CATCTCCAGAAACAGCAACAGATGCAGATTACGCTGCAATTTTGGGTGTTATTGGTCATGAGTACTTTCACAATTGGACA
GGGAACAGGGTGACATGCCGTGACTGGTTCCAACTCAGTCTAAAGGAAGGACTTACTGTCTTCCGTGACCAGGAATTTTC
ATCTGACATGGGAAGCCGAACTGTAAAGCGAATTGCTGATGTTTCAAAGCTTCGGATCTATCAATTTCCGCAGGATGCTG
GTCCTATGGCACATCCTGTTCGCCCACATTCATACATCAAGATGGATAACTTCTACACAGTGACGGTTTATGAAAAGGGA
GCTGAGGTCGTCAGGATGTACAAAACTCTACTAGGAACTCAGGGGTTCCGAAAGGGAATTGATCTCTATTTTGAAAGACA
TGATGAGCAAGCAGTGACTTGTGAAGACTTCTTTGCTGCTATGCGTGATGCAAATAATGCAGACTTTGCTAATTTCTTGC
AATGGTACTCTCAAGCTGGAACGCCCGTTGTCAAAGTGGTATCCTCTTACAATGCTGACGCTCGTACTTTCTCTTTAAAA
TTCAGTCAGGAGATACCTCCAACTCCAGGCCAGCCAACAAAAGAACCTACATTTATTCCAGTGGTTGTTGGTCTTTTGGA
CTCAAGTGGGAAAGACATTACTCTTTCCTCTGTTCATCATGATGGTACAGTGCAGACCATTTCAGGCAGCAGCACAATAC
TTCGAGTGACTAAGAAAGAAGAAGAGTTTGTGTTTTCTGATATACCAGAAAGACCTGTTCCATCCCTATTTAGGGGATTC
AGTGCCCCAGTTCGTGTTGAAACTGATCTCTCTAATGATGACTTATTCTTCCTCCTAGCACATGATTCAGATGAATTCAA
TAGGTGGGAGGCCGGTCAAGTTCTGGCAAGAAAGCTGATGCTGAACTTAGTTTCTGATTTCCAGCAAAATAAACCGTTGG
CTCTAAACCCAAAATTTGTGCAAGGTCTCGGCAGTGTGCTTTCTGACTCAAGCTTGGACAAGGAATTTATAGCCAAAGCA
ATAACACTACCTGGGGAGGGAGAGATAATGGACATGATGGCCGTGGCGGATCCTGATGCTGTTCATGCTGTTAGAAAGTT
TGTACGAAAGCAGCTTGCATCTGAACTTAAGGAGGAGCTTCTAAAGATAGTCGAGAACAATAGGAGCACAGAAGCTTATG
TCTTTGACCACTCAAACATGGCGAGGCGTGCTTTGAAGAATACAGCTCTAGCTTATCTTGCATCTCTCGAAGATCCAGCA
TATATGGAACTTGCACTGAACGAATACAAGATGGCCACCAATTTGACCGACCAATTTGCTGCTTTGGCAGCTCTATCCCA
GAACCCTGGTAAAACCCGTGACGACATTCTTGCCGACTTCTATAACAAGTGGCAGGACGATTACTTGGTTGTTAATAAAT
GGTTCCTCCTTCAATCAACATCCGACATTCCTGGCAATGTAGAGAATGTCAAGAAGCTTTTGGATCACCCAGCTTTCGAT
CTGCGCAACCCAAACAAGGTTTATTCGCTTATTGGAGGGTTCTGCGGTTCCCCAGTGAATTTCCATGCCAAGGATGGATC
AGGTTACAAGTTCTTGGGTGACATTGTTGTCCAGTTAGACAAATTGAACCCTCAGGTTGCTTCTCGTATGGTGTCTGCCT
TTTCGAGGTGGAAGCGGTACGATGAAACCCGTCAAGGTCTAGCCAAGGCACAATTGGAAATGATAATGTCTGCTAATGGG
CTGTCTGAGAATGTCTTTGAGATTGCCTCCAAGAGTTTGGCTGCTTGA
Microexon DNA seq GTTTATGAAAAG
Microexon Amino Acid seq VYEK
Microexon-tag DNA Seq GTTCGCCCACATTCATACATCAAGATGGATAACTTCTACACAGTGACGGTTTATGAAAAGGGAGCTGAGGTCGTCAGGATGTACAAAACTCTACTAGGAACTCAGGGG
Microexon-tag Amino Acid seq VRPHSYIKMDNFYTVTVYEKGAEVVRMYKTLLGTQG
Transcript ID At.5504.1
Gene ID At.5504
Gene Name NA
Pfam domain motif Peptidase_M1
Motif E-value 1.3e-50
Motif start 231
Motif end 442
Protein seq >At.5504.1
MDAPKEIFLKNYTKPDYYFETVDLSFSLGEEKTIVSSKIKVSPRVKGSSAALVLDGHDLKLLSVKVEGKLLKEGDYQLDS
RHLTLPSLPAEESFVLEIDTEIYPHKNTSLEGLYKSSGNFCTQCEAEGFRKITFYQDRPDIMAKYTCRVEGDKTLYPVLL
SNGNLISQGDIEGGRHYALWEDPFKKPCYLFALVAGQLVSRDDTFTTRSGRQVSLKIWTPAEDLPKTAHAMYSLKAAMKW
DEDVFGLEYDLDLFNIVAVPDFNMGAMENKSLNIFNSKLVLASPETATDADYAAILGVIGHEYFHNWTGNRVTCRDWFQL
SLKEGLTVFRDQEFSSDMGSRTVKRIADVSKLRIYQFPQDAGPMAHPVRPHSYIKMDNFYTVTVYEKGAEVVRMYKTLLG
TQGFRKGIDLYFERHDEQAVTCEDFFAAMRDANNADFANFLQWYSQAGTPVVKVVSSYNADARTFSLKFSQEIPPTPGQP
TKEPTFIPVVVGLLDSSGKDITLSSVHHDGTVQTISGSSTILRVTKKEEEFVFSDIPERPVPSLFRGFSAPVRVETDLSN
DDLFFLLAHDSDEFNRWEAGQVLARKLMLNLVSDFQQNKPLALNPKFVQGLGSVLSDSSLDKEFIAKAITLPGEGEIMDM
MAVADPDAVHAVRKFVRKQLASELKEELLKIVENNRSTEAYVFDHSNMARRALKNTALAYLASLEDPAYMELALNEYKMA
TNLTDQFAALAALSQNPGKTRDDILADFYNKWQDDYLVVNKWFLLQSTSDIPGNVENVKKLLDHPAFDLRNPNKVYSLIG
GFCGSPVNFHAKDGSGYKFLGDIVVQLDKLNPQVASRMVSAFSRWKRYDETRQGLAKAQLEMIMSANGLSENVFEIASKS
LAA*
CDS seq >At.5504.1
ATGGATGCACCTAAAGAAATATTTCTCAAGAACTACACAAAGCCTGATTACTACTTTGAAACTGTGGATCTGAGCTTCTC
TCTAGGTGAAGAGAAAACAATTGTTAGCTCTAAAATCAAGGTTTCTCCTCGAGTTAAAGGATCTTCTGCTGCATTGGTCT
TGGATGGACATGACTTGAAGCTACTTTCTGTCAAGGTTGAGGGGAAGCTTCTTAAGGAAGGGGATTACCAGTTGGACTCT
CGCCATCTAACTCTTCCTTCGCTTCCAGCTGAGGAGTCCTTTGTTTTGGAGATTGATACTGAGATATACCCCCACAAGAA
TACTTCACTTGAGGGGCTTTACAAGTCTTCTGGGAATTTCTGCACACAGTGTGAAGCAGAAGGTTTCCGCAAAATTACAT
TTTACCAGGATCGTCCTGATATAATGGCTAAGTACACGTGTCGTGTGGAAGGTGACAAGACACTTTATCCTGTGTTATTG
TCCAATGGAAACCTCATTTCTCAAGGAGATATAGAGGGTGGTCGACACTATGCCTTATGGGAGGATCCATTCAAGAAACC
GTGCTATCTATTTGCTTTGGTGGCTGGACAGCTAGTGAGCAGAGATGATACATTTACCACACGCTCTGGTAGGCAGGTTT
CCCTGAAAATCTGGACTCCTGCAGAAGATCTACCAAAGACTGCCCATGCCATGTATTCCCTAAAAGCGGCCATGAAGTGG
GATGAGGATGTGTTCGGGCTTGAGTATGACCTGGATCTCTTCAACATTGTTGCTGTTCCAGATTTTAACATGGGAGCCAT
GGAAAACAAGAGTTTGAATATTTTTAATTCCAAGCTTGTCCTAGCATCTCCAGAAACAGCAACAGATGCAGATTACGCTG
CAATTTTGGGTGTTATTGGTCATGAGTACTTTCACAATTGGACAGGGAACAGGGTGACATGCCGTGACTGGTTCCAACTC
AGTCTAAAGGAAGGACTTACTGTCTTCCGTGACCAGGAATTTTCATCTGACATGGGAAGCCGAACTGTAAAGCGAATTGC
TGATGTTTCAAAGCTTCGGATCTATCAATTTCCGCAGGATGCTGGTCCTATGGCACATCCTGTTCGCCCACATTCATACA
TCAAGATGGATAACTTCTACACAGTGACGGTTTATGAAAAGGGAGCTGAGGTCGTCAGGATGTACAAAACTCTACTAGGA
ACTCAGGGGTTCCGAAAGGGAATTGATCTCTATTTTGAAAGACATGATGAGCAAGCAGTGACTTGTGAAGACTTCTTTGC
TGCTATGCGTGATGCAAATAATGCAGACTTTGCTAATTTCTTGCAATGGTACTCTCAAGCTGGAACGCCCGTTGTCAAAG
TGGTATCCTCTTACAATGCTGACGCTCGTACTTTCTCTTTAAAATTCAGTCAGGAGATACCTCCAACTCCAGGCCAGCCA
ACAAAAGAACCTACATTTATTCCAGTGGTTGTTGGTCTTTTGGACTCAAGTGGGAAAGACATTACTCTTTCCTCTGTTCA
TCATGATGGTACAGTGCAGACCATTTCAGGCAGCAGCACAATACTTCGAGTGACTAAGAAAGAAGAAGAGTTTGTGTTTT
CTGATATACCAGAAAGACCTGTTCCATCCCTATTTAGGGGATTCAGTGCCCCAGTTCGTGTTGAAACTGATCTCTCTAAT
GATGACTTATTCTTCCTCCTAGCACATGATTCAGATGAATTCAATAGGTGGGAGGCCGGTCAAGTTCTGGCAAGAAAGCT
GATGCTGAACTTAGTTTCTGATTTCCAGCAAAATAAACCGTTGGCTCTAAACCCAAAATTTGTGCAAGGTCTCGGCAGTG
TGCTTTCTGACTCAAGCTTGGACAAGGAATTTATAGCCAAAGCAATAACACTACCTGGGGAGGGAGAGATAATGGACATG
ATGGCCGTGGCGGATCCTGATGCTGTTCATGCTGTTAGAAAGTTTGTACGAAAGCAGCTTGCATCTGAACTTAAGGAGGA
GCTTCTAAAGATAGTCGAGAACAATAGGAGCACAGAAGCTTATGTCTTTGACCACTCAAACATGGCGAGGCGTGCTTTGA
AGAATACAGCTCTAGCTTATCTTGCATCTCTCGAAGATCCAGCATATATGGAACTTGCACTGAACGAATACAAGATGGCC
ACCAATTTGACCGACCAATTTGCTGCTTTGGCAGCTCTATCCCAGAACCCTGGTAAAACCCGTGACGACATTCTTGCCGA
CTTCTATAACAAGTGGCAGGACGATTACTTGGTTGTTAATAAATGGTTCCTCCTTCAATCAACATCCGACATTCCTGGCA
ATGTAGAGAATGTCAAGAAGCTTTTGGATCACCCAGCTTTCGATCTGCGCAACCCAAACAAGGTTTATTCGCTTATTGGA
GGGTTCTGCGGTTCCCCAGTGAATTTCCATGCCAAGGATGGATCAGGTTACAAGTTCTTGGGTGACATTGTTGTCCAGTT
AGACAAATTGAACCCTCAGGTTGCTTCTCGTATGGTGTCTGCCTTTTCGAGGTGGAAGCGGTACGATGAAACCCGTCAAG
GTCTAGCCAAGGCACAATTGGAAATGATAATGTCTGCTAATGGGCTGTCTGAGAATGTCTTTGAGATTGCCTCCAAGAGT
TTGGCTGCTTGA