| Microexon ID | At_1:23660722-23660733:- |
| Species | Arabidopsis thaliana | Coordinates | 1:23660722..23660733 |
| Microexon Cluster ID | MEP28 |
| Size | 12 |
| Phase | 0 |
| Pfam Domain Motif | Peptidase_M1 |
| Structure of Microexon-tag (flanking exon, microexon, flanking exon sizes) | 24,19,5,12,48 |
| Microexon location in the Microexon-tag | 4 |
| Microexon-tag DNA Seq | GTTMGRCCWCAYTCTTAYATYAAGATGGACAACTTCTAYACAGTRACGGTKTATGARAAGGGWGCTGAAGTTGTCMGRATGTACAARACMTTRYTKGGRAGTYMAGGR |
| Logo of Microexon-tag DNA Seq | ![]() |
| Alignment of exons | ![]() |
| Microexon DNA seq | GTTTATGAAAAG |
| Microexon Amino Acid seq | VYEK |
| Microexon-tag DNA Seq | GTTCGCCCACATTCATACATCAAGATGGATAACTTCTACACAGTGACGGTTTATGAAAAGGGAGCTGAGGTCGTCAGGATGTACAAAACTCTACTAGGAACTCAGGGG |
| Microexon-tag Amino Acid Seq | VRPHSYIKMDNFYTVTVYEKGAEVVRMYKTLLGTQG |
| Microexon-tag spanning region | 23660597-23661112 |
| Microexon-tag prediction score | 0.975 |
| Overlapped with the annotated transcript (%) | 100 |
| New Transcript ID | AT1G63770.5x |
| Reference Transcript ID | AT1G63770.5 |
| Gene ID | AT1G63770 |
| Gene Name | NA |
| Transcript ID | AT1G63770.5 |
| Protein ID | AT1G63770.5 |
| Gene ID | AT1G63770 |
| Gene Name | NA |
| Pfam domain motif | Peptidase_M1 |
| Motif E-value | 1.6e-50 |
| Motif start | 323 |
| Motif end | 534 |
| Protein seq | >AT1G63770.5 MARLIIPCRSSSLARVNLLGLLSRAPVPVRSSCLRSSANRLTQHRPFLTSEAICLRKNRFLPHSVDTHKQNSRRLICSVA TESVPDKAEDSKMDAPKEIFLKNYTKPDYYFETVDLSFSLGEEKTIVSSKIKVSPRVKGSSAALVLDGHDLKLLSVKVEG KLLKEGDYQLDSRHLTLPSLPAEESFVLEIDTEIYPHKNTSLEGLYKSSGNFCTQCEAEGFRKITFYQDRPDIMAKYTCR VEGDKTLYPVLLSNGNLISQGDIEGGRHYALWEDPFKKPCYLFALVAGQLVSRDDTFTTRSGRQVSLKIWTPAEDLPKTA HAMYSLKAAMKWDEDVFGLEYDLDLFNIVAVPDFNMGAMENKSLNIFNSKLVLASPETATDADYAAILGVIGHEYFHNWT GNRVTCRDWFQLSLKEGLTVFRDQEFSSDMGSRTVKRIADVSKLRIYQFPQDAGPMAHPVRPHSYIKMDNFYTVTVYEKG AEVVRMYKTLLGTQGFRKGIDLYFERHDEQAVTCEDFFAAMRDANNADFANFLQWYSQAGTPVVKVVSSYNADARTFSLK FSQEIPPTPGQPTKEPTFIPVVVGLLDSSGKDITLSSVHHDGTVQTISGSSTILRVTKKEEEFVFSDIPERPVPSLFRGF SAPVRVETDLSNDDLFFLLAHDSDEFNRWEAGQVLARKLMLNLVSDFQQNKPLALNPKFVQGLGSVLSDSSLDKEFIAKA ITLPGEGEIMDMMAVADPDAVHAVRKFVRKQLASELKEELLKIVENNRSTEAYVFDHSNMARRALKNTALAYLASLEDPA YMELALNEYKMATNLTDQFAALAALSQNPGKTRDDILADFYNKWQDDYLVVNKWFLLQSTSDIPGNVENVKKLLDHPAFD LRNPNKVYSLIGGFCGSPVNFHAKDGSGYKFLGDIVVQLDKLNPQVASRMVSAFSRWKRYDETRQGLAKAQLEMIMSANG LSENVFEIASKSLAA* |
| CDS seq | >AT1G63770.5 ATGGCTCGGTTGATAATTCCTTGTCGAAGTTCGTCTCTGGCGAGAGTCAATCTTCTGGGTTTGCTCTCTCGTGCTCCTGT TCCTGTTAGAAGCAGTTGTCTTCGTAGTTCGGCAAACAGACTTACTCAACACAGACCGTTTCTTACTTCCGAGGCCATTT GTTTGAGGAAGAATCGGTTTCTACCCCATTCTGTTGATACACATAAGCAAAACAGCAGGAGGCTCATTTGTTCTGTTGCC ACAGAATCAGTTCCGGATAAAGCTGAAGATTCCAAAATGGATGCACCTAAAGAAATATTTCTCAAGAACTACACAAAGCC TGATTACTACTTTGAAACTGTGGATCTGAGCTTCTCTCTAGGTGAAGAGAAAACAATTGTTAGCTCTAAAATCAAGGTTT CTCCTCGAGTTAAAGGATCTTCTGCTGCATTGGTCTTGGATGGACATGACTTGAAGCTACTTTCTGTCAAGGTTGAGGGG AAGCTTCTTAAGGAAGGGGATTACCAGTTGGACTCTCGCCATCTAACTCTTCCTTCGCTTCCAGCTGAGGAGTCCTTTGT TTTGGAGATTGATACTGAGATATACCCCCACAAGAATACTTCACTTGAGGGGCTTTACAAGTCTTCTGGGAATTTCTGCA CACAGTGTGAAGCAGAAGGTTTCCGCAAAATTACATTTTACCAGGATCGTCCTGATATAATGGCTAAGTACACGTGTCGT GTGGAAGGTGACAAGACACTTTATCCTGTGTTATTGTCCAATGGAAACCTCATTTCTCAAGGAGATATAGAGGGTGGTCG ACACTATGCCTTATGGGAGGATCCATTCAAGAAACCGTGCTATCTATTTGCTTTGGTGGCTGGACAGCTAGTGAGCAGAG ATGATACATTTACCACACGCTCTGGTAGGCAGGTTTCCCTGAAAATCTGGACTCCTGCAGAAGATCTACCAAAGACTGCC CATGCCATGTATTCCCTAAAAGCGGCCATGAAGTGGGATGAGGATGTGTTCGGGCTTGAGTATGACCTGGATCTCTTCAA CATTGTTGCTGTTCCAGATTTTAACATGGGAGCCATGGAAAACAAGAGTTTGAATATTTTTAATTCCAAGCTTGTCCTAG CATCTCCAGAAACAGCAACAGATGCAGATTACGCTGCAATTTTGGGTGTTATTGGTCATGAGTACTTTCACAATTGGACA GGGAACAGGGTGACATGCCGTGACTGGTTCCAACTCAGTCTAAAGGAAGGACTTACTGTCTTCCGTGACCAGGAATTTTC ATCTGACATGGGAAGCCGAACTGTAAAGCGAATTGCTGATGTTTCAAAGCTTCGGATCTATCAATTTCCGCAGGATGCTG GTCCTATGGCACATCCTGTTCGCCCACATTCATACATCAAGATGGATAACTTCTACACAGTGACGGTTTATGAAAAGGGA GCTGAGGTCGTCAGGATGTACAAAACTCTACTAGGAACTCAGGGGTTCCGAAAGGGAATTGATCTCTATTTTGAAAGACA TGATGAGCAAGCAGTGACTTGTGAAGACTTCTTTGCTGCTATGCGTGATGCAAATAATGCAGACTTTGCTAATTTCTTGC AATGGTACTCTCAAGCTGGAACGCCCGTTGTCAAAGTGGTATCCTCTTACAATGCTGACGCTCGTACTTTCTCTTTAAAA TTCAGTCAGGAGATACCTCCAACTCCAGGCCAGCCAACAAAAGAACCTACATTTATTCCAGTGGTTGTTGGTCTTTTGGA CTCAAGTGGGAAAGACATTACTCTTTCCTCTGTTCATCATGATGGTACAGTGCAGACCATTTCAGGCAGCAGCACAATAC TTCGAGTGACTAAGAAAGAAGAAGAGTTTGTGTTTTCTGATATACCAGAAAGACCTGTTCCATCCCTATTTAGGGGATTC AGTGCCCCAGTTCGTGTTGAAACTGATCTCTCTAATGATGACTTATTCTTCCTCCTAGCACATGATTCAGATGAATTCAA TAGGTGGGAGGCCGGTCAAGTTCTGGCAAGAAAGCTGATGCTGAACTTAGTTTCTGATTTCCAGCAAAATAAACCGTTGG CTCTAAACCCAAAATTTGTGCAAGGTCTCGGCAGTGTGCTTTCTGACTCAAGCTTGGACAAGGAATTTATAGCCAAAGCA ATAACACTACCTGGGGAGGGAGAGATAATGGACATGATGGCCGTGGCGGATCCTGATGCTGTTCATGCTGTTAGAAAGTT TGTACGAAAGCAGCTTGCATCTGAACTTAAGGAGGAGCTTCTAAAGATAGTCGAGAACAATAGGAGCACAGAAGCTTATG TCTTTGACCACTCAAACATGGCGAGGCGTGCTTTGAAGAATACAGCTCTAGCTTATCTTGCATCTCTCGAAGATCCAGCA TATATGGAACTTGCACTGAACGAATACAAGATGGCCACCAATTTGACCGACCAATTTGCTGCTTTGGCAGCTCTATCCCA GAACCCTGGTAAAACCCGTGACGACATTCTTGCCGACTTCTATAACAAGTGGCAGGACGATTACTTGGTTGTTAATAAAT GGTTCCTCCTTCAATCAACATCCGACATTCCTGGCAATGTAGAGAATGTCAAGAAGCTTTTGGATCACCCAGCTTTCGAT CTGCGCAACCCAAACAAGGTTTATTCGCTTATTGGAGGGTTCTGCGGTTCCCCAGTGAATTTCCATGCCAAGGATGGATC AGGTTACAAGTTCTTGGGTGACATTGTTGTCCAGTTAGACAAATTGAACCCTCAGGTTGCTTCTCGTATGGTGTCTGCCT TTTCGAGGTGGAAGCGGTACGATGAAACCCGTCAAGGTCTAGCCAAGGCACAATTGGAAATGATAATGTCTGCTAATGGG CTGTCTGAGAATGTCTTTGAGATTGCCTCCAAGAGTTTGGCTGCTTGA |
| Microexon DNA seq | GTTTATGAAAAG |
| Microexon Amino Acid seq | VYEK |
| Microexon-tag DNA Seq | GTTCGCCCACATTCATACATCAAGATGGATAACTTCTACACAGTGACGGTTTATGAAAAGGGAGCTGAGGTCGTCAGGATGTACAAAACTCTACTAGGAACTCAGGGG |
| Microexon-tag Amino Acid seq | VRPHSYIKMDNFYTVTVYEKGAEVVRMYKTLLGTQG |
| Transcript ID | At.5504.1 |
| Gene ID | At.5504 |
| Gene Name | NA |
| Pfam domain motif | Peptidase_M1 |
| Motif E-value | 1.3e-50 |
| Motif start | 231 |
| Motif end | 442 |
| Protein seq | >At.5504.1 MDAPKEIFLKNYTKPDYYFETVDLSFSLGEEKTIVSSKIKVSPRVKGSSAALVLDGHDLKLLSVKVEGKLLKEGDYQLDS RHLTLPSLPAEESFVLEIDTEIYPHKNTSLEGLYKSSGNFCTQCEAEGFRKITFYQDRPDIMAKYTCRVEGDKTLYPVLL SNGNLISQGDIEGGRHYALWEDPFKKPCYLFALVAGQLVSRDDTFTTRSGRQVSLKIWTPAEDLPKTAHAMYSLKAAMKW DEDVFGLEYDLDLFNIVAVPDFNMGAMENKSLNIFNSKLVLASPETATDADYAAILGVIGHEYFHNWTGNRVTCRDWFQL SLKEGLTVFRDQEFSSDMGSRTVKRIADVSKLRIYQFPQDAGPMAHPVRPHSYIKMDNFYTVTVYEKGAEVVRMYKTLLG TQGFRKGIDLYFERHDEQAVTCEDFFAAMRDANNADFANFLQWYSQAGTPVVKVVSSYNADARTFSLKFSQEIPPTPGQP TKEPTFIPVVVGLLDSSGKDITLSSVHHDGTVQTISGSSTILRVTKKEEEFVFSDIPERPVPSLFRGFSAPVRVETDLSN DDLFFLLAHDSDEFNRWEAGQVLARKLMLNLVSDFQQNKPLALNPKFVQGLGSVLSDSSLDKEFIAKAITLPGEGEIMDM MAVADPDAVHAVRKFVRKQLASELKEELLKIVENNRSTEAYVFDHSNMARRALKNTALAYLASLEDPAYMELALNEYKMA TNLTDQFAALAALSQNPGKTRDDILADFYNKWQDDYLVVNKWFLLQSTSDIPGNVENVKKLLDHPAFDLRNPNKVYSLIG GFCGSPVNFHAKDGSGYKFLGDIVVQLDKLNPQVASRMVSAFSRWKRYDETRQGLAKAQLEMIMSANGLSENVFEIASKS LAA* |
| CDS seq | >At.5504.1 ATGGATGCACCTAAAGAAATATTTCTCAAGAACTACACAAAGCCTGATTACTACTTTGAAACTGTGGATCTGAGCTTCTC TCTAGGTGAAGAGAAAACAATTGTTAGCTCTAAAATCAAGGTTTCTCCTCGAGTTAAAGGATCTTCTGCTGCATTGGTCT TGGATGGACATGACTTGAAGCTACTTTCTGTCAAGGTTGAGGGGAAGCTTCTTAAGGAAGGGGATTACCAGTTGGACTCT CGCCATCTAACTCTTCCTTCGCTTCCAGCTGAGGAGTCCTTTGTTTTGGAGATTGATACTGAGATATACCCCCACAAGAA TACTTCACTTGAGGGGCTTTACAAGTCTTCTGGGAATTTCTGCACACAGTGTGAAGCAGAAGGTTTCCGCAAAATTACAT TTTACCAGGATCGTCCTGATATAATGGCTAAGTACACGTGTCGTGTGGAAGGTGACAAGACACTTTATCCTGTGTTATTG TCCAATGGAAACCTCATTTCTCAAGGAGATATAGAGGGTGGTCGACACTATGCCTTATGGGAGGATCCATTCAAGAAACC GTGCTATCTATTTGCTTTGGTGGCTGGACAGCTAGTGAGCAGAGATGATACATTTACCACACGCTCTGGTAGGCAGGTTT CCCTGAAAATCTGGACTCCTGCAGAAGATCTACCAAAGACTGCCCATGCCATGTATTCCCTAAAAGCGGCCATGAAGTGG GATGAGGATGTGTTCGGGCTTGAGTATGACCTGGATCTCTTCAACATTGTTGCTGTTCCAGATTTTAACATGGGAGCCAT GGAAAACAAGAGTTTGAATATTTTTAATTCCAAGCTTGTCCTAGCATCTCCAGAAACAGCAACAGATGCAGATTACGCTG CAATTTTGGGTGTTATTGGTCATGAGTACTTTCACAATTGGACAGGGAACAGGGTGACATGCCGTGACTGGTTCCAACTC AGTCTAAAGGAAGGACTTACTGTCTTCCGTGACCAGGAATTTTCATCTGACATGGGAAGCCGAACTGTAAAGCGAATTGC TGATGTTTCAAAGCTTCGGATCTATCAATTTCCGCAGGATGCTGGTCCTATGGCACATCCTGTTCGCCCACATTCATACA TCAAGATGGATAACTTCTACACAGTGACGGTTTATGAAAAGGGAGCTGAGGTCGTCAGGATGTACAAAACTCTACTAGGA ACTCAGGGGTTCCGAAAGGGAATTGATCTCTATTTTGAAAGACATGATGAGCAAGCAGTGACTTGTGAAGACTTCTTTGC TGCTATGCGTGATGCAAATAATGCAGACTTTGCTAATTTCTTGCAATGGTACTCTCAAGCTGGAACGCCCGTTGTCAAAG TGGTATCCTCTTACAATGCTGACGCTCGTACTTTCTCTTTAAAATTCAGTCAGGAGATACCTCCAACTCCAGGCCAGCCA ACAAAAGAACCTACATTTATTCCAGTGGTTGTTGGTCTTTTGGACTCAAGTGGGAAAGACATTACTCTTTCCTCTGTTCA TCATGATGGTACAGTGCAGACCATTTCAGGCAGCAGCACAATACTTCGAGTGACTAAGAAAGAAGAAGAGTTTGTGTTTT CTGATATACCAGAAAGACCTGTTCCATCCCTATTTAGGGGATTCAGTGCCCCAGTTCGTGTTGAAACTGATCTCTCTAAT GATGACTTATTCTTCCTCCTAGCACATGATTCAGATGAATTCAATAGGTGGGAGGCCGGTCAAGTTCTGGCAAGAAAGCT GATGCTGAACTTAGTTTCTGATTTCCAGCAAAATAAACCGTTGGCTCTAAACCCAAAATTTGTGCAAGGTCTCGGCAGTG TGCTTTCTGACTCAAGCTTGGACAAGGAATTTATAGCCAAAGCAATAACACTACCTGGGGAGGGAGAGATAATGGACATG ATGGCCGTGGCGGATCCTGATGCTGTTCATGCTGTTAGAAAGTTTGTACGAAAGCAGCTTGCATCTGAACTTAAGGAGGA GCTTCTAAAGATAGTCGAGAACAATAGGAGCACAGAAGCTTATGTCTTTGACCACTCAAACATGGCGAGGCGTGCTTTGA AGAATACAGCTCTAGCTTATCTTGCATCTCTCGAAGATCCAGCATATATGGAACTTGCACTGAACGAATACAAGATGGCC ACCAATTTGACCGACCAATTTGCTGCTTTGGCAGCTCTATCCCAGAACCCTGGTAAAACCCGTGACGACATTCTTGCCGA CTTCTATAACAAGTGGCAGGACGATTACTTGGTTGTTAATAAATGGTTCCTCCTTCAATCAACATCCGACATTCCTGGCA ATGTAGAGAATGTCAAGAAGCTTTTGGATCACCCAGCTTTCGATCTGCGCAACCCAAACAAGGTTTATTCGCTTATTGGA GGGTTCTGCGGTTCCCCAGTGAATTTCCATGCCAAGGATGGATCAGGTTACAAGTTCTTGGGTGACATTGTTGTCCAGTT AGACAAATTGAACCCTCAGGTTGCTTCTCGTATGGTGTCTGCCTTTTCGAGGTGGAAGCGGTACGATGAAACCCGTCAAG GTCTAGCCAAGGCACAATTGGAAATGATAATGTCTGCTAATGGGCTGTCTGAGAATGTCTTTGAGATTGCCTCCAAGAGT TTGGCTGCTTGA |

