Microexon ID At_2:12747870-12747880:+
Species Arabidopsis thaliana
Coordinates 2:12747870..12747880
Microexon Cluster ID MEP27
Size 11
Phase 1
Pfam Domain Motif Gelsolin
Structure of Microexon-tag (flanking exon, microexon, flanking exon sizes) 49,11,48
Microexon location in the Microexon-tag 2
Microexon-tag DNA Seq MRRGADRYTGWMAGWGATCCYCAYYTGTTYDCWTKYWCWTTYWMTAAAGGRAADYTKRAGGTKRMRGARRTHTACAAYTTYWCYCARGATGAYYTGWTGACWGARGAT
Logo of Microexon-tag DNA Seq NT60 Logo
Alignment of exons MSA
Microexon DNA seq ATGTACTCAAG
Microexon Amino Acid seq DVLK
Microexon-tag DNA Seq AGAAAACAAATAGAAGAACCACATTTATTTACATGTTCATGCAGTTCAGATGTACTCAAGGTGAAAGAAATATACAATTTTGTGCAAGATGATTTAACTACTGAAGAT
Microexon-tag Amino Acid Seq RKQIEEPHLFTCSCSSDVLKVKEIYNFVQDDLTTED
Microexon-tag spanning region12747705-12748028
Microexon-tag prediction score0.8721
Overlapped with the annotated transcript (%) 100
New Transcript ID AT2G29890.1x
Reference Transcript ID AT2G29890.1
Gene ID AT2G29890
Gene Name VLN1
Transcript ID AT2G29890.1
Protein ID AT2G29890.1
Gene ID AT2G29890
Gene Name VLN1
Pfam domain motif Unknown
Motif E-value NA
Motif start NA
Motif end NA
Protein seq >AT2G29890.1
MSRLSKDIDSAFQGVGTKSGLEIWCVYNKQLISIPKSSFGKFHSGNAYLVLRTFLRKIESPQYDIHYWLGIDANEVDSIL
ASDKALDLDAALGCCTVQYREVQGQETEKFLSYFKPCIIPVEGKYSPKTGIAGETYQVTLLRCKGDHVVRVKEVPFLRSS
LNHDDVFILDTASKVFLFAGCNSSTQEKAKAMEVVEYIKDNKHDGRCEVATIEDGKFSGDSDAGEFWSFFGGYAPIPKLS
SSTTQEQTQTPCAELFWIDTKGNLHPTGTSSLDKDMLEKNKCYMLDCHSEVFVWMGRNTSLTERKTSISSSEEFLRKEGR
STTTSLVLLTEGLENARFRSFFNKWPQTVESSLYNEGREKVAALFKQKGYDVEELPDEEDDPLYTNCRDNLKVWRVDGDD
VSLLSIPDQTKLFTGDCYLVQYKYTYKERTEHLLYVWIGCESIQQDRADAITNASAIVGTTKGESVLCHIYQGNEPSRFF
PMFQSLVVFKGGLSRRYKVLLAEKEKIGEEYNENKASLFRVVGTSPRNMQAIQVNLVATSLNSSYSYILQYGASAFTWIG
KLSSDSDHEVLDRMLYFLDTSCQPIYIREGNETDTFWNLLGGKSEYPKEKEMRKQIEEPHLFTCSCSSDVLKVKEIYNFV
QDDLTTEDVFLLDCQSEVYVWIGSNSNIKSKEEALTLGLKFLEMDILEEGLTMRTPVYVVTEGHEPPFFTRFFEWVPEKA
NMHGNSFERKLASLKGKKTSTKRSSGSQYRSQSKDNASRDLQSRSVSSNGSERGVSPCSSEKLLSLSSAEDMTNSSNSTP
VVKKLFSESLLVDPNDGVARQESSSKSDISKQKPRVGINSDLSSLESLAYSYEQLRVDSQKPVTDIDATRREAYLTEKEF
EERFGMAKSEFYALPKWKQNKLKISLHLF*
CDS seq >AT2G29890.1
ATGTCTAGGCTAAGTAAAGACATTGATTCAGCGTTTCAAGGTGTTGGAACCAAAAGTGGCTTGGAGATTTGGTGCGTTTA
TAATAAACAGCTTATTTCTATCCCTAAGTCTTCTTTTGGAAAATTTCACTCTGGAAATGCATACCTTGTTCTTAGAACGT
TTTTGCGGAAGATCGAATCTCCCCAGTATGACATTCACTATTGGCTAGGAATTGATGCAAATGAGGTGGATTCAATTTTG
GCATCAGACAAAGCATTAGATTTAGATGCCGCACTTGGATGTTGTACGGTGCAATACCGTGAAGTTCAAGGTCAAGAGAC
TGAGAAGTTTCTCTCTTACTTCAAACCTTGTATTATACCTGTTGAAGGCAAGTATTCCCCAAAAACTGGAATTGCTGGCG
AGACATACCAAGTCACCTTGCTAAGGTGCAAGGGAGATCATGTTGTTCGTGTTAAAGAGGTGCCTTTTCTTCGGTCATCA
CTGAACCATGACGATGTCTTCATTCTGGATACTGCGTCAAAGGTTTTTCTTTTTGCTGGTTGTAACTCAAGCACTCAGGA
AAAAGCTAAAGCGATGGAGGTTGTGGAATATATAAAGGATAACAAGCATGACGGAAGATGTGAGGTCGCGACTATCGAGG
ATGGAAAATTTTCAGGTGATTCAGATGCTGGGGAATTCTGGTCTTTCTTTGGTGGTTATGCTCCCATTCCCAAGCTTTCA
TCTTCTACCACCCAAGAACAAACTCAGACTCCATGTGCAGAATTGTTCTGGATAGATACTAAGGGAAATCTACATCCAAC
TGGAACAAGTTCTTTGGACAAGGACATGCTTGAGAAGAACAAATGCTACATGCTGGACTGTCACAGTGAAGTATTTGTTT
GGATGGGAAGAAACACATCACTTACAGAAAGGAAGACATCCATTTCTTCCTCAGAAGAATTTCTACGAAAGGAGGGACGC
TCGACGACCACAAGTTTAGTACTTCTAACAGAAGGACTAGAAAATGCCAGATTCAGGTCATTTTTTAACAAATGGCCTCA
GACCGTGGAGTCTAGCCTCTACAATGAAGGTCGAGAAAAAGTGGCTGCGTTGTTCAAACAAAAAGGATATGACGTTGAGG
AGCTTCCTGATGAAGAAGATGACCCTCTCTACACAAACTGCCGAGACAACCTCAAGGTTTGGCGTGTAGATGGTGATGAC
GTCTCGCTTCTCTCTATTCCTGACCAGACAAAGCTATTCACCGGCGATTGCTATCTTGTGCAGTATAAATATACTTATAA
AGAAAGAACCGAACATCTTTTATATGTATGGATCGGTTGTGAAAGCATACAGCAAGATAGAGCTGATGCCATAACCAATG
CTAGTGCCATTGTTGGTACAACCAAGGGTGAATCTGTACTGTGTCATATATATCAGGGAAACGAACCTTCTCGGTTTTTT
CCAATGTTCCAGTCACTGGTTGTTTTTAAGGGCGGTTTGAGTAGACGGTACAAAGTGCTTCTAGCAGAGAAGGAAAAGAT
AGGGGAAGAATATAATGAGAACAAGGCTTCTCTTTTCCGTGTTGTAGGAACAAGCCCAAGAAACATGCAAGCAATCCAAG
TGAATCTAGTTGCAACCTCCTTGAACTCATCCTACTCTTACATTTTACAATATGGAGCTTCTGCCTTCACTTGGATTGGG
AAACTTTCATCAGACTCTGATCATGAAGTTCTTGACAGAATGCTATATTTCCTTGATACATCTTGTCAACCTATATACAT
CAGGGAAGGAAATGAAACAGACACATTTTGGAATTTGCTTGGTGGTAAGTCAGAGTACCCAAAAGAAAAGGAAATGAGAA
AACAAATAGAAGAACCACATTTATTTACATGTTCATGCAGTTCAGATGTACTCAAGGTGAAAGAAATATACAATTTTGTG
CAAGATGATTTAACTACTGAAGATGTATTTCTATTGGACTGCCAAAGTGAAGTATATGTCTGGATTGGATCAAACTCAAA
CATAAAGTCGAAGGAAGAAGCTCTTACTCTTGGTCTGAAATTCCTAGAGATGGATATACTGGAAGAAGGTCTAACCATGA
GGACTCCTGTATATGTTGTCACAGAAGGCCACGAGCCACCATTTTTCACCCGTTTCTTTGAGTGGGTTCCTGAAAAGGCA
AACATGCATGGTAATTCATTTGAAAGGAAGCTTGCTAGTTTGAAAGGAAAGAAGACAAGCACTAAGAGATCTAGTGGAAG
CCAGTATAGATCACAGTCAAAGGATAATGCATCACGTGATTTACAAAGTCGATCTGTGAGCTCAAACGGATCGGAGCGAG
GAGTATCACCTTGCTCCAGCGAAAAGCTTTTGAGTTTGAGCTCTGCAGAAGACATGACAAACAGCAGTAACTCAACTCCA
GTTGTCAAAAAGCTTTTCTCAGAATCTCTTTTAGTGGATCCTAATGATGGAGTGGCGAGACAAGAGTCGAGTTCCAAGTC
GGACATTTCTAAACAGAAGCCACGCGTTGGAATCAATAGCGATCTTAGTAGTCTAGAGTCACTTGCATATTCATATGAAC
AGCTCAGAGTTGATTCTCAGAAGCCAGTGACGGATATAGATGCAACAAGAAGAGAGGCGTACTTAACAGAGAAAGAGTTT
GAAGAGAGATTTGGAATGGCGAAATCTGAGTTCTATGCACTTCCAAAGTGGAAACAGAATAAACTCAAAATATCTCTTCA
TCTTTTCTAA
Microexon DNA seq ATGTACTCAAG
Microexon Amino Acid seq DVLK
Microexon-tag DNA Seq AGAAAACAAATAGAAGAACCACATTTATTTACATGTTCATGCAGTTCAGATGTACTCAAGGTGAAAGAAATATACAATTTTGTGCAAGATGATTTAACTACTGAAGAT
Microexon-tag Amino Acid seq RKQIEEPHLFTCSCSSDVLKVKEIYNFVQDDLTTED
Transcript ID AT2G29890.1
Gene ID At.9773
Gene Name VLN1
Pfam domain motif Unknown
Motif E-value NA
Motif start NA
Motif end NA
Protein seq >AT2G29890.1
MSRLSKDIDSAFQGVGTKSGLEIWCVYNKQLISIPKSSFGKFHSGNAYLVLRTFLRKIESPQYDIHYWLGIDANEVDSIL
ASDKALDLDAALGCCTVQYREVQGQETEKFLSYFKPCIIPVEGKYSPKTGIAGETYQVTLLRCKGDHVVRVKEVPFLRSS
LNHDDVFILDTASKVFLFAGCNSSTQEKAKAMEVVEYIKDNKHDGRCEVATIEDGKFSGDSDAGEFWSFFGGYAPIPKLS
SSTTQEQTQTPCAELFWIDTKGNLHPTGTSSLDKDMLEKNKCYMLDCHSEVFVWMGRNTSLTERKTSISSSEEFLRKEGR
STTTSLVLLTEGLENARFRSFFNKWPQTVESSLYNEGREKVAALFKQKGYDVEELPDEEDDPLYTNCRDNLKVWRVDGDD
VSLLSIPDQTKLFTGDCYLVQYKYTYKERTEHLLYVWIGCESIQQDRADAITNASAIVGTTKGESVLCHIYQGNEPSRFF
PMFQSLVVFKGGLSRRYKVLLAEKEKIGEEYNENKASLFRVVGTSPRNMQAIQVNLVATSLNSSYSYILQYGASAFTWIG
KLSSDSDHEVLDRMLYFLDTSCQPIYIREGNETDTFWNLLGGKSEYPKEKEMRKQIEEPHLFTCSCSSDVLKVKEIYNFV
QDDLTTEDVFLLDCQSEVYVWIGSNSNIKSKEEALTLGLKFLEMDILEEGLTMRTPVYVVTEGHEPPFFTRFFEWVPEKA
NMHGNSFERKLASLKGKKTSTKRSSGSQYRSQSKDNASRDLQSRSVSSNGSERGVSPCSSEKLLSLSSAEDMTNSSNSTP
VVKKLFSESLLVDPNDGVARQESSSKSDISKQKPRVGINSDLSSLESLAYSYEQLRVDSQKPVTDIDATRREAYLTEKEF
EERFGMAKSEFYALPKWKQNKLKISLHLF*
CDS seq >AT2G29890.1
ATGTCTAGGCTAAGTAAAGACATTGATTCAGCGTTTCAAGGTGTTGGAACCAAAAGTGGCTTGGAGATTTGGTGCGTTTA
TAATAAACAGCTTATTTCTATCCCTAAGTCTTCTTTTGGAAAATTTCACTCTGGAAATGCATACCTTGTTCTTAGAACGT
TTTTGCGGAAGATCGAATCTCCCCAGTATGACATTCACTATTGGCTAGGAATTGATGCAAATGAGGTGGATTCAATTTTG
GCATCAGACAAAGCATTAGATTTAGATGCCGCACTTGGATGTTGTACGGTGCAATACCGTGAAGTTCAAGGTCAAGAGAC
TGAGAAGTTTCTCTCTTACTTCAAACCTTGTATTATACCTGTTGAAGGCAAGTATTCCCCAAAAACTGGAATTGCTGGCG
AGACATACCAAGTCACCTTGCTAAGGTGCAAGGGAGATCATGTTGTTCGTGTTAAAGAGGTGCCTTTTCTTCGGTCATCA
CTGAACCATGACGATGTCTTCATTCTGGATACTGCGTCAAAGGTTTTTCTTTTTGCTGGTTGTAACTCAAGCACTCAGGA
AAAAGCTAAAGCGATGGAGGTTGTGGAATATATAAAGGATAACAAGCATGACGGAAGATGTGAGGTCGCGACTATCGAGG
ATGGAAAATTTTCAGGTGATTCAGATGCTGGGGAATTCTGGTCTTTCTTTGGTGGTTATGCTCCCATTCCCAAGCTTTCA
TCTTCTACCACCCAAGAACAAACTCAGACTCCATGTGCAGAATTGTTCTGGATAGATACTAAGGGAAATCTACATCCAAC
TGGAACAAGTTCTTTGGACAAGGACATGCTTGAGAAGAACAAATGCTACATGCTGGACTGTCACAGTGAAGTATTTGTTT
GGATGGGAAGAAACACATCACTTACAGAAAGGAAGACATCCATTTCTTCCTCAGAAGAATTTCTACGAAAGGAGGGACGC
TCGACGACCACAAGTTTAGTACTTCTAACAGAAGGACTAGAAAATGCCAGATTCAGGTCATTTTTTAACAAATGGCCTCA
GACCGTGGAGTCTAGCCTCTACAATGAAGGTCGAGAAAAAGTGGCTGCGTTGTTCAAACAAAAAGGATATGACGTTGAGG
AGCTTCCTGATGAAGAAGATGACCCTCTCTACACAAACTGCCGAGACAACCTCAAGGTTTGGCGTGTAGATGGTGATGAC
GTCTCGCTTCTCTCTATTCCTGACCAGACAAAGCTATTCACCGGCGATTGCTATCTTGTGCAGTATAAATATACTTATAA
AGAAAGAACCGAACATCTTTTATATGTATGGATCGGTTGTGAAAGCATACAGCAAGATAGAGCTGATGCCATAACCAATG
CTAGTGCCATTGTTGGTACAACCAAGGGTGAATCTGTACTGTGTCATATATATCAGGGAAACGAACCTTCTCGGTTTTTT
CCAATGTTCCAGTCACTGGTTGTTTTTAAGGGCGGTTTGAGTAGACGGTACAAAGTGCTTCTAGCAGAGAAGGAAAAGAT
AGGGGAAGAATATAATGAGAACAAGGCTTCTCTTTTCCGTGTTGTAGGAACAAGCCCAAGAAACATGCAAGCAATCCAAG
TGAATCTAGTTGCAACCTCCTTGAACTCATCCTACTCTTACATTTTACAATATGGAGCTTCTGCCTTCACTTGGATTGGG
AAACTTTCATCAGACTCTGATCATGAAGTTCTTGACAGAATGCTATATTTCCTTGATACATCTTGTCAACCTATATACAT
CAGGGAAGGAAATGAAACAGACACATTTTGGAATTTGCTTGGTGGTAAGTCAGAGTACCCAAAAGAAAAGGAAATGAGAA
AACAAATAGAAGAACCACATTTATTTACATGTTCATGCAGTTCAGATGTACTCAAGGTGAAAGAAATATACAATTTTGTG
CAAGATGATTTAACTACTGAAGATGTATTTCTATTGGACTGCCAAAGTGAAGTATATGTCTGGATTGGATCAAACTCAAA
CATAAAGTCGAAGGAAGAAGCTCTTACTCTTGGTCTGAAATTCCTAGAGATGGATATACTGGAAGAAGGTCTAACCATGA
GGACTCCTGTATATGTTGTCACAGAAGGCCACGAGCCACCATTTTTCACCCGTTTCTTTGAGTGGGTTCCTGAAAAGGCA
AACATGCATGGTAATTCATTTGAAAGGAAGCTTGCTAGTTTGAAAGGAAAGAAGACAAGCACTAAGAGATCTAGTGGAAG
CCAGTATAGATCACAGTCAAAGGATAATGCATCACGTGATTTACAAAGTCGATCTGTGAGCTCAAACGGATCGGAGCGAG
GAGTATCACCTTGCTCCAGCGAAAAGCTTTTGAGTTTGAGCTCTGCAGAAGACATGACAAACAGCAGTAACTCAACTCCA
GTTGTCAAAAAGCTTTTCTCAGAATCTCTTTTAGTGGATCCTAATGATGGAGTGGCGAGACAAGAGTCGAGTTCCAAGTC
GGACATTTCTAAACAGAAGCCACGCGTTGGAATCAATAGCGATCTTAGTAGTCTAGAGTCACTTGCATATTCATATGAAC
AGCTCAGAGTTGATTCTCAGAAGCCAGTGACGGATATAGATGCAACAAGAAGAGAGGCGTACTTAACAGAGAAAGAGTTT
GAAGAGAGATTTGGAATGGCGAAATCTGAGTTCTATGCACTTCCAAAGTGGAAACAGAATAAACTCAAAATATCTCTTCA
TCTTTTCTAA