Microexon ID At_4:14757778-14757788:+
Species Arabidopsis thaliana
Coordinates 4:14757778..14757788
Microexon Cluster ID MEP27
Size 11
Phase 1
Pfam Domain Motif Gelsolin
Structure of Microexon-tag (flanking exon, microexon, flanking exon sizes) 49,11,48
Microexon location in the Microexon-tag 2
Microexon-tag DNA Seq MRRGADRYTGWMAGWGATCCYCAYYTGTTYDCWTKYWCWTTYWMTAAAGGRAADYTKRAGGTKRMRGARRTHTACAAYTTYWCYCARGATGAYYTGWTGACWGARGAT
Logo of Microexon-tag DNA Seq NT60 Logo
Alignment of exons MSA
Microexon DNA seq AAGTTCTCAAG
Microexon Amino Acid seq EVLK
Microexon-tag DNA Seq AAGGAACCCGAGCGTGACCCTCACTTGTTCTCTTGTACATTCACAAAAGAAGTTCTCAAGGTGACAGAGATATATAACTTTACACAGGATGACTTGATGACCGAAGAT
Microexon-tag Amino Acid Seq KEPERDPHLFSCTFTKEVLKVTEIYNFTQDDLMTED
Microexon-tag spanning region14757621-14757952
Microexon-tag prediction score0.951
Overlapped with the annotated transcript (%) 100
New Transcript ID AT4G30160.1x
Reference Transcript ID AT4G30160.1
Gene ID AT4G30160
Gene Name VLN4
Transcript ID AT4G30160.1
Protein ID AT4G30160.1
Gene ID AT4G30160
Gene Name VLN4
Pfam domain motif Unknown
Motif E-value NA
Motif start NA
Motif end NA
Protein seq >AT4G30160.1
MSVSMRDLDPAFQGAGQKAGIEIWRIENFIPTPIPKSSIGKFFTGDSYIVLKTTALKTGALRHDIHYWLGKDTSQDEAGT
AAVKTVELDAALGGRAVQYREVQGHETEKFLSYFKPCIIPQEGGVASGFKHVVAEEHITRLFVCRGKHVVHVKEVPFARS
SLNHDDIYILDTKSKIFQFNGSNSSIQERAKALEVVQYIKDTYHDGTCEVATVEDGKLMADADSGEFWGFFGGFAPLPRK
TANDEDKTYNSDITRLFCVEKGQANPVEGDTLKREMLDTNKCYILDCGIEVFVWMGRTTSLDDRKIASKAAEEMIRSSER
PKSQMIRIIEGFETVPFRSKFESWTQETNTTVSEDGRGRVAALLQRQGVNVRGLMKAAPPKEEPQVFIDCTGNLQVWRVN
GQAKTLLQAADHSKFYSGDCYVFQYSYPGEEKEEVLIGTWFGKQSVEEERGSAVSMASKMVESMKFVPAQARIYEGKEPI
QFFVIMQSFIVFKGGISSGYKKYIAEKEVDDDTYNENGVALFRIQGSGPENMQAIQVDPVAASLNSSYYYILHNDSSVFT
WAGNLSTATDQELAERQLDLIKPNQQSRAQKEGSESEQFWELLGGKAEYSSQKLTKEPERDPHLFSCTFTKEVLKVTEIY
NFTQDDLMTEDIFIIDCHSEIFVWVGQEVVPKNKLLALTIGEKFIEKDSLLEKLSPEAPIYVIMEGGEPSFFTRFFTSWD
SSKSAMHGNSFQRKLKIVKNGGTPVADKPKRRTPASYGGRASVPDKSQQRSRSMSFSPDRVRVRGRSPAFNALAATFESQ
NARNLSTPPPVVRKLYPRSVTPDSSKFAPAPKSSAIASRSALFEKIPPQEPSIPKPVKASPKTPESPAPESNSKEQEEKK
ENDKEEGSMSSRIESLTIQEDAKEGVEDEEDLPAHPYDRLKTTSTDPVSDIDVTRREAYLSSEEFKEKFGMTKEAFYKLP
KWKQNKFKMAVQLF*
CDS seq >AT4G30160.1
ATGTCTGTTTCTATGAGAGATTTAGATCCAGCTTTCCAAGGAGCTGGACAGAAAGCTGGTATTGAGATATGGCGTATAGA
GAATTTCATCCCTACCCCAATTCCAAAATCTTCTATTGGGAAGTTTTTCACCGGAGATTCTTACATAGTATTGAAGACAA
CGGCGTTAAAAACTGGTGCATTGCGCCATGATATCCATTACTGGCTTGGTAAAGATACCTCTCAGGATGAAGCCGGTACT
GCTGCAGTTAAGACAGTTGAATTAGATGCTGCTCTAGGAGGTCGTGCAGTGCAGTATCGGGAAGTTCAAGGCCACGAGAC
TGAGAAATTCTTGTCTTATTTTAAGCCATGTATCATTCCTCAAGAAGGTGGAGTAGCATCAGGATTCAAGCATGTCGTAG
CTGAAGAACATATTACCCGCTTGTTCGTCTGCAGAGGAAAACATGTTGTCCATGTCAAAGAGGTTCCTTTTGCTCGGAGT
TCATTAAACCATGACGATATTTACATTCTTGACACAAAGTCCAAGATTTTCCAATTCAATGGATCCAATTCTAGTATCCA
AGAGAGAGCAAAAGCACTGGAAGTGGTTCAGTACATCAAAGATACTTACCATGATGGGACATGTGAAGTTGCTACAGTTG
AGGATGGGAAACTTATGGCTGATGCTGATAGTGGAGAATTTTGGGGTTTCTTTGGTGGGTTTGCTCCGCTACCTAGGAAA
ACAGCTAATGATGAAGACAAAACTTATAATTCAGATATCACCAGATTATTTTGTGTCGAGAAGGGACAGGCAAATCCTGT
TGAAGGCGATACATTGAAGAGGGAGATGCTGGATACAAACAAGTGTTACATTCTTGATTGTGGAATTGAAGTGTTTGTTT
GGATGGGAAGAACCACTTCTCTTGATGATAGAAAAATTGCGAGTAAAGCAGCAGAAGAAATGATCCGTTCATCTGAACGA
CCGAAATCGCAAATGATCCGCATAATAGAAGGGTTTGAAACAGTACCATTCCGATCAAAGTTTGAATCTTGGACTCAAGA
AACTAATACAACCGTGTCAGAAGATGGTAGAGGCAGAGTTGCTGCTCTTTTGCAACGACAAGGAGTAAATGTCAGAGGCC
TGATGAAAGCTGCTCCGCCTAAAGAAGAGCCTCAGGTTTTCATCGACTGCACGGGAAATCTGCAGGTTTGGCGTGTGAAT
GGTCAGGCAAAGACTCTCCTTCAAGCTGCTGATCATTCAAAATTCTACAGTGGAGATTGCTATGTTTTCCAGTATTCTTA
TCCCGGAGAAGAAAAAGAAGAAGTTCTTATAGGAACGTGGTTTGGCAAACAAAGTGTGGAGGAAGAAAGAGGTTCTGCAG
TCTCTATGGCAAGCAAAATGGTTGAGTCAATGAAATTTGTCCCAGCCCAAGCTCGCATTTATGAAGGAAAGGAACCAATT
CAATTCTTCGTGATTATGCAAAGCTTTATCGTTTTCAAGGGTGGTATTAGCAGTGGATACAAGAAATACATAGCCGAGAA
AGAAGTTGATGATGATACATACAATGAGAATGGTGTTGCTCTATTCCGAATTCAAGGGTCTGGTCCGGAAAATATGCAAG
CTATACAAGTTGACCCGGTTGCTGCATCACTGAACTCCTCGTACTATTACATACTACATAATGATTCTTCCGTCTTTACT
TGGGCTGGAAATTTATCAACCGCAACTGACCAAGAACTGGCGGAAAGGCAGCTAGATCTGATTAAGCCAAATCAACAATC
TAGAGCACAAAAGGAAGGTTCAGAATCAGAACAGTTCTGGGAGTTATTAGGAGGCAAAGCTGAATATTCGAGCCAAAAGC
TCACAAAGGAACCCGAGCGTGACCCTCACTTGTTCTCTTGTACATTCACAAAAGAAGTTCTCAAGGTGACAGAGATATAT
AACTTTACACAGGATGACTTGATGACCGAAGATATATTTATCATAGACTGTCACTCAGAAATCTTTGTCTGGGTTGGCCA
AGAAGTAGTCCCAAAGAACAAGTTACTAGCTTTAACTATTGGAGAGAAATTCATCGAGAAAGATTCTCTCCTGGAGAAGT
TATCCCCTGAAGCCCCTATTTATGTGATCATGGAAGGCGGTGAGCCGTCATTCTTCACCCGGTTCTTCACTTCTTGGGAT
TCCTCAAAATCCGCTATGCATGGAAACTCATTCCAAAGAAAACTTAAAATTGTCAAAAATGGTGGAACTCCAGTGGCAGA
TAAACCAAAACGAAGAACTCCAGCTTCATATGGTGGCCGTGCCAGCGTTCCTGACAAGTCGCAGCAGCGGTCAAGAAGCA
TGTCATTTAGTCCAGACAGGGTTCGCGTGAGGGGCAGATCTCCGGCGTTCAATGCACTCGCAGCAACATTTGAGAGCCAA
AATGCAAGAAACCTGTCAACTCCTCCCCCAGTAGTTAGGAAACTCTACCCAAGATCTGTTACTCCTGACTCCTCAAAGTT
TGCTCCCGCTCCCAAGTCTTCAGCCATCGCTTCTCGAAGTGCACTTTTCGAAAAAATACCTCCACAAGAACCTTCAATTC
CAAAACCAGTCAAAGCGAGCCCGAAGACACCTGAGTCTCCAGCGCCAGAATCCAATTCAAAAGAACAAGAAGAGAAAAAG
GAAAATGACAAGGAGGAGGGATCAATGAGCAGCCGGATAGAATCTCTTACGATTCAAGAAGATGCTAAAGAAGGAGTCGA
AGACGAGGAAGATTTACCAGCTCACCCTTATGATCGTCTCAAGACAACTTCCACTGATCCTGTCTCTGACATTGATGTAA
CAAGGAGAGAGGCTTACCTTTCATCAGAAGAGTTCAAGGAGAAATTTGGCATGACGAAAGAAGCTTTCTACAAGCTGCCT
AAATGGAAACAGAACAAATTCAAAATGGCTGTTCAGCTTTTCTGA
Microexon DNA seq AAGTTCTCAAG
Microexon Amino Acid seq EVLK
Microexon-tag DNA Seq AAGGAACCCGAGCGTGACCCTCACTTGTTCTCTTGTACATTCACAAAAGAAGTTCTCAAGGTGACAGAGATATATAACTTTACACAGGATGACTTGATGACCGAAGAT
Microexon-tag Amino Acid seq KEPERDPHLFSCTFTKEVLKVTEIYNFTQDDLMTED
Transcript ID AT4G30160.1
Gene ID At.20530
Gene Name VLN4
Pfam domain motif Unknown
Motif E-value NA
Motif start NA
Motif end NA
Protein seq >AT4G30160.1
MSVSMRDLDPAFQGAGQKAGIEIWRIENFIPTPIPKSSIGKFFTGDSYIVLKTTALKTGALRHDIHYWLGKDTSQDEAGT
AAVKTVELDAALGGRAVQYREVQGHETEKFLSYFKPCIIPQEGGVASGFKHVVAEEHITRLFVCRGKHVVHVKEVPFARS
SLNHDDIYILDTKSKIFQFNGSNSSIQERAKALEVVQYIKDTYHDGTCEVATVEDGKLMADADSGEFWGFFGGFAPLPRK
TANDEDKTYNSDITRLFCVEKGQANPVEGDTLKREMLDTNKCYILDCGIEVFVWMGRTTSLDDRKIASKAAEEMIRSSER
PKSQMIRIIEGFETVPFRSKFESWTQETNTTVSEDGRGRVAALLQRQGVNVRGLMKAAPPKEEPQVFIDCTGNLQVWRVN
GQAKTLLQAADHSKFYSGDCYVFQYSYPGEEKEEVLIGTWFGKQSVEEERGSAVSMASKMVESMKFVPAQARIYEGKEPI
QFFVIMQSFIVFKGGISSGYKKYIAEKEVDDDTYNENGVALFRIQGSGPENMQAIQVDPVAASLNSSYYYILHNDSSVFT
WAGNLSTATDQELAERQLDLIKPNQQSRAQKEGSESEQFWELLGGKAEYSSQKLTKEPERDPHLFSCTFTKEVLKVTEIY
NFTQDDLMTEDIFIIDCHSEIFVWVGQEVVPKNKLLALTIGEKFIEKDSLLEKLSPEAPIYVIMEGGEPSFFTRFFTSWD
SSKSAMHGNSFQRKLKIVKNGGTPVADKPKRRTPASYGGRASVPDKSQQRSRSMSFSPDRVRVRGRSPAFNALAATFESQ
NARNLSTPPPVVRKLYPRSVTPDSSKFAPAPKSSAIASRSALFEKIPPQEPSIPKPVKASPKTPESPAPESNSKEQEEKK
ENDKEEGSMSSRIESLTIQEDAKEGVEDEEDLPAHPYDRLKTTSTDPVSDIDVTRREAYLSSEEFKEKFGMTKEAFYKLP
KWKQNKFKMAVQLF*
CDS seq >AT4G30160.1
ATGTCTGTTTCTATGAGAGATTTAGATCCAGCTTTCCAAGGAGCTGGACAGAAAGCTGGTATTGAGATATGGCGTATAGA
GAATTTCATCCCTACCCCAATTCCAAAATCTTCTATTGGGAAGTTTTTCACCGGAGATTCTTACATAGTATTGAAGACAA
CGGCGTTAAAAACTGGTGCATTGCGCCATGATATCCATTACTGGCTTGGTAAAGATACCTCTCAGGATGAAGCCGGTACT
GCTGCAGTTAAGACAGTTGAATTAGATGCTGCTCTAGGAGGTCGTGCAGTGCAGTATCGGGAAGTTCAAGGCCACGAGAC
TGAGAAATTCTTGTCTTATTTTAAGCCATGTATCATTCCTCAAGAAGGTGGAGTAGCATCAGGATTCAAGCATGTCGTAG
CTGAAGAACATATTACCCGCTTGTTCGTCTGCAGAGGAAAACATGTTGTCCATGTCAAAGAGGTTCCTTTTGCTCGGAGT
TCATTAAACCATGACGATATTTACATTCTTGACACAAAGTCCAAGATTTTCCAATTCAATGGATCCAATTCTAGTATCCA
AGAGAGAGCAAAAGCACTGGAAGTGGTTCAGTACATCAAAGATACTTACCATGATGGGACATGTGAAGTTGCTACAGTTG
AGGATGGGAAACTTATGGCTGATGCTGATAGTGGAGAATTTTGGGGTTTCTTTGGTGGGTTTGCTCCGCTACCTAGGAAA
ACAGCTAATGATGAAGACAAAACTTATAATTCAGATATCACCAGATTATTTTGTGTCGAGAAGGGACAGGCAAATCCTGT
TGAAGGCGATACATTGAAGAGGGAGATGCTGGATACAAACAAGTGTTACATTCTTGATTGTGGAATTGAAGTGTTTGTTT
GGATGGGAAGAACCACTTCTCTTGATGATAGAAAAATTGCGAGTAAAGCAGCAGAAGAAATGATCCGTTCATCTGAACGA
CCGAAATCGCAAATGATCCGCATAATAGAAGGGTTTGAAACAGTACCATTCCGATCAAAGTTTGAATCTTGGACTCAAGA
AACTAATACAACCGTGTCAGAAGATGGTAGAGGCAGAGTTGCTGCTCTTTTGCAACGACAAGGAGTAAATGTCAGAGGCC
TGATGAAAGCTGCTCCGCCTAAAGAAGAGCCTCAGGTTTTCATCGACTGCACGGGAAATCTGCAGGTTTGGCGTGTGAAT
GGTCAGGCAAAGACTCTCCTTCAAGCTGCTGATCATTCAAAATTCTACAGTGGAGATTGCTATGTTTTCCAGTATTCTTA
TCCCGGAGAAGAAAAAGAAGAAGTTCTTATAGGAACGTGGTTTGGCAAACAAAGTGTGGAGGAAGAAAGAGGTTCTGCAG
TCTCTATGGCAAGCAAAATGGTTGAGTCAATGAAATTTGTCCCAGCCCAAGCTCGCATTTATGAAGGAAAGGAACCAATT
CAATTCTTCGTGATTATGCAAAGCTTTATCGTTTTCAAGGGTGGTATTAGCAGTGGATACAAGAAATACATAGCCGAGAA
AGAAGTTGATGATGATACATACAATGAGAATGGTGTTGCTCTATTCCGAATTCAAGGGTCTGGTCCGGAAAATATGCAAG
CTATACAAGTTGACCCGGTTGCTGCATCACTGAACTCCTCGTACTATTACATACTACATAATGATTCTTCCGTCTTTACT
TGGGCTGGAAATTTATCAACCGCAACTGACCAAGAACTGGCGGAAAGGCAGCTAGATCTGATTAAGCCAAATCAACAATC
TAGAGCACAAAAGGAAGGTTCAGAATCAGAACAGTTCTGGGAGTTATTAGGAGGCAAAGCTGAATATTCGAGCCAAAAGC
TCACAAAGGAACCCGAGCGTGACCCTCACTTGTTCTCTTGTACATTCACAAAAGAAGTTCTCAAGGTGACAGAGATATAT
AACTTTACACAGGATGACTTGATGACCGAAGATATATTTATCATAGACTGTCACTCAGAAATCTTTGTCTGGGTTGGCCA
AGAAGTAGTCCCAAAGAACAAGTTACTAGCTTTAACTATTGGAGAGAAATTCATCGAGAAAGATTCTCTCCTGGAGAAGT
TATCCCCTGAAGCCCCTATTTATGTGATCATGGAAGGCGGTGAGCCGTCATTCTTCACCCGGTTCTTCACTTCTTGGGAT
TCCTCAAAATCCGCTATGCATGGAAACTCATTCCAAAGAAAACTTAAAATTGTCAAAAATGGTGGAACTCCAGTGGCAGA
TAAACCAAAACGAAGAACTCCAGCTTCATATGGTGGCCGTGCCAGCGTTCCTGACAAGTCGCAGCAGCGGTCAAGAAGCA
TGTCATTTAGTCCAGACAGGGTTCGCGTGAGGGGCAGATCTCCGGCGTTCAATGCACTCGCAGCAACATTTGAGAGCCAA
AATGCAAGAAACCTGTCAACTCCTCCCCCAGTAGTTAGGAAACTCTACCCAAGATCTGTTACTCCTGACTCCTCAAAGTT
TGCTCCCGCTCCCAAGTCTTCAGCCATCGCTTCTCGAAGTGCACTTTTCGAAAAAATACCTCCACAAGAACCTTCAATTC
CAAAACCAGTCAAAGCGAGCCCGAAGACACCTGAGTCTCCAGCGCCAGAATCCAATTCAAAAGAACAAGAAGAGAAAAAG
GAAAATGACAAGGAGGAGGGATCAATGAGCAGCCGGATAGAATCTCTTACGATTCAAGAAGATGCTAAAGAAGGAGTCGA
AGACGAGGAAGATTTACCAGCTCACCCTTATGATCGTCTCAAGACAACTTCCACTGATCCTGTCTCTGACATTGATGTAA
CAAGGAGAGAGGCTTACCTTTCATCAGAAGAGTTCAAGGAGAAATTTGGCATGACGAAAGAAGCTTTCTACAAGCTGCCT
AAATGGAAACAGAACAAATTCAAAATGGCTGTTCAGCTTTTCTGA