Microexon ID At_1:23660834-23660838:-
Species Arabidopsis thaliana
Coordinates 1:23660834..23660838
Microexon Cluster ID MEP07
Size 5
Phase 1
Pfam Domain Motif Peptidase_M1
Structure of Microexon-tag (flanking exon, microexon, flanking exon sizes) 33,19,5,12,39
Microexon location in the Microexon-tag 3
Microexon-tag DNA Seq GCWCAYCCTGTTCGRCCWCAYTCTTAYATYAAGATGGACAACTTCTAYACAGTRACGGTKTATGARAAGGGWGCTGAAGTTGTCMGRATGTACAARACMTTRYTKGGR
Logo of Microexon-tag DNA Seq NT60 Logo
Alignment of exons MSA
Microexon DNA seq TGACG
Microexon Amino Acid seq VT
Microexon-tag DNA Seq GCACATCCTGTTCGCCCACATTCATACATCAAGATGGATAACTTCTACACAGTGACGGTTTATGAAAAGGGAGCTGAGGTCGTCAGGATGTACAAAACTCTACTAGGA
Microexon-tag Amino Acid Seq AHPVRPHSYIKMDNFYTVTVYEKGAEVVRMYKTLLG
Microexon-tag spanning region23660606-23661121
Microexon-tag prediction score0.9778
Overlapped with the annotated transcript (%) 100
New Transcript ID AT1G63770.5x
Reference Transcript ID AT1G63770.5
Gene ID AT1G63770
Gene Name NA
Transcript ID AT1G63770.3
Protein ID AT1G63770.3
Gene ID AT1G63770
Gene Name NA
Pfam domain motif Peptidase_M1
Motif E-value 1.1e-44
Motif start 323
Motif end 546
Protein seq >AT1G63770.3
MARLIIPCRSSSLARVNLLGLLSRAPVPVRSSCLRSSANRLTQHRPFLTSEAICLRKNRFLPHSVDTHKQNSRRLICSVA
TESVPDKAEDSKMDAPKEIFLKNYTKPDYYFETVDLSFSLGEEKTIVSSKIKVSPRVKGSSAALVLDGHDLKLLSVKVEG
KLLKEGDYQLDSRHLTLPSLPAEESFVLEIDTEIYPHKNTSLEGLYKSSGNFCTQCEAEGFRKITFYQDRPDIMAKYTCR
VEGDKTLYPVLLSNGNLISQGDIEGGRHYALWEDPFKKPCYLFALVAGQLVSRDDTFTTRSGRQVSLKIWTPAEDLPKTA
HAMYSLKAAMKWDEDVFGLEYDLDLFNIVAVPDFNMGAMENKSLNIFNSKLVLASPETATDADYAAILGVIGHEYFHNWT
GNRVTCRDWFQLSLKEGLTVFRDQEFSSDMGSRTVKRIADVSKLRIYQFPQDAGPMAHPVRPHSYIKMDNFYTVTVYEKV
WLFTNSVLLYAGAEVVRMYKTLLGTQGFRKGIDLYFERHDEQAVTCEDFFAAMRDANNADFANFLQWYSQAGTPVVKVVS
SYNADARTFSLKFSQEIPPTPGQPTKEPTFIPVVVGLLDSSGKDITLSSVHHDGTVQTISGSSTILRVTKKEEEFVFSDI
PERPVPSLFRGFSAPVRVETDLSNDDLFFLLAHDSDEFNRWEAGQVLARKLMLNLVSDFQQNKPLALNPKFVQGLGSVLS
DSSLDKEFIAKAITLPGEGEIMDMMAVADPDAVHAVRKFVRKQLASELKEELLKIVENNRSTEAYVFDHSNMARRALKNT
ALAYLASLEDPAYMELALNEYKMATNLTDQFAALAALSQNPGKTRDDILADFYNKWQDDYLVVNKWFLLQSTSDIPGNVE
NVKKLLDHPAFDLRNPNKVYSLIGGFCGSPVNFHAKDGSGYKFLGDIVVQLDKLNPQVASRMVSAFSRWKRYDETRQGLA
KAQLEMIMSANGLSENVFEIASKSLAA*
CDS seq >AT1G63770.3
ATGGCTCGGTTGATAATTCCTTGTCGAAGTTCGTCTCTGGCGAGAGTCAATCTTCTGGGTTTGCTCTCTCGTGCTCCTGT
TCCTGTTAGAAGCAGTTGTCTTCGTAGTTCGGCAAACAGACTTACTCAACACAGACCGTTTCTTACTTCCGAGGCCATTT
GTTTGAGGAAGAATCGGTTTCTACCCCATTCTGTTGATACACATAAGCAAAACAGCAGGAGGCTCATTTGTTCTGTTGCC
ACAGAATCAGTTCCGGATAAAGCTGAAGATTCCAAAATGGATGCACCTAAAGAAATATTTCTCAAGAACTACACAAAGCC
TGATTACTACTTTGAAACTGTGGATCTGAGCTTCTCTCTAGGTGAAGAGAAAACAATTGTTAGCTCTAAAATCAAGGTTT
CTCCTCGAGTTAAAGGATCTTCTGCTGCATTGGTCTTGGATGGACATGACTTGAAGCTACTTTCTGTCAAGGTTGAGGGG
AAGCTTCTTAAGGAAGGGGATTACCAGTTGGACTCTCGCCATCTAACTCTTCCTTCGCTTCCAGCTGAGGAGTCCTTTGT
TTTGGAGATTGATACTGAGATATACCCCCACAAGAATACTTCACTTGAGGGGCTTTACAAGTCTTCTGGGAATTTCTGCA
CACAGTGTGAAGCAGAAGGTTTCCGCAAAATTACATTTTACCAGGATCGTCCTGATATAATGGCTAAGTACACGTGTCGT
GTGGAAGGTGACAAGACACTTTATCCTGTGTTATTGTCCAATGGAAACCTCATTTCTCAAGGAGATATAGAGGGTGGTCG
ACACTATGCCTTATGGGAGGATCCATTCAAGAAACCGTGCTATCTATTTGCTTTGGTGGCTGGACAGCTAGTGAGCAGAG
ATGATACATTTACCACACGCTCTGGTAGGCAGGTTTCCCTGAAAATCTGGACTCCTGCAGAAGATCTACCAAAGACTGCC
CATGCCATGTATTCCCTAAAAGCGGCCATGAAGTGGGATGAGGATGTGTTCGGGCTTGAGTATGACCTGGATCTCTTCAA
CATTGTTGCTGTTCCAGATTTTAACATGGGAGCCATGGAAAACAAGAGTTTGAATATTTTTAATTCCAAGCTTGTCCTAG
CATCTCCAGAAACAGCAACAGATGCAGATTACGCTGCAATTTTGGGTGTTATTGGTCATGAGTACTTTCACAATTGGACA
GGGAACAGGGTGACATGCCGTGACTGGTTCCAACTCAGTCTAAAGGAAGGACTTACTGTCTTCCGTGACCAGGAATTTTC
ATCTGACATGGGAAGCCGAACTGTAAAGCGAATTGCTGATGTTTCAAAGCTTCGGATCTATCAATTTCCGCAGGATGCTG
GTCCTATGGCACATCCTGTTCGCCCACATTCATACATCAAGATGGATAACTTCTACACAGTGACGGTTTATGAAAAGGTT
TGGCTCTTTACCAATAGTGTTTTGTTATATGCAGGAGCTGAGGTCGTCAGGATGTACAAAACTCTACTAGGAACTCAGGG
GTTCCGAAAGGGAATTGATCTCTATTTTGAAAGACATGATGAGCAAGCAGTGACTTGTGAAGACTTCTTTGCTGCTATGC
GTGATGCAAATAATGCAGACTTTGCTAATTTCTTGCAATGGTACTCTCAAGCTGGAACGCCCGTTGTCAAAGTGGTATCC
TCTTACAATGCTGACGCTCGTACTTTCTCTTTAAAATTCAGTCAGGAGATACCTCCAACTCCAGGCCAGCCAACAAAAGA
ACCTACATTTATTCCAGTGGTTGTTGGTCTTTTGGACTCAAGTGGGAAAGACATTACTCTTTCCTCTGTTCATCATGATG
GTACAGTGCAGACCATTTCAGGCAGCAGCACAATACTTCGAGTGACTAAGAAAGAAGAAGAGTTTGTGTTTTCTGATATA
CCAGAAAGACCTGTTCCATCCCTATTTAGGGGATTCAGTGCCCCAGTTCGTGTTGAAACTGATCTCTCTAATGATGACTT
ATTCTTCCTCCTAGCACATGATTCAGATGAATTCAATAGGTGGGAGGCCGGTCAAGTTCTGGCAAGAAAGCTGATGCTGA
ACTTAGTTTCTGATTTCCAGCAAAATAAACCGTTGGCTCTAAACCCAAAATTTGTGCAAGGTCTCGGCAGTGTGCTTTCT
GACTCAAGCTTGGACAAGGAATTTATAGCCAAAGCAATAACACTACCTGGGGAGGGAGAGATAATGGACATGATGGCCGT
GGCGGATCCTGATGCTGTTCATGCTGTTAGAAAGTTTGTACGAAAGCAGCTTGCATCTGAACTTAAGGAGGAGCTTCTAA
AGATAGTCGAGAACAATAGGAGCACAGAAGCTTATGTCTTTGACCACTCAAACATGGCGAGGCGTGCTTTGAAGAATACA
GCTCTAGCTTATCTTGCATCTCTCGAAGATCCAGCATATATGGAACTTGCACTGAACGAATACAAGATGGCCACCAATTT
GACCGACCAATTTGCTGCTTTGGCAGCTCTATCCCAGAACCCTGGTAAAACCCGTGACGACATTCTTGCCGACTTCTATA
ACAAGTGGCAGGACGATTACTTGGTTGTTAATAAATGGTTCCTCCTTCAATCAACATCCGACATTCCTGGCAATGTAGAG
AATGTCAAGAAGCTTTTGGATCACCCAGCTTTCGATCTGCGCAACCCAAACAAGGTTTATTCGCTTATTGGAGGGTTCTG
CGGTTCCCCAGTGAATTTCCATGCCAAGGATGGATCAGGTTACAAGTTCTTGGGTGACATTGTTGTCCAGTTAGACAAAT
TGAACCCTCAGGTTGCTTCTCGTATGGTGTCTGCCTTTTCGAGGTGGAAGCGGTACGATGAAACCCGTCAAGGTCTAGCC
AAGGCACAATTGGAAATGATAATGTCTGCTAATGGGCTGTCTGAGAATGTCTTTGAGATTGCCTCCAAGAGTTTGGCTGC
TTGA
Microexon DNA seq TGACG
Microexon Amino Acid seq VT
Microexon-tag DNA Seq GCACATCCTGTTCGCCCACATTCATACATCAAGATGGATAACTTCTACACAGTGACGGTTTATGAAAAGGGAGCTGAGGTCGTCAGGATGTACAAAACTCTACTAGGA
Microexon-tag Amino Acid seq AHPVRPHSYIKMDNFYTVTVYEKGAEVVRMYKTLLG
Transcript ID At.5504.1
Gene ID At.5504
Gene Name NA
Pfam domain motif Peptidase_M1
Motif E-value 1.3e-50
Motif start 231
Motif end 442
Protein seq >At.5504.1
MDAPKEIFLKNYTKPDYYFETVDLSFSLGEEKTIVSSKIKVSPRVKGSSAALVLDGHDLKLLSVKVEGKLLKEGDYQLDS
RHLTLPSLPAEESFVLEIDTEIYPHKNTSLEGLYKSSGNFCTQCEAEGFRKITFYQDRPDIMAKYTCRVEGDKTLYPVLL
SNGNLISQGDIEGGRHYALWEDPFKKPCYLFALVAGQLVSRDDTFTTRSGRQVSLKIWTPAEDLPKTAHAMYSLKAAMKW
DEDVFGLEYDLDLFNIVAVPDFNMGAMENKSLNIFNSKLVLASPETATDADYAAILGVIGHEYFHNWTGNRVTCRDWFQL
SLKEGLTVFRDQEFSSDMGSRTVKRIADVSKLRIYQFPQDAGPMAHPVRPHSYIKMDNFYTVTVYEKGAEVVRMYKTLLG
TQGFRKGIDLYFERHDEQAVTCEDFFAAMRDANNADFANFLQWYSQAGTPVVKVVSSYNADARTFSLKFSQEIPPTPGQP
TKEPTFIPVVVGLLDSSGKDITLSSVHHDGTVQTISGSSTILRVTKKEEEFVFSDIPERPVPSLFRGFSAPVRVETDLSN
DDLFFLLAHDSDEFNRWEAGQVLARKLMLNLVSDFQQNKPLALNPKFVQGLGSVLSDSSLDKEFIAKAITLPGEGEIMDM
MAVADPDAVHAVRKFVRKQLASELKEELLKIVENNRSTEAYVFDHSNMARRALKNTALAYLASLEDPAYMELALNEYKMA
TNLTDQFAALAALSQNPGKTRDDILADFYNKWQDDYLVVNKWFLLQSTSDIPGNVENVKKLLDHPAFDLRNPNKVYSLIG
GFCGSPVNFHAKDGSGYKFLGDIVVQLDKLNPQVASRMVSAFSRWKRYDETRQGLAKAQLEMIMSANGLSENVFEIASKS
LAA*
CDS seq >At.5504.1
ATGGATGCACCTAAAGAAATATTTCTCAAGAACTACACAAAGCCTGATTACTACTTTGAAACTGTGGATCTGAGCTTCTC
TCTAGGTGAAGAGAAAACAATTGTTAGCTCTAAAATCAAGGTTTCTCCTCGAGTTAAAGGATCTTCTGCTGCATTGGTCT
TGGATGGACATGACTTGAAGCTACTTTCTGTCAAGGTTGAGGGGAAGCTTCTTAAGGAAGGGGATTACCAGTTGGACTCT
CGCCATCTAACTCTTCCTTCGCTTCCAGCTGAGGAGTCCTTTGTTTTGGAGATTGATACTGAGATATACCCCCACAAGAA
TACTTCACTTGAGGGGCTTTACAAGTCTTCTGGGAATTTCTGCACACAGTGTGAAGCAGAAGGTTTCCGCAAAATTACAT
TTTACCAGGATCGTCCTGATATAATGGCTAAGTACACGTGTCGTGTGGAAGGTGACAAGACACTTTATCCTGTGTTATTG
TCCAATGGAAACCTCATTTCTCAAGGAGATATAGAGGGTGGTCGACACTATGCCTTATGGGAGGATCCATTCAAGAAACC
GTGCTATCTATTTGCTTTGGTGGCTGGACAGCTAGTGAGCAGAGATGATACATTTACCACACGCTCTGGTAGGCAGGTTT
CCCTGAAAATCTGGACTCCTGCAGAAGATCTACCAAAGACTGCCCATGCCATGTATTCCCTAAAAGCGGCCATGAAGTGG
GATGAGGATGTGTTCGGGCTTGAGTATGACCTGGATCTCTTCAACATTGTTGCTGTTCCAGATTTTAACATGGGAGCCAT
GGAAAACAAGAGTTTGAATATTTTTAATTCCAAGCTTGTCCTAGCATCTCCAGAAACAGCAACAGATGCAGATTACGCTG
CAATTTTGGGTGTTATTGGTCATGAGTACTTTCACAATTGGACAGGGAACAGGGTGACATGCCGTGACTGGTTCCAACTC
AGTCTAAAGGAAGGACTTACTGTCTTCCGTGACCAGGAATTTTCATCTGACATGGGAAGCCGAACTGTAAAGCGAATTGC
TGATGTTTCAAAGCTTCGGATCTATCAATTTCCGCAGGATGCTGGTCCTATGGCACATCCTGTTCGCCCACATTCATACA
TCAAGATGGATAACTTCTACACAGTGACGGTTTATGAAAAGGGAGCTGAGGTCGTCAGGATGTACAAAACTCTACTAGGA
ACTCAGGGGTTCCGAAAGGGAATTGATCTCTATTTTGAAAGACATGATGAGCAAGCAGTGACTTGTGAAGACTTCTTTGC
TGCTATGCGTGATGCAAATAATGCAGACTTTGCTAATTTCTTGCAATGGTACTCTCAAGCTGGAACGCCCGTTGTCAAAG
TGGTATCCTCTTACAATGCTGACGCTCGTACTTTCTCTTTAAAATTCAGTCAGGAGATACCTCCAACTCCAGGCCAGCCA
ACAAAAGAACCTACATTTATTCCAGTGGTTGTTGGTCTTTTGGACTCAAGTGGGAAAGACATTACTCTTTCCTCTGTTCA
TCATGATGGTACAGTGCAGACCATTTCAGGCAGCAGCACAATACTTCGAGTGACTAAGAAAGAAGAAGAGTTTGTGTTTT
CTGATATACCAGAAAGACCTGTTCCATCCCTATTTAGGGGATTCAGTGCCCCAGTTCGTGTTGAAACTGATCTCTCTAAT
GATGACTTATTCTTCCTCCTAGCACATGATTCAGATGAATTCAATAGGTGGGAGGCCGGTCAAGTTCTGGCAAGAAAGCT
GATGCTGAACTTAGTTTCTGATTTCCAGCAAAATAAACCGTTGGCTCTAAACCCAAAATTTGTGCAAGGTCTCGGCAGTG
TGCTTTCTGACTCAAGCTTGGACAAGGAATTTATAGCCAAAGCAATAACACTACCTGGGGAGGGAGAGATAATGGACATG
ATGGCCGTGGCGGATCCTGATGCTGTTCATGCTGTTAGAAAGTTTGTACGAAAGCAGCTTGCATCTGAACTTAAGGAGGA
GCTTCTAAAGATAGTCGAGAACAATAGGAGCACAGAAGCTTATGTCTTTGACCACTCAAACATGGCGAGGCGTGCTTTGA
AGAATACAGCTCTAGCTTATCTTGCATCTCTCGAAGATCCAGCATATATGGAACTTGCACTGAACGAATACAAGATGGCC
ACCAATTTGACCGACCAATTTGCTGCTTTGGCAGCTCTATCCCAGAACCCTGGTAAAACCCGTGACGACATTCTTGCCGA
CTTCTATAACAAGTGGCAGGACGATTACTTGGTTGTTAATAAATGGTTCCTCCTTCAATCAACATCCGACATTCCTGGCA
ATGTAGAGAATGTCAAGAAGCTTTTGGATCACCCAGCTTTCGATCTGCGCAACCCAAACAAGGTTTATTCGCTTATTGGA
GGGTTCTGCGGTTCCCCAGTGAATTTCCATGCCAAGGATGGATCAGGTTACAAGTTCTTGGGTGACATTGTTGTCCAGTT
AGACAAATTGAACCCTCAGGTTGCTTCTCGTATGGTGTCTGCCTTTTCGAGGTGGAAGCGGTACGATGAAACCCGTCAAG
GTCTAGCCAAGGCACAATTGGAAATGATAATGTCTGCTAATGGGCTGTCTGAGAATGTCTTTGAGATTGCCTCCAAGAGT
TTGGCTGCTTGA