Microexon ID Ha_1:19012649-19012663:-
Species Helianthus annuus
Coordinates 1:19012649..19012663
Microexon Cluster ID MEP41
Size 15
Phase 0
Pfam Domain Motif DUF974
Structure of Microexon-tag (flanking exon, microexon, flanking exon sizes) 48,15,45
Microexon location in the Microexon-tag 2
Microexon-tag DNA Seq CARTTYTTCAAGTTYATTGTTKCWAAYCCACTTTCWGTTAGRACAAAGGTYCGYRYTRTCAAGGAAACTACMTWTYTRGARGCTTGYATWGARAAYCATACAAAATCA
Logo of Microexon-tag DNA Seq NT60 Logo
Alignment of exons MSA
Microexon DNA seq GTCCGCGTTGTGAAG
Microexon Amino Acid seq VRVVK
Microexon-tag DNA Seq CAGTATTTCAAGTTTATAGTTTCAAATCCGCTTTCTGTCAGGACAAAGGTCCGCGTTGTGAAGGAAACGACATACTTGGAGGCATGTTTAGAAAATAATACAAAATCA
Microexon-tag Amino Acid Seq QYFKFIVSNPLSVRTKVRVVKETTYLEACLENNTKS
Microexon-tag spanning region19012276-19012837
Microexon-tag prediction score0.9639
Overlapped with the annotated transcript (%) 100
New Transcript ID OTG36106x
Reference Transcript ID OTG36106
Gene ID HannXRQ_Chr01g0003951
Gene Name TPC13
Transcript ID OTG36106
Protein ID OTG36106
Gene ID HannXRQ_Chr01g0003951
Gene Name TPC13
Pfam domain motif DUF974
Motif E-value 3.5e-63
Motif start 82
Motif end 311
Protein seq >OTG36106
MSTQSLAFRVMRLCRPTFQVETPLRFELSDLILGEDLLDDPSAAPQLRHLLHTIDSSTDLTYTNRFLLRDDPSDAMGLPG
MLVLPQSFGAIYLGETFCSYISINNSSSFEVRDIIIKAEIQTERQRILLLDTSKTPVETIRAGGRYDFIVEHDVKELGAH
TLVCTAQYSDGDAERKYLPQYFKFIVSNPLSVRTKVRVVKETTYLEACLENNTKSNLYMDQVDFEPTSNWSATLLKADSH
LSEKSVLTREILKPPILIKSGGGIHNYLYQLKSLLDGSAPSKFEGSNVLGKLQITWRTNLGEPGRLQTQQIIGNIITQRE
IELKATKVPSVIILEKPFTVCLSFTNLTENKVGPFEVLLSPSDSQEDKTLVGIGLKKMVLPQVEAFKSLDFQMNLIPMEA
GMQKISGITVFNTTERKSYDPLPDIEIYVDTY*
CDS seq >OTG36106
ATGAGCACGCAGTCGTTAGCCTTCCGGGTGATGCGCCTATGCCGTCCCACATTTCAGGTGGAAACCCCACTCCGATTCGA
GCTTTCCGATCTCATACTCGGTGAAGATCTACTCGACGATCCATCCGCCGCACCTCAACTCCGTCATCTCCTCCACACCA
TTGACTCCTCCACCGATCTCACCTACACCAATCGCTTCCTTCTCCGTGATGACCCATCCGATGCCATGGGCCTTCCCGGC
ATGCTCGTCCTCCCTCAGTCATTCGGTGCCATATATCTTGGAGAAACATTTTGCAGCTATATAAGCATCAACAACAGCTC
CAGTTTTGAAGTGAGGGACATTATAATCAAGGCAGAAATACAAACAGAAAGGCAGAGAATACTGCTTTTAGATACAAGTA
AAACACCAGTTGAAACAATAAGAGCAGGAGGACGCTACGATTTTATTGTTGAGCATGATGTTAAGGAGCTTGGTGCACAC
ACTCTGGTCTGCACTGCGCAATATAGTGATGGCGATGCTGAGCGCAAATATCTCCCACAGTATTTCAAGTTTATAGTTTC
AAATCCGCTTTCTGTCAGGACAAAGGTCCGCGTTGTGAAGGAAACGACATACTTGGAGGCATGTTTAGAAAATAATACAA
AATCAAACCTGTATATGGACCAAGTGGATTTTGAGCCTACTTCAAACTGGAGTGCAACATTACTAAAAGCCGACAGTCAC
CTTTCTGAGAAGAGCGTTTTGACAAGAGAGATATTGAAGCCACCCATTTTGATTAAATCTGGGGGTGGAATTCACAACTA
TCTCTACCAGTTGAAGTCATTATTGGATGGATCTGCACCAAGCAAATTTGAGGGGAGTAATGTTCTTGGCAAACTTCAGA
TAACATGGCGTACGAATTTGGGTGAACCTGGCCGCTTGCAAACACAACAGATCATTGGTAATATCATTACACAAAGAGAG
ATTGAATTGAAAGCAACGAAGGTGCCATCTGTCATCATCTTAGAAAAACCCTTTACAGTATGCTTGAGTTTCACAAACCT
TACTGAGAACAAAGTTGGCCCGTTTGAAGTTTTGTTATCTCCGAGCGATAGCCAAGAGGATAAAACCCTTGTTGGTATTG
GGCTTAAAAAGATGGTTTTACCTCAGGTGGAGGCATTCAAATCTCTGGATTTTCAAATGAACCTAATTCCTATGGAAGCT
GGAATGCAGAAAATCAGTGGTATCACAGTGTTTAACACGACAGAGAGGAAAAGTTATGATCCATTGCCCGATATCGAGAT
TTATGTTGATACATATTGA
Microexon DNA seq GTCCGCGTTGTGAAG
Microexon Amino Acid seq VRVVK
Microexon-tag DNA Seq CAGTATTTCAAGTTTATAGTTTCAAATCCGCTTTCTGTCAGGACAAAGGTCCGCGTTGTGAAGGAAACGACATACTTGGAGGCATGTTTAGAAAATAATACAAAATCA
Microexon-tag Amino Acid seq QYFKFIVSNPLSVRTKVRVVKETTYLEACLENNTKS
Transcript ID OTG36106
Gene ID Ha.426
Gene Name TPC13
Pfam domain motif DUF974
Motif E-value 3.5e-63
Motif start 82
Motif end 311
Protein seq >OTG36106
MSTQSLAFRVMRLCRPTFQVETPLRFELSDLILGEDLLDDPSAAPQLRHLLHTIDSSTDLTYTNRFLLRDDPSDAMGLPG
MLVLPQSFGAIYLGETFCSYISINNSSSFEVRDIIIKAEIQTERQRILLLDTSKTPVETIRAGGRYDFIVEHDVKELGAH
TLVCTAQYSDGDAERKYLPQYFKFIVSNPLSVRTKVRVVKETTYLEACLENNTKSNLYMDQVDFEPTSNWSATLLKADSH
LSEKSVLTREILKPPILIKSGGGIHNYLYQLKSLLDGSAPSKFEGSNVLGKLQITWRTNLGEPGRLQTQQIIGNIITQRE
IELKATKVPSVIILEKPFTVCLSFTNLTENKVGPFEVLLSPSDSQEDKTLVGIGLKKMVLPQVEAFKSLDFQMNLIPMEA
GMQKISGITVFNTTERKSYDPLPDIEIYVDTY*
CDS seq >OTG36106
ATGAGCACGCAGTCGTTAGCCTTCCGGGTGATGCGCCTATGCCGTCCCACATTTCAGGTGGAAACCCCACTCCGATTCGA
GCTTTCCGATCTCATACTCGGTGAAGATCTACTCGACGATCCATCCGCCGCACCTCAACTCCGTCATCTCCTCCACACCA
TTGACTCCTCCACCGATCTCACCTACACCAATCGCTTCCTTCTCCGTGATGACCCATCCGATGCCATGGGCCTTCCCGGC
ATGCTCGTCCTCCCTCAGTCATTCGGTGCCATATATCTTGGAGAAACATTTTGCAGCTATATAAGCATCAACAACAGCTC
CAGTTTTGAAGTGAGGGACATTATAATCAAGGCAGAAATACAAACAGAAAGGCAGAGAATACTGCTTTTAGATACAAGTA
AAACACCAGTTGAAACAATAAGAGCAGGAGGACGCTACGATTTTATTGTTGAGCATGATGTTAAGGAGCTTGGTGCACAC
ACTCTGGTCTGCACTGCGCAATATAGTGATGGCGATGCTGAGCGCAAATATCTCCCACAGTATTTCAAGTTTATAGTTTC
AAATCCGCTTTCTGTCAGGACAAAGGTCCGCGTTGTGAAGGAAACGACATACTTGGAGGCATGTTTAGAAAATAATACAA
AATCAAACCTGTATATGGACCAAGTGGATTTTGAGCCTACTTCAAACTGGAGTGCAACATTACTAAAAGCCGACAGTCAC
CTTTCTGAGAAGAGCGTTTTGACAAGAGAGATATTGAAGCCACCCATTTTGATTAAATCTGGGGGTGGAATTCACAACTA
TCTCTACCAGTTGAAGTCATTATTGGATGGATCTGCACCAAGCAAATTTGAGGGGAGTAATGTTCTTGGCAAACTTCAGA
TAACATGGCGTACGAATTTGGGTGAACCTGGCCGCTTGCAAACACAACAGATCATTGGTAATATCATTACACAAAGAGAG
ATTGAATTGAAAGCAACGAAGGTGCCATCTGTCATCATCTTAGAAAAACCCTTTACAGTATGCTTGAGTTTCACAAACCT
TACTGAGAACAAAGTTGGCCCGTTTGAAGTTTTGTTATCTCCGAGCGATAGCCAAGAGGATAAAACCCTTGTTGGTATTG
GGCTTAAAAAGATGGTTTTACCTCAGGTGGAGGCATTCAAATCTCTGGATTTTCAAATGAACCTAATTCCTATGGAAGCT
GGAATGCAGAAAATCAGTGGTATCACAGTGTTTAACACGACAGAGAGGAAAAGTTATGATCCATTGCCCGATATCGAGAT
TTATGTTGATACATATTGA