Microexon ID Ha_1:146422881-146422893:-
Species Helianthus annuus
Coordinates 1:146422881..146422893
Microexon Cluster ID MEP31
Size 13
Phase 0
Pfam Domain Motif TPT
Structure of Microexon-tag (flanking exon, microexon, flanking exon sizes) 48,13,47
Microexon location in the Microexon-tag 2
Microexon-tag DNA Seq GCTGTYATGTCTGGKTTYCGYTGGTSYATGACTCARATTCTTYTGCAGAAAGAARMHTAYGGTYTRAARAATCCAYTTACCTTGATGAGYTATGTKACYCCAGTGATG
Logo of Microexon-tag DNA Seq NT60 Logo
Alignment of exons MSA
Microexon DNA seq AAAGAGGTCTATG
Microexon Amino Acid seq KEVYG
Microexon-tag DNA Seq GCTGTTATGTCGGGTTTTCGTTGGACCATGACTCAGATCCTTCTTCAGAAAGAGGTCTATGGTTTGAAAAATCCCCTGACGTTAATGAGCTATGTAACCCCTGTTATG
Microexon-tag Amino Acid Seq AVMSGFRWTMTQILLQKEVYGLKNPLTLMSYVTPVM
Microexon-tag spanning region146422707-146423336
Microexon-tag prediction score0.961
Overlapped with the annotated transcript (%) 100
New Transcript ID OTG38338x
Reference Transcript ID OTG38338
Gene ID HannXRQ_Chr01g0028621
Gene Name NA
Transcript ID OTG38338
Protein ID OTG38338
Gene ID HannXRQ_Chr01g0028621
Gene Name NA
Pfam domain motif TPT
Motif E-value 4e-31
Motif start 67
Motif end 366
Protein seq >OTG38338
MSGGSSTEMVSAASPRRTHTNEKRVSFDIENDTNGGLKYSNTTDFVDRNREGLASNKSPIVVVDILKTLFFILVWYTFSI
LLTLYNKKLLGDDQGKFPAPLLMNTIHFGMQAVFSKAIVYFWSERFQPTVTMSWKDYFMRVVPTAIGTALDINLSNASLV
FISVTFATMCKSAAPIFLLIFAFAFRLESPSFKLLGIMLIISAGVLLTVAKETEFDLWGFIFVMLAAVMSGFRWTMTQIL
LQKEVYGLKNPLTLMSYVTPVMALLTALLSLILDPWGEFRTSSYFDSKLHIMRSCFLMLFGGTLAFFMVLTEYILVSVTS
AVTVTIAGIVKEAVTILVAVFYFHDEFTWLKGGGLVTIIFGVSLFNLYKYQKLHNEKSSPDELAGSQTSVHAKYVILEEM
DDEETEETRP*
CDS seq >OTG38338
ATGAGTGGTGGTAGCAGTACTGAAATGGTTTCTGCTGCTAGCCCGAGAAGAACACACACAAACGAGAAACGGGTTTCTTT
CGATATTGAAAATGATACAAACGGAGGGTTGAAATATTCAAACACAACAGATTTTGTTGATAGAAATAGAGAGGGTTTGG
CAAGCAACAAGAGCCCGATTGTCGTTGTCGATATACTCAAAACGTTGTTTTTTATACTCGTTTGGTACACATTCAGCATA
TTACTCACATTGTATAATAAGAAACTATTGGGAGATGATCAGGGGAAATTCCCTGCTCCTTTGTTAATGAATACGATTCA
TTTTGGAATGCAAGCCGTTTTTTCAAAGGCGATTGTATACTTTTGGTCCGAAAGATTCCAACCGACAGTAACTATGTCAT
GGAAAGACTACTTTATGAGAGTTGTGCCAACTGCTATTGGAACAGCACTGGATATAAATTTGAGCAATGCATCACTCGTT
TTCATATCGGTTACATTTGCAACCATGTGTAAATCTGCTGCTCCTATATTTCTCCTTATATTTGCTTTTGCATTCAGGTT
GGAATCTCCAAGTTTTAAACTTTTAGGCATTATGTTGATAATATCTGCTGGAGTTTTATTAACAGTTGCAAAAGAGACCG
AATTTGATTTGTGGGGTTTCATTTTTGTTATGCTTGCTGCTGTTATGTCGGGTTTTCGTTGGACCATGACTCAGATCCTT
CTTCAGAAAGAGGTCTATGGTTTGAAAAATCCCCTGACGTTAATGAGCTATGTAACCCCTGTTATGGCCCTATTAACCGC
TTTGCTATCGCTAATTTTGGATCCTTGGGGCGAATTTAGAACTAGCAGTTATTTTGATAGTAAATTGCATATCATGAGGA
GTTGCTTCTTGATGCTCTTTGGGGGAACTCTTGCCTTTTTTATGGTATTAACAGAGTACATCCTTGTATCTGTGACAAGT
GCAGTTACAGTTACCATAGCAGGAATCGTAAAGGAAGCTGTCACTATTCTGGTAGCAGTATTTTATTTCCATGATGAATT
TACATGGCTGAAAGGAGGTGGTCTTGTCACCATCATTTTTGGTGTGAGTTTGTTCAACTTGTACAAATATCAGAAGTTGC
ACAACGAAAAATCGAGTCCGGATGAGCTGGCGGGATCTCAAACCTCAGTCCATGCAAAATATGTTATTCTGGAGGAAATG
GACGATGAAGAAACCGAAGAAACTAGACCTTGA
Microexon DNA seq AAAGAGGTCTATG
Microexon Amino Acid seq KEVYG
Microexon-tag DNA Seq GCTGTTATGTCGGGTTTTCGTTGGACCATGACTCAGATCCTTCTTCAGAAAGAGGTCTATGGTTTGAAAAATCCCCTGACGTTAATGAGCTATGTAACCCCTGTTATG
Microexon-tag Amino Acid seq AVMSGFRWTMTQILLQKEVYGLKNPLTLMSYVTPVM
Transcript ID Ha.2929.1
Gene ID Ha.2929
Gene Name NA
Pfam domain motif TPT
Motif E-value 4e-31
Motif start 67
Motif end 366
Protein seq >Ha.2929.1
MSGGSSTEMVSAASPRRTHTNEKRVSFDIENDTNGGLKYSNTTDFVDRNREGLASNKSPIVVVDILKTLFFILVWYTFSI
LLTLYNKKLLGDDQGKFPAPLLMNTIHFGMQAVFSKAIVYFWSERFQPTVTMSWKDYFMRVVPTAIGTALDINLSNASLV
FISVTFATMCKSAAPIFLLIFAFAFRLESPSFKLLGIMLIISAGVLLTVAKETEFDLWGFIFVMLAAVMSGFRWTMTQIL
LQKEVYGLKNPLTLMSYVTPVMALLTALLSLILDPWGEFRTSSYFDSKLHIMRSCFLMLFGGTLAFFMVLTEYILVSVTS
AVTVTIAGIVKEAVTILVAVFYFHDEFTWLKGGGLVTIIFGVSLFNLYKYQKLHNEKSSPDELAGSQTSVHAKYVILEEM
DDEETEETRP*
CDS seq >Ha.2929.1
ATGAGTGGTGGTAGCAGTACTGAAATGGTTTCTGCTGCTAGCCCGAGAAGAACACACACAAACGAGAAACGGGTTTCTTT
CGATATTGAAAATGATACAAACGGAGGGTTGAAATATTCAAACACAACAGATTTTGTTGATAGAAATAGAGAGGGTTTGG
CAAGCAACAAGAGCCCGATTGTCGTTGTCGATATACTCAAAACGTTGTTTTTTATACTCGTTTGGTACACATTCAGCATA
TTACTCACATTGTATAATAAGAAACTATTGGGAGATGATCAGGGGAAATTCCCTGCTCCTTTGTTAATGAATACGATTCA
TTTTGGAATGCAAGCCGTTTTTTCAAAGGCGATTGTATACTTTTGGTCCGAAAGATTCCAACCGACAGTAACTATGTCAT
GGAAAGACTACTTTATGAGAGTTGTGCCAACTGCTATTGGAACAGCACTGGATATAAATTTGAGCAATGCATCACTCGTT
TTCATATCGGTTACATTTGCAACCATGTGTAAATCTGCTGCTCCTATATTTCTCCTTATATTTGCTTTTGCATTCAGGTT
GGAATCTCCAAGTTTTAAACTTTTAGGCATTATGTTGATAATATCTGCTGGAGTTTTATTAACAGTTGCAAAAGAGACCG
AATTTGATTTGTGGGGTTTCATTTTTGTTATGCTTGCTGCTGTTATGTCGGGTTTTCGTTGGACCATGACTCAGATCCTT
CTTCAGAAAGAGGTCTATGGTTTGAAAAATCCCCTGACGTTAATGAGCTATGTAACCCCTGTTATGGCCCTATTAACCGC
TTTGCTATCGCTAATTTTGGATCCTTGGGGCGAATTTAGAACTAGCAGTTATTTTGATAGTAAATTGCATATCATGAGGA
GTTGCTTCTTGATGCTCTTTGGGGGAACTCTTGCCTTTTTTATGGTATTAACAGAGTACATCCTTGTATCTGTGACAAGT
GCAGTTACAGTTACCATAGCAGGAATCGTAAAGGAAGCTGTCACTATTCTGGTAGCAGTATTTTATTTCCATGATGAATT
TACATGGCTGAAAGGAGGTGGTCTTGTCACCATCATTTTTGGTGTGAGTTTGTTCAACTTGTACAAATATCAGAAGTTGC
ACAACGAAAAATCGAGTCCGGATGAGCTGGCGGGATCTCAAACCTCAGTCCATGCAAAATATGTTATTCTGGAGGAAATG
GACGATGAAGAAACCGAAGAAACTAGACCTTGA