Microexon ID At_5:23216003-23216013:+
Species Arabidopsis thaliana
Coordinates 5:23216003..23216013
Microexon Cluster ID MEP27
Size 11
Phase 1
Pfam Domain Motif Gelsolin
Structure of Microexon-tag (flanking exon, microexon, flanking exon sizes) 49,11,48
Microexon location in the Microexon-tag 2
Microexon-tag DNA Seq MRRGADRYTGWMAGWGATCCYCAYYTGTTYDCWTKYWCWTTYWMTAAAGGRAADYTKRAGGTKRMRGARRTHTACAAYTTYWCYCARGATGAYYTGWTGACWGARGAT
Logo of Microexon-tag DNA Seq NT60 Logo
Alignment of exons MSA
Microexon DNA seq AAAGTCTGAAG
Microexon Amino Acid seq ESLK
Microexon-tag DNA Seq AGGGATGGAGAGAGTGATCCTCATCTCTTTTCTTGCACATATACAAACGAAAGTCTGAAGGCTACAGAGATCTTCAACTTCACTCAAGATGATTTGATGACTGAAGAT
Microexon-tag Amino Acid Seq RDGESDPHLFSCTYTNESLKATEIFNFTQDDLMTED
Microexon-tag spanning region23215820-23216171
Microexon-tag prediction score0.9427
Overlapped with the annotated transcript (%) 100
New Transcript ID AT5G57320.1x
Reference Transcript ID AT5G57320.1
Gene ID AT5G57320
Gene Name VLN5
Transcript ID AT5G57320.1
Protein ID AT5G57320.1
Gene ID AT5G57320
Gene Name VLN5
Pfam domain motif Unknown
Motif E-value NA
Motif start NA
Motif end NA
Protein seq >AT5G57320.1
MTFSMRDLDQALQGAGQKSGIEIWRIENFKPVTVPQESHGKFFTGDSYIVLKTTASRSGSLHHDIHYWLGKDSSQDEAGA
VAVMTVELDSALGGRAVQYREVQGHETEKFLSYFKPCIIPQEGGVASGFNHVKPEEHQTRLYICKGKHVVRVKEVPFVRS
TLNHEDVFILDTESKIFQFSGSKSSIQERAKALEVVQYIKDTYHDGKCDIAAVEDGRMMADAEAGEFWGLFGGFAPLPKK
PAVNDDETAASDGIKLFSVEKGQTDAVEAECLTKELLDTNKCYILDCGLELFVWKGRSTSIDQRKSATEAAEEFFRSSEP
PKSNLVSVMEGYETVMFRSKFDSWPASSTIAEPQQGRGKVAALLQRQGVNVQGLVKTSSSSSKDEPKPYIDGTGNLQVWR
INCEEKILLEAAEQSKFYSGDCYILQYSYPGEDREEHLVGTWFGKQSVEEDRASAISLANKMVESMKFVPAQARINEGKE
PIQFFVIMQSFITFKGGVSDAFKKYIAENDIPDTTYEAEGVALFRVQGSGPENMQAIQIEAASAGLNSSHCYILHGDSTV
FTWCGNLTSSEDQELMERMLDLIKPNEPTKAQKEGSESEQFWELLGGKSEYPSQKIKRDGESDPHLFSCTYTNESLKATE
IFNFTQDDLMTEDIFILDCHTEVFVWVGQQVDPKKKPQALDIGENFLKHDFLLENLASETPIYIVTEGNEPPFFTRFFTW
DSSKSGMHGDSFQRKLAILTNKGKPLLDKPKRRVPAYSSRSTVPDKSQPRSRSMTFSPDRARVRGRSPAFNALAANFEKL
NIRNQSTPPPMVSPMVRKLYPKSHAPDLSKIAPKSAIAARTALFEKPTPTSQEPPTSPSSSEATNQAEAPKSTSETNEEE
AMSSINEDSKEEEAEEESSLPTFPYERLKTDSEDPVSDVDLTRREAYLTSVEFKEKFEMTKNEFYKLPKWKQNKLKMSVN
LF*
CDS seq >AT5G57320.1
ATGACGTTTTCCATGAGAGATTTAGATCAGGCTCTTCAAGGAGCTGGCCAGAAATCCGGGATTGAAATATGGCGCATCGA
GAACTTCAAACCTGTCACTGTTCCACAAGAGTCTCATGGGAAATTTTTCACCGGGGATTCCTATATAGTCTTAAAGACCA
CAGCGTCAAGAAGCGGTTCTTTGCATCATGATATTCACTACTGGCTCGGTAAAGATTCCAGCCAGGATGAAGCAGGTGCT
GTAGCCGTCATGACTGTTGAATTAGATTCAGCATTAGGTGGACGTGCTGTTCAGTACCGAGAAGTACAGGGTCACGAAAC
CGAGAAGTTTCTTTCCTACTTCAAACCTTGCATAATACCTCAAGAAGGTGGAGTTGCTTCAGGGTTCAACCATGTAAAGC
CCGAGGAGCATCAGACGCGTCTGTATATCTGCAAAGGCAAACATGTCGTCCGTGTTAAAGAGGTTCCGTTTGTTCGATCG
ACTCTCAACCACGAAGACGTTTTTATTCTTGATACAGAGTCTAAAATTTTTCAATTCAGTGGTTCCAAGTCGAGTATTCA
AGAAAGAGCAAAAGCTCTTGAGGTTGTTCAGTACATTAAAGACACTTACCATGATGGAAAGTGCGATATCGCAGCTGTTG
AGGATGGGAGGATGATGGCTGATGCTGAAGCTGGAGAGTTTTGGGGCTTGTTTGGTGGGTTTGCTCCGCTTCCTAAGAAA
CCAGCAGTCAATGATGACGAAACCGCTGCATCTGATGGTATCAAACTTTTCAGTGTCGAGAAGGGACAGACAGATGCCGT
AGAGGCTGAGTGTTTGACGAAAGAGCTTCTGGACACTAACAAATGTTATATTCTCGATTGCGGTCTTGAATTGTTCGTTT
GGAAGGGACGAAGTACTTCAATCGACCAAAGAAAGAGCGCAACTGAAGCTGCAGAAGAATTTTTCCGTTCGTCTGAACCG
CCAAAATCAAACCTGGTCAGTGTGATGGAAGGGTATGAAACAGTGATGTTCCGATCTAAGTTTGATTCATGGCCGGCTTC
AAGTACCATAGCTGAGCCCCAACAAGGCAGAGGCAAAGTCGCAGCTCTTTTGCAGCGGCAAGGAGTTAACGTTCAAGGCC
TGGTTAAGACTTCTTCTTCTTCTTCTAAAGACGAACCGAAGCCATACATTGATGGTACAGGAAATCTCCAGGTCTGGCGA
ATCAATTGTGAAGAAAAGATCCTTCTCGAAGCAGCAGAGCAATCAAAGTTCTATAGCGGGGATTGTTATATACTCCAGTA
CTCATACCCAGGAGAAGACAGAGAGGAACATCTAGTGGGTACTTGGTTTGGCAAGCAAAGCGTTGAGGAAGACAGAGCAT
CTGCTATTTCTTTGGCAAACAAGATGGTTGAATCCATGAAGTTTGTGCCGGCTCAGGCTCGCATTAATGAGGGAAAAGAG
CCAATTCAGTTTTTTGTGATCATGCAAAGCTTCATCACATTTAAGGGTGGTGTAAGTGATGCTTTCAAAAAGTACATAGC
TGAGAACGACATCCCTGATACTACTTACGAAGCAGAAGGTGTTGCACTGTTTCGGGTTCAAGGTTCTGGACCCGAGAACA
TGCAAGCCATACAGATAGAAGCGGCTTCCGCAGGACTAAATTCTTCGCACTGTTATATATTACACGGTGATTCTACTGTT
TTCACTTGGTGTGGCAATCTTACTTCCTCGGAAGATCAAGAACTTATGGAAAGAATGTTGGATTTGATTAAGCCAAATGA
ACCCACTAAGGCACAAAAGGAAGGCTCAGAGTCTGAACAGTTTTGGGAATTATTAGGAGGCAAATCAGAATATCCGAGCC
AAAAGATCAAAAGGGATGGAGAGAGTGATCCTCATCTCTTTTCTTGCACATATACAAACGAAAGTCTGAAGGCTACAGAG
ATCTTCAACTTCACTCAAGATGATTTGATGACTGAAGATATCTTCATACTTGATTGTCACACTGAGGTCTTTGTCTGGGT
GGGACAACAAGTTGACCCAAAGAAGAAACCTCAAGCTTTGGACATTGGAGAGAACTTTCTTAAGCATGATTTCCTCCTAG
AGAATCTAGCAAGTGAAACACCGATATACATTGTAACAGAGGGAAACGAACCTCCATTTTTCACTCGGTTCTTCACCTGG
GACTCTTCTAAATCTGGAATGCATGGAGACTCTTTCCAGAGAAAGCTTGCGATCCTGACAAATAAAGGAAAGCCGCTTCT
AGATAAACCTAAAAGGAGAGTCCCTGCATACAGTAGCAGGTCCACCGTTCCAGACAAATCGCAGCCTCGGTCCAGAAGTA
TGACTTTTAGTCCAGACAGGGCCCGCGTGAGGGGACGGTCTCCAGCTTTCAACGCACTTGCAGCAAACTTTGAGAAACTA
AATATAAGAAACCAATCAACTCCACCACCTATGGTTAGTCCAATGGTCCGGAAGCTTTACCCGAAATCTCATGCCCCAGA
CCTCTCGAAGATAGCTCCTAAATCCGCTATAGCTGCCCGTACAGCGCTTTTTGAAAAACCTACTCCTACTTCTCAAGAAC
CACCTACTAGCCCCAGTTCTTCAGAAGCTACAAACCAAGCAGAAGCACCAAAATCGACATCAGAGACAAACGAGGAAGAA
GCAATGAGCAGCATCAATGAAGACTCAAAAGAAGAAGAAGCAGAAGAAGAGAGCAGCCTCCCTACTTTCCCATATGAACG
CCTTAAAACCGACTCAGAGGATCCTGTTTCAGATGTCGACCTCACTAGAAGAGAGGCATACTTGACTTCAGTAGAGTTCA
AGGAGAAGTTTGAGATGACAAAGAATGAGTTCTACAAACTGCCTAAATGGAAACAAAACAAGCTCAAAATGTCTGTCAAT
CTCTTCTAA
Microexon DNA seq AAAGTCTGAAG
Microexon Amino Acid seq ESLK
Microexon-tag DNA Seq AGGGATGGAGAGAGTGATCCTCATCTCTTTTCTTGCACATATACAAACGAAAGTCTGAAGGCTACAGAGATCTTCAACTTCACTCAAGATGATTTGATGACTGAAGAT
Microexon-tag Amino Acid seq RDGESDPHLFSCTYTNESLKATEIFNFTQDDLMTED
Transcript ID At.26873.1
Gene ID At.26873
Gene Name VLN5
Pfam domain motif Unknown
Motif E-value NA
Motif start NA
Motif end NA
Protein seq >At.26873.1
MTFSMRDLDQALQGAGQKSGIEIWRIENFKPVTVPQESHGKFFTGDSYIVLKTTASRSGSLHHDIHYWLGKDSSQDEAGA
VAVMTVELDSALGGRAVQYREVQGHETEKFLSYFKPCIIPQEGGVASGFNHVKPEEHQTRLYICKGKHVVRVKEVPFVRS
TLNHEDVFILDTESKIFQFSGSKSSIQERAKALEVVQYIKDTYHDGKCDIAAVEDGRMMADAEAGEFWGLFGGFAPLPKK
PAVNDDETAASDGIKLFSVEKGQTDAVEAECLTKELLDTNKCYILDCGLELFVWKGRSTSIDQRKSATEAAEEFFRSSEP
PKSNLVSVMEGYETVMFRSKFDSWPASSTIAEPQQGRGKVAALLQRQGVNVQGLVKTSSSSSKDEPKPYIDGTGNLQVWR
INCEEKILLEAAEQSKFYSGDCYILQYSYPGEDREEHLVGTWFGKQSVEEDRASAISLANKMVESMKFVPAQARINEGKE
PIQFFVIMQSFITFKGGVSDAFKKYIAENDIPDTTYEAEGVALFRVQGSGPENMQAIQIEAASAGLNSSHCYILHGDSTV
FTWCGNLTSSEDQELMERMLDLIKPNEPTKAQKEGSESEQFWELLGGKSEYPSQKIKRDGESDPHLFSCTYTNESLKATE
IFNFTQDDLMTEDIFILDCHTEVFVWVGQQVDPKKKPQALDIGENFLKHDFLLENLASETPIYIVTEGNEPPFFTRFFTW
DSSKSGMHGDSFQRKLAILTNKGKPLLDKPKRRVPAYSSRSTVPDKSQPRSRSMTFSPDRARVRGRSPAFNALAANFEKL
NIRNQSTPPPMVSPMVRKLYPKSHAPDLSKIAPKSAIAARTALFEKPTPTSQEPPTSPSSSEATNQAEAPKSTSETNEEE
AMSSINEDSKEEEAEEESSLPTFPYERLKTDSEDPVSDVDLTRREAYLTSVEFKEKFEMTKNEFYKLPKWKQNKLKMSVN
LF*
CDS seq >At.26873.1
ATGACGTTTTCCATGAGAGATTTAGATCAGGCTCTTCAAGGAGCTGGCCAGAAATCCGGGATTGAAATATGGCGCATCGA
GAACTTCAAACCTGTCACTGTTCCACAAGAGTCTCATGGGAAATTTTTCACCGGGGATTCCTATATAGTCTTAAAGACCA
CAGCGTCAAGAAGCGGTTCTTTGCATCATGATATTCACTACTGGCTCGGTAAAGATTCCAGCCAGGATGAAGCAGGTGCT
GTAGCCGTCATGACTGTTGAATTAGATTCAGCATTAGGTGGACGTGCTGTTCAGTACCGAGAAGTACAGGGTCACGAAAC
CGAGAAGTTTCTTTCCTACTTCAAACCTTGCATAATACCTCAAGAAGGTGGAGTTGCTTCAGGGTTCAACCATGTAAAGC
CCGAGGAGCATCAGACGCGTCTGTATATCTGCAAAGGCAAACATGTCGTCCGTGTTAAAGAGGTTCCGTTTGTTCGATCG
ACTCTCAACCACGAAGACGTTTTTATTCTTGATACAGAGTCTAAAATTTTTCAATTCAGTGGTTCCAAGTCGAGTATTCA
AGAAAGAGCAAAAGCTCTTGAGGTTGTTCAGTACATTAAAGACACTTACCATGATGGAAAGTGCGATATCGCAGCTGTTG
AGGATGGGAGGATGATGGCTGATGCTGAAGCTGGAGAGTTTTGGGGCTTGTTTGGTGGGTTTGCTCCGCTTCCTAAGAAA
CCAGCAGTCAATGATGACGAAACCGCTGCATCTGATGGTATCAAACTTTTCAGTGTCGAGAAGGGACAGACAGATGCCGT
AGAGGCTGAGTGTTTGACGAAAGAGCTTCTGGACACTAACAAATGTTATATTCTCGATTGCGGTCTTGAATTGTTCGTTT
GGAAGGGACGAAGTACTTCAATCGACCAAAGAAAGAGCGCAACTGAAGCTGCAGAAGAATTTTTCCGTTCGTCTGAACCG
CCAAAATCAAACCTGGTCAGTGTGATGGAAGGGTATGAAACAGTGATGTTCCGATCTAAGTTTGATTCATGGCCGGCTTC
AAGTACCATAGCTGAGCCCCAACAAGGCAGAGGCAAAGTCGCAGCTCTTTTGCAGCGGCAAGGAGTTAACGTTCAAGGCC
TGGTTAAGACTTCTTCTTCTTCTTCTAAAGACGAACCGAAGCCATACATTGATGGTACAGGAAATCTCCAGGTCTGGCGA
ATCAATTGTGAAGAAAAGATCCTTCTCGAAGCAGCAGAGCAATCAAAGTTCTATAGCGGGGATTGTTATATACTCCAGTA
CTCATACCCAGGAGAAGACAGAGAGGAACATCTAGTGGGTACTTGGTTTGGCAAGCAAAGCGTTGAGGAAGACAGAGCAT
CTGCTATTTCTTTGGCAAACAAGATGGTTGAATCCATGAAGTTTGTGCCGGCTCAGGCTCGCATTAATGAGGGAAAAGAG
CCAATTCAGTTTTTTGTGATCATGCAAAGCTTCATCACATTTAAGGGTGGTGTAAGTGATGCTTTCAAAAAGTACATAGC
TGAGAACGACATCCCTGATACTACTTACGAAGCAGAAGGTGTTGCACTGTTTCGGGTTCAAGGTTCTGGACCCGAGAACA
TGCAAGCCATACAGATAGAAGCGGCTTCCGCAGGACTAAATTCTTCGCACTGTTATATATTACACGGTGATTCTACTGTT
TTCACTTGGTGTGGCAATCTTACTTCCTCGGAAGATCAAGAACTTATGGAAAGAATGTTGGATTTGATTAAGCCAAATGA
ACCCACTAAGGCACAAAAGGAAGGCTCAGAGTCTGAACAGTTTTGGGAATTATTAGGAGGCAAATCAGAATATCCGAGCC
AAAAGATCAAAAGGGATGGAGAGAGTGATCCTCATCTCTTTTCTTGCACATATACAAACGAAAGTCTGAAGGCTACAGAG
ATCTTCAACTTCACTCAAGATGATTTGATGACTGAAGATATCTTCATACTTGATTGTCACACTGAGGTCTTTGTCTGGGT
GGGACAACAAGTTGACCCAAAGAAGAAACCTCAAGCTTTGGACATTGGAGAGAACTTTCTTAAGCATGATTTCCTCCTAG
AGAATCTAGCAAGTGAAACACCGATATACATTGTAACAGAGGGAAACGAACCTCCATTTTTCACTCGGTTCTTCACCTGG
GACTCTTCTAAATCTGGAATGCATGGAGACTCTTTCCAGAGAAAGCTTGCGATCCTGACAAATAAAGGAAAGCCGCTTCT
AGATAAACCTAAAAGGAGAGTCCCTGCATACAGTAGCAGGTCCACCGTTCCAGACAAATCGCAGCCTCGGTCCAGAAGTA
TGACTTTTAGTCCAGACAGGGCCCGCGTGAGGGGACGGTCTCCAGCTTTCAACGCACTTGCAGCAAACTTTGAGAAACTA
AATATAAGAAACCAATCAACTCCACCACCTATGGTTAGTCCAATGGTCCGGAAGCTTTACCCGAAATCTCATGCCCCAGA
CCTCTCGAAGATAGCTCCTAAATCCGCTATAGCTGCCCGTACAGCGCTTTTTGAAAAACCTACTCCTACTTCTCAAGAAC
CACCTACTAGCCCCAGTTCTTCAGAAGCTACAAACCAAGCAGAAGCACCAAAATCGACATCAGAGACAAACGAGGAAGAA
GCAATGAGCAGCATCAATGAAGACTCAAAAGAAGAAGAAGCAGAAGAAGAGAGCAGCCTCCCTACTTTCCCATATGAACG
CCTTAAAACCGACTCAGAGGATCCTGTTTCAGATGTCGACCTCACTAGAAGAGAGGCATACTTGACTTCAGTAGAGTTCA
AGGAGAAGTTTGAGATGACAAAGAATGAGTTCTACAAACTGCCTAAATGGAAACAAAACAAGCTCAAAATGTCTGTCAAT
CTCTTCTAA