UCADceraas BIOINFORMATICS
Sesamum indicum Genetic Discovery Database (SiGeDiD)
Gene ID
LG
mRNA sequence
Peptide sequence
function predicted
SIN_1007935

15

+ ATGGGAGTTCATGGGCTTTGGGAACTCCTCGCCCCCGTTGGCCGCCGAGT CTCTGTTGAAACCCTTGCTGGGAAAAAACTCGCAATTGATGCGAGCATAT GGATGATACAGTTTATGAAGGCAATGCGAGATGAGAAGGGAGAGATGGTT CGGAATGCTCACATATTGGGATTCTTCCGTCGAATTTGCAAGCTTCTCTA CCTCAGGACTAAGCCTGTGTTCGTATTTGACGGCGGCACGCCTGCGCTTA AGCGGCGAACAGTCATTGCTCGCCGCCGCCAGCGCGAGAATGCCCAGGCT AAGATTAGAAAGACTGCGGAGAAATTACTTCTCAATCACCTTAAGGCGAT GAGGTTGAAAGAATTGGCAGCGGATCTTGAGAAGCAGAGGCAAGAGAATG ATACTAAAGGGAAAAGACCCATCATACAGGAACCAAATATTCTACAGAAC ACGGAGAAAGGAAATGATGCTGAGGCCGTTAATTACAATCAGGAAGAAGT GGATGAGATGTTGGCAGCTTCTATTGCTGTAGAGGAGAATGAGGGTTTTA GTTTTGATGCATCGACATCAGGTGCTGGTGATCCAGATAACAAAGTATTT GATGGAGAAGAGGATGAGGATGATGATGAAGATGAAGAAATGATACTACC AGAGATGCATGGAAAAGTCGATCCTGCTGTCTTGGCAGCTCTTCCTCCAT CCATGCAACTTGACCTCCTTGTTCAGATGAGAGAGAGACTGATGGCTGAG AACAGGCAGAAGTATCAAAAAGTGAAAAAGGCTCCAGCAAGGTTTTCAGA ACTACAAATACAGGCTTATCTCAAAACAGTGGCTTTCCGCCGTGAGATAG ATGGGGTACAGAAGTCTGCTGCAGGGAGGGGAATAGGTGGCGTGCAGACT TCACGAATTGCATCTGAATCTAAAAGAGAATTTATTTTCTCCTCATCATT TACTGGGGACAAACAAGCCCTTACATCTGTTGGACAAGAGGTCGTAGGAG CTGATCAAAGTCAGCCTGAGCCAGTTAATTGCTCTACCGATGCTGTCAAT GAAATTCTGTCGACCAGTGGTGCAGTTGGACCAACAGTGGTTGAAACTGA AAAGGCGTTTCATGATGATGTGGAGACATATCTGGATGAGAGAGGTCGTG TTCGGGTCAGTAGGGTAAGAGCGCTGGGTATTCGTATGACTCGAGACCTG CAGAGGAATTTAGATTTGATGAAGGAGATTGATCAGGAGAAAGCAGATAC AGATCAAGAGAAAAATAAGGAATCCACCACTGCAGAACTGGTTGATGACC TAGAAAGGTCATCTGACAGGATCCAGCATCGAGAAGTTTCTGATAAGATT AATAACAGAATGAATGATGAAATTGACAAAACAGATGAGCCTGCAGTGGT AAATGGAACTTCTATCGAGATTTCCTTTGAAGATACATTGGATAACGAAT GTGACAATGATGATGATAAGTTGTTTGCTAGTCTGGTGGCCGGAAACCCC GTAATGGACTTCGCTGTTGATAATTCTGCCTCAGTGAAACAAACTTCAGA CCACTCTGCTTCAGATTTTGAGTGGGAGGAAGGTGTCATTGAAGAGAAAA GGTCGGCGTATCTGTTCGAAGGAGGCATGAGAGGTGAGGGAGAAGTAGAG TGGGAGGAAGAAGTTCAGGACATTCAATTGAAGTCCTCATCTTGTCCAGA CGAAAGCCAGAAAACTGTTAGAAAGGGTGCTCTGCAGGAAGAATCTGATA TTCAGGAGGCAATTAGAAGAAGTCTTGAGGATACAAGGGGTTGCAGATCC ATGAATAACTTTCATGAAAATAGCATATGCGAAAGAGGTAGAGAAGTGGT TACCAAAGAGCACATGACACATGCATGTCAGGTGCAGTCTGTTTATGAAG GGAAAGAAGGCCCTGAGGTTGATGCCTCAATGATCAATGTAAGGCAACCA TTTGGCTCATCGAATATCCTTGAGAACAACTGTTCAGAAGCTAAATCTGC TGCGTTCATGGATTTGAAACATGAAAAAAGCAGCTTAGACCTGAAGCTTT CGAGTGAAGATGCTGGCATAAGCGGAAATTTGACTGGAGAAAAACTTGTG ACTCCAGATACTATTCCTGAAGAAGAGGAGTTGTGTGTGACTGAGAAGCA GCCAATAGACACTTGTAGTGAAGATGGCAATAGCCATGCAGCAAACAAAC TAGAGGATACTTGTAGCGGGCTGGCTGCTCATAATGTTAGTGGCTCTGCG TTTAGTTCTGTTATTCATGAACTGAATGACAGAGCCCTTGACTCAGGTTC TGCTGATGCTCAACACATGTTTCAGGCAGCATCAGACGACCATGCTTGTG ACACTGCAAAAATTGGAAAAATTTCAACTGATGATTCAATAACTGATTTA GATGGTGTGAAAGATTTGGGCAAGGAGAAAATTTATGGTAACTTTTCAAT GGAAAAGGAGGAAACTACGAGAAATTCTTCATTTATGGATGATGACAAGG AGCAGGAGATCATGGAGGCTCATCTGGAGGAAGAAATGCTGTTTCTAGGT ATAGAACGTGAAGAGCTAGGAAGTGAGCAACGGAAACTGGAGAGAAATGC AGAATCAGTAAGCAATGAAATGTTTGCTGAATGCCAGGAGTTGCTTCAAA TGTTTGGCATACCATATATTATTGCACCAATGGAAGCTGAAGCTCAATGT GCTTTCATGGAGCAATCAAATCTTGTTGATGGTGTGGTCACAGATGACTC TGATGCATTCTTGTTTGGAGCACGAAGTGTGTACAAAAATATCTTTGATG ATCGCAAATATGTGGAAACATACTTAATGAAGGACATCGAAAATGAGCTT GGGTTAGATCGAGAAAAATTAATCCATATGGCACTGCTTCTTGGGAGTGA TTATACCGAAGGCATAAGTGGCATTGGAATTGTCAATGCAATTGAGGTTG TAAATGCATTTCCTAAGAAAGATGGTCTTCGTGAATTCCGAGAATGGATT GAATCGCCAGATCCCACTATTCTTGGAAAATTAGATGTGGAAGCAGGGGG TAACTCAAGAAGGAAAGGATCAAAAGGTAGTGAGAGTATGATGGGTGGCT CAAGTAGCAACACAGAAGGGAGGTCTTGTGATCAAAGTGAACCACAACCT GTGGATGAGGCCAAAAGGATAAAGCAGATTTTCATGGATAAGCATAGAAA TGTGAGCAAAAACTGGCATATTCCTGCTACTTTTCCAAGTGATGCTGTGA TTTTAGCATATGCTTCTCCACAGGTGGATAAATCAACAGATCCGTTTTCA TGGGGAAAGCCGGATCTTTTTGTGCTTCGCAAGTTGTGCTGGGAGAAGTT TGGGTGGGGCATGTCAAAATCAGATGAATTGCTGTTGCCAGTTCTAAAGG AGTACAACAAGCATGAGACGCAATTGAGATTGGAAGCATTCTACACATTC AATGAGAGATTTGCCAAGATCCGAAGTAAGAGAATAAAGAAGGCTGTCAA AGGAATAACAGGGGAGAAATCTTCTGACTTGATGGATGACACTACACCAC AATCTGGGAGTGGAAAGAAACGAAAAGTGAGGCCTAGTGAGAATGAAGCC AACCAATCAGGAGGAGGTTCAGAAGGATTGGATGGTTGCGGTACTAGTGA TAACACCATAAAAAAAACAACTGTGAGGCGGTTAAAAGGAGGACAGACAA AAGAAAAGACTTCGCGGAGGAACTTAGAACTATCAACTAATGTAGATAAC CATCTTCTCACCAGGAAAGAATCGCATATAAGAGGACATCTTAGTGGAAA AGGGAGGAGGAAACAAAGGAATTCTTCTGGTGAAGATACTGAAACTGGCA GTGATGATGGCACCTATAGTGGAAGTGACAAGGAAAAGCAACTTGACATA TCGAAAGAATCTTTCCAAGTACGACGGTCCGGGCGAATTAGAAAGACTGT GAATTATACTGTTGCTGATGTATTTGACAACTGTGAAGAGGAGAGTCCTA ATTGCCTCGAAGAAGGTGCTGTGACTAAGGAGTCGTTAATGGATCAAGTA GTTGGTAGTGTTGATAAGAGCAATGTAAATGAACACAAAGAAGGTAATGA TGTTGGGATGGGAAGTAGACTTTGTGTGGATGAAACTGAACAAGCATCCA GGATGGATGAGATGAGGACCAGTCAGTTTAGTGATAGTCAAATTGATGAT CCTGTCAACCAGAGCCATTTATCGAAAGATTATCTGCAGTTTGGGGGTGG ATTCTGCATGGAGGAGGATGAAGAGGTGGAGCTTAATGGGCACGCATCAT GTCCTCCAAAAGAAGTGATCCCTGAGAAACCAGATGTTCTTGATAGTGTT GATTCAGCAGTGGAAGAAAATCATGAAAGTGATCGGTCGATTATAACTCC AATACCATCGTATGGGAAGACAGAGTCGTCTGATACTGAGGTAATTATGG CCGATAAAAGGCCAAGTGATGATATTTCATCAGATGGTACCAAGAATGAT CCTGGGAGTTCCTTACCAAAATCTCTTCGAGCAATGCCTAATCTGAGAAG GAAGAAGAGAAAAACTTGA

standard] MGVHGLWELLAPVGRRVSVETLAGKKLAIDASIWMIQFMKAMRDEKGEMV RNAHILGFFRRICKLLYLRTKPVFVFDGGTPALKRRTVIARRRQRENAQA KIRKTAEKLLLNHLKAMRLKELAADLEKQRQENDTKGKRPIIQEPNILQN TEKGNDAEAVNYNQEEVDEMLAASIAVEENEGFSFDASTSGAGDPDNKVF DGEEDEDDDEDEEMILPEMHGKVDPAVLAALPPSMQLDLLVQMRERLMAE NRQKYQKVKKAPARFSELQIQAYLKTVAFRREIDGVQKSAAGRGIGGVQT SRIASESKREFIFSSSFTGDKQALTSVGQEVVGADQSQPEPVNCSTDAVN EILSTSGAVGPTVVETEKAFHDDVETYLDERGRVRVSRVRALGIRMTRDL QRNLDLMKEIDQEKADTDQEKNKESTTAELVDDLERSSDRIQHREVSDKI NNRMNDEIDKTDEPAVVNGTSIEISFEDTLDNECDNDDDKLFASLVAGNP VMDFAVDNSASVKQTSDHSASDFEWEEGVIEEKRSAYLFEGGMRGEGEVE WEEEVQDIQLKSSSCPDESQKTVRKGALQEESDIQEAIRRSLEDTRGCRS MNNFHENSICERGREVVTKEHMTHACQVQSVYEGKEGPEVDASMINVRQP FGSSNILENNCSEAKSAAFMDLKHEKSSLDLKLSSEDAGISGNLTGEKLV TPDTIPEEEELCVTEKQPIDTCSEDGNSHAANKLEDTCSGLAAHNVSGSA FSSVIHELNDRALDSGSADAQHMFQAASDDHACDTAKIGKISTDDSITDL DGVKDLGKEKIYGNFSMEKEETTRNSSFMDDDKEQEIMEAHLEEEMLFLG IEREELGSEQRKLERNAESVSNEMFAECQELLQMFGIPYIIAPMEAEAQC AFMEQSNLVDGVVTDDSDAFLFGARSVYKNIFDDRKYVETYLMKDIENEL GLDREKLIHMALLLGSDYTEGISGIGIVNAIEVVNAFPKKDGLREFREWI ESPDPTILGKLDVEAGGNSRRKGSKGSESMMGGSSSNTEGRSCDQSEPQP VDEAKRIKQIFMDKHRNVSKNWHIPATFPSDAVILAYASPQVDKSTDPFS WGKPDLFVLRKLCWEKFGWGMSKSDELLLPVLKEYNKHETQLRLEAFYTF NERFAKIRSKRIKKAVKGITGEKSSDLMDDTTPQSGSGKKRKVRPSENEA NQSGGGSEGLDGCGTSDNTIKKTTVRRLKGGQTKEKTSRRNLELSTNVDN HLLTRKESHIRGHLSGKGRRKQRNSSGEDTETGSDDGTYSGSDKEKQLDI SKESFQVRRSGRIRKTVNYTVADVFDNCEEESPNCLEEGAVTKESLMDQV VGSVDKSNVNEHKEGNDVGMGSRLCVDETEQASRMDEMRTSQFSDSQIDD PVNQSHLSKDYLQFGGGFCMEEDEEVELNGHASCPPKEVIPEKPDVLDSV DSAVEENHESDRSIITPIPSYGKTESSDTEVIMADKRPSDDISSDGTKND PGSSLPKSLRAMPNLRRKKRKT

IPR001044; Xeroderma pigmentosum group G protein IPR006084; DNA repair protein (XPGC)/yeast Rad IPR006085; XPG N-terminal IPR006086; XPG/RAD2 endonuclease IPR008918; Helix-hairpin-helix motif, class 2