UCADceraas BIOINFORMATICS
Sesamum indicum Genetic Discovery Database (SiGeDiD)
Gene ID
LG
mRNA sequence
Peptide sequence
function predicted
SIN_1004850

15

+ ATGGATTTGCGGTTCCCCTATTCGCCGGCCGAGGTTGCCAAGATCCGGGT GGTCCAGTTTGGCATCCTCAGCCCGGATGAAATTAGGCAAATGTCGGTCG TGCATATAGAACATAGTGAAACGACAGAGAGAGGTAAGCCGAAGCCAGGT GGGTTGAGTGACCCTCGTCTGGGGACTATTGATAGGAAGATGAAGTGCGA GACTTGTATGGCTAACATGGCCGACTGCCCCGGCCACTTTGGTCACCTCG AGCTCGCAAAACCCATGTTTCACATTGGCTTCATGAAGACTGTGCTCAGC ATCCTTCGTTGTGTTTGCTTCAATTGCTCCAAAATTTTGGCAGACGAGGA GGATCCGAAGTTCAAGCAAGCATTGAGAATAAGAAACCCTAAGAATAGGC TAAAAAAGATATTAGATGCCTGCAAAAACAAATCCAAGTGTGAAGGAGGC GATGAGATTGATGTGCAAGGTCAAGACACTGATGAACCTGTGAAGAAGAC TAGGGGAGGGTGTGGTGCTCAGCAGCCAAAAATTTCTATTGATGGCATGA AAATGGTTGCTGAGTACAAGATTCAAAAGAAGAAGAACGATGACCCGGAG CAAATGCCTGAACCAGTCGAAAGGAAACAACAACTTTCTGCAGAGAAGGT TCTTAGCATCTTGAAGAGAATAACTGATGAAGATTGTCAATTGCTGGGTT TGAATCCAAAATATGCCCGCCCCGATTGGATGATACTTCAAGTTCTTCCT ATTCCTCCGCCTCCTGTTAGACCTTCTGTGATGATGGATACTTCTTCTAG AAGTGAGGATGATTTAACTCATCAATTGGCGATGATCATAAGGCACAATG AGAACTTAAAGAGGCAGGAGCGGAATGGGGCTCCTGCACACATCATCTCT GAGTTTGCACAGTTACTGCAGTTCCACATAGCCACCTACTTTGATAACGA ATTGCCTGGACAGCCAAGGGCTACACAAAGATCTGGTAGACCAATCAAAT CAATATGTAGCAGGCTCAAAGCAAAAGAAGGTAGGATCAGAGGTAACTTG ATGGGTAAGAGAGTTGATTTCTCAGCTCGGACTGTGATCACACCTGATCC AACCATCAACATTGACGAACTGGGAGTACCTTGGAGTATTGCCCTGAATC TGACATATCCAGAAACTGTGACGCCTTATAATATAGAAAGGTTGAAAGAG CTTGTGGAATATGGACCTCATCCTCCTCCAGGTAAAACTGGTGCTAAATA TATAATTAGGGATGATGGGCAAAGGCTTGATCTTAGATATTTGAAGAAGA GTAGTGATCAGCATCTAGAACTTGGCTATAAGGTGGAGCGCCACTTGAAT GACGGTGACTTTGTCCTGTTCAATCGGCAACCGAGTTTGCATAAAATGTC TATAATGGGGCATAAAATAAAGATTATGCCTTACTCAACCTTCCGTCTGA ATCTGTCTGTAACCTCACCCTATAATGCTGATTTTGATGGGGATGAAATG AACATGCACGTTCCCCAGTCATTTGAAACTAGAGCTGAAGTGTTGGAACT GATGATGGTTCCGAAATGCATCGTGTCACCTCAATCAAACCGGCCTGTAA TGGGTATAGTCCAGGATACACTTTTAGGTTGCCGCAAAATCACCAAAAGA GATACATTTATTGAAAAGGATGTTTTCATGAATATTCTGATGTGGTGGGA GGATTTTGATGGAAAAGTGCCTGCACCGGCAATTTTGAAACCTAGGCCAC TTTGGACTGGAAAACAAGTGTTCAATTTAATAATACCAAAGCAGATAAAC CTTTTGAGATATTCAGCATGGCATCAGGAAAGTGAAAAGGGATTTATAAC TCCTGGGGATACTCAAGTCCGAATAGAAAAAGGGGAATTGCTATCTGGCA CTCTTTGCAAGAAGACGCTTGGGACATCCACGGGTAGTCTTATACATGTT ATTTGGGAAGAGGTTGGTCCGGATGCAGCTCGAAAATTCTTGGGACATAC GCAATGGCTTGTCAACTATTGGCTTTTGCAGAATGCTTTCAGCATTGGAA TTGGTGATACAATTGCTGATGCTGCAACAATGGAAAAGATCAATGAAACT ATTTCAAATGCGAAGAATGAGGTGAAAGAACTTATTAGGGCTGCTCAAGA AAAGCAGTTAGAGGCTGAACCTGGTCGAACGATGATGGAATCATTTGAAA ACAGAGTGAACCAGGTGTTGAATAAGGCCCGTGATGATGCTGGAAGCAGT GCTCAGAAGAGCTTGTCAGAAAGTAACAATCTTAAGGCTATGGTTACTGC AGGATCTAAGGGAAGTTTTATCAACATTTCTCAGATGACTGCTTGTGTGG GGCAGCAGAATGTTGAAGGCAAACGAATTCCTTTTGGATTCATAGATCGT ACTCTGCCACACTTCACCAAAGATGATTATGGTCCAGAGAGTCGTGGGTT TGTAGAAAATTCATATCTTAGGGGTTTGACGCCTCAGGAGTTCTTCTTCC ATGCTATGGGTGGTAGGGAAGGTCTAATAGATACAGCAGTGAAAACTTCG GAGACCGGGTACATTCAGAGGCGTTTGGTAAAAGCAATGGAGGATATCAT GGTTAAGTATGATGGGACTGTCCGGAACTCCTTGGGGGATGTGATTCAGT TTCTTTACGGGGAAGATGGTATGGATGCTGTTTGGATTGAATCGCAGCCC CTAGAATCTTTGAAGCTGAAGAAGGCTGATTTTAATGATATGTACAGATA TGAAATTGATAATGCAAATTGGAATCCTAATTACATGTTGCCTGAAGCTG TTGAAGATCTGAAAACGATTCGTGAAATCCGCAGTGTATTTGATGCGGAG GTACAGAAACTTGAAGCTGACAGATACCAACTTGGGACAGAGATTGCGAC CACTGGTGACAATTCTTGGCCCCTGCCTGTTAACATTAGGAGGCTTGTCT TAAATGCACAAAAGACATTTAAGGTTGATTTTCGGCGGCCTTCTGATATG CATCCTATGGAGATTGTGGAAGCTGTTGACAAGTTACAGGAAAGGCTTAA GGTGGTGGTTGGTGATGATTATTTGAGTATGGAAGCCCAAAAGAATGCTA CTCTTTTCTTTAATATATTGCTTCGGAGTGCCTTAGCAAGTAAGAGGGTT TTAAAGGAATATAGACTTACCCGAGAAGCTTTTGAATGGGTTATTGGTGA GATAGAATCTCGTTTTCTGCAGTCACTTGTGGCTCCTGGGGAAATGATAG GGTGTGTCGCTGCACAGTCAATTGGTGAGCCTGCCACACAGATGACCCTT AATACTTTCCACTATGCTGGTGTGAGTGCGAAAAACGTCACATTAGGTGT TCCAAGGTTGAGGGAGATCATTAATGTTGCTAAAAAAATCAAGACACCCT CTCTTTCAGTATATTTGAAGCCTGATGTTAGTAAAACAAAGGAGAGGGCC AAAAATGTCCAGTGTGCTCTTGAGTACACTACTTTACGCAGTGTGACACA AGCCACAGAAGTATGGTATGATCCAGATCCCATGAGCACAATTATTGAGG AAGATGTGGAATTTGTGAAGTCCTATTATGAGATGCCTGATGAAGAGATT GACCCTGACAAAATCTCCCCTTGGTTGCTTCGCATAGAGTTGAACCGGGA GATGATGGTGGATAAGAAACTCAGCATGGCAGATATTGCCGAGAAGATAA ATCTTGAATTTGATGATGATTTGACATGTATATTCAATGATGACAATGCT GAGAAACTGATTCTTCGTATTCGTATCATGAATGATGAAGCCCCCAAGGG TGAATTGAATGATGAATCAGCTGAGGATGATGTATTTCTCAAGAAGATTG AGAGTAACATGCTCACAGAAATGGCTCTTCGAGGCATACCAGATATCAAT AAGGTGTTCATAAAGAACAGTAAGCTAAATAAGTTTGATGAGAATGAAGG ATTCAAGGCAGAGAATGAGTGGATGCTGGACACTGAAGGTGTTAACCTAC TGGCTGTCATGTGCCATGAAGATGTTGATGCAAGGAGGACAACAAGCAAT CACTTGATTGAAGTTATCGAAGTTCTAGGAATTGAGGCAGTTCGTAAAGC TTTGCTGGATGAATTGCGTGTTGTCATATCCTTTGACGGATCTTATGTGA ACTACCGACACTTGGCAATATTGTGCGATACCATGACCTATCGTGGTCAC TTGATGGCCATAACTCGTCATGGGATTAATCGCAATGATACTGGCCCGAT GATGAGATGCTCCTTCGAGGAAACGGTGGACATTCTGCTTGATGCTGCTG TCTATGCAGAAACGGACCATCTAAGGGGTGTCACTGAAAACATTATGTTG GGTCAGCTTGCACCAATTGGCACTGGAGACTGCGCCTTGTATCTGAACGA GGAGATGTTAAAACAAGCTATTGAGATCCCTTTACCCAGTTACATGGAAG GTGGTGGCCTGGAATTTGGCATGACACCTGCTCGTTCCCCACTTACAGGA ACCCCGTATCATGACGGCATGATGTCACCAAGTTATTTGCTCAGCCCAAA TCTGCGGTTATCTCCTGTTACAGATGCCCAGTTTTCTCCTTATGTTGGTG GAATGGCCTTCTCTCCTACATCATCTCCAGGTTACAGCCCGTCCTCTCCT GGATACAGCCCATCATCTCCTGGTTATAGCCCCACTTCTCCTGGTTACAG CCCCACTTCCCCTGGTTATAGCCCCACTTCCCCAACATATAGTCCCAGTT CACCTGGATATAGCCCAACAAGTCCTGCATATTCTCCAACTAGCCCATCC TACTCACCTACTTCTCCAACCTACAGCCCAACTTCTCCTAGCTATAGTCC GACTTCACCAAGTTATAGTCCTACATCTCCAAGTTACAGTCCGACATCTC CGAGCTATAGCCCAACCTCCCCTGCATACAGCCCAACATCCCCTGCATAC AGCCCAACATCCCCTGCTTACAGCCCCACTTCACCCTCTTACAGCCCAAC GTCACCCTCTTACAGCCCAACCTCTCCATCTTACAGCCCAACCTCACCTT CTTACAGTCCCACCTCTCCATCATACAGCCCAACTTCTCCGTCATATAGC CCAACCTCACCTGCCTACAGCCCGACCTCCCCTGGTTATAGCCCAACCTC TCCAAGTTACAGTCCTACCTCACCAAGTTATAGCCCTACATCTCCAAGTT ACAATCCTTCAGCAAGATACAGCCCTTCTCTTGCATATTCACCTACAAGT CCAAAGATCTCACCTTCTAGTCCTTACAGTCCGTCCTCTCCAAGTTACAG TCCAACCTCACCTTCATATTCTCCAACATCTCCATCGTATTCTCCATCTA GTCCAAGTTATAGCCCAAGCAGCCCCTACAATTCTGGTGCAAGCCCTGAT TATAGTCCTAGCTCCCCGCAGTACAGTCCAAGTGCAGGATACTCTCCTAG TGCACCAGGGTACTCCCCATCTTCAACCAGCCAGTACACTCCCCGCACAA ATGACAGAGACGACAAGAGTGTCAAGGATGAGAAGAGCAAGCGTTGA

standard] MDLRFPYSPAEVAKIRVVQFGILSPDEIRQMSVVHIEHSETTERGKPKPG GLSDPRLGTIDRKMKCETCMANMADCPGHFGHLELAKPMFHIGFMKTVLS ILRCVCFNCSKILADEEDPKFKQALRIRNPKNRLKKILDACKNKSKCEGG DEIDVQGQDTDEPVKKTRGGCGAQQPKISIDGMKMVAEYKIQKKKNDDPE QMPEPVERKQQLSAEKVLSILKRITDEDCQLLGLNPKYARPDWMILQVLP IPPPPVRPSVMMDTSSRSEDDLTHQLAMIIRHNENLKRQERNGAPAHIIS EFAQLLQFHIATYFDNELPGQPRATQRSGRPIKSICSRLKAKEGRIRGNL MGKRVDFSARTVITPDPTINIDELGVPWSIALNLTYPETVTPYNIERLKE LVEYGPHPPPGKTGAKYIIRDDGQRLDLRYLKKSSDQHLELGYKVERHLN DGDFVLFNRQPSLHKMSIMGHKIKIMPYSTFRLNLSVTSPYNADFDGDEM NMHVPQSFETRAEVLELMMVPKCIVSPQSNRPVMGIVQDTLLGCRKITKR DTFIEKDVFMNILMWWEDFDGKVPAPAILKPRPLWTGKQVFNLIIPKQIN LLRYSAWHQESEKGFITPGDTQVRIEKGELLSGTLCKKTLGTSTGSLIHV IWEEVGPDAARKFLGHTQWLVNYWLLQNAFSIGIGDTIADAATMEKINET ISNAKNEVKELIRAAQEKQLEAEPGRTMMESFENRVNQVLNKARDDAGSS AQKSLSESNNLKAMVTAGSKGSFINISQMTACVGQQNVEGKRIPFGFIDR TLPHFTKDDYGPESRGFVENSYLRGLTPQEFFFHAMGGREGLIDTAVKTS ETGYIQRRLVKAMEDIMVKYDGTVRNSLGDVIQFLYGEDGMDAVWIESQP LESLKLKKADFNDMYRYEIDNANWNPNYMLPEAVEDLKTIREIRSVFDAE VQKLEADRYQLGTEIATTGDNSWPLPVNIRRLVLNAQKTFKVDFRRPSDM HPMEIVEAVDKLQERLKVVVGDDYLSMEAQKNATLFFNILLRSALASKRV LKEYRLTREAFEWVIGEIESRFLQSLVAPGEMIGCVAAQSIGEPATQMTL NTFHYAGVSAKNVTLGVPRLREIINVAKKIKTPSLSVYLKPDVSKTKERA KNVQCALEYTTLRSVTQATEVWYDPDPMSTIIEEDVEFVKSYYEMPDEEI DPDKISPWLLRIELNREMMVDKKLSMADIAEKINLEFDDDLTCIFNDDNA EKLILRIRIMNDEAPKGELNDESAEDDVFLKKIESNMLTEMALRGIPDIN KVFIKNSKLNKFDENEGFKAENEWMLDTEGVNLLAVMCHEDVDARRTTSN HLIEVIEVLGIEAVRKALLDELRVVISFDGSYVNYRHLAILCDTMTYRGH LMAITRHGINRNDTGPMMRCSFEETVDILLDAAVYAETDHLRGVTENIML GQLAPIGTGDCALYLNEEMLKQAIEIPLPSYMEGGGLEFGMTPARSPLTG TPYHDGMMSPSYLLSPNLRLSPVTDAQFSPYVGGMAFSPTSSPGYSPSSP GYSPSSPGYSPTSPGYSPTSPGYSPTSPTYSPSSPGYSPTSPAYSPTSPS YSPTSPTYSPTSPSYSPTSPSYSPTSPSYSPTSPSYSPTSPAYSPTSPAY SPTSPAYSPTSPSYSPTSPSYSPTSPSYSPTSPSYSPTSPSYSPTSPSYS PTSPAYSPTSPGYSPTSPSYSPTSPSYSPTSPSYNPSARYSPSLAYSPTS PKISPSSPYSPSSPSYSPTSPSYSPTSPSYSPSSPSYSPSSPYNSGASPD YSPSSPQYSPSAGYSPSAPGYSPSSTSQYTPRTNDRDDKSVKDEKSKR

IPR000684; RNA polymerase II, heptapeptide repeat, eukaryotic IPR000722; RNA polymerase, alpha subunit IPR006592; RNA polymerase, N-terminal IPR007066; RNA polymerase Rpb1, domain 3 IPR007073; RNA pol