UCADceraas BIOINFORMATICS
Sesamum indicum Genetic Discovery Database (SiGeDiD)
Gene ID
LG
mRNA sequence
Peptide sequence
function predicted
SIN_1011081

11

+ ATGGTGGGAGTGATGGCGGCGGTTTCCAGAGCGAAACTCGCTAGCCGAAT CAACTCAATTAAGCAAAGCGGTTCCGCTGACATGGGGACGAAGCTGGACA AGTTGCGGAGGCTGCGTGACGAGCTCTTAGCCGCGGATTCTGTGCTGCTC GTGGACTTCCTCTCCCCTATTCTTGACCTCCTTTCAGACCGTTCTAGCCC AGTCCGCAAATTCATTATTCAGATGATTGGTGAAATTGGTTTGAAGCACT CGGAGTTGTTACCTGATATTATACCAGCGCTGATAGCTGCTTTGAAAGAT GATACACCAGCTGTTGCTCGACAGGCCATCACATGCGGTGTTGATATATT CCGCTGTTCTCTGGTCAAAGTTGCCATCCAGGGCTTGTACTCAAGTGAGT TCAACGAATCACTGAAATCATCATGGGAATGTGTGCTAAAGTTTAGAGAT GAAATATACTCCATGGCCTTTAAGGTGGGGAATGATGGTAGAAGATTGCC TGCATTGAAGTTTGTGGAATCAATGGTGCTACTTTATACCCCTGATCCTA ATGGCTCTTTGGAGCCACCACCTGATCATGTTTCTGAAGGGAAGTTTGAA GAATTCAATGTCTCATGGCTGCGCGGTGGACATCCAATACTCAATGTCAG AGATCTATCAGCCGAAGCAAGCCAAAATTTGGGTCTACTGCTGGATCAGC TGAGATTCCCTTCTCTGAAATCACATAGTTATTTGGTGATGATTGTGCTA ATTAAAAGTCTTTCAACAGTTGCAAGGAAGAGGCCTGCCTTTTATGGTCG CATACTGCCAGTTTTACTTGGATTGGACCCTTCAAGCTGTACTAGCAAAG GCTTGCATCTTGCTGGGGTTCATCATGCTCTGAGAAGTGCTTTTGAGTCT TGTTTGAACTGTACACACCCTGGTGCTGCACCGTGGCGGGATCGCCTTGT TAGTGCACTTAAAGAAATTAAAGTTGGAAGACCAACTGAACAGGCTAGAA ATGAAATATCAGAGAACAAGGGAAGAGAAGAGTGGCCAGGGGATGCATAT GTGGTTCAAATTCATGAGAATGAAAAACCTTCTGTAGCATTTGTAACTGA ACATAAAAATGCTGGTAGAAAAAGAACTGGAGTGCTAGATTCATCAGAAT TTACTCAGGATGACATGTCTGGAAAGCGTGCCAGGTCAACACCTGACAAC TTAAAGGAACCAGGACATGAAATCAGCGGGAGACAGGAGGGGGTCTCTTC TAGTGGGCAAACACCATCCAGAGAGGATTCAGATAGTGGACCTGTACAGC AGCTTGTGGCAATGTTTGCTGCATTGGTTGCTCAGGGCGAGAAAGCCAGT GCTTCTTTGGAGATTCTCATCTCTAGCATATCAGCTGACTTGTTAGCTGA AGTAGTTATGGTTAATTTGCGGAATCTTCCTCTGCAAACCCCTACATCTG AAGCGGATGAAGAGCCACTTACAGATATGGTTGCCTTTCCTGATACCCAC ATCAAACATTTGTCTTTGTTGCTTAGGGACATACTTTCAGAATCTATTCC TCTGGAGAAAGAAACGGGAACCGAGGATCCTCATCACTCGCAAACTCAAG AAGAAGAGGAACCGCCGGCGACTATTGCTGATAGTAATGTTGCATATGAT GATTTGAATCGTGCAAGGCAAGAAACAGTGCATGTCAATGAGTCTGTTTC CCCTGAAGAGATTCCATCTGCAATGGAGGCTGGTTATGGAGCGATCACTT CTGTAGTCATTGAAAATGAAGGTGTGGGAAATGAAATACCTGGACTTGCT TTATCCACTGAGGATGATGCTTTGCCTGAAGATGCTGCTGTTTTTCCAAG GGCCTTGACTGAGTTAGAAGATGCCAATCTGACTGATTTAAATGATGCCA ATCAGGAAACATTTACTAATTTAGGTAGGATGCCAATAGAATTAGACAAG ACGCAGATAGAATTAGCTCAATCATTCTCTACCGATCGGTCTGAGGAGCT TAGCCCTAAAGCAGCAATCACGGACACAAATAACATGAATTCCTCTACTG CAACTTCTGTTGGGTTATCTTCCCAGTTGGTTCTGCCCAAGATATCTGCC CCTGTTATCTGCCTTGCGGATGAACAAAAAGACCAGTTACAACAACTGGC TTTTGTGCGCATTGTTGATGCTTATAAGCAGGTTACTGTTGCTGGAGGGT CTGAAGTCCGTTTTTCGATTCTTGCCCATTCAGGAATGGAGTTTCCATTG GAGCTCGACCCATGGAAGTTACTGAAAACACATATACTGTCAGACTATGT AAATCATGAGGGGCACGAGTTAACATTGCGCGTGCTATACAGGTTGTTTG GTGAGGCAGAAGAAGACCGTGACTTTTTTACTTCGACAACTGCTACATCT GTATATGAAACTTTTCTTCTCCAAGTGGCAGAAACACTGAGAGATTCTTT TCCAGCTTCTGACAAATCTTTGAGTAGATTGCTTGGTGAAGTCCCATATC TGCCAAAGTCGATATTTGAAATGTTGGAGTCCTTATGTTCTCCTGGTAGT AGTGATAATGATGATAGGGAGATGCAGGGTGGAGATCGAGTTACCCAAGG GCTGAGTACTGTATGGAGCCTGATTTTGACGAGACCTCCTATTCGAGATG CTTGCCTCAAAATTGCTTTGAAGAGTGCAGTTCATCACTTGGAAGAAGTA CGAATGAAAGCTATACGTCTGGTGGCGAATAAGCTTTATCCTTTATCATC CATATCTGAAAAAATCGAAGATTTTGCCAAGGAAATGTTGCTATCAGTTG TAGGTGATAATCAAATTGAAGTGGAAAAAGAGGCTGATGGAATCCATGCT GAACTACAAAAGGATGAGAATCCTTCAAGTGAAAAACAGTCGGTGAGTTT GGCAGTTAAAGAGATCGCCGTTGGTAATCATCAGAACTCAGCATCTGAAA GCATTCCATTGTCTATGATAGCTGAGAAACATTCCCTTTTCCGGCAAATA TTTGATGTCTATAAAGGCACATCCAAGGCCGCAAAGCAGGCAGTTCATCA TCAAATCCCCTTACTTGTTCGAACTATTGGCTCGTCCAGAGAGCTCCTTG ATATTTTATCAGATCCACCAACTGGAAGTGAAGGGCTTATAACTCAGGTT GTGCATACACTTACAGATGGAACAGTTCCTTCTCCGGATTTATTAACTAC TGTTAAGAGGTTATACGATACAAAGCTAAAGGATATAGATATTCTTATTC CAATATTAGCATTCCTCCCAAAAGATGAGGTTTTACTCCTTTTCCCTCAG CTTGTTAATGCACCTTTGGATAAGTTCCAAGTTGCGCTTACTCGTGTTCT TCAGGGGCTAAATCATTCTCCACCAGTGCTCACTCCAGCCGAAGCACTAA TTGCGATCCATGGGATTGATCCTGATAGAGATGGAATTCCATTGAAAAAG GCAATGATTGTCACAGATGCCTGCAATGCTTGTTTTGAGCAGCGGCATAT ATTTTCTCAGCAAGTTCTGGCCAAAGTCTTGAATCAGTTGGTTGAGCAAA TTCCTCTTCCCTTGTTATTTATGCGCACGGTATTGCAGGCAATTGGTGCT TTTCCTTCTTTGGTGGAATTCATTATGGAGATCCTTTCCCGTCTTGTAAG CAAGCAGATATGGAAATATCCAAAGCTGTGGGTGGGATTCGTGAAGTGCG CTCTTTTGACAAAGCCACAGTCTTTCAGTGTTCTGCTTCAGCTACCTACA GCACAGCTTGAAAATGCTTTAAATAGAACTCCTGCTCTCAAGGCTCCCTT AGTTGCCCATGCAAGCCAACCTCACATAAGATCTTCGCTTCCAAGGTCTA CCTTAGTGGCTCTTGGCTTAGTGTCAGAGCCTCAAACGTCTAATCAGACA CAGCCAACTCAAACTCAGACTGCAGAGACAGGCAATTCAGAGATGGAAGC AGCGACGGATAAGTCGAAAGAATCATCTACTGCCAGTTGA

standard] MVGVMAAVSRAKLASRINSIKQSGSADMGTKLDKLRRLRDELLAADSVLL VDFLSPILDLLSDRSSPVRKFIIQMIGEIGLKHSELLPDIIPALIAALKD DTPAVARQAITCGVDIFRCSLVKVAIQGLYSSEFNESLKSSWECVLKFRD EIYSMAFKVGNDGRRLPALKFVESMVLLYTPDPNGSLEPPPDHVSEGKFE EFNVSWLRGGHPILNVRDLSAEASQNLGLLLDQLRFPSLKSHSYLVMIVL IKSLSTVARKRPAFYGRILPVLLGLDPSSCTSKGLHLAGVHHALRSAFES CLNCTHPGAAPWRDRLVSALKEIKVGRPTEQARNEISENKGREEWPGDAY VVQIHENEKPSVAFVTEHKNAGRKRTGVLDSSEFTQDDMSGKRARSTPDN LKEPGHEISGRQEGVSSSGQTPSREDSDSGPVQQLVAMFAALVAQGEKAS ASLEILISSISADLLAEVVMVNLRNLPLQTPTSEADEEPLTDMVAFPDTH IKHLSLLLRDILSESIPLEKETGTEDPHHSQTQEEEEPPATIADSNVAYD DLNRARQETVHVNESVSPEEIPSAMEAGYGAITSVVIENEGVGNEIPGLA LSTEDDALPEDAAVFPRALTELEDANLTDLNDANQETFTNLGRMPIELDK TQIELAQSFSTDRSEELSPKAAITDTNNMNSSTATSVGLSSQLVLPKISA PVICLADEQKDQLQQLAFVRIVDAYKQVTVAGGSEVRFSILAHSGMEFPL ELDPWKLLKTHILSDYVNHEGHELTLRVLYRLFGEAEEDRDFFTSTTATS VYETFLLQVAETLRDSFPASDKSLSRLLGEVPYLPKSIFEMLESLCSPGS SDNDDREMQGGDRVTQGLSTVWSLILTRPPIRDACLKIALKSAVHHLEEV RMKAIRLVANKLYPLSSISEKIEDFAKEMLLSVVGDNQIEVEKEADGIHA ELQKDENPSSEKQSVSLAVKEIAVGNHQNSASESIPLSMIAEKHSLFRQI FDVYKGTSKAAKQAVHHQIPLLVRTIGSSRELLDILSDPPTGSEGLITQV VHTLTDGTVPSPDLLTTVKRLYDTKLKDIDILIPILAFLPKDEVLLLFPQ LVNAPLDKFQVALTRVLQGLNHSPPVLTPAEALIAIHGIDPDRDGIPLKK AMIVTDACNACFEQRHIFSQQVLAKVLNQLVEQIPLPLLFMRTVLQAIGA FPSLVEFIMEILSRLVSKQIWKYPKLWVGFVKCALLTKPQSFSVLLQLPT AQLENALNRTPALKAPLVAHASQPHIRSSLPRSTLVALGLVSEPQTSNQT QPTQTQTAETGNSEMEAATDKSKESSTAS

IPR021850; Protein of unknown function DUF3453 IPR022075; Symplekin tight junction protein C-terminal