{"ymdb_id":"YMDB00769","created_at":"2011-05-29T18:51:29.000Z","updated_at":"2016-09-08T18:35:52.000Z","name":"squalene","cas":"111-02-4","state":"Liquid","melting_point":"-75 oC","description":"Squalene is the biochemical precursor to the whole family of steroids. It is found in yeast lipids, but also in fish liver oils and many vegetable oils such as palm oil, cottonseed oil and rapeseed oil.","experimental_water_solubility":"","experimental_logp_hydrophobicity":"","location":"endoplasmic reticulum;cytoplasm","synthesis_reference":"Peng, Wanxi; Li, Kaifu.  Method of preparation of squalene.    Faming Zhuanli Shenqing Gongkai Shuomingshu  (2006), 9pp. ","chebi_id":"15440","hmdb_id":"HMDB00256","kegg_id":"C00751","pubchem_id":"1105","cs_id":"25995494","foodb_id":null,"wikipedia_link":"Squalene","biocyc_id":"SQUALENE","iupac":"(6E,10E,14E,18E)-2,6,10,15,19,23-hexamethyltetracosa-2,6,10,14,18,22-hexaene","traditional_iupac":"squalene","logp":"10.422011552666666","pka":null,"alogps_solubility":"5.02e-04 g/l","alogps_logp":"8.64","alogps_logs":"-5.91","acceptor_count":"0","donor_count":"0","rotatable_bond_count":"15","polar_surface_area":"0.0","refractivity":"144.61780000000005","polarizability":"56.16810415012681","formal_charge":"0","physiological_charge":"0","pka_strongest_basic":null,"pka_strongest_acidic":null,"bioavailability":"0","number_of_rings":"0","rule_of_five":"0","ghose_filter":"0","veber_rule":"0","mddr_like_rule":"0","synonyms":["(all-E)-2,6,10,15,19,23-hexamethyl-2,6,10,14,18,22-Tetracosahexaene","(E,E,E,E)-Squalene","2,6,10,15,19,23-Hexamethyl-2,6,10,14,18,22-tetracosahexaene","all-trans-Squalene","Nikko Squalane EX","Spinacene","Squalene","trans-Squalene"],"pathways":[{"name":"Steroid biosynthesis","kegg_map_id":"00100"},{"name":"Cholesterol biosynthesis and metabolism CE(10:0)","kegg_map_id":null},{"name":"Cholesterol biosynthesis and metabolism CE(12:0)","kegg_map_id":null},{"name":"Cholesterol biosynthesis and metabolism CE(14:0)","kegg_map_id":null},{"name":"Cholesterol biosynthesis and metabolism CE(16:0)","kegg_map_id":null},{"name":"Cholesterol biosynthesis and metabolism CE(18:0)","kegg_map_id":null}],"growth_conditions":[],"references":[{"pubmed_id":21051339,"citation":"UniProt Consortium (2011). \"Ongoing and future developments at the Universal Protein Resource.\" Nucleic Acids Res 39:D214-D219."},{"pubmed_id":21062828,"citation":"Scheer, M., Grote, A., Chang, A., Schomburg, I., Munaretto, C., Rother, M., Sohngen, C., Stelzer, M., Thiele, J., Schomburg, D. (2011). \"BRENDA, the enzyme information system in 2011.\" Nucleic Acids Res 39:D670-D676."},{"pubmed_id":18846089,"citation":"Herrgard, M. J., Swainston, N., Dobson, P., Dunn, W. B., Arga, K. Y., Arvas, M., Bluthgen, N., Borger, S., Costenoble, R., Heinemann, M., Hucka, M., Le Novere, N., Li, P., Liebermeister, W., Mo, M. L., Oliveira, A. P., Petranovic, D., Pettifer, S., Simeonidis, E., Smallbone, K., Spasic, I., Weichart, D., Brent, R., Broomhead, D. S., Westerhoff, H. V., Kirdar, B., Penttila, M., Klipp, E., Palsson, B. O., Sauer, U., Oliver, S. G., Mendes, P., Nielsen, J., Kell, D. B. (2008). \"A consensus yeast metabolic network reconstruction obtained from a community approach to systems biology.\" Nat Biotechnol 26:1155-1160."},{"pubmed_id":7896706,"citation":"Loubbardi, A., Marcireau, C., Karst, F., Guilloton, M. (1995). \"Sterol uptake induced by an impairment of pyridoxal phosphate synthesis in Saccharomyces cerevisiae: cloning and sequencing of the PDX3 gene encoding pyridoxine (pyridoxamine) phosphate oxidase.\" J Bacteriol 177:1817-1823."},{"pubmed_id":10622712,"citation":"Lamb, D. C., Kelly, D. E., Manning, N. J., Kaderbhai, M. A., Kelly, S. L. (1999). \"Biodiversity of the P450 catalytic cycle: yeast cytochrome b5/NADH cytochrome b5 reductase complex efficiently drives the entire sterol 14-demethylation (CYP51) reaction.\" FEBS Lett 462:283-288."}],"proteins":[{"created_at":"2011-05-27T02:31:30.000Z","updated_at":"2011-05-27T15:01:18.000Z","name":"Squalene monooxygenase","uniprot_id":"P32476","uniprot_name":"ERG1_YEAST","enzyme":true,"transporter":false,"gene_name":"ERG1","num_residues":496,"molecular_weight":"55125.39844","theoretical_pi":"6.44","general_function":"Involved in oxidoreductase activity","specific_function":"Catalyzes the first oxygenation step in sterol biosynthesis and is suggested to be one of the rate-limiting enzymes in this pathway","reactions":[{"id":2678,"direction":"\u003e","locations":"Microsome membrane; Multi-pass membrane protein. Endoplasmic reticulum membrane; Multi-pass membrane protein","altext":"Squalene + AH(2) + O(2) = (S)-squalene-2,3-epoxide + A + H(2)O.","export":false,"pw_reaction_id":null,"source":null},{"id":14379,"direction":"\u003e","locations":null,"altext":null,"export":true,"pw_reaction_id":"PW_R006894","source":"Smpdb"}],"signal_regions":"None","transmembrane_regions":"17-37;475-495","pdb_id":null,"cellular_location":"Microsome membrane; Multi-pass membrane protein. Endoplasmic reticulum membrane; Multi-pass membrane protein","genbank_gene_id":"M64994","genbank_protein_id":"171471","gene_card_id":"ERG1","chromosome_location":"chromosome 7","locus":"YGR175C","synonyms":["Squalene epoxidase","SE"],"enzyme_classes":["1.14.99.7"],"go_classes":[{"category":"Component","description":" cell part"},{"category":"Component","description":" membrane part"},{"category":"Component","description":" intrinsic to membrane"},{"category":"Component","description":" integral to membrane"},{"category":"Function","description":" binding"},{"category":"Function","description":" nucleoside binding"},{"category":"Function","description":" purine nucleoside binding"},{"category":"Function","description":" adenyl nucleotide binding"},{"category":"Function","description":" FAD or FADH2 binding"},{"category":"Function","description":" oxidoreductase activity"},{"category":"Function","description":" monooxygenase activity"},{"category":"Function","description":" squalene monooxygenase activity"},{"category":"Function","description":" catalytic activity"},{"category":"Process","description":" oxidation reduction"},{"category":"Process","description":" metabolic process"}],"pfams":[{"name":"SE","identifier":"PF08491"}],"pathways":[{"name":"Steroid biosynthesis","kegg_map_id":"00100"},{"name":"Cholesterol biosynthesis and metabolism CE(10:0)","kegg_map_id":null},{"name":"Cholesterol biosynthesis and metabolism CE(12:0)","kegg_map_id":null},{"name":"Cholesterol biosynthesis and metabolism CE(14:0)","kegg_map_id":null},{"name":"Cholesterol biosynthesis and metabolism CE(16:0)","kegg_map_id":null},{"name":"Cholesterol biosynthesis and metabolism CE(18:0)","kegg_map_id":null}],"gene_sequence":"ATGTCTGCTGTTAACGTTGCACCTGAATTGATTAATGCCGACAACACAATTACCTACGATGCGATTGTCATCGGTGCTGGTGTTATCGGTCCATGTGTTGCTACTGGTCTAGCAAGAAAGGGTAAGAAAGTTCTTATCGTAGAACGTGACTGGGCTATGCCTGATAGAATTGTTGGTGAATTGATGCAACCAGGTGGTGTTAGAGCATTGAGAAGTCTGGGTATGATTCAATCTATCAACAACATCGAAGCATATCCTGTTACCGGTTATACCGTCTTTTTCAACGGCGAACAAGTTGATATTCCATACCCTTACAAGGCCGATATCCCTAAAGTTGAAAAATTGAAGGACTTGGTCAAAGATGGTAATGACAAGGTCTTGGAAGACAGCACTATTCACATCAAGGATTACGAAGATGATGAAAGAGAAAGGGGTGTTGCTTTTGTTCATGGTAGATTCTTGAACAACTTGAGAAACATTACTGCTCAAGAGCCAAATGTTACTAGAGTGCAAGGTAACTGTATTGAGATATTGAAGGATGAAAAGAATGAGGTTGTTGGTGCCAAGGTTGACATTGATGGCCGTGGCAAGGTGGAATTCAAAGCCCACTTGACATTTATCTGTGACGGTATCTTTTCACGTTTCAGAAAGGAATTGCACCCAGACCATGTTCCAACTGTCGGTTCTTCGTTTGTCGGTATGTCTTTGTTCAATGCTAAGAATCCTGCTCCTATGCACGGTCACGTTATTTTTGGTAGTGATCATATGCCAATCTTGGTTTACCAAATCAGTCCAGAAGAAACAAGAATCCTTTGTGCTTACAACTCTCCAAAGGTCCCAGCTGATATCAAGAGTTGGATGATTAAGGATGTCCAACCTTTCATTCCAAAGAGTCTACGTCCTTCATTTGATGAAGCCGTCAGCCAAGGTAAATTTAGAGCTATGCCAAACTCCTACTTGCCAGCTAGACAAAACGACGTCACTGGTATGTGTGTTATCGGTGACGCTCTAAATATGAGACATCCATTGACTGGTGGTGGTATGACTGTCGGTTTGCATGATGTTGTCTTGTTGATTAAGAAAATAGGTGACCTAGACTTCAGCGACCGTGAAAAGGTTTTGGATGAATTACTAGACTACCATTTCGAAAGAAAGAGTTACGATTCCGTTATTAACGTTTTGTCAGTGGCTTTGTATTCTTTGTTCGCTGCTGACAGCGATAACTTGAAGGCATTACAAAAAGGTTGTTTCAAATATTTCCAAAGAGGTGGCGATTGTGTCAACAAACCCGTTGAATTTCTGTCTGGTGTCTTGCCAAAGCCTTTGCAATTGACCAGGGTTTTCTTCGCTGTCGCTTTTTACACCATTTACTTGAACATGGAAGAACGTGGTTTCTTGGGATTACCAATGGCTTTATTGGAAGGTATTATGATTTTGATCACAGCTATTAGAGTATTCACCCCATTTTTGTTTGGTGAGTTGATTGGTTAA","protein_sequence":"MSAVNVAPELINADNTITYDAIVIGAGVIGPCVATGLARKGKKVLIVERDWAMPDRIVGELMQPGGVRALRSLGMIQSINNIEAYPVTGYTVFFNGEQVDIPYPYKADIPKVEKLKDLVKDGNDKVLEDSTIHIKDYEDDERERGVAFVHGRFLNNLRNITAQEPNVTRVQGNCIEILKDEKNEVVGAKVDIDGRGKVEFKAHLTFICDGIFSRFRKELHPDHVPTVGSSFVGMSLFNAKNPAPMHGHVILGSDHMPILVYQISPEETRILCAYNSPKVPADIKSWMIKDVQPFIPKSLRPSFDEAVSQGKFRAMPNSYLPARQNDVTGMCVIGDALNMRHPLTGGGMTVGLHDVVLLIKKIGDLDFSDREKVLDELLDYHFERKSYDSVINVLSVALYSLFAADSDNLKALQKGCFKYFQRGGDCVNKPVEFLSGVLPKPLQLTRVFFAVAFYTIYLNMEERGFLGLPMALLEGIMILITAIRVFTPFLFGELIG"},{"created_at":"2011-05-27T04:20:38.000Z","updated_at":"2011-05-27T15:01:22.000Z","name":"Squalene synthase","uniprot_id":"P29704","uniprot_name":"FDFT_YEAST","enzyme":true,"transporter":false,"gene_name":"ERG9","num_residues":444,"molecular_weight":"51719.39844","theoretical_pi":"5.75","general_function":"Involved in transferase activity","specific_function":"May regulate the flux of isoprene intermediates through the sterol pathway. Squalene synthase is crucial for balancing the incorporation of farnesyl diphosphate (FPP) into sterol and nonsterol isoprene synthesis. ERG9 is also essential for cell growth in yeast","reactions":[{"id":1980,"direction":"\u003e","locations":"cytoplasm","altext":null,"export":true,"pw_reaction_id":null,"source":null},{"id":2762,"direction":"\u003e","locations":"Endoplasmic reticulum membrane; Multi-pass membrane protein","altext":"2 farnesyl diphosphate = diphosphate + presqualene diphosphate.","export":false,"pw_reaction_id":null,"source":null},{"id":2763,"direction":"\u003e","locations":"Endoplasmic reticulum membrane; Multi-pass membrane protein","altext":"Presqualene diphosphate + NAD(P)H = squalene + diphosphate + NAD(P)(+).","export":false,"pw_reaction_id":null,"source":null},{"id":14378,"direction":"\u003e","locations":null,"altext":null,"export":true,"pw_reaction_id":"PW_R006893","source":"Smpdb"}],"signal_regions":"None","transmembrane_regions":"291-311;421-441","pdb_id":null,"cellular_location":"Endoplasmic reticulum membrane; Multi-pass membrane protein","genbank_gene_id":"M63979","genbank_protein_id":"171481","gene_card_id":"ERG9","chromosome_location":"chromosome 8","locus":"YHR190W","synonyms":["SQS","SS","FPP:FPP farnesyltransferase","Farnesyl-diphosphate farnesyltransferase"],"enzyme_classes":["2.5.1.21"],"go_classes":[{"category":"Component","description":" cell part"},{"category":"Component","description":" membrane part"},{"category":"Component","description":" intrinsic to membrane"},{"category":"Component","description":" integral to membrane"},{"category":"Function","description":" transferase activity, transferring alkyl or aryl (other than methyl) groups"},{"category":"Function","description":" prenyltransferase activity"},{"category":"Function","description":" farnesyltranstransferase activity"},{"category":"Function","description":" farnesyl-diphosphate farnesyltransferase activity"},{"category":"Function","description":" catalytic activity"},{"category":"Function","description":" transferase activity"},{"category":"Process","description":" metabolic process"},{"category":"Process","description":" biosynthetic process"},{"category":"Process","description":" primary metabolic process"},{"category":"Process","description":" lipid metabolic process"},{"category":"Process","description":" lipid biosynthetic process"}],"pfams":[{"name":"SQS_PSY","identifier":"PF00494"}],"pathways":[{"name":"Steroid biosynthesis","kegg_map_id":"00100"},{"name":"Cholesterol biosynthesis and metabolism CE(10:0)","kegg_map_id":null},{"name":"Cholesterol biosynthesis and metabolism CE(12:0)","kegg_map_id":null},{"name":"Cholesterol biosynthesis and metabolism CE(14:0)","kegg_map_id":null},{"name":"Cholesterol biosynthesis and metabolism CE(16:0)","kegg_map_id":null},{"name":"Cholesterol biosynthesis and metabolism CE(18:0)","kegg_map_id":null}],"gene_sequence":"ATGGGAAAGCTATTACAATTGGCATTGCATCCGGTCGAGATGAAGGCAGCTTTGAAGCTGAAGTTTTGCAGAACACCGCTATTCTCCATCTATGATCAGTCCACGTCTCCATATCTCTTGCACTGTTTCGAACTGTTGAACTTGACCTCCAGATCGTTTGCTGCTGTGATCAGAGAGCTGCATCCAGAATTGAGAAACTGTGTTACTCTCTTTTATTTGATTTTAAGGGCTTTGGATACCATCGAAGACGATATGTCCATCGAACACGATTTGAAAATTGACTTGTTGCGTCACTTCCACGAGAAATTGTTGTTAACTAAATGGAGTTTCGACGGAAATGCCCCCGATGTGAAGGACAGAGCCGTTTTGACAGATTTCGAATCGATTCTTATTGAATTCCACAAATTGAAACCAGAATATCAAGAAGTCATCAAGGAGATCACCGAGAAAATGGGTAATGGTATGGCCGACTACATCTTGGATGAAAATTACAACTTGAATGGGTTGCAAACCGTCCACGACTACGACGTGTACTGTCACTACGTAGCTGGTTTGGTCGGTGATGGTTTGACCCGTTTGATTGTCATTGCCAAGTTTGCCAACGAATCTTTGTATTCTAATGAGCAATTGTATGAAAGCATGGGTCTTTTCCTACAAAAAACCAACATCATCAGAGACTACAATGAAGATTTGGTCGATGGTAGATCCTTCTGGCCCAAGGAAATCTGGTCACAATACGCTCCTCAGTTGAAGGACTTCATGAAACCTGAAAACGAACAACTGGGGTTGGACTGTATAAACCACCTCGTCTTAAACGCATTGAGTCATGTTATCGATGTGTTGACTTATTTGGCCAGTATCCACGAGCAATCCACTTTCCAATTTTGTGCCATTCCCCAAGTTATGGCCATTGCAACCTTGGCTTTGGTATTCAACAACCGTGAAGTGCTACATGGCAATGTAAAGATTCGTAAGGGTACTACCTGCTATTTAATTTTGAAATCAAGGACTTTGCGTGGCTGTGTCGAGATTTTTGACTATTACTTACGTGATATCAAATCTAAATTGGCTGTGCAAGATCCAAATTTCTTAAAATTGAACATTCAAATCTCCAAGATCGAACAATTCATGGAAGAAATGTACCAGGATAAATTACCTCCTAACGTGAAGCCAAATGAAACTCCAATTTTCTTGAAAGTTAAAGAAAGATCCAGATACGATGATGAATTGGTCCCAACCCAACAAGAAGAAGAGTACAAGTTCAATATGGTTTTATCTATCATCTTGTCCGTTCTTCTTGGGTTTTATTATATATACACTTTACACAGAGCGTGA","protein_sequence":"MGKLLQLALHPVEMKAALKLKFCRTPLFSIYDQSTSPYLLHCFELLNLTSRSFAAVIRELHPELRNCVTLFYLILRALDTIEDDMSIEHDLKIDLLRHFHEKLLLTKWSFDGNAPDVKDRAVLTDFESILIEFHKLKPEYQEVIKEITEKMGNGMADYILDENYNLNGLQTVHDYDVYCHYVAGLVGDGLTRLIVIAKFANESLYSNEQLYESMGLFLQKTNIIRDYNEDLVDGRSFWPKEIWSQYAPQLKDFMKPENEQLGLDCINHLVLNALSHVIDVLTYLAGIHEQSTFQFCAIPQVMAIATLALVFNNREVLHGNVKIRKGTTCYLILKSRTLRGCVEIFDYYLRDIKSKLAVQDPNFLKLNIQISKIEQFMEEMYQDKLPPNVKPNETPIFLKVKERSRYDDELVPTQQEEEYKFNMVLSIILSVLLGFYYIYTLHRA"},{"created_at":"2011-05-27T07:22:30.000Z","updated_at":"2011-05-27T15:01:28.000Z","name":"NADPH--cytochrome P450 reductase","uniprot_id":"P16603","uniprot_name":"NCPR_YEAST","enzyme":true,"transporter":false,"gene_name":"NCP1","num_residues":691,"molecular_weight":"76771.10156","theoretical_pi":"4.8","general_function":"Involved in oxidoreductase activity","specific_function":"This enzyme is required for electron transfer from NADP to cytochrome P450 in microsomes. It can also provide electron transfer to heme oxygenase and cytochrome B5","reactions":[{"id":2811,"direction":"\u003e","locations":"Endoplasmic reticulum membrane; Single-pass membrane protein","altext":"NADPH + n oxidized hemoprotein = NADP(+) + n reduced hemoprotein.","export":false,"pw_reaction_id":null,"source":null}],"signal_regions":"None","transmembrane_regions":"8-24","pdb_id":null,"cellular_location":"Endoplasmic reticulum membrane; Single-pass membrane protein","genbank_gene_id":"AY693091","genbank_protein_id":"51013633","gene_card_id":"NCP1","chromosome_location":"chromosome 8","locus":"YHR042W","synonyms":["CPR","P450R"],"enzyme_classes":["1.6.2.4"],"go_classes":[{"category":"Component","description":" Not Available"},{"category":"Function","description":" binding"},{"category":"Function","description":" nucleotide binding"},{"category":"Function","description":" FMN binding"},{"category":"Function","description":" ion binding"},{"category":"Function","description":" cation binding"},{"category":"Function","description":" metal ion binding"},{"category":"Function","description":" transition metal ion binding"},{"category":"Function","description":" iron ion binding"},{"category":"Function","description":" oxidoreductase activity"},{"category":"Function","description":" catalytic activity"},{"category":"Process","description":" oxidation reduction"},{"category":"Process","description":" metabolic process"}],"pfams":[{"name":"NAD_binding_1","identifier":"PF00175"},{"name":"Flavodoxin_1","identifier":"PF00258"},{"name":"FAD_binding_1","identifier":"PF00667"}],"pathways":[],"gene_sequence":"ATGCCGTTTGGAATAGACAACACCGACTTCACTGTCCTGGCGGGGCTAGTGCTTGCCGTGCTACTGTACGTAAAGAGAAACTCCATCAAGGAACTGCTGATGTCCGATGACGGAGATATCACAGCTGTCAGCTCGGGCAACAGGGACATTGCTCAGGTGGTGACCGAAAACAACAAGAACTACTTGGTGTTGTATGCGTCGCAGACTGGGACTGCCGAGGATTACGCCAAAAAGTTTTCCAAGGAGCTGGTGGCCAAGTTCAACCTAAACGTGATGTGCGCAGATGTTGAGAACTACGACTTTGAGTCGCTAAACGATGTGCCCGTCATAGTCTCGATTTTTATCTCTACATATGGTGAAGGAGACTTCCCCGACGGGGCGGTCAACTTTGAAGACTTTATTTGTAATGCGGAAGCGGGTGCACTATCGAACCTGAGGTATAATATGTTTGGTCTGGGAAATTCTACTTATGAATTCTTTAATGGTGCCGCCAAGAAGGCCGAGAAGCATCTCTCCGCCGCGGGCGCTATCAGACTAGGCAAGCTCGGTGAAGCTGATGATGGTGCAGGAACTACAGACGAAGATTACATGGCCTGGAAGGACTCCATCCTGGAGGTTTTGAAAGACGAACTGCATTTGGACGAACAGGAAGCCAAGTTCACCTCTCAATTCCAGTACACTGTGTTGAACGAAATCACTGACTCCATGTCGCTTGGTGAACCCTCTGCTCACTATTTGCCCTCGCATCAGTTGAACCGCAACGCAGACGGCATCCAATTGGGTCCCTTCGATTTGTCTCAACCGTATATTGCACCCATCGTGAAATCTCGCGAACTGTTCTCTTCCAATGACCGTAATTGCATCCACTCTGAATTTGACTTGTCCGGCTCTAACATCAAGTACTCCACTGGTGACCATCTTGCTGTTTGGCCTTCCAACCCATTGGAAAAGGTCGAACAGTTCTTATCCATATTCAACCTGGACCCTGAAACCATTTTTGACTTGAAGCCCCTGGATCCCACCGTCAAAGTGCCCTTCCCAACGCCAACTACTATTGGCGCTGCTATTAAACACTATTTGGAAATTACAGGACCTGTCTCCAGACAATTGTTTTCATCTTTGATTCAGTTCGCCCCCAACGCTGACGTCAAGGAAAAATTGACTCTGCTTTCGAAAGACAAGGACCAATTCGCCGTCGAGATAACCTCCAAATATTTCAACATCGCAGATGCTCTGAAATATTTGTCTGATGGCGCCAAATGGGACACCGTACCCATGCAATTCTTGGTCGAATCAGTTCCCCAAATGACTCCTCGTTACTACTCTATCTCTTCCTCTTCTCTGTCTGAAAAGCAAACCGTCCATGTCACCTCCATTGTGGAAAACTTTCCTAACCCAGAATTGCCTGATGCTCCTCCAGTTGTTGGTGTTACGACTAACTTGTTAAGAAACATTCAATTGGCTCAAAACAATGTTAACATTGCCGAAACTAACCTACCTGTTCACTACGATTTAAATGGCCCACGTAAACTTTTCGCCAATTACAAATTGCCCGTCCACGTTCGTCGTTCTAACTTCAGATTGCCTTCCAACCCTTCCACCCCAGTTATCATGATCGGTCCAGGTACCGGTGTTGCCCCATTCCGTGGGTTTATCAGAGAGCGTGTCGCGTTCCTCGAATCACAAAAGAAGGGCGGTAACAACGTTTCGCTAGGTAAGCATATACTGTTTTATGGATCCCGTAACACTGATGATTTCTTGTACCAGGACGAATGGCCAGAATACGCCAAAAAATTGGATGGTTCGTTCGAAATGGTCGTGGCCCATTCCAGGTTGCCAAACACCAAAAAAGTCTATGTTCAAGATAAATTAAAGGATTACGAAGACCAAGTATTTGAAATGATTAACAACGGTGCATTTATCTACGTCTGTGGTGATGCAAAGGGTATGGCCAAGGGTGTGTCAACCGCATTGGTTGGCATCTTATCCCGTGGTAAATCCATTACCACTGATGAAGCAACAGAGCTAATCAAGATGCTCAAGACTTCAGGTAGATACCAAGAAGATGTCTGGTAA","protein_sequence":"MPFGIDNTDFTVLAGLVLAVLLYVKRNSIKELLMSDDGDITAVSSGNRDIAQVVTENNKNYLVLYASQTGTAEDYAKKFSKELVAKFNLNVMCADVENYDFESLNDVPVIVSIFISTYGEGDFPDGAVNFEDFICNAEAGALSNLRYNMFGLGNSTYEFFNGAAKKAEKHLSAAGAIRLGKLGEADDGAGTTDEDYMAWKDSILEVLKDELHLDEQEAKFTSQFQYTVLNEITDSMSLGEPSAHYLPSHQLNRNADGIQLGPFDLSQPYIAPIVKSRELFSSNDRNCIHSEFDLSGSNIKYSTGDHLAVWPSNPLEKVEQFLSIFNLDPETIFDLKPLDPTVKVPFPTPTTIGAAIKHYLEITGPVSRQLFSSLIQFAPNADVKEKLTLLSKDKDQFAVEITSKYFNIADALKYLSDGAKWDTVPMQFLVESVPQMTPRYYSISSSSLSEKQTVHVTSIVENFPNPELPDAPPVVGVTTNLLRNIQLAQNNVNIAETNLPVHYDLNGPRKLFANYKLPVHVRRSNFRLPSNPSTPVIMIGPGTGVAPFRGFIRERVAFLESQKKGGNNVSLGKHILFYGSRNTDDFLYQDEWPEYAKKLDGSFEMVVAHSRLPNTKKVYVQDKLKDYEDQVFEMINNGAFIYVCGDAKGMAKGVSTALVGILSRGKSITTDEATELIKMLKTSGRYQEDVW"}]}