{"ymdb_id":"YMDB00758","created_at":"2011-05-29T18:50:30.000Z","updated_at":"2016-09-08T18:35:52.000Z","name":"2-formamido-N(1)-(5-phospho-D-ribosyl)acetamidine","cas":"6157-85-3","state":"Solid","melting_point":null,"description":"2-(Formamido)-N1-(5-phospho-D-ribosyl)acetamidine is an intermediate in purine metabolism. The production of this metabolite from N2-formyl-N1-(5-phospho-D-ribosyl)glycinamide is catalyzed by the enzyme phosphoribosylformylglycinamidine synthase [EC:6.3.5.3]. [Biocyc PWY-6277]","experimental_water_solubility":null,"experimental_logp_hydrophobicity":null,"location":"Cytoplasm","synthesis_reference":null,"chebi_id":"18413","hmdb_id":"HMDB06211","kegg_id":"C04640","pubchem_id":"9552078","cs_id":"21864854","foodb_id":null,"wikipedia_link":null,"biocyc_id":"5-PHOSPHORIBOSYL-N-FORMYLGLYCINEAMIDINE","iupac":"{[(2R,3S,4R)-3,4-dihydroxy-5-(2-formamidoethanimidamido)oxolan-2-yl]methoxy}phosphonic acid","traditional_iupac":"FGAM","logp":"-4.139211972332413","pka":"6.2179514527651865","alogps_solubility":"8.65e+00 g/l","alogps_logp":"-2.58","alogps_logs":"-1.56","acceptor_count":"9","donor_count":"7","rotatable_bond_count":"6","polar_surface_area":"181.42999999999998","refractivity":"72.72009999999999","polarizability":"26.409320837922355","formal_charge":"0","physiological_charge":"-1","pka_strongest_basic":"7.381668547772353","pka_strongest_acidic":"1.225940476160587","bioavailability":"1","number_of_rings":"1","rule_of_five":"0","ghose_filter":"0","veber_rule":"0","mddr_like_rule":"0","synonyms":["1-(5-phosphoribosyl)-n-formylglycinamidine","1-(5'-Phosphoribosyl)-N-formylglycinamidine","2-(formamido)-N(1)-(5-phospho-D-ribosyl)acetamidine","2-(formamido)-N(1)-(5'-phosphoribosyl)acetamidine","2-(Formamido)-N1-(5-phospho-D-ribosyl)acetamidine","2-(formamido)-n1-(5-phosphoribosyl)acetamidine","2-(Formamido)-N1-(5'-phosphoribosyl)acetamidine","5-phosphoribosyl-n-formylglycinamidine","5-phosphoribosylformylglycinamidine","5'-Phosphoribosyl-N-formylglycinamidine","5'-Phosphoribosylformylglycinamidine","FGAM"],"pathways":[{"name":"Purine metabolism","kegg_map_id":"00230"},{"name":"purine nucleotides de novo biosynthesis","kegg_map_id":null}],"growth_conditions":[],"references":[{"pubmed_id":18846089,"citation":"Herrgard, M. J., Swainston, N., Dobson, P., Dunn, W. B., Arga, K. Y., Arvas, M., Bluthgen, N., Borger, S., Costenoble, R., Heinemann, M., Hucka, M., Le Novere, N., Li, P., Liebermeister, W., Mo, M. L., Oliveira, A. P., Petranovic, D., Pettifer, S., Simeonidis, E., Smallbone, K., Spasic, I., Weichart, D., Brent, R., Broomhead, D. S., Westerhoff, H. V., Kirdar, B., Penttila, M., Klipp, E., Palsson, B. O., Sauer, U., Oliver, S. G., Mendes, P., Nielsen, J., Kell, D. B. (2008). \"A consensus yeast metabolic network reconstruction obtained from a community approach to systems biology.\" Nat Biotechnol 26:1155-1160."},{"pubmed_id":8600995,"citation":"Tret'iakov, O. I. u., Ryzhova, T. A., Velichutina, I. V., Kostikova, T. R., Miasnikov, A. N., Smirnov, M. N., Domkin, V. D. (1995). \"[Glycine amide ribonucleotide synthetase (EC 6.3.4.13)--is aminoimidazole ribonucleotide synthetase (EC 6.3.3.1) from Saccharomyces cerevisiae].\" Biokhimiia 60:2011-2021."}],"proteins":[{"created_at":"2011-05-24T19:21:34.000Z","updated_at":"2011-05-27T14:55:57.000Z","name":"Phosphoribosylformylglycinamidine synthase","uniprot_id":"P38972","uniprot_name":"PUR4_YEAST","enzyme":true,"transporter":false,"gene_name":"ADE6","num_residues":1358,"molecular_weight":"148904.0","theoretical_pi":"4.92","general_function":"Involved in catalytic activity","specific_function":"ATP + N(2)-formyl-N(1)-(5-phospho-D- ribosyl)glycinamide + L-glutamine + H(2)O = ADP + phosphate + 2- (formamido)-N(1)-(5-phospho-D-ribosyl)acetamidine + L-glutamate","reactions":[{"id":1889,"direction":"\u003e","locations":"cytoplasm","altext":null,"export":true,"pw_reaction_id":null,"source":null},{"id":2248,"direction":"\u003e","locations":"Cytoplasm","altext":"ATP + N(2)-formyl-N(1)-(5-phospho-D-ribosyl)glycinamide + L-glutamine + H(2)O = ADP + phosphate + 2-(formamido)-N(1)-(5-phospho-D-ribosyl)acetamidine + L-glutamate.","export":false,"pw_reaction_id":null,"source":null}],"signal_regions":"None","transmembrane_regions":"None","pdb_id":null,"cellular_location":"Cytoplasm","genbank_gene_id":"Z72846","genbank_protein_id":"1323079","gene_card_id":"ADE6","chromosome_location":"chromosome 7","locus":"YGR061C","synonyms":["FGAM synthase","FGAMS","Formylglycinamide ribotide amidotransferase","FGARAT","Formylglycinamide ribotide synthetase"],"enzyme_classes":["6.3.5.3"],"go_classes":[{"category":"Component","description":" Not Available"},{"category":"Function","description":" ligase activity"},{"category":"Function","description":" ligase activity, forming carbon-nitrogen bonds"},{"category":"Function","description":" carbon-nitrogen ligase activity, with glutamine as amido-N-donor"},{"category":"Function","description":" phosphoribosylformylglycinamidine synthase activity"},{"category":"Function","description":" catalytic activity"},{"category":"Process","description":" nitrogen compound metabolic process"},{"category":"Process","description":" cellular nitrogen compound metabolic process"},{"category":"Process","description":" nucleobase, nucleoside, nucleotide and nucleic acid metabolic process"},{"category":"Process","description":" nucleobase, nucleoside and nucleotide metabolic process"},{"category":"Process","description":" nucleoside phosphate metabolic process"},{"category":"Process","description":" nucleotide metabolic process"},{"category":"Process","description":" purine nucleotide metabolic process"},{"category":"Process","description":" purine nucleotide biosynthetic process"},{"category":"Process","description":" purine nucleoside monophosphate biosynthetic process"},{"category":"Process","description":" purine ribonucleoside monophosphate biosynthetic process"},{"category":"Process","description":" IMP biosynthetic process"},{"category":"Process","description":" metabolic process"},{"category":"Process","description":" 'de novo' IMP biosynthetic process"}],"pfams":[{"name":"AIRS","identifier":"PF00586"},{"name":"AIRS_C","identifier":"PF02769"}],"pathways":[{"name":"Purine metabolism","kegg_map_id":"00230"}],"gene_sequence":"ATGACTGATTATATTTTGCCGGGTCCCAAGGCCTTATCTCAGTTCAGAGTCGATAATCTAATTAAAGATATAAACTCCTATACAAACAGTACTTCTGTCATCAATGAATTGCGTTCGTGTTACATTCACTATGTCAACGGCATCGCTCAAAATTTGTCTGAACAGGACACTAAATTGCTAGAAGTTTTGTTGACTTACGATTCTGCTTTAGATATTGCTAACGATCCATTAGCAAGACAATTAAACGATGCTGTCGCTAATAATTTACCCAGTTCAGCTCTTGGCGAAGACACATATTTGATTAGAGTTGTTCCTAGATCAGGCACTATCTCTCCTTGGTCTTCCAAGGCTACTAATATTGCTCATGTATGCGGGCTACAAGACAAAGTTCAACGTATTGAAAGAGGTTTAGCCTTACTCATAAAGACTGTTCCAGGTTTCCCTCTTTTGGAAAATCTAAATGATATTTCATTGAAGTGTGTCTACGATAGGATGACACAACAATTATATCTGACCGAACCACCAAATACGATGAGTATTTTCACACATGAAGAGCCAAAGCCATTAGTTCACGTTCCTTTAACTCCTAAGGACACTAAACAGTCTCCAAAGGATATTTTATCCAAAGCTAATACGGAATTGGGTTTAGCTCTAGATAGTGGAGAAATGGAATATTTGATTCATGCATTCGTCGAAACTATGAAAAGAGATCCTACTGATGTTGAGTTATTTATGTTCGCTCAAGTTAATTCTGAACATTGTCGTCACAAGATCTTCAATGCTGATTGGACCATTGATGGAATAAAACAACAATTCACCTTGTTTCAAATGATTAGAAATACCCATAAATTAAACCCAGAATATACTATTAGCGCCTATTCTGATAATGCAGCCGTTTTGGATAGTGAAAATGATGCCTTTTTCTTTGCACCAAATTCAACTACAAAGGAATGGACCTCTACAAAGGAAAGAATTCCATTACTTATCAAAGTCGAAACTCACAACCATCCAACAGCCGTGTCTCCTTTCCCAGGTGCTGCTACAGGTTCTGGTGGTGAAATCAGAGACGAGGGTGCTACAGGCAGAGGTTCCAAGACTAAGTGTGGTTTGAGTGGATTCTCTGTCAGCGACCTTTTGATACCAGGTAATGAACAACCTTGGGAGTTGAATATTGGTAAGCCTTACCATATTGCATCTGCATTAGATATTATGATTGAGGCTCCTTTGGGTTCAGCTGCATTTAACAATGAGTTTGGTAGACCTTGTATAAACGGTTACTTCAGAACTTTAACTACAAAGGTTTTGAATCACCAAGGGAAGGAGGAAATCAGAGGGTTCCACAAGCCAATTATGATTGCGGGTGGTTTCGGTACTGTTAGACCTCAATTTGCTTTGAAGAACACCCCAATAACTCCAGGCTCTTGTTTAATTGTACTTGGTGGTCAATCTATGCTGATTGGTTTAGGTGGTGGTGCTGCTTCTTCTGTAGCTTCCGGTGAAGGTTCCGCCGATTTGGATTTTGCTTCTGTACAAAGAGGGAACCCCGAAATGGAACGTCGTTGCCAACAAGTGATTGACGCTTGTGTTGCCTTAGGTAACAATAATCCTATCCAATCTATTCACGATGTTGGTGCTGGTGGGTTATCCAACGCTTTGCCAGAATTGGTTCATGACAATGACTTGGGTGCTAAATTCGATATTAGAAAGGTCCTCTCCTTAGAACCTGGTATGTCACCAATGGAAATTTGGTGTAATGAATCACAAGAACGTTATGTTCTTGGTGTTTCTCCTCAAGACTTATCCATTTTCGAGGAAATCTGTAAGAGAGAAAGAGCACCATTTGCTGTCGTCGGTCACGCAACCGCTGAACAAAAATTGATTGTAGAAGATCCTCTTTTGAAAACAACTCCAATTGATTTAGAAATGCCAATTTTATTTGGTAAGCCTCCAAAGATGTCAAGAGAAACCATAACTGAAGCACTAAATCTACCAGAGGCAAATTTGAGCGAAATTCCTTCCCTACAAGATGCTATTCAAAGAGTTCTAAACTTACCATCTGTTGGCTCAAAGTCATTTTTGATTACTATTGGTGACAGATCCGTCACAGGTCTAATTGATAGGGATCAATTTGTTGGTCCTTGGCAAGTACCTGTTGCGGATGTCGGTGTTACCGGTACCTCTTTGGGTGAGACAATAATTTCCACAGGTGAAGCCATGGCTATGGGTGAAAAACCAGTTAACGCCCTAATCTCCGCATCTGCTTCCGCTAAATTATCTGTGGCAGAATCTTTATTGAACATATTCGCTGCTGATGTGAAATCTTTAAATCATATCAAGCTATCTGCTAACTGGATGTCTCCAGCCTCTCATCAAGGTGAGGGTTCTAAGTTATATGAAGCCGTTCAAGCATTAGGTCTGGATTTATGTCCTGCATTAGGTGTTGCTATCCCTGTTGGTAAGGATTCCATGTCCATGAAGATGAAATGGGATGATAAGGAAGTTACTGCACCATTGTCATTGAATATCACAGCATTTGCACCAGTTTTCAACACTAGTAAAACGTGGACTCCATTGCTAAATAGAAACACAGATGATTCTGTCCTTGTCTTGGTTGATCTATCAGCTAAACAAGAGACTAAGTCACTAGGTGCCTCTGCTTTGTTGCAAGTTTACAACCAAGTTGGTAACAAGTCGCCTACTGTGTATGATAACGCCATTTTGAAGGGTTTCTTGGAAAGTTTAATCCAATTGCATCAACAAAAGGAGGATATAGTGCTTGCCTACCATGATAGGTCTGATGGTGGTCTACTAATCACTTTACTGGAAATGGCATTTGCTTCCAGATGCGGCTTAGAAATCAACATTGACGGTGGAGACCTAGAAAGTCAATTAACAAACCTATTCAATGAAGAATTAGGTGCAGTATTCCAGATTTCTGCTAAGAACTTGAGCAAGTTCGAAAAAATCTTGAACGAGAACGGAGTTGCTAAAGAATATATTTCTATTGTTGGTAAGCCATCCTTCCAAAGCCAGGAAATCAAGATTATTAACTCCACAACAAATGATGTAATTTACGCCAATTCGAGATCTGAATTGGAGCAAACTTGGAGTAAGACATCTTACGAAATGCAGAAATTGAGAGACAATCCAAAAACAGCCGAAGAAGAGTTTGCCAGTATCACGGACGATAGAGATCCCGGTTTGCAGTATGCCCTAACATACAACCCAGCCGATGATATGAAGATCGGATTAGAATTATCCAGTCAAAGGCCAAAGGTTGCTATCTTAAGAGAGCAAGGTGTGAACGGTCAAATGGAAATGGCATGGTGCTTCCAACAAGCTGGATTCAACTCAGTGGATGTCACTATGACAGATTTGCTAGAAGGTAGGTTCCATTTGGATGACTTCATCGGTCTTGCCGCATGTGGTGGTTTCTCTTATGGTGATGTCTTAGGTGCAGGTGCGGGTTGGGCTAAATCCGTATTGTATCACGAAGGTGTGCGCTCGCAATTTTCTAAGTTCTTCAATGAAAGACAAGATACATTTGCTTTTGGTGCTTGTAATGGTTGTCAATTCTTGAGTAGATTAAAAGATATCATACCCGGGTGTGAAAACTGGCCAAGTTTCGAAAGAAATGTTAGTGAACAATATGAAGCCCGTGTATGTATGGTGCAAATATCTCAAGAAAAGGACAATTCTAGCGAGGAATCTGTTTTCTTGAATGGCATGGCAGGATCCAAATTGCCAATTGCTGTCGCACATGGTGAAGGTAAAGCAACATTTTCTAAAAGCGCTGAACAACTGGAAAAGTTCGAAAAGGATGGTTTATGTTGTATAAGGTATGTGGACAACTACGGTAACGTCACCGAAAGGTTCCCCTTCAACCCCAATGGGTCGACCAATGGTATTGCCGGTATCAAGTCACCAAATGGTAGAGTGCTTGCCATGATGCCACATCCTGAAAGAGTTTGCAGATTGGAGGCCAATTCCTGGTATCCAGAGGGCAAATACGAAGAGTGGGGTGGATACGGTCCATGGATTAGATTATTCAGATCTGCCAGAAGATGGGTCGGTTGA","protein_sequence":"MTDYILPGPKALSQFRVDNLIKDINSYTNSTSVINELRSCYIHYVNGIAQNLSEQDTKLLEVLLTYDSALDIANDPLARQLNDAVANNLPSSALGEDTYLIRVVPRSGTISPWSSKATNIAHVCGLQDKVQRIERGLALLIKTVPGFPLLENLNDISLKCVYDRMTQQLYLTEPPNTMSIFTHEEPKPLVHVPLTPKDTKQSPKDILSKANTELGLALDSGEMEYLIHAFVETMKRDPTDVELFMFAQVNSEHCRHKIFNADWTIDGIKQQFTLFQMIRNTHKLNPEYTISAYSDNAAVLDSENDAFFFAPNSTTKEWTSTKERIPLLIKVETHNHPTAVSPFPGAATGSGGEIRDEGATGRGSKTKCGLSGFSVSDLLIPGNEQPWELNIGKPYHIASALDIMIEAPLGSAAFNNEFGRPCINGYFRTLTTKVLNHQGKEEIRGFHKPIMIAGGFGTVRPQFALKNTPITPGSCLIVLGGQSMLIGLGGGAASSVASGEGSADLDFASVQRGNPEMERRCQQVIDACVALGNNNPIQSIHDVGAGGLSNALPELVHDNDLGAKFDIRKVLSLEPGMSPMEIWCNESQERYVLGVSPQDLSIFEEICKRERAPFAVVGHATAEQKLIVEDPLLKTTPIDLEMPILFGKPPKMSRETITEALNLPEANLSEIPSLQDAIQRVLNLPSVGSKSFLITIGDRSVTGLIDRDQFVGPWQVPVADVGVTGTSLGETIISTGEAMAMGEKPVNALISASASAKLSVAESLLNIFAADVKSLNHIKLSANWMSPASHQGEGSKLYEAVQALGLDLCPALGVAIPVGKDSMSMKMKWDDKEVTAPLSLNITAFAPVFNTSKTWTPLLNRNTDDSVLVLVDLSAKQETKSLGASALLQVYNQVGNKSPTVYDNAILKGFLESLIQLHQQKEDIVLAYHDRSDGGLLITLLEMAFASRCGLEINIDGGDLESQLTNLFNEELGAVFQISAKNLSKFEKILNENGVAKEYISIVGKPSFQSQEIKIINSTTNDVIYANSRSELEQTWSKTSYEMQKLRDNPKTAEEEFASITDDRDPGLQYALTYNPADDMKIGLELSSQRPKVAILREQGVNGQMEMAWCFQQAGFNSVDVTMTDLLEGRFHLDDFIGLAACGGFSYGDVLGAGAGWAKSVLYHEGVRSQFSKFFNERQDTFAFGACNGCQFLSRLKDIIPGCENWPSFERNVSEQYEARVCMVQISQEKDNSSEESVFLNGMAGSKLPIAVAHGEGKATFSKSAEQLEKFEKDGLCCIRYVDNYGNVTERFPFNPNGSTNGIAGIKSPNGRVLAMMPHPERVCRLEANSWYPEGKYEEWGGYGPWIRLFRSARRWVG"},{"created_at":"2011-05-24T20:12:36.000Z","updated_at":"2011-05-24T20:12:36.000Z","name":"Bifunctional purine biosynthetic protein ADE5,7","uniprot_id":"P07244","uniprot_name":"PUR2_YEAST","enzyme":true,"transporter":false,"gene_name":"ADE5","num_residues":802,"molecular_weight":"86067.39844","theoretical_pi":"4.89","general_function":"Involved in catalytic activity","specific_function":"ATP + 5-phospho-D-ribosylamine + glycine = ADP + phosphate + N(1)-(5-phospho-D-ribosyl)glycinamide","reactions":[{"id":1885,"direction":"\u003e","locations":"cytoplasm","altext":null,"export":true,"pw_reaction_id":null,"source":null},{"id":1891,"direction":"\u003c\u003e","locations":"cytoplasm","altext":null,"export":true,"pw_reaction_id":null,"source":null},{"id":2304,"direction":"\u003e","locations":null,"altext":"ATP + 5-phospho-D-ribosylamine + glycine = ADP + phosphate + N(1)-(5-phospho-D-ribosyl)glycinamide.","export":false,"pw_reaction_id":null,"source":null},{"id":2305,"direction":"\u003e","locations":null,"altext":"ATP + 2-(formamido)-N(1)-(5-phospho-D-ribosyl)acetamidine = ADP + phosphate + 5-amino-1-(5-phospho-D-ribosyl)imidazole.","export":false,"pw_reaction_id":null,"source":null}],"signal_regions":"None","transmembrane_regions":"None","pdb_id":null,"cellular_location":null,"genbank_gene_id":"X04337","genbank_protein_id":"3335","gene_card_id":"ADE5","chromosome_location":null,"locus":"YGL234W","synonyms":["Phosphoribosylamine--glycine ligase","Glycinamide ribonucleotide synthetase","GARS","Phosphoribosylglycinamide synthetase","Phosphoribosylformylglycinamidine cyclo-ligase","AIR synthase","AIRS","Phosphoribosyl-aminoimidazole synthetase"],"enzyme_classes":["6.3.4.13","6.3.3.1"],"go_classes":[{"category":"Component","description":" cell part"},{"category":"Component","description":" intracellular part"},{"category":"Component","description":" cytoplasm"},{"category":"Function","description":" binding"},{"category":"Function","description":" ligase activity"},{"category":"Function","description":" ligase activity, forming carbon-nitrogen bonds"},{"category":"Function","description":" nucleoside binding"},{"category":"Function","description":" purine nucleoside binding"},{"category":"Function","description":" adenyl nucleotide binding"},{"category":"Function","description":" adenyl ribonucleotide binding"},{"category":"Function","description":" ATP binding"},{"category":"Function","description":" cyclo-ligase activity"},{"category":"Function","description":" phosphoribosylformylglycinamidine cyclo-ligase activity"},{"category":"Function","description":" catalytic activity"},{"category":"Function","description":" phosphoribosylamine-glycine ligase activity"},{"category":"Process","description":" nitrogen compound metabolic process"},{"category":"Process","description":" cellular nitrogen compound metabolic process"},{"category":"Process","description":" nucleobase, nucleoside, nucleotide and nucleic acid metabolic process"},{"category":"Process","description":" nucleobase, nucleoside and nucleotide metabolic process"},{"category":"Process","description":" nucleoside phosphate metabolic process"},{"category":"Process","description":" nucleotide metabolic process"},{"category":"Process","description":" cellular aromatic compound metabolic process"},{"category":"Process","description":" purine nucleotide metabolic process"},{"category":"Process","description":" nucleobase metabolic process"},{"category":"Process","description":" purine nucleotide biosynthetic process"},{"category":"Process","description":" purine base metabolic process"},{"category":"Process","description":" purine nucleoside monophosphate biosynthetic process"},{"category":"Process","description":" purine base biosynthetic process"},{"category":"Process","description":" purine ribonucleoside monophosphate biosynthetic process"},{"category":"Process","description":" IMP biosynthetic process"},{"category":"Process","description":" 'de novo' IMP biosynthetic process"},{"category":"Process","description":" metabolic process"},{"category":"Process","description":" cellular metabolic process"}],"pfams":[{"name":"AIRS","identifier":"PF00586"},{"name":"AIRS_C","identifier":"PF02769"},{"name":"GARS_A","identifier":"PF01071"},{"name":"GARS_C","identifier":"PF02843"},{"name":"GARS_N","identifier":"PF02844"}],"pathways":[{"name":"Purine metabolism","kegg_map_id":"00230"}],"gene_sequence":"ATGCTCAACATTCTCGTTTTAGGAAACGGTGCAAGAGAACACGTTCTTGTCACCAAGCTGGCTCAGTCACCCACCGTGGGTAAGATCTATGTCGCTCCAGGTAATGGAGGGACCGCAACCATGGATCCTTCGCGTGTGATAAACTGGGATATTACGCCAGATGTCGCCAATTTTGCTCGTTTGCAGTCGATGGCTGTGGAACATAAGATCAACTTGGTCGTTCCTGGTCCAGAATTACCTCTAGTCAACGGCATCACCTCCGTGTTCCATAGCGTTGGTATTCCCGTTTTTGGACCTTCCGTCAAAGCCGCTCAGTTGGAAGCTTCCAAGGCTTTCTCCAAGAGATTTATGTCAAAACACAATATTCCAACCGCGTCTTATGATGTCTTCACTAATCCAGAAGAAGCCATTTCATTCTTGCAAGCTCATACTGACAAAGCTTTTGTCATCAAGGCCGACGGGATCGCTGCTGGGAAAGGTGTTATTATCCCATCTAGCATCGACGAGTCCGTCCAAGCTATCAAGGACATAATGGTCACCAAGCAATTCGGTGAAGAAGCGGGCAAGCAGGTTGTGATAGAACAATTCTTGGAAGGTGATGAAATCTCTCTACTCACCATTGTTGACGGGTACTCTCACTTCAATCTCCCCGTCGCACAAGATCACAAGAGGATCTTTGATGGCGACAAGGGCTTGAACACCGGTGGGATGGGTGCCTATGCCCCCGCTCCTGTGGCCACACCATCTTTGTTGAAGACCATAGATTCACAGATTGTGAAGCCTACGATTGATGGGATGAGACGTGATGGTATGCCCTTTGTTGGTGTGCTGTTCACCGGGATGATTTTGGTGAAGGATTCTAAGACAAATCAACTTGTTCCCGAAGTGTTAGAATATAATGTCAGATTCGGTGACCCAGAGACACAGGCTGTTTTGAGTTTACTTGATGATCAAACCGATTTGGCGCAAGTGTTTTTGGCTGCTGCTGAACATCGTTTGGATTCCGTAAACATAGGAATCGATGACACAAGATCTGCCGTTACTGTCGTAGTGGCTGCAGGTGGTTATCCTGAATCATACGCCAAAGGTGACAAAATTACCTTGGATACCGATAAATTACCTCCACATACACAAATCTTCCAAGCAGGTACCAAATACGATTCCGCCACCGATTCTTTATTGACCAATGGTGGTAGAGTTCTTTCTGTGACCTCCACTGCTCAGGACTTGAGAACAGCAGTAGATACAGTATATGAAGCCGTCAAATGCGTCCATTTCCAAAATTCTTACTACAGAAAGGACATCGCATACCGTGCGTTCCAAAACTCAGAATCATCAAAAGTTGCCATCACATACGCAGACTCAGGTGTCTCTGTTGATAATGGTAACAATCTCGTACAAACTATCAAAGAAATGGTCAGATCCACAAGAAGGCCAGGTGCAGACTCTGATATTGGTGGTTTTGGTGGTTTATTCGATTTGGCTCAAGCAGGTTTCCGTCAAAACGAAGATACCTTACTAGTAGGTGCTACAGATGGTGTCGGTACTAAATTAATCATTGCCCAAGAGACCGGGATTCATAATACTGTCGGTATTGACCTGGTGGCCATGAATGTTAACGATTTGGTGGTACAAGGTGCTGAGCCTCTATTCTTTTTGGACTACTTTGCCACTGGTGCTCTTGACATTCAAGTTGCCTCTGATTTTGTGTCCGGTGTTGCTAATGGTTGTATTCAAAGTGGTTGTGCTCTTGTGGGTGGTGAAACTTCGGAAATGCCCGGTATGTATCCACCCGGCCACTACGATACTAATGGTACCGCTGTTGGTGCTGTATTAAGACAAGATATCTTGCCCAAGATAAATGAAATGGCCGCAGGAGATGTTCTTCTGGGTCTCGCCTCTAGCGGTGTTCATTCTAATGGTTTCTCTTTGGTTAGAAAAATTATTCAACATGTAGCATTACCATGGGACGCTCCATGTCCATGGGATGAATCTAAGACGTTAGGTGAAGGTATTCTTGAACCAACAAAAATTTACGTCAAGCAATTATTGCCATCAATTAGACAAAGACTACTACTAGGTTTAGCTCATATAACAGGTGGTGGTTTAGTAGAGAATATCCCAAGAGCTATTCCAGACCACCTACAGGCCCGCGTTGATATGTCAACCTGGGAAGTACCCCGTGTCTTCAAATGGTTTGGTCAAGCAGGTAATGTTCCACACGATGACATTTTAAGAACCTTCAACATGGGTGTTGGTATGGTTTTGATTGTCAAGAGAGAAAACGTCAAGGCTGTTTGTGATTCATTGACTGAAGAAGGTGAAATTATTTGGGAGCTTGGTTCTTTGCAAGAAAGACCAAAGGATGCTCCCGGTTGTGTGATTGAAAACGGAACTAAGCTTTACTAA","protein_sequence":"MLNILVLGNGAREHVLVTKLAQSPTVGKIYVAPGNGGTATMDPSRVINWDITPDVANFARLQSMAVEHKINLVVPGPELPLVNGITSVFHSVGIPVFGPSVKAAQLEASKAFSKRFMSKHNIPTASYDVFTNPEEAISFLQAHTDKAFVIKADGIAAGKGVIIPSSIDESVQAIKDIMVTKQFGEEAGKQVVIEQFLEGDEISLLTIVDGYSHFNLPVAQDHKRIFDGDKGLNTGGMGAYAPAPVATPSLLKTIDSQIVKPTIDGMRRDGMPFVGVLFTGMILVKDSKTNQLVPEVLEYNVRFGDPETQAVLSLLDDQTDLAQVFLAAAEHRLDSVNIGIDDTRSAVTVVVAAGGYPESYAKGDKITLDTDKLPPHTQIFQAGTKYDSATDSLLTNGGRVLSVTSTAQDLRTAVDTVYEAVKCVHFQNSYYRKDIAYRAFQNSESSKVAITYADSGVSVDNGNNLVQTIKEMVRSTRRPGADSDIGGFGGLFDLAQAGFRQNEDTLLVGATDGVGTKLIIAQETGIHNTVGIDLVAMNVNDLVVQGAEPLFFLDYFATGALDIQVASDFVSGVANGCIQSGCALVGGETSEMPGMYPPGHYDTNGTAVGAVLRQDILPKINEMAAGDVLLGLASSGVHSNGFSLVRKIIQHVALPWDAPCPWDESKTLGEGILEPTKIYVKQLLPSIRQRLLLGLAHITGGGLVENIPRAIPDHLQARVDMSTWEVPRVFKWFGQAGNVPHDDILRTFNMGVGMVLIVKRENVKAVCDSLTEEGEIIWELGSLQERPKDAPGCVIENGTKLY"}]}