{"ymdb_id":"YMDB00398","created_at":"2011-05-29T18:12:47.000Z","updated_at":"2016-09-08T18:35:27.000Z","name":"2-(Formamido)-N1-(5-phospho-D-ribosyl)acetamidine","cas":"6157-85-3","state":"Solid","melting_point":"","description":"2-(Formamido)-N1-(5-phospho-D-ribosyl)acetamidine is an intermediate in purine metabolism. The enzyme phosphoribosylformylglycinamidine synthase [EC:6.3.5.3] catalyzes the production of this metabolite from N2-formyl-N1-(5-phospho-D-ribosyl)glycinamide.","experimental_water_solubility":"","experimental_logp_hydrophobicity":"","location":"cytoplasm","synthesis_reference":null,"chebi_id":"18413","hmdb_id":"HMDB06211","kegg_id":"C04640","pubchem_id":"9552078","cs_id":null,"foodb_id":null,"wikipedia_link":null,"biocyc_id":"5-PHOSPHORIBOSYL-N-FORMYLGLYCINEAMIDINE","iupac":"{[(2R,3S,4R,5R)-3,4-dihydroxy-5-[(formamidomethanimidoyl)amino]oxolan-2-yl]oxy}phosphonic acid","traditional_iupac":"[(2R,3S,4R,5R)-3,4-dihydroxy-5-[(formamidomethanimidoyl)amino]oxolan-2-yl]oxyphosphonic acid","logp":"-3.117339851729372","pka":"6.265948157804224","alogps_solubility":"7.66e+00 g/l","alogps_logp":"-2.07","alogps_logs":"-1.57","acceptor_count":"9","donor_count":"7","rotatable_bond_count":"3","polar_surface_area":"181.42999999999998","refractivity":"63.2266","polarizability":"22.68962556497434","formal_charge":"0","physiological_charge":"-2","pka_strongest_basic":"5.258678507329495","pka_strongest_acidic":"1.118556859786445","bioavailability":"1","number_of_rings":"1","rule_of_five":"0","ghose_filter":"0","veber_rule":"0","mddr_like_rule":"0","synonyms":["[(2R,3S,4R,5R)-5-[(1-amino-2-formamido-ethylidene)amino]-3,4-dihydroxy-oxolan-2-yl]methoxyphosphonic acid","1-(5'-Phosphoribosyl)-N-formylglycinamidine","1-deoxy-1-[2-(formamido)acetimidamido]-D-ribofuranose 5-(dihydrogen phosphate)","2-(formamido)-N(1)-(5-phospho-D-ribosyl)acetamidine","2-(formamido)-N(1)-(5'-phosphoribosyl)acetamidine","2-(Formamido)-N1-(5'-phosphoribosyl)acetamidine","5'-Phosphoribosyl-N-formylglycinamidine","5'-Phosphoribosylformylglycinamidine","FGAM","N-[2-(formamido)ethanimidoyl]-5-O-phosphono-D-ribofuranosylamine"],"pathways":[{"name":"Purine metabolism","kegg_map_id":"00230"},{"name":"purine nucleotides de novo biosynthesis","kegg_map_id":null}],"growth_conditions":[],"references":[{"pubmed_id":21051339,"citation":"UniProt Consortium (2011). \"Ongoing and future developments at the Universal Protein Resource.\" Nucleic Acids Res 39:D214-D219."}],"proteins":[{"created_at":"2011-05-24T19:21:34.000Z","updated_at":"2011-05-27T14:55:57.000Z","name":"Phosphoribosylformylglycinamidine synthase","uniprot_id":"P38972","uniprot_name":"PUR4_YEAST","enzyme":true,"transporter":false,"gene_name":"ADE6","num_residues":1358,"molecular_weight":"148904.0","theoretical_pi":"4.92","general_function":"Involved in catalytic activity","specific_function":"ATP + N(2)-formyl-N(1)-(5-phospho-D- ribosyl)glycinamide + L-glutamine + H(2)O = ADP + phosphate + 2- (formamido)-N(1)-(5-phospho-D-ribosyl)acetamidine + L-glutamate","reactions":[{"id":1889,"direction":"\u003e","locations":"cytoplasm","altext":null,"export":true,"pw_reaction_id":null,"source":null},{"id":2248,"direction":"\u003e","locations":"Cytoplasm","altext":"ATP + N(2)-formyl-N(1)-(5-phospho-D-ribosyl)glycinamide + L-glutamine + H(2)O = ADP + phosphate + 2-(formamido)-N(1)-(5-phospho-D-ribosyl)acetamidine + L-glutamate.","export":false,"pw_reaction_id":null,"source":null}],"signal_regions":"None","transmembrane_regions":"None","pdb_id":null,"cellular_location":"Cytoplasm","genbank_gene_id":"Z72846","genbank_protein_id":"1323079","gene_card_id":"ADE6","chromosome_location":"chromosome 7","locus":"YGR061C","synonyms":["FGAM synthase","FGAMS","Formylglycinamide ribotide amidotransferase","FGARAT","Formylglycinamide ribotide synthetase"],"enzyme_classes":["6.3.5.3"],"go_classes":[{"category":"Component","description":" Not Available"},{"category":"Function","description":" catalytic activity"},{"category":"Function","description":" ligase activity"},{"category":"Function","description":" ligase activity, forming carbon-nitrogen bonds"},{"category":"Function","description":" carbon-nitrogen ligase activity, with glutamine as amido-N-donor"},{"category":"Function","description":" phosphoribosylformylglycinamidine synthase activity"},{"category":"Process","description":" nitrogen compound metabolic process"},{"category":"Process","description":" cellular nitrogen compound metabolic process"},{"category":"Process","description":" nucleobase, nucleoside, nucleotide and nucleic acid metabolic process"},{"category":"Process","description":" nucleobase, nucleoside and nucleotide metabolic process"},{"category":"Process","description":" nucleoside phosphate metabolic process"},{"category":"Process","description":" nucleotide metabolic process"},{"category":"Process","description":" purine nucleotide metabolic process"},{"category":"Process","description":" purine nucleotide biosynthetic process"},{"category":"Process","description":" purine nucleoside monophosphate biosynthetic process"},{"category":"Process","description":" purine ribonucleoside monophosphate biosynthetic process"},{"category":"Process","description":" IMP biosynthetic process"},{"category":"Process","description":" 'de novo' IMP biosynthetic process"},{"category":"Process","description":" metabolic process"}],"pfams":[{"name":"AIRS","identifier":"PF00586"},{"name":"AIRS_C","identifier":"PF02769"}],"pathways":[{"name":"Purine metabolism","kegg_map_id":"00230"}],"gene_sequence":"ATGACTGATTATATTTTGCCGGGTCCCAAGGCCTTATCTCAGTTCAGAGTCGATAATCTAATTAAAGATATAAACTCCTATACAAACAGTACTTCTGTCATCAATGAATTGCGTTCGTGTTACATTCACTATGTCAACGGCATCGCTCAAAATTTGTCTGAACAGGACACTAAATTGCTAGAAGTTTTGTTGACTTACGATTCTGCTTTAGATATTGCTAACGATCCATTAGCAAGACAATTAAACGATGCTGTCGCTAATAATTTACCCAGTTCAGCTCTTGGCGAAGACACATATTTGATTAGAGTTGTTCCTAGATCAGGCACTATCTCTCCTTGGTCTTCCAAGGCTACTAATATTGCTCATGTATGCGGGCTACAAGACAAAGTTCAACGTATTGAAAGAGGTTTAGCCTTACTCATAAAGACTGTTCCAGGTTTCCCTCTTTTGGAAAATCTAAATGATATTTCATTGAAGTGTGTCTACGATAGGATGACACAACAATTATATCTGACCGAACCACCAAATACGATGAGTATTTTCACACATGAAGAGCCAAAGCCATTAGTTCACGTTCCTTTAACTCCTAAGGACACTAAACAGTCTCCAAAGGATATTTTATCCAAAGCTAATACGGAATTGGGTTTAGCTCTAGATAGTGGAGAAATGGAATATTTGATTCATGCATTCGTCGAAACTATGAAAAGAGATCCTACTGATGTTGAGTTATTTATGTTCGCTCAAGTTAATTCTGAACATTGTCGTCACAAGATCTTCAATGCTGATTGGACCATTGATGGAATAAAACAACAATTCACCTTGTTTCAAATGATTAGAAATACCCATAAATTAAACCCAGAATATACTATTAGCGCCTATTCTGATAATGCAGCCGTTTTGGATAGTGAAAATGATGCCTTTTTCTTTGCACCAAATTCAACTACAAAGGAATGGACCTCTACAAAGGAAAGAATTCCATTACTTATCAAAGTCGAAACTCACAACCATCCAACAGCCGTGTCTCCTTTCCCAGGTGCTGCTACAGGTTCTGGTGGTGAAATCAGAGACGAGGGTGCTACAGGCAGAGGTTCCAAGACTAAGTGTGGTTTGAGTGGATTCTCTGTCAGCGACCTTTTGATACCAGGTAATGAACAACCTTGGGAGTTGAATATTGGTAAGCCTTACCATATTGCATCTGCATTAGATATTATGATTGAGGCTCCTTTGGGTTCAGCTGCATTTAACAATGAGTTTGGTAGACCTTGTATAAACGGTTACTTCAGAACTTTAACTACAAAGGTTTTGAATCACCAAGGGAAGGAGGAAATCAGAGGGTTCCACAAGCCAATTATGATTGCGGGTGGTTTCGGTACTGTTAGACCTCAATTTGCTTTGAAGAACACCCCAATAACTCCAGGCTCTTGTTTAATTGTACTTGGTGGTCAATCTATGCTGATTGGTTTAGGTGGTGGTGCTGCTTCTTCTGTAGCTTCCGGTGAAGGTTCCGCCGATTTGGATTTTGCTTCTGTACAAAGAGGGAACCCCGAAATGGAACGTCGTTGCCAACAAGTGATTGACGCTTGTGTTGCCTTAGGTAACAATAATCCTATCCAATCTATTCACGATGTTGGTGCTGGTGGGTTATCCAACGCTTTGCCAGAATTGGTTCATGACAATGACTTGGGTGCTAAATTCGATATTAGAAAGGTCCTCTCCTTAGAACCTGGTATGTCACCAATGGAAATTTGGTGTAATGAATCACAAGAACGTTATGTTCTTGGTGTTTCTCCTCAAGACTTATCCATTTTCGAGGAAATCTGTAAGAGAGAAAGAGCACCATTTGCTGTCGTCGGTCACGCAACCGCTGAACAAAAATTGATTGTAGAAGATCCTCTTTTGAAAACAACTCCAATTGATTTAGAAATGCCAATTTTATTTGGTAAGCCTCCAAAGATGTCAAGAGAAACCATAACTGAAGCACTAAATCTACCAGAGGCAAATTTGAGCGAAATTCCTTCCCTACAAGATGCTATTCAAAGAGTTCTAAACTTACCATCTGTTGGCTCAAAGTCATTTTTGATTACTATTGGTGACAGATCCGTCACAGGTCTAATTGATAGGGATCAATTTGTTGGTCCTTGGCAAGTACCTGTTGCGGATGTCGGTGTTACCGGTACCTCTTTGGGTGAGACAATAATTTCCACAGGTGAAGCCATGGCTATGGGTGAAAAACCAGTTAACGCCCTAATCTCCGCATCTGCTTCCGCTAAATTATCTGTGGCAGAATCTTTATTGAACATATTCGCTGCTGATGTGAAATCTTTAAATCATATCAAGCTATCTGCTAACTGGATGTCTCCAGCCTCTCATCAAGGTGAGGGTTCTAAGTTATATGAAGCCGTTCAAGCATTAGGTCTGGATTTATGTCCTGCATTAGGTGTTGCTATCCCTGTTGGTAAGGATTCCATGTCCATGAAGATGAAATGGGATGATAAGGAAGTTACTGCACCATTGTCATTGAATATCACAGCATTTGCACCAGTTTTCAACACTAGTAAAACGTGGACTCCATTGCTAAATAGAAACACAGATGATTCTGTCCTTGTCTTGGTTGATCTATCAGCTAAACAAGAGACTAAGTCACTAGGTGCCTCTGCTTTGTTGCAAGTTTACAACCAAGTTGGTAACAAGTCGCCTACTGTGTATGATAACGCCATTTTGAAGGGTTTCTTGGAAAGTTTAATCCAATTGCATCAACAAAAGGAGGATATAGTGCTTGCCTACCATGATAGGTCTGATGGTGGTCTACTAATCACTTTACTGGAAATGGCATTTGCTTCCAGATGCGGCTTAGAAATCAACATTGACGGTGGAGACCTAGAAAGTCAATTAACAAACCTATTCAATGAAGAATTAGGTGCAGTATTCCAGATTTCTGCTAAGAACTTGAGCAAGTTCGAAAAAATCTTGAACGAGAACGGAGTTGCTAAAGAATATATTTCTATTGTTGGTAAGCCATCCTTCCAAAGCCAGGAAATCAAGATTATTAACTCCACAACAAATGATGTAATTTACGCCAATTCGAGATCTGAATTGGAGCAAACTTGGAGTAAGACATCTTACGAAATGCAGAAATTGAGAGACAATCCAAAAACAGCCGAAGAAGAGTTTGCCAGTATCACGGACGATAGAGATCCCGGTTTGCAGTATGCCCTAACATACAACCCAGCCGATGATATGAAGATCGGATTAGAATTATCCAGTCAAAGGCCAAAGGTTGCTATCTTAAGAGAGCAAGGTGTGAACGGTCAAATGGAAATGGCATGGTGCTTCCAACAAGCTGGATTCAACTCAGTGGATGTCACTATGACAGATTTGCTAGAAGGTAGGTTCCATTTGGATGACTTCATCGGTCTTGCCGCATGTGGTGGTTTCTCTTATGGTGATGTCTTAGGTGCAGGTGCGGGTTGGGCTAAATCCGTATTGTATCACGAAGGTGTGCGCTCGCAATTTTCTAAGTTCTTCAATGAAAGACAAGATACATTTGCTTTTGGTGCTTGTAATGGTTGTCAATTCTTGAGTAGATTAAAAGATATCATACCCGGGTGTGAAAACTGGCCAAGTTTCGAAAGAAATGTTAGTGAACAATATGAAGCCCGTGTATGTATGGTGCAAATATCTCAAGAAAAGGACAATTCTAGCGAGGAATCTGTTTTCTTGAATGGCATGGCAGGATCCAAATTGCCAATTGCTGTCGCACATGGTGAAGGTAAAGCAACATTTTCTAAAAGCGCTGAACAACTGGAAAAGTTCGAAAAGGATGGTTTATGTTGTATAAGGTATGTGGACAACTACGGTAACGTCACCGAAAGGTTCCCCTTCAACCCCAATGGGTCGACCAATGGTATTGCCGGTATCAAGTCACCAAATGGTAGAGTGCTTGCCATGATGCCACATCCTGAAAGAGTTTGCAGATTGGAGGCCAATTCCTGGTATCCAGAGGGCAAATACGAAGAGTGGGGTGGATACGGTCCATGGATTAGATTATTCAGATCTGCCAGAAGATGGGTCGGTTGA","protein_sequence":"MTDYILPGPKALSQFRVDNLIKDINSYTNSTSVINELRSCYIHYVNGIAQNLSEQDTKLLEVLLTYDSALDIANDPLARQLNDAVANNLPSSALGEDTYLIRVVPRSGTISPWSSKATNIAHVCGLQDKVQRIERGLALLIKTVPGFPLLENLNDISLKCVYDRMTQQLYLTEPPNTMSIFTHEEPKPLVHVPLTPKDTKQSPKDILSKANTELGLALDSGEMEYLIHAFVETMKRDPTDVELFMFAQVNSEHCRHKIFNADWTIDGIKQQFTLFQMIRNTHKLNPEYTISAYSDNAAVLDSENDAFFFAPNSTTKEWTSTKERIPLLIKVETHNHPTAVSPFPGAATGSGGEIRDEGATGRGSKTKCGLSGFSVSDLLIPGNEQPWELNIGKPYHIASALDIMIEAPLGSAAFNNEFGRPCINGYFRTLTTKVLNHQGKEEIRGFHKPIMIAGGFGTVRPQFALKNTPITPGSCLIVLGGQSMLIGLGGGAASSVASGEGSADLDFASVQRGNPEMERRCQQVIDACVALGNNNPIQSIHDVGAGGLSNALPELVHDNDLGAKFDIRKVLSLEPGMSPMEIWCNESQERYVLGVSPQDLSIFEEICKRERAPFAVVGHATAEQKLIVEDPLLKTTPIDLEMPILFGKPPKMSRETITEALNLPEANLSEIPSLQDAIQRVLNLPSVGSKSFLITIGDRSVTGLIDRDQFVGPWQVPVADVGVTGTSLGETIISTGEAMAMGEKPVNALISASASAKLSVAESLLNIFAADVKSLNHIKLSANWMSPASHQGEGSKLYEAVQALGLDLCPALGVAIPVGKDSMSMKMKWDDKEVTAPLSLNITAFAPVFNTSKTWTPLLNRNTDDSVLVLVDLSAKQETKSLGASALLQVYNQVGNKSPTVYDNAILKGFLESLIQLHQQKEDIVLAYHDRSDGGLLITLLEMAFASRCGLEINIDGGDLESQLTNLFNEELGAVFQISAKNLSKFEKILNENGVAKEYISIVGKPSFQSQEIKIINSTTNDVIYANSRSELEQTWSKTSYEMQKLRDNPKTAEEEFASITDDRDPGLQYALTYNPADDMKIGLELSSQRPKVAILREQGVNGQMEMAWCFQQAGFNSVDVTMTDLLEGRFHLDDFIGLAACGGFSYGDVLGAGAGWAKSVLYHEGVRSQFSKFFNERQDTFAFGACNGCQFLSRLKDIIPGCENWPSFERNVSEQYEARVCMVQISQEKDNSSEESVFLNGMAGSKLPIAVAHGEGKATFSKSAEQLEKFEKDGLCCIRYVDNYGNVTERFPFNPNGSTNGIAGIKSPNGRVLAMMPHPERVCRLEANSWYPEGKYEEWGGYGPWIRLFRSARRWVG"},{"created_at":"2011-05-24T20:12:36.000Z","updated_at":"2011-05-24T20:12:36.000Z","name":"Bifunctional purine biosynthetic protein ADE5,7","uniprot_id":"P07244","uniprot_name":"PUR2_YEAST","enzyme":true,"transporter":false,"gene_name":"ADE5","num_residues":802,"molecular_weight":"86067.39844","theoretical_pi":"4.89","general_function":"Involved in catalytic activity","specific_function":"ATP + 5-phospho-D-ribosylamine + glycine = ADP + phosphate + N(1)-(5-phospho-D-ribosyl)glycinamide","reactions":[{"id":1885,"direction":"\u003e","locations":"cytoplasm","altext":null,"export":true,"pw_reaction_id":null,"source":null},{"id":1891,"direction":"\u003c\u003e","locations":"cytoplasm","altext":null,"export":true,"pw_reaction_id":null,"source":null},{"id":2304,"direction":"\u003e","locations":null,"altext":"ATP + 5-phospho-D-ribosylamine + glycine = ADP + phosphate + N(1)-(5-phospho-D-ribosyl)glycinamide.","export":false,"pw_reaction_id":null,"source":null},{"id":2305,"direction":"\u003e","locations":null,"altext":"ATP + 2-(formamido)-N(1)-(5-phospho-D-ribosyl)acetamidine = ADP + phosphate + 5-amino-1-(5-phospho-D-ribosyl)imidazole.","export":false,"pw_reaction_id":null,"source":null}],"signal_regions":"None","transmembrane_regions":"None","pdb_id":null,"cellular_location":null,"genbank_gene_id":"X04337","genbank_protein_id":"3335","gene_card_id":"ADE5","chromosome_location":null,"locus":"YGL234W","synonyms":["Phosphoribosylamine--glycine ligase","Glycinamide ribonucleotide synthetase","GARS","Phosphoribosylglycinamide synthetase","Phosphoribosylformylglycinamidine cyclo-ligase","AIR synthase","AIRS","Phosphoribosyl-aminoimidazole synthetase"],"enzyme_classes":["6.3.4.13","6.3.3.1"],"go_classes":[{"category":"Component","description":" cell part"},{"category":"Component","description":" intracellular part"},{"category":"Component","description":" cytoplasm"},{"category":"Function","description":" purine nucleoside binding"},{"category":"Function","description":" adenyl nucleotide binding"},{"category":"Function","description":" adenyl ribonucleotide binding"},{"category":"Function","description":" ATP binding"},{"category":"Function","description":" cyclo-ligase activity"},{"category":"Function","description":" phosphoribosylformylglycinamidine cyclo-ligase activity"},{"category":"Function","description":" phosphoribosylamine-glycine ligase activity"},{"category":"Function","description":" catalytic activity"},{"category":"Function","description":" binding"},{"category":"Function","description":" ligase activity"},{"category":"Function","description":" ligase activity, forming carbon-nitrogen bonds"},{"category":"Function","description":" nucleoside binding"},{"category":"Process","description":" cellular nitrogen compound metabolic process"},{"category":"Process","description":" nucleobase, nucleoside, nucleotide and nucleic acid metabolic process"},{"category":"Process","description":" nucleobase, nucleoside and nucleotide metabolic process"},{"category":"Process","description":" nucleoside phosphate metabolic process"},{"category":"Process","description":" nucleotide metabolic process"},{"category":"Process","description":" cellular aromatic compound metabolic process"},{"category":"Process","description":" purine nucleotide metabolic process"},{"category":"Process","description":" nucleobase metabolic process"},{"category":"Process","description":" purine nucleotide biosynthetic process"},{"category":"Process","description":" purine base metabolic process"},{"category":"Process","description":" purine nucleoside monophosphate biosynthetic process"},{"category":"Process","description":" purine base biosynthetic process"},{"category":"Process","description":" purine ribonucleoside monophosphate biosynthetic process"},{"category":"Process","description":" IMP biosynthetic process"},{"category":"Process","description":" 'de novo' IMP biosynthetic process"},{"category":"Process","description":" cellular metabolic process"},{"category":"Process","description":" metabolic process"},{"category":"Process","description":" nitrogen compound metabolic process"}],"pfams":[{"name":"AIRS","identifier":"PF00586"},{"name":"AIRS_C","identifier":"PF02769"},{"name":"GARS_A","identifier":"PF01071"},{"name":"GARS_C","identifier":"PF02843"},{"name":"GARS_N","identifier":"PF02844"}],"pathways":[{"name":"Purine metabolism","kegg_map_id":"00230"}],"gene_sequence":"ATGCTCAACATTCTCGTTTTAGGAAACGGTGCAAGAGAACACGTTCTTGTCACCAAGCTGGCTCAGTCACCCACCGTGGGTAAGATCTATGTCGCTCCAGGTAATGGAGGGACCGCAACCATGGATCCTTCGCGTGTGATAAACTGGGATATTACGCCAGATGTCGCCAATTTTGCTCGTTTGCAGTCGATGGCTGTGGAACATAAGATCAACTTGGTCGTTCCTGGTCCAGAATTACCTCTAGTCAACGGCATCACCTCCGTGTTCCATAGCGTTGGTATTCCCGTTTTTGGACCTTCCGTCAAAGCCGCTCAGTTGGAAGCTTCCAAGGCTTTCTCCAAGAGATTTATGTCAAAACACAATATTCCAACCGCGTCTTATGATGTCTTCACTAATCCAGAAGAAGCCATTTCATTCTTGCAAGCTCATACTGACAAAGCTTTTGTCATCAAGGCCGACGGGATCGCTGCTGGGAAAGGTGTTATTATCCCATCTAGCATCGACGAGTCCGTCCAAGCTATCAAGGACATAATGGTCACCAAGCAATTCGGTGAAGAAGCGGGCAAGCAGGTTGTGATAGAACAATTCTTGGAAGGTGATGAAATCTCTCTACTCACCATTGTTGACGGGTACTCTCACTTCAATCTCCCCGTCGCACAAGATCACAAGAGGATCTTTGATGGCGACAAGGGCTTGAACACCGGTGGGATGGGTGCCTATGCCCCCGCTCCTGTGGCCACACCATCTTTGTTGAAGACCATAGATTCACAGATTGTGAAGCCTACGATTGATGGGATGAGACGTGATGGTATGCCCTTTGTTGGTGTGCTGTTCACCGGGATGATTTTGGTGAAGGATTCTAAGACAAATCAACTTGTTCCCGAAGTGTTAGAATATAATGTCAGATTCGGTGACCCAGAGACACAGGCTGTTTTGAGTTTACTTGATGATCAAACCGATTTGGCGCAAGTGTTTTTGGCTGCTGCTGAACATCGTTTGGATTCCGTAAACATAGGAATCGATGACACAAGATCTGCCGTTACTGTCGTAGTGGCTGCAGGTGGTTATCCTGAATCATACGCCAAAGGTGACAAAATTACCTTGGATACCGATAAATTACCTCCACATACACAAATCTTCCAAGCAGGTACCAAATACGATTCCGCCACCGATTCTTTATTGACCAATGGTGGTAGAGTTCTTTCTGTGACCTCCACTGCTCAGGACTTGAGAACAGCAGTAGATACAGTATATGAAGCCGTCAAATGCGTCCATTTCCAAAATTCTTACTACAGAAAGGACATCGCATACCGTGCGTTCCAAAACTCAGAATCATCAAAAGTTGCCATCACATACGCAGACTCAGGTGTCTCTGTTGATAATGGTAACAATCTCGTACAAACTATCAAAGAAATGGTCAGATCCACAAGAAGGCCAGGTGCAGACTCTGATATTGGTGGTTTTGGTGGTTTATTCGATTTGGCTCAAGCAGGTTTCCGTCAAAACGAAGATACCTTACTAGTAGGTGCTACAGATGGTGTCGGTACTAAATTAATCATTGCCCAAGAGACCGGGATTCATAATACTGTCGGTATTGACCTGGTGGCCATGAATGTTAACGATTTGGTGGTACAAGGTGCTGAGCCTCTATTCTTTTTGGACTACTTTGCCACTGGTGCTCTTGACATTCAAGTTGCCTCTGATTTTGTGTCCGGTGTTGCTAATGGTTGTATTCAAAGTGGTTGTGCTCTTGTGGGTGGTGAAACTTCGGAAATGCCCGGTATGTATCCACCCGGCCACTACGATACTAATGGTACCGCTGTTGGTGCTGTATTAAGACAAGATATCTTGCCCAAGATAAATGAAATGGCCGCAGGAGATGTTCTTCTGGGTCTCGCCTCTAGCGGTGTTCATTCTAATGGTTTCTCTTTGGTTAGAAAAATTATTCAACATGTAGCATTACCATGGGACGCTCCATGTCCATGGGATGAATCTAAGACGTTAGGTGAAGGTATTCTTGAACCAACAAAAATTTACGTCAAGCAATTATTGCCATCAATTAGACAAAGACTACTACTAGGTTTAGCTCATATAACAGGTGGTGGTTTAGTAGAGAATATCCCAAGAGCTATTCCAGACCACCTACAGGCCCGCGTTGATATGTCAACCTGGGAAGTACCCCGTGTCTTCAAATGGTTTGGTCAAGCAGGTAATGTTCCACACGATGACATTTTAAGAACCTTCAACATGGGTGTTGGTATGGTTTTGATTGTCAAGAGAGAAAACGTCAAGGCTGTTTGTGATTCATTGACTGAAGAAGGTGAAATTATTTGGGAGCTTGGTTCTTTGCAAGAAAGACCAAAGGATGCTCCCGGTTGTGTGATTGAAAACGGAACTAAGCTTTACTAA","protein_sequence":"MLNILVLGNGAREHVLVTKLAQSPTVGKIYVAPGNGGTATMDPSRVINWDITPDVANFARLQSMAVEHKINLVVPGPELPLVNGITSVFHSVGIPVFGPSVKAAQLEASKAFSKRFMSKHNIPTASYDVFTNPEEAISFLQAHTDKAFVIKADGIAAGKGVIIPSSIDESVQAIKDIMVTKQFGEEAGKQVVIEQFLEGDEISLLTIVDGYSHFNLPVAQDHKRIFDGDKGLNTGGMGAYAPAPVATPSLLKTIDSQIVKPTIDGMRRDGMPFVGVLFTGMILVKDSKTNQLVPEVLEYNVRFGDPETQAVLSLLDDQTDLAQVFLAAAEHRLDSVNIGIDDTRSAVTVVVAAGGYPESYAKGDKITLDTDKLPPHTQIFQAGTKYDSATDSLLTNGGRVLSVTSTAQDLRTAVDTVYEAVKCVHFQNSYYRKDIAYRAFQNSESSKVAITYADSGVSVDNGNNLVQTIKEMVRSTRRPGADSDIGGFGGLFDLAQAGFRQNEDTLLVGATDGVGTKLIIAQETGIHNTVGIDLVAMNVNDLVVQGAEPLFFLDYFATGALDIQVASDFVSGVANGCIQSGCALVGGETSEMPGMYPPGHYDTNGTAVGAVLRQDILPKINEMAAGDVLLGLASSGVHSNGFSLVRKIIQHVALPWDAPCPWDESKTLGEGILEPTKIYVKQLLPSIRQRLLLGLAHITGGGLVENIPRAIPDHLQARVDMSTWEVPRVFKWFGQAGNVPHDDILRTFNMGVGMVLIVKRENVKAVCDSLTEEGEIIWELGSLQERPKDAPGCVIENGTKLY"}]}