{"ymdb_id":"YMDB00066","created_at":"2011-05-29T15:53:30.000Z","updated_at":"2016-09-08T18:34:58.000Z","name":"N-acetyl-L-glutamic acid","cas":"1188-37-0","state":"Solid","melting_point":"199 - 201 oC","description":"N-Acetylglutamic acid (NAcGlu) activates carbamoyl phosphate synthase in the urea cycle. It is biosynthesized from glutamic acid and acetyl-CoA by the enzyme N-acetylglutamate synthase. Arginine is the activator for this reaction.","experimental_water_solubility":"52 mg/mL [HMP experimental]","experimental_logp_hydrophobicity":"","location":"Mitochondrion","synthesis_reference":"hang, Xiaolin; Yang, Qiyong; Sun, Yuesheng.  Preparation of N-acetyl-L-glutamic acid.    Huaxue Shijie  (2002),  43(7),  363-365.","chebi_id":"17533","hmdb_id":"HMDB01138","kegg_id":"C00624","pubchem_id":"185","cs_id":"64077","foodb_id":null,"wikipedia_link":"N-Acetylglutamic_acid","biocyc_id":"ACETYL-GLU","iupac":"(2S)-2-acetamidopentanedioic acid","traditional_iupac":"N-acetyl-L-glutamate","logp":"-1.1130154166666664","pka":"4.288470351468538","alogps_solubility":"1.86e+01 g/l","alogps_logp":"-0.67","alogps_logs":"-1.01","acceptor_count":"5","donor_count":"3","rotatable_bond_count":"5","polar_surface_area":"103.70000000000002","refractivity":"40.731500000000004","polarizability":"17.440299050547086","formal_charge":"0","physiological_charge":"-2","pka_strongest_basic":"-1.800828671223234","pka_strongest_acidic":"3.432014588807448","bioavailability":"1","number_of_rings":"0","rule_of_five":"1","ghose_filter":"0","veber_rule":"0","mddr_like_rule":"0","synonyms":["(S)-2-(acetylamino)pentanedioate","2-acetamido-L-Glutaraldehydate","2-acetamido-L-Glutaraldehydic acid","Ac-Glu-OH","acetyl-glutamate","Acetyl-L-glutamate","Acetyl-L-glutamic acid","acetylglutamate","acetylglutamic acid","DL-Acetylglutamate","DL-Acetylglutamic acid","L-Glutamic acid, N-acetyl-","N-Ac-Glu-OH","N-acetyl L-glutamate","N-acetyl L-glutamic acid","N-Acetyl-DL-glutamate","N-Acetyl-DL-glutamic acid","N-acetyl-Glutamate","N-acetyl-Glutamic acid","N-Acetyl-L-glutamate","N-Acetyl-L-glutamic acid","N-Acetyl-L-glutamic acid-gamma-semialdehyde","N-Acetylglutamate","N-Acetylglutamic acid","N-Acetylglutamic gamma-semialdehyde","NAG"],"pathways":[{"name":"Arginine and proline metabolism","kegg_map_id":"00330"}],"growth_conditions":[],"references":[{"pubmed_id":21051339,"citation":"UniProt Consortium (2011). \"Ongoing and future developments at the Universal Protein Resource.\" Nucleic Acids Res 39:D214-D219."},{"pubmed_id":21062828,"citation":"Scheer, M., Grote, A., Chang, A., Schomburg, I., Munaretto, C., Rother, M., Sohngen, C., Stelzer, M., Thiele, J., Schomburg, D. (2011). \"BRENDA, the enzyme information system in 2011.\" Nucleic Acids Res 39:D670-D676."},{"pubmed_id":18846089,"citation":"Herrgard, M. J., Swainston, N., Dobson, P., Dunn, W. B., Arga, K. Y., Arvas, M., Bluthgen, N., Borger, S., Costenoble, R., Heinemann, M., Hucka, M., Le Novere, N., Li, P., Liebermeister, W., Mo, M. L., Oliveira, A. P., Petranovic, D., Pettifer, S., Simeonidis, E., Smallbone, K., Spasic, I., Weichart, D., Brent, R., Broomhead, D. S., Westerhoff, H. V., Kirdar, B., Penttila, M., Klipp, E., Palsson, B. O., Sauer, U., Oliver, S. G., Mendes, P., Nielsen, J., Kell, D. B. (2008). \"A consensus yeast metabolic network reconstruction obtained from a community approach to systems biology.\" Nat Biotechnol 26:1155-1160."},{"pubmed_id":11553611,"citation":"Abadjieva, A., Pauwels, K., Hilven, P., Crabeel, M. (2001). \"A new yeast metabolon involving at least the two first enzymes of arginine biosynthesis: acetylglutamate synthase activity requires complex formation with acetylglutamate kinase.\" J Biol Chem 276:42869-42880."},{"pubmed_id":17439666,"citation":"Castrillo, J. I., Zeef, L. A., Hoyle, D. C., Zhang, N., Hayes, A., Gardner, D. C., Cornell, M. J., Petty, J., Hakes, L., Wardleworth, L., Rash, B., Brown, M., Dunn, W. B., Broadhurst, D., O'Donoghue, K., Hester, S. S., Dunkley, T. P., Hart, S. R., Swainston, N., Li, P., Gaskell, S. J., Paton, N. W., Lilley, K. S., Kell, D. B., Oliver, S. G. (2007). \"Growth control of the eukaryote cell: a systems biology study in yeast.\" J Biol 6:4."}],"proteins":[{"created_at":"2011-05-24T20:44:57.000Z","updated_at":"2011-05-27T14:56:01.000Z","name":"Arginine biosynthesis bifunctional protein ArgJ, mitochondrial","uniprot_id":"Q04728","uniprot_name":"ARGJ_YEAST","enzyme":true,"transporter":false,"gene_name":"ARG7","num_residues":441,"molecular_weight":"47848.30078","theoretical_pi":"7.18","general_function":"Involved in glutamate N-acetyltransferase activity","specific_function":"Catalyzes two activities which are involved in the cyclic version of arginine biosynthesis:the synthesis of acetylglutamate from glutamate and acetyl-CoA, and of ornithine by transacetylation between acetylornithine and glutamate","reactions":[{"id":1755,"direction":"\u003e","locations":"mitochondrion","altext":null,"export":true,"pw_reaction_id":null,"source":null},{"id":1806,"direction":"\u003e","locations":"mitochondrion","altext":null,"export":true,"pw_reaction_id":null,"source":null},{"id":2339,"direction":"\u003e","locations":"Mitochondrion matrix","altext":"N(2)-acetyl-L-ornithine + L-glutamate = L-ornithine + N-acetyl-L-glutamate.","export":false,"pw_reaction_id":null,"source":null},{"id":2340,"direction":"\u003e","locations":"Mitochondrion matrix;Mitochondrion","altext":"Acetyl-CoA + L-glutamate = CoA + N-acetyl-L-glutamate.","export":false,"pw_reaction_id":null,"source":null}],"signal_regions":"None","transmembrane_regions":"None","pdb_id":null,"cellular_location":"Mitochondrion matrix","genbank_gene_id":"U90438","genbank_protein_id":"1895091","gene_card_id":"ARG7","chromosome_location":"chromosome 13","locus":"YMR062C","synonyms":["Extracellular mutant protein 40","Glutamate N-acetyltransferase","GAT","Ornithine acetyltransferase","OATase","Ornithine transacetylase","Amino-acid acetyltransferase","N-acetylglutamate synthase","AGS","Arginine biosynthesis bifunctional protein ArgJ alpha chain","Arginine biosynthesis bifunctional protein ArgJ beta chain"],"enzyme_classes":["2.3.1.35","2.3.1.1"],"go_classes":[{"category":"Component","description":" Not Available"},{"category":"Function","description":" transferase activity, transferring acyl groups"},{"category":"Function","description":" transferase activity, transferring acyl groups other than amino-acyl groups"},{"category":"Function","description":" acyltransferase activity"},{"category":"Function","description":" acetyl-CoA:L-glutamate N-acetyltransferase activity"},{"category":"Function","description":" acetyltransferase activity"},{"category":"Function","description":" N-acetyltransferase activity"},{"category":"Function","description":" glutamate N-acetyltransferase activity"},{"category":"Function","description":" catalytic activity"},{"category":"Function","description":" transferase activity"},{"category":"Process","description":" glutamine family amino acid metabolic process"},{"category":"Process","description":" arginine metabolic process"},{"category":"Process","description":" arginine biosynthetic process"},{"category":"Process","description":" metabolic process"},{"category":"Process","description":" cellular metabolic process"},{"category":"Process","description":" cellular amino acid and derivative metabolic process"},{"category":"Process","description":" cellular amino acid metabolic process"}],"pfams":[{"name":"ArgJ","identifier":"PF01960"}],"pathways":[{"name":"Arginine and proline metabolism","kegg_map_id":"00330"}],"gene_sequence":"ATGAGAATATCATCAACATTGCTTCAACGCTCGAAGCAGCTTATAGATAAGTATGCATTATACGTGCCCAAGACGGGCTCTTTTCCTAAAGGATTTGAAGTAGGCTACACTGCATCTGGAGTCAAAAAAAACGGGAGCCTGGACCTGGGTGTAATCTTGAATACCAATAAATCTCGTCCTTCAACCGCAGCAGCTGTTTTCACGACCAATAAATTCAAAGCTGCGCCAGTTTTGACATCGAAAAAAGTCCTTGAAACTGCTCGTGGTAAAAACATCAACGCTATTGTAGTCAATTCCGGTTGTGCTAACTCGGTCACAGGTGATCTTGGTATGAAAGATGCCCAAGTAATGATTGATTTGGTTAACGATAAAATTGGTCAAAAAAATTCTACCCTAGTCATGTCTACAGGCGTTATTGGACAACGACTACAGATGGACAAGATCAGCACTGGTATCAATAAAATTTTTGGAGAAGAAAAGTTCGGCAGTGATTTTAACTCTTGGTTGAACGTAGCCAAATCAATCTGTACTACTGATACTTTCCCAAAATTAGTTACATCTAGATTCAAATTACCTAGTGGTACTGAGTATACTTTGACAGGTATGGCAAAGGGCGCGGGTATGATTTGTCCGAATATGGCTACCTTATTAGGTTTCATAGTTACAGATCTTCCTATTGAAAGCAAGGCGTTGCAGAAGATGCTGACTTTCGCTACTACCCGTTCATTTAATTGTATATCGGTGGACGGTGATATGAGCACCAATGACACAATTTGCATGTTGGCCAACGGTGCTATTGACACCAAAGAAATTAACGAAGACTCTAAAGATTTTGAACAAGTAAAATTGCAGGTCACAGAATTTGCTCAGCGCTTGGCCCAGTTAGTCGTTCGCGATGGTGAAGGTTCGACAAAGTTTGTTACTGTTAACGTTAAAAATGCTTTGCATTTTGAAGACGCCAAAATAATTGCTGAATCAATCTCAAACTCTATGTTGGTCAAAACCGCACTATATGGGCAAGATGCCAATTGGGGAAGAATATTGTGCGCGATCGGGTATGCAAAGCTGAATGACTTAAAATCTCTAGATGTCAACAAAATTAATGTTAGCTTTATTGCTACCGACAATTCAGAACCTCGTGAGCTGAAGCTTGTCGCTAATGGTGTGCCACAATTGGAGATCGATGAAACAAGGGCTTCTGAAATATTGGCTTTGAATGATTTGGAAGTGTCTGTCGACTTGGGAACCGGTGATCAGGCAGCACAATTTTGGACTTGTGATTTATCACATGAATATGTAACAATTAACGGTGATTACCGTTCATAA","protein_sequence":"MRISSTLLQRSKQLIDKYALYVPKTGSFPKGFEVGYTASGVKKNGSLDLGVILNTNKSRPSTAAAVFTTNKFKAAPVLTSKKVLETARGKNINAIVVNSGCANSVTGDLGMKDAQVMIDLVNDKIGQKNSTLVMSTGVIGQRLQMDKISTGINKIFGEEKFGSDFNSWLNVAKSICTTDTFPKLVTSRFKLPSGTEYTLTGMAKGAGMICPNMATLLGFIVTDLPIESKALQKMLTFATTRSFNCISVDGDMSTNDTICMLANGAIDTKEINEDSKDFEQVKLQVTEFAQRLAQLVVRDGEGSTKFVTVNVKNALHFEDAKIIAESISNSMLVKTALYGQDANWGRILCAIGYAKLNDLKSLDVNKINVSFIATDNSEPRELKLVANGVPQLEIDETRASEILALNDLEVSVDLGTGDQAAQFWTCDLSHEYVTINGDYRS"},{"created_at":"2011-05-26T16:22:54.000Z","updated_at":"2011-05-29T05:06:18.000Z","name":"Amino-acid acetyltransferase, mitochondrial","uniprot_id":"P40360","uniprot_name":"NAGS_YEAST","enzyme":true,"transporter":false,"gene_name":"ARG2","num_residues":574,"molecular_weight":"65609.5","theoretical_pi":"9.26","general_function":"Involved in arginine biosynthetic process","specific_function":"N-acetylglutamate synthase involved in arginine biosynthesis","reactions":[{"id":1755,"direction":"\u003e","locations":"mitochondrion","altext":null,"export":true,"pw_reaction_id":null,"source":null},{"id":2340,"direction":"\u003e","locations":"Mitochondrion matrix;Mitochondrion","altext":"Acetyl-CoA + L-glutamate = CoA + N-acetyl-L-glutamate.","export":false,"pw_reaction_id":null,"source":null}],"signal_regions":"None","transmembrane_regions":"None","pdb_id":null,"cellular_location":"Mitochondrion","genbank_gene_id":"X88851","genbank_protein_id":"895896","gene_card_id":"ARG2","chromosome_location":"chromosome 10","locus":"YJL071W","synonyms":["Arginine-requiring protein 2","Glutamate N-acetyltransferase","N-acetylglutamate synthase","AGS","NAGS"],"enzyme_classes":["2.3.1.1"],"go_classes":[{"category":"Component","description":" Not Available"},{"category":"Function","description":" Not Available"},{"category":"Process","description":" arginine metabolic process"},{"category":"Process","description":" arginine biosynthetic process"},{"category":"Process","description":" metabolic process"},{"category":"Process","description":" cellular metabolic process"},{"category":"Process","description":" cellular amino acid and derivative metabolic process"},{"category":"Process","description":" cellular amino acid metabolic process"},{"category":"Process","description":" glutamine family amino acid metabolic process"}],"pfams":[{"name":"DUF619","identifier":"PF04768"}],"pathways":[{"name":"Arginine and proline metabolism","kegg_map_id":"00330"}],"gene_sequence":"ATGTGGAGGAGAATATTCGCGCATGAACTCAAGTATGATCAACCCAATGCATCTTCAAAAAACTTGATCCTTTCAGTTCTGAATACAACCGCTACAAAACGAGAGGCTAAGGATTATCTCTCAAAATATACAAATGATAGTGGGCAGCATAATCATTGTTTGTTTTTTATCAGGGACCTGCATAAAGTCGCACCAGCGATTTTGTCCCAGTTTTCAAGTGTCATAAAGAGACTAGGAATGCTAGGTTTGCGACCTATGTTTGTAATTCCGCCGTCGCCAACTCATGTAAATATACAGGCAGAGTTACTTGACAGTATCGTTACAGAAGCAGATTTAAAGCCACTTCACCTTAAGGAGGGTCTTACTAAATCCCGCACTGGGTTATATCATTCTGTTTTTTCGCAAGAGAGTCGTTTCTTTGATATTGGAAATTCCAATTTTATACCAATTGTGAAACCTTATGTGTATAATGAAGAGACTGCTTCAGAATTCATGACAAAGGATGTTGTAAAATTTATGGATTGCCTGTGCCAAGGGAATATTCCACACATTGACAAATTCTTCATTCTAAATAATGCCGGAGGTATACCTTCGGGAGAGAGAAATGATAACGCTCATGTATTCATCAATCTTTCTCAGGAACTCGAGCATTTGTCCTCGTCATTATCTCACAATATAAGCACTCTAACCAAACGAGAGCCACGCTCCCAAAACCTGTTACACAGAATGGAGGTGTACGTTAAAAAAGATGAGATATCTTCCTTAGAATGTGAATACCATGATCATTTAGAAAACCTGTTATTGATGGACAAAGTTTTATCAAATCTAGCGGCTACAGCAACGGGACTGATTACAACTGTCAAAGCTGCCGCACTATCATCAGATAGGAAAAATCCTTTAGTATATAATTTATTGACAGACCGATCGCTAATTTCTTCTTCTTTACCAAGGTTTAAAAAAAAGGACGGCGAGATAGACTCACCAGCCAACATGTTTGATGATCACGCATGGTATGAATTGCCTTCCCAACAGGTAAATGCAGCTCCTTCTAACTCAGATGCAGTTTTAGTGACAACTGTTCTCAAAAAGGGCGTCCATATCAAAACTTATGACTATAAGACGCTGACTCAATTCAACTCAATTGGGCTTCCAAAGAAGTTTCACGTACCTGAGAAAGGAGCAAAACCCTCGAGCAATAGTCCAAAACTAGATATCAACAAATTTAAATCCATCATCGATCAGAGCTTTAAAAGATCTTTGGATTTGCATGACTACATAAAAAGGATTAATGGAAAAATAGCTACAATTATTGTGATAGGTGATTATGAAGGCATTGCAATTCTTACCTATGAAGGCTCGGAGGAAAATTCCTTTGTTTATCTCGATAAGTTCGCCGTTCTACCACACTTGAAAGGCTCGCTGGGTATATCTGATATAATCTTCAATTTGATGTTCAAAAAATTTCCTAATGAGATACTTTGGAGAAGCAGAAAAGACAATGTGGTGAACAAGTGGTATTTTCAACGTAGCGTTGCTGTGCTAGATTTGTCGATTGACTTAGACCCCGAACACTGTGATGAAAAGCAAAGCCAATTTAAACTATTTTACTACGGTAACCCTCAATACGCTAAGAGGGCACTACGTGACAAGAAACGTTTAAGAGAATTCATGAGGTCTGTCAGGGACATCAAGCCAAGTTGGGAAAATGAAAAAAATATTTCATGA","protein_sequence":"MWRRIFAHELKYDQPNASSKNLILSVLNTTATKREAKDYLSKYTNDSGQHNHCLFFIRDLHKVAPAILSQFSSVIKRLGMLGLRPMFVIPPSPTHVNIQAELLDSIVTEADLKPLHLKEGLTKSRTGLYHSVFSQESRFFDIGNSNFIPIVKPYVYNEETASEFMTKDVVKFMDCLCQGNIPHIDKFFILNNAGGIPSGERNDNAHVFINLSQELEHLSSSLSHNISTLTKREPRSQNLLHRMEVYVKKDEISSLECEYHDHLENLLLMDKVLSNLAATATGLITTVKAAALSSDRKNPLVYNLLTDRSLISSSLPRFKKKDGEIDSPANMFDDHAWYELPSQQVNAAPSNSDAVLVTTVLKKGVHIKTYDYKTLTQFNSIGLPKKFHVPEKGAKPSSNSPKLDINKFKSIIDQSFKRSLDLHDYIKRINGKIATIIVIGDYEGIAILTYEGSEENSFVYLDKFAVLPHLKGSLGISDIIFNLMFKKFPNEILWRSRKDNVVNKWYFQRSVAVLDLSIDLDPEHCDEKQSQFKLFYYGNPQYAKRALRDKKRLREFMRSVRDIKPSWENEKNIS"},{"created_at":"2011-05-26T16:34:45.000Z","updated_at":"2011-07-22T17:53:55.000Z","name":"Protein ARG5,6, mitochondrial","uniprot_id":"Q01217","uniprot_name":"ARG56_YEAST","enzyme":true,"transporter":false,"gene_name":"ARG5","num_residues":863,"molecular_weight":"94868.39844","theoretical_pi":"8.45","general_function":"Involved in acetylglutamate kinase activity","specific_function":"N-acetyl-L-glutamate 5-semialdehyde + NADP(+) + phosphate = N-acetyl-5-glutamyl phosphate + NADPH","reactions":[{"id":1244,"direction":"\u003e","locations":"mitochondrion","altext":null,"export":true,"pw_reaction_id":null,"source":null},{"id":1752,"direction":"\u003e","locations":"mitochondrion","altext":null,"export":true,"pw_reaction_id":null,"source":null},{"id":2427,"direction":"\u003e","locations":"Mitochondrion","altext":"N-acetyl-L-glutamate 5-semialdehyde + NADP(+) + phosphate = N-acetyl-5-glutamyl phosphate + NADPH.","export":false,"pw_reaction_id":null,"source":null},{"id":2428,"direction":"\u003e","locations":"Mitochondrion","altext":"ATP + N-acetyl-L-glutamate = ADP + N-acetyl-L-glutamate 5-phosphate.","export":false,"pw_reaction_id":null,"source":null}],"signal_regions":"None","transmembrane_regions":"None","pdb_id":null,"cellular_location":"Mitochondrion","genbank_gene_id":"U18813","genbank_protein_id":"603305","gene_card_id":"ARG5","chromosome_location":null,"locus":"YER069W","synonyms":["N-acetyl-gamma-glutamyl-phosphate reductase","N-acetyl-glutamate semialdehyde dehydrogenase","NAGSA dehydrogenase","Acetylglutamate kinase","N-acetyl-L-glutamate 5-phosphotransferase","NAG kinase","AGK"],"enzyme_classes":["1.2.1.38","2.7.2.8"],"go_classes":[{"category":"Component","description":" mitochondrion"},{"category":"Component","description":" intracellular part"},{"category":"Component","description":" cytoplasm"},{"category":"Component","description":" cell part"},{"category":"Component","description":" organelle"},{"category":"Component","description":" membrane-bounded organelle"},{"category":"Component","description":" intracellular membrane-bounded organelle"},{"category":"Function","description":" protein dimerization activity"},{"category":"Function","description":" binding"},{"category":"Function","description":" oxidoreductase activity, acting on the aldehyde or oxo group of donors"},{"category":"Function","description":" oxidoreductase activity, acting on the aldehyde or oxo group of donors, NAD or NADP as acceptor"},{"category":"Function","description":" N-acetyl-gamma-glutamyl-phosphate reductase activity"},{"category":"Function","description":" nucleotide binding"},{"category":"Function","description":" acetylglutamate kinase activity"},{"category":"Function","description":" oxidoreductase activity"},{"category":"Function","description":" kinase activity"},{"category":"Function","description":" catalytic activity"},{"category":"Function","description":" transferase activity"},{"category":"Function","description":" protein binding"},{"category":"Function","description":" NAD or NADH binding"},{"category":"Function","description":" transferase activity, transferring phosphorus-containing groups"},{"category":"Process","description":" cellular amino acid metabolic process"},{"category":"Process","description":" arginine biosynthetic process"},{"category":"Process","description":" glutamine family amino acid metabolic process"},{"category":"Process","description":" oxidation reduction"},{"category":"Process","description":" arginine metabolic process"},{"category":"Process","description":" metabolic process"},{"category":"Process","description":" cellular amino acid biosynthetic process"},{"category":"Process","description":" cellular metabolic process"},{"category":"Process","description":" cellular amino acid and derivative metabolic process"}],"pfams":[{"name":"Semialdhyde_dh","identifier":"PF01118"},{"name":"Semialdhyde_dhC","identifier":"PF02774"},{"name":"DUF619","identifier":"PF04768"},{"name":"AA_kinase","identifier":"PF00696"}],"pathways":[{"name":"Arginine and proline metabolism","kegg_map_id":"00330"}],"gene_sequence":"ATGCCATCTGCTAGCTTACTCGTCTCGACAAAGAGACTTAACGCTTCCAAATTCCAAAAATTTGTGTCTTCATTAAACAAATCCACCATAGCAGGATTTGCATCTGTACCCTTGAGAGCTCCACCATCCGTTGCATTTACGAGAAAGAAAGTCGGATACTCAAAGAGGTATGTTTCATCTACTAACGGCTTTTCAGCTACTAGATCCACTGTGATCCAACTGTTGAACAATATCAGCACAAAAAGAGAGGTTGAACAATATTTGAAATATTTCACTTCCGTCTCACAACAACAATTTGCTGTGATCAAGGTGGGTGGTGCCATTATCAGCGACAATCTACACGAACTCGCTTCCTGCTTGGCATTTTTGTATCATGTTGGTCTATATCCAATAGTTTTACATGGTACCGGTCCTCAGGTTAATGGAAGGCTAGAAGCGCAGGGAATTGAGCCAGACTATATTGATGGTATTAGAATCACGGATGAGCACACAATGGCCGTAGTTAGAAAATGTTTTTTGGAACAAAATCTTAAGCTAGTTACTGCATTAGAACAGCTAGGGGTCCGTGCAAGACCCATTACTTCTGGTGTTTTTACTGCTGACTATTTGGATAAGGACAAATACAAGCTAGTGGGCAATATTAAAAGTGTCACAAAAGAGCCAATTGAAGCATCTATTAAGGCAGGTGCCCTACCAATCTTGACCTCTTTAGCCGAAACTGCTTCTGGTCAAATGTTGAACGTCAACGCCGACGTAGCTGCTGGTGAATTAGCCCGTGTTTTTGAGCCTTTGAAGATCGTTTACCTGAATGAGAAAGGGGGTATTATCAATGGCTCCACGGGAGAAAAAATTTCGATGATCAATTTGGATGAAGAGTATGACGATTTAATGAAGCAAAGTTGGGTGAAGTATGGTACCAAATTAAAAATTAGAGAAATTAAAGAGCTTTTGGACTATCTTCCTCGTTCTTCTTCAGTTGCAATCATTAACGTTCAAGATCTACAAAAAGAACTGTTCACTGATTCTGGTGCGGGTACTATGATCAGGAGAGGTTACAAATTAGTGAAGAGATCCTCCATTGGCGAATTTCCATCCGCTGATGCTCTAAGAAAAGCTCTTCAAAGGGACGCTGGCATTAGTTCCGGTAAAGAATCTGTTGCTTCTTATTTAAGATATTTGGAAAACTCTGATTTTGTCTCTTATGCTGATGAACCTCTTGAAGCAGTGGCCATTGTAAAGAAAGATACGAACGTTCCCACACTAGACAAATTTGTCTGTTCTGACGCAGCCTGGTTGAATAACGTCACAGATAATGTATTCAATGTTTTGCGCCGTGATTTTCCTGCTTTACAATGGGTAGTCAGTGAAAATGATGCTAACATTGCATGGCATTTTGATAAGTCTCAAGGTTCATATCTAAAAGGCGGAAAAGTTTTGTTCTGGTATGGTATCGATGATATAAATACAATATCCGAGCTCGTTGAAAATTTTGTGAAGTCGTGTGACACTGCTTCTACCCTCAACTCATCAGCAAGTAGTGGAGTATTTGCTAACAAAAAATCAGCTAGGTCGTACTCAACTAGATCCACTCCTCGTCCCGAGGGAGTTAACACCAACCCTGGTCGTGTCGCGCTTATTGGTGCTAGAGGTTACACAGGTAAAAATTTGGTATCTTTGATCAACGGCCACCCATATTTAGAAGTGGCCCATGTTTCTTCTCGTGAATTGAAAGGTCAAAAGTTGCAAGATTATACAAAATCCGAAATTATATATGAAAGTTTGCAAATACAGGATATTAGGAAACTGGAAGAACAAAATGCTGTGGACTTTTGGGTTATGGCATTACCCAACAAAGTCTGTGAACCTTTCGTTGAGACAATCCAAAGTGTTCATGGTAAGTCTAAAATTATTGATCTGTCCGCTGATCACAGGTTTGTATCAGAATCAGACTGGGCTTACGGTTTGCCAGAATTGAATGATAGAGCAAAAATTGCAAACGCTGCCAAAATTGCTAATCCCGGTTGTTATGCTACTGGTTCGCAATTAACTATTTCTCCGTTAACAAAGTATATCAATGGTCTTCCAACTGTGTTTGGTGTTTCAGGGTATTCAGGCGCGGGGACGAAGCCTTCTCCAAAAAACGATCCCAAATTCTTGAACAATAACTTAATTCCTTACGCTTTAAGTGATCATATACACGAACGCGAAATCTCAGCTCGCATTGGGCACAATGTTGCATTCATGCCCCATGTTGGGCAGTGGTTTCAAGGTATCTCTTTGACCGTCTCTATTCCAATAAAAAAAGGTTCCTTGTCTATTGATGAGATCAGGAAATTATACAGAAATTTTTACGAAGACGAAAAGCTAGTACATGTCATCGATGATATCCCACTGGTTAAAGATATTGAGGGCACCCATGGTGTAGTTATTGGTGGTTTCAAGCTGAATGATGCTGAAGATCGTGTAGTTGTTTGCGCAACCATCGATAACTTACTTAAAGGCGCCGCTACTCAATGTCTGCAAAATATTAATCTTGCTATGGGTTATGGAGAGTATGCTGGTATCCCTGAAAATAAAATTATTGGTGTCTGA","protein_sequence":"MPSASLLVSTKRLNASKFQKFVSSLNKSTIAGFASVPLRAPPSVAFTRKKVGYSKRYVSSTNGFSATRSTVIQLLNNISTKREVEQYLKYFTSVSQQQFAVIKVGGAIISDNLHELASCLAFLYHVGLYPIVLHGTGPQVNGRLEAQGIEPDYIDGIRITDEHTMAVVRKCFLEQNLKLVTALEQLGVRARPITSGVFTADYLDKDKYKLVGNIKSVTKEPIEASIKAGALPILTSLAETASGQMLNVNADVAAGELARVFEPLKIVYLNEKGGIINGSTGEKISMINLDEEYDDLMKQSWVKYGTKLKIREIKELLDYLPRSSSVAIINVQDLQKELFTDSGAGTMIRRGYKLVKRSSIGEFPSADALRKALQRDAGISSGKESVASYLRYLENSDFVSYADEPLEAVAIVKKDTNVPTLDKFVCSDAAWLNNVTDNVFNVLRRDFPALQWVVSENDANIAWHFDKSQGSYLKGGKVLFWYGIDDINTISELVENFVKSCDTASTLNSSASSGVFANKKSARSYSTRSTPRPEGVNTNPGRVALIGARGYTGKNLVSLINGHPYLEVAHVSSRELKGQKLQDYTKSEIIYESLQIQDIRKLEEQNAVDFWVMALPNKVCEPFVETIQSVHGKSKIIDLSADHRFVSESDWAYGLPELNDRAKIANAAKIANPGCYATGSQLTISPLTKYINGLPTVFGVSGYSGAGTKPSPKNDPKFLNNNLIPYALSDHIHEREISARIGHNVAFMPHVGQWFQGISLTVSIPIKKGSLSIDEIRKLYRNFYEDEKLVHVIDDIPLVKDIEGTHGVVIGGFKLNDAEDRVVVCATIDNLLKGAATQCLQNINLAMGYGEYAGIPENKIIGV"}]}