{"ymdb_id":"YMDB16197","created_at":"2016-09-12T18:22:21.000Z","updated_at":"2016-10-17T22:34:42.000Z","name":"PIP(18:0/20:4(5Z,8Z,11Z,14Z))","cas":null,"state":"Solid","melting_point":null,"description":null,"experimental_water_solubility":null,"experimental_logp_hydrophobicity":null,"location":null,"synthesis_reference":null,"chebi_id":null,"hmdb_id":null,"kegg_id":null,"pubchem_id":null,"cs_id":null,"foodb_id":null,"wikipedia_link":null,"biocyc_id":null,"iupac":"{[(1R,5S)-2,3,4,6-tetrahydroxy-5-({hydroxy[(2R)-2-[(5Z,8Z,11Z,14Z)-icosa-5,8,11,14-tetraenoyloxy]-3-(octadecanoyloxy)propoxy]phosphoryl}oxy)cyclohexyl]oxy}phosphonic acid","traditional_iupac":"[(1R,5S)-2,3,4,6-tetrahydroxy-5-{[hydroxy(2R)-2-[(5Z,8Z,11Z,14Z)-icosa-5,8,11,14-tetraenoyloxy]-3-(octadecanoyloxy)propoxyphosphoryl]oxy}cyclohexyl]oxyphosphonic acid","logp":"10.090215222000001","pka":"1.9166523982542731","alogps_solubility":"4.31e-03 g/l","alogps_logp":"6.83","alogps_logs":"-5.35","acceptor_count":"11","donor_count":"7","rotatable_bond_count":"42","polar_surface_area":"256.03999999999996","refractivity":"253.5131","polarizability":"107.98000204383341","formal_charge":"0","physiological_charge":"-3","pka_strongest_basic":"-3.6477611462617663","pka_strongest_acidic":"1.0756803498311571","bioavailability":"0","number_of_rings":"1","rule_of_five":"0","ghose_filter":"0","veber_rule":"0","mddr_like_rule":"0","synonyms":[],"pathways":[{"name":"Stress-activated signalling pathways: cell wall stress test 1","kegg_map_id":null}],"growth_conditions":[],"references":[],"proteins":[{"created_at":"2011-05-26T21:20:36.000Z","updated_at":"2011-05-27T15:01:10.000Z","name":"Phosphatidylinositol 4-kinase STT4","uniprot_id":"P37297","uniprot_name":"STT4_YEAST","enzyme":true,"transporter":false,"gene_name":"STT4","num_residues":1900,"molecular_weight":"214605.0","theoretical_pi":"7.51","general_function":"Involved in binding","specific_function":"Acts on phosphatidylinositol (PI) in the first committed step in the production of the second messenger inositol-1,4,5,- trisphosphate. STT4 functions in PKC1 protein kinase pathway","reactions":[{"id":2054,"direction":"\u003e","locations":"cytoplasm;cell envelope","altext":null,"export":true,"pw_reaction_id":null,"source":null},{"id":2590,"direction":"\u003e","locations":" Peripheral membrane protein.; Cytoplasmic side.; Peripheral membrane protein;Cell membrane; Cytoplasmic side. Vacuole membrane; Peripheral membrane protein. Vacuole membrane;Nucleus","altext":"ATP + 1-phosphatidyl-1D-myo-inositol = ADP + 1-phosphatidyl-1D-myo-inositol 4-phosphate.","export":false,"pw_reaction_id":null,"source":null},{"id":34400,"direction":"\u003e","locations":null,"altext":null,"export":true,"pw_reaction_id":"PW_R026916","source":"Smpdb"}],"signal_regions":"None","transmembrane_regions":"None","pdb_id":null,"cellular_location":null,"genbank_gene_id":"D13717","genbank_protein_id":"454207","gene_card_id":"STT4","chromosome_location":"chromosome 12","locus":"YLR305C","synonyms":["PI4-kinase","PtdIns-4-kinase"],"enzyme_classes":["2.7.1.67"],"go_classes":[{"category":"Component","description":" Not Available"},{"category":"Function","description":" transferase activity, transferring phosphorus-containing groups"},{"category":"Function","description":" binding"},{"category":"Function","description":" kinase activity"},{"category":"Function","description":" phosphotransferase activity, alcohol group as acceptor"},{"category":"Function","description":" inositol or phosphatidylinositol kinase activity"},{"category":"Function","description":" catalytic activity"},{"category":"Function","description":" transferase activity"},{"category":"Process","description":" metabolic process"},{"category":"Process","description":" glycerophospholipid metabolic process"},{"category":"Process","description":" phosphoinositide metabolic process"},{"category":"Process","description":" phosphoinositide phosphorylation"},{"category":"Process","description":" biological regulation"},{"category":"Process","description":" regulation of biological process"},{"category":"Process","description":" regulation of cellular process"},{"category":"Process","description":" signal transduction"},{"category":"Process","description":" organophosphate metabolic process"},{"category":"Process","description":" phospholipid metabolic process"},{"category":"Process","description":" intracellular signal transduction"},{"category":"Process","description":" second-messenger-mediated signaling"},{"category":"Process","description":" phosphoinositide-mediated signaling"}],"pfams":[{"name":"PI3_PI4_kinase","identifier":"PF00454"},{"name":"PI3Ka","identifier":"PF00613"}],"pathways":[{"name":"Inositol phosphate metabolism","kegg_map_id":"00562"},{"name":"Stress-activated signalling pathways: cell wall stress test 1","kegg_map_id":null}],"gene_sequence":"ATGAGATTTACCAGAGGATTGAAAGCCTCTTCATCTTTAAGAGCGAAAGCTATTGGAAGACTAACTAAACTTTCGACGGGTGCACCAAATGATCAAAACAGTAATGGTACAACATTAGATTTGATAACTCATACATTGCCTATTTTTTATTCCACAAATACGTCAAAGATATATACTATACCACTTACATTGAGTGAATGGGAAGTACTAACTTCGCTTTGTGTTGCAATCCCTACCACGCTCGATCTGGTAGAGACAATGCTTAAAGAAATAATTGCGCCTTATTTTCTGGAAACTCCAAGACAAAGAATTTCAGATGTTCTGTCCTCGAAATTCAAGTTGGAACAGATGAGAAATCCGATTGAATTGCTAACTTTCCAGCTGACTAAGTTTATGATTCAAGCTTGTGAACAGTATCCCGTTCTTTATGAAAATATAGGCGGCATCATTTCAACTTATTTTGAGCGTGTATTAAAGATTTTTACTATTAAACAATCTGGTCTACTATCATTAGTGGGCTTTATTAATGCATTTATCCAGTTTCCCAACTCTACAGAGCTTACTAAATTCACCTGGAAAAAGTTAGCGAAACTTGTGCTTCGTGGATCATTTCTAAACGAAGTTGATAAGATTTTGAATTCCTCAGCAACGTTTACCAATGATTCAATCGTCCAGTATTACGATGCTGGAAATGAGCTATCCAGTGCCTACTTATTGGAATTAATATCTCGCTTGCAAGTTTCGCTAATATCTCATCTGTTAAACACTTCACATGTCGGCGCCAACTTGAGTGAGTTCTTACTAAATCAACAATACCAATTTTACAAATTTGATCAAGAAGTAGCCGATGAAAACGATGATACGAAGTGTATTGATGACTTTTTTTTCAACGTAAGAAGCAATAAACAGTTTTTTACAGACATGTGTAAAATTTCCCTGCAATTTTGTTCTGAGTCACATATTCTCGACCTATCTACAGACAACCGTGCAAGATTTTCTTTTGATACTCGAGCCCACTACCTACAAACGCTATGTTTAATTCCGTTTATAGAAGATACAGAAAGCGAGCTTTTTGAAAGTTTTACAAACGTTGTTTCTGAATCTATTGATAAATTTTTCTTATCTGATGTCGTTACACCATCTTTGATAAAAGCAATTGTAGCTTCTGCCTCATTACTGAATTTTTTCACGGAAAAATTGTCATTAACTCTGATTCGTATGTTTCCTTTATTGGTAGCGTCTCCACATATCACGACTGAAACAGTTAATGATGTTGCAAAAATATTCACCACAGGGTTATACCCTCTAAATGAAGATGCAATTGTGAGTACAATTTATTCCATGAACAACTTATTAGCTGTTTCCGAGGACGGATCGCCCGTCCCGGTGCTCCGTGAGCGCCAATTAACAATAACATCTGGAAAAAATATTGAGAAAGACTACTTTCCCTTGCGAAATTCATCTGCCAGTTTGGATGGCACTGGTGCTCTGCTCGGAAATACAACTGTGGGCCAACTTTCCTCCCATGATGTCAATAGTGGAGCTACTATGACATACCATGCCTCTTTGATATCTAATTGTGTGGCTGCAACGACTACAATCGCCTCATACTATAACACCCAAAGCATAACAGCTTTGACTATCTCTATTCTTACGCAAAAAGTTAATTCTATGTCAAAGGAATTAGACGGTGTTATTTTGAATTCTTTGGCAAGACTGGCTCCGAATACTTCACTGACAGAATTTTCGCTTTTACTAAAGTTCTTCAAATCGAGAACTGTTATTGCAACAAAAATTGATGATAGCGCACTATTGAAAAATATTATTAAAGCGAAATGTGTGATATCAAAAGAATTGTTGGCCAGGCATTTTTCAAGTGACTTATATTTCATGTATCTTCATGATCTATTGGACTCAATTATTGCCAGTGGGGAAGTAGAACGATTGGAACATCACAGACCTCAAACGGAGATTTCCCGTGTTGCTGATCAAATCGCTACCTACTTAGAGCCCCTCGCCGCTCTACTACCTGTTCCAGGCGATACGCCATTGGACATCAATAAGGATGAAGTCACTACAAACAAGTTTAGGAACGCCTGGTTTAATTTTGTCATCCATGGGTACCACTTGGGCGGGCCTATTGTCAAACGAAATTTTTCTTTCTTATTAACTATTGCCTATAATTCACCACCGCTGGCTTCTGAATTTCCTGCTAATAACAAGGAGCTTTCATTGGAAATGAATACAATTTTGCGCCGCGGTTCATCTAACGAAAATATCAAACAACAAAAACAGCAGATAACCGAATATTTCAATACCAATATTGTTCAGTACAGAACAACTTCATCGTCTAAAATCATGTTTTTAGCGGCTGCCGTTCTTCTAGAAACGATAAGATGCGAGGCAGGTGATTGTTCGAAGACTCTACTGTACTTCTCAGACCCCTCCATTCTTTCCGGTTCAATTGAGAAGTGTATTGCAGTTTTGTCGGTTTCCATGATTAGAAAATATGCTAGGTTGATTCAAAAGGGTAACGATGCCATATTCAACTCCAAAATGATTGCTCAACAACTTAATAACTTGCTACTTTGCCTTTCTCATAGGGAACCAACTTTGCAGGATGCCGCTTTTCATGCCTGTGAAATATTTATCAGATCTATTCCATCATCGTTATGTCATCATTTGTCTCTCTACACCTTATTAGATATGTTAACAGCCCTATTTGATAGTATCTTGGACTCAGAGGCGCATAAATTTGAACCACGGTATGAATTTAAATTAAAACATTCTAAGACAACAATATTGGTTCCAAGCTCATCGTCCTGGCGCGCTACGACACTATCAAGATTGCACAAGTCAGCTAAAGAATGGGTAAGAATTCTATTGAATAGAAGCAACCAGGATACTAAGATTTTGTTGCAATCATACATATCAGATCTCGGTGAGTACAGCAGGCTAAATTCTGTTGAATTTGGTGTCTCATTTGCCATGGATATGGCTGGTTTGATTTTACCTGCAGATAAGGAATTATCAAGGCTTACTTATTATGGTCCAGAGAAACCTAACACGATTTCTGGATTTATATCTTTACATTCTTGGAGGTCAAAATATCTTTTCGATACCGCTATTACATCATCACCAGAGGATATCAAAAGGCAAATAGGTATTTCCACTCAAAACATAAGAAAAAATTTGACCTTAGGAAATAAGATTATAACTAAAGACGTAACTGATTTTCTTGATATGGCTACCGCGTTGTTAATTCTTGGCAATGGTGCACCAGCGTCATTGATATATGATATTGTGCACATCCCGTTCGAAGTCTTCACCTCTGCATCTTTAAAGATTGCTACAAACGTATGGTTAACAATCATAACTGAGAAGCCCGAAGTTGCACATTTGCTTTTAGTTGAAGTATGCTATTGCTGGATGCGCTCCATTGATGACAATATTGGTCTCTATTCTCGCGATCATGACTTAAAGGGCGAAGAATACCAAAAAATGGAATATTCCCCTTACGACAAAGCAGGTATCAACAGAGATGCAAAAAATGCATCCCAAGCTATGCAACCACATCTTCATGTTATTAAATTTTTTGCTTCCCATTTCGAGGGCACACTATTTCAAAGTGACTTTTTATTGAAAATTTTCACGAAATGTGCGCTTTATGGTATCAAGAATCTGTATAAGGCTTCACTACACCCGTTCGCTAGAATGATTCGCCACGAATTATTGTTATTCGCGACTCTCGTACTAAACGCAAGTTATAAGCAGGGATCCAAGTATATGGGCCGTTTGTCGCAAGAAATCACAAACGGTGCCCTAAGTTGGTTTAAAAGACCAGTAGCGTGGCCATTCGGCTCAAATGAGCTGAAAATCAAAGCAGATTTATCTGTTACTAGGGACCTTTTCCTTCAGCTCAACAAATTAAGCTCGCTAATGTCACGTCATTGCGGGAAAGATTACAAGATCCTGAACTATTTCTTGGCAAGCGAAATCCAGCAGATTCAAACCTGGCTTACTCCGACTGAGAAGATTGAAGGAGCCGACAGCAACGAGCTTACAAGCGATATCGTTGAAGCTACCTTTGCTAAAGATCCAACATTAGCAATAAATCTCTTACAACGATGTTATAGCAAGAAAGCTGAGGATGTTTTAGTGGGCTTAGTTGCAAAACATGCTTTAATGTGTGTGGGGTCCCCAAGTGCTCTTGACCTGTTCATAAAAGGAAGCCACCTGAGCAGTAAGAAAGACCTACACGCAACCTTATACTGGGCGCCAGTGAGCCCGTTAAAATCTATCAACCTTTTCCTTCCCGAATGGCAAGGTAATTCTTTTATCTTACAATTCAGCATATATTCGTTGGAATCACAAGATGTGAACTTGGCATTCTTCTATGTTCCTCAAATCGTACAATGTTTGAGATACGATAAAACCGGATATGTCGAAAGATTGATTTTGGATACTGCGAAAATTAGTGTGTTATTTTCTCATCAAATAATCTGGAATATGCTTGCAAACTGCTACAAGGATGATGAAGGTATACAAGAAGATGAAATCAAACCAACTCTAGATCGTATTAGGGAGCGTATGGTTTCAAGTTTCAGCCAATCTCATCGCGATTTTTACGAACGTGAATTTGAATTCTTCGACGAAGTAACTGGCATATCTGGTAAGTTGAAACCATACATAAAAAAAAGTAAGGCTGAAAAGAAACATAAGATCGATGAAGAAATGAGCAAAATTGAGGTGAAACCTGATGTTTATTTACCTTCTAATCCTGACGGTGTAGTTATTGATATTGATCGGAAGAGTGGTAAGCCACTTCAATCTCACGCAAAGGCGCCTTTTATGGCGACCTTTAAAATAAAGAAAGACGTAAAAGATCCTTTGACAGGTAAAAACAAGGAAGTTGAAAAATGGCAAGCTGCTATCTTCAAAGTCGGTGATGACTGTAGGCAAGATGTTCTAGCGTTACAATTGATCTCGCTATTTAGAACCATTTGGTCTAGTATTGGCTTGGATGTCTACGTTTTTCCCTACAGAGTTACTGCGACGGCACCGGGTTGTGGTGTCATCGATGTGCTACCCAATTCGGTATCCCGTGATATGTTAGGACGTGAAGCTGTTAATGGATTATATGAATATTTCACTAGTAAATTTGGTAATGAATCTACTATCGAATTTCAAAACGCACGAAACAACTTTGTTAAATCCTTAGCGGGATATAGCGTAATTTCGTATTTGTTGCAATTCAAGGATAGACATAATGGTAACATTATGTACGATGATCAAGGACATTGTCTACATATCGATTTTGGGTTTATTTTTGATATTGTCCCAGGTGGTATCAAGTTTGAAGCAGTACCATTCAAGCTGACGAAAGAAATGGTTAAAGTGATGGGAGGTTCGCCCCAGACCCCAGCGTATCTGGACTTTGAAGAACTTTGTATCAAGGCATATCTAGCCGCCCGTCCGCACGTGGAGGCCATAATTGAGTGTGTAAATCCTATGTTAGGAAGCGGTCTCCCCTGCTTTAAGGGTCACAAGACAATTAGGAATCTAAGAGCAAGATTTCAACCTCAAAAAACCGATCACGAAGCTGCACTATATATGAAGGCGCTAATCCGTAAAAGTTATGAAAGTATATTCACTAAAGGTTATGATGAATTCCAAAGGCTCACAAATGGCATTCCGTACTGA","protein_sequence":"MRFTRGLKASSSLRAKAIGRLTKLSTGAPNDQNSNGTTLDLITHTLPIFYSTNTSKIYTIPLTLSEWEVLTSLCVAIPTTLDLVETMLKEIIAPYFLETPRQRISDVLSSKFKLEQMRNPIELLTFQLTKFMIQACEQYPVLYENIGGIISTYFERVLKIFTIKQSGLLSLVGFINAFIQFPNSTELTKFTWKKLAKLVLRGSFLNEVDKILNSSATFTNDSIVQYYDAGNELSSAYLLELISRLQVSLISHLLNTSHVGANLSEFLLNQQYQFYKFDQEVADENDDTKCIDDFFFNVRSNKQFFTDMCKISLQFCSESHILDLSTDNRARFSFDTRAHYLQTLCLIPFIEDTESELFESFTNVVSESIDKFFLSDVVTPSLIKAIVASASLLNFFTEKLSLTLIRMFPLLVASPHITTETVNDVAKIFTTGLYPLNEDAIVSTIYSMNNLLAVSEDGSPVPVLRERQLTITSGKNIEKDYFPLRNSSASLDGTGALLGNTTVGQLSSHDVNSGATMTYHASLISNCVAATTTIASYYNTQSITALTISILTQKVNSMSKELDGVILNSLARLAPNTSLTEFSLLLKFFKSRTVIATKIDDSALLKNIIKAKCVISKELLARHFSSDLYFMYLHDLLDSIIASGEVERLEHHRPQTEISRVADQIATYLEPLAALLPVPGDTPLDINKDEVTTNKFRNAWFNFVIHGYHLGGPIVKRNFSFLLTIAYNSPPLASEFPANNKELSLEMNTILRRGSSNENIKQQKQQITEYFNTNIVQYRTTSSSKIMFLAAAVLLETIRCEAGDCSKTLLYFSDPSILSGSIEKCIAVLSVSMIRKYARLIQKGNDAIFNSKMIAQQLNNLLLCLSHREPTLQDAAFHACEIFIRSIPSSLCHHLSLYTLLDMLTALFDSILDSEAHKFEPRYEFKLKHSKTTILVPSSSSWRATTLSRLHKSAKEWVRILLNRSNQDTKILLQSYISDLGEYSRLNSVEFGVSFAMDMAGLILPADKELSRLTYYGPEKPNTISGFISLHSWRSKYLFDTAITSSPEDIKRQIGISTQNIRKNLTLGNKIITKDVTDFLDMATALLILGNGAPASLIYDIVHIPFEVFTSASLKIATNVWLTIITEKPEVAHLLLVEVCYCWMRSIDDNIGLYSRDHDLKGEEYQKMEYSPYDKAGINRDAKNASQAMQPHLHVIKFFASHFEGTLFQSDFLLKIFTKCALYGIKNLYKASLHPFARMIRHELLLFATLVLNASYKQGSKYMGRLSQEITNGALSWFKRPVAWPFGSNELKIKADLSVTRDLFLQLNKLSSLMSRHCGKDYKILNYFLASEIQQIQTWLTPTEKIEGADSNELTSDIVEATFAKDPTLAINLLQRCYSKKAEDVLVGLVAKHALMCVGSPSALDLFIKGSHLSSKKDLHATLYWAPVSPLKSINLFLPEWQGNSFILQFSIYSLESQDVNLAFFYVPQIVQCLRYDKTGYVERLILDTAKISVLFSHQIIWNMLANCYKDDEGIQEDEIKPTLDRIRERMVSSFSQSHRDFYEREFEFFDEVTGISGKLKPYIKKSKAEKKHKIDEEMSKIEVKPDVYLPSNPDGVVIDIDRKSGKPLQSHAKAPFMATFKIKKDVKDPLTGKNKEVEKWQAAIFKVGDDCRQDVLALQLISLFRTIWSSIGLDVYVFPYRVTATAPGCGVIDVLPNSVSRDMLGREAVNGLYEYFTSKFGNESTIEFQNARNNFVKSLAGYSVISYLLQFKDRHNGNIMYDDQGHCLHIDFGFIFDIVPGGIKFEAVPFKLTKEMVKVMGGSPQTPAYLDFEELCIKAYLAARPHVEAIIECVNPMLGSGLPCFKGHKTIRNLRARFQPQKTDHEAALYMKALIRKSYESIFTKGYDEFQRLTNGIPY"},{"created_at":"2011-05-27T01:34:44.000Z","updated_at":"2011-05-27T15:01:15.000Z","name":"Probable phosphatidylinositol-4-phosphate 5-kinase MSS4","uniprot_id":"P38994","uniprot_name":"MSS4_YEAST","enzyme":true,"transporter":false,"gene_name":"MSS4","num_residues":779,"molecular_weight":"89319.60156","theoretical_pi":"9.77","general_function":"Involved in phosphatidylinositol phosphate kinase activity","specific_function":"Catalyzes the phosphorylation of phosphatidylinositol-4- phosphate on the fifth hydroxyl of the myo-inositol ring, to form phosphatidylinositol-4,5-biphosphate. Acts downstream of STT4, but in a pathway that does not involve PKC1. May be involved in the organization of the actin cytoskeleton","reactions":[{"id":1854,"direction":"\u003c\u003e","locations":"nucleus","altext":null,"export":true,"pw_reaction_id":null,"source":null},{"id":2055,"direction":"\u003e","locations":"cytoplasm;cell envelope","altext":null,"export":true,"pw_reaction_id":null,"source":null},{"id":2629,"direction":"\u003e","locations":null,"altext":"ATP + 1-phosphatidyl-1D-myo-inositol 4-phosphate = ADP + 1-phosphatidyl-1D-myo-inositol 4,5-bisphosphate.","export":false,"pw_reaction_id":null,"source":null},{"id":34399,"direction":"\u003e","locations":null,"altext":null,"export":true,"pw_reaction_id":"PW_R026915","source":"Smpdb"}],"signal_regions":"None","transmembrane_regions":"None","pdb_id":null,"cellular_location":null,"genbank_gene_id":"D13716","genbank_protein_id":"493719","gene_card_id":"MSS4","chromosome_location":"chromosome 4","locus":"YDR208W","synonyms":["1-phosphatidylinositol-4-phosphate kinase","Diphosphoinositide kinase","PIP5K"],"enzyme_classes":["2.7.1.68"],"go_classes":[{"category":"Component","description":" Not Available"},{"category":"Function","description":" transferase activity, transferring phosphorus-containing groups"},{"category":"Function","description":" kinase activity"},{"category":"Function","description":" lipid kinase activity"},{"category":"Function","description":" phosphatidylinositol phosphate kinase activity"},{"category":"Function","description":" catalytic activity"},{"category":"Function","description":" transferase activity"},{"category":"Process","description":" metabolic process"},{"category":"Process","description":" organophosphate metabolic process"},{"category":"Process","description":" phospholipid metabolic process"},{"category":"Process","description":" glycerophospholipid metabolic process"},{"category":"Process","description":" phosphoinositide metabolic process"},{"category":"Process","description":" phosphatidylinositol metabolic process"}],"pfams":[{"name":"PIP5K","identifier":"PF01504"}],"pathways":[{"name":"Inositol phosphate metabolism","kegg_map_id":"00562"},{"name":"Stress-activated signalling pathways: cell wall stress test 1","kegg_map_id":null}],"gene_sequence":"ATGTCAGTCTTGCGATCACAACCTCCTTCAGTTGTACCGCTACATCTGACAACATCCACCAGTCGCAAAACAGAACAGGAACCATCGCTATTGCACTCTGCAATAATTGAGCGACATCAAGACCGTTCGGTGCCGAATTCGAATTCGAATCCGGATTCGAACCATCGAATAAAAAAGGATCGCAATAATCACACTAGCTACCATTCTTCATCGAATTCGGAGTCAAACATGGAAAGTCCTCGCTTGTCAGATGGTGAGTCTTCCACTCCGACCTCTATTGAAGAGTTAAACCCAACAATAAATAATTCGAGGCTGGTGAAGAGAAACTACTCAATATCAATTGATCCTTTGCATGACAACAGCAATAATAATACGGATGATGATCATCCAAACACCATAACTTCTCCCCGACCAAATAGCACTAGTAACAAGGAAATGCAGAAATATAGTTTTCCCGAAGGTAAGGAGTCGAAGAAAATAACAACACCCTCGTTAAACTCGAATAATTGTTTGGATTTGGACAATTCCAGTCTTGTTCACACAGATTCGTATATACAAGACTTAAATGACGATCATATTTTACTAAACAAGCGCGTTTCGAGGCGATCTTCCAGAATATCGGCAGTAACAGCAACTTCCACGACAATTAAGCAAAGAAGAAATACTCAGGATTCAAACCTGCCCAATATTCCCTTTCACGCTTCCAAGCATTCTCAAATTCTACCTATGGATGATTCAGATGTAATAAAATTGGCCAATGGAGATACCAGCATGAAACCAAATTCTGCTACTAAAATTAGTCACTCAATGACTTCTTTGCCTCTTCACCCACTTCCACAACCTTCTCAAAAGTCAAAGCAATATCATATGATATCCAAATCAACCACTTCTTTGCCGCCAGAGAATGACCACTACTATCAACACAGTCGTGGCACCAACCATAATCACGCTGCTAATGCTGCTGCTGTTAATAATAATACTACTACTACTACCGCTGCTACGGGTCTGAAAAGATCAGAGTCTGCAACGGCAGAAATTAAGAAGATGAGACAATCTTTGCTGCATAAAAGAGAAATGAAAAGGAAAAGAAAAACATTTTTGGTGGATGACGATCGTGTTTTGATCGGTAATAAAGTCAGTGAAGGCCATGTCAACTTTATTATAGCTTATAACATGCTAACTGGTATTCGTGTCGCAGTGTCACGTTGTTCTGGCATAATGAAACCTTTAACTCCGGCAGACTTTAGATTTACGAAGAAGCTTGCCTTTGATTATCACGGAAATGAATTGACCCCTTCTTCTCAGTATGCATTTAAGTTTAAGGACTACTGTCCCGAAGTCTTTAGGGAACTTCGTGCATTATTCGGCTTAGACCCTGCTGATTATTTGGTTTCGTTAACTTCCAAGTACATTTTGAGTGAGTTGAACTCGCCAGGTAAAAGTGGTTCATTTTTTTATTATTCGAGAGATTACAAATATATTATCAAGACCATACATCATTCTGAGCATATTCATCTGAGAAAGCACATACAAGAATACTATAACCATGTAAGAGACAATCCAAACACTTTGATTTGTCAATTTTATGGTTTGCATAGAGTGAAAATGCCTATATCGTTTCAAAATAAAATTAAGCATCGGAAAATTTACTTCCTAGTCATGAATAACTTATTTCCACCACACTTAGACATTCACATTACTTATGATTTAAAAGGTTCCACATGGGGCCGTTTTACCAATTTGGATAAAGAAAGGTTGGCGAAAGATAGATCATATAGGCCTGTGATGAAAGATTTAAATTGGCTTGAAGAAGGTCAGAAAATTAAATTGGGTCCATTGAAGAAGAAGACTTTTTTGACACAGCTGAAAAAAGATGTGGAATTGCTTGCTAAATTGAATACAATGGACTATTCCTTGTTAATTGGCATTCATGACATCAATAAAGCTAAAGAAGACGACTTACAATTGGCGGATACGGCATCTATTGAGGAGCAACCACAGACTCAAGGACCCATAAGAACCGGTACCGGAACGGTAGTACGACATTTTTTTAGAGAGTTTGAAGGTGGAATTCGAGCATCTGATCAATTCAACAATGATGTGGATTTGATTTATTACGTGGGGATAATCGATTTCTTGACTAATTACTCAGTTATGAAGAAATTAGAAACGTTTTGGAGAAGTCTACGCCATGATACCAAGTTGGTGAGTGCCATACCTCCAAGAGACTACGCCAATAGGTTTTACGAATTCATAGAAGATTCTGTAGATCCTCTGCCCCAGAAAAAAACTCAATCGTCGTATAGAGACGACCCTAACCAGAAAAATTATAAAGACTGA","protein_sequence":"MSVLRSQPPSVVPLHLTTSTSRKTEQEPSLLHSAIIERHQDRSVPNSNSNPDSNHRIKKDRNNHTSYHSSSNSESNMESPRLSDGESSTPTSIEELNPTINNSRLVKRNYSISIDPLHDNSNNNTDDDHPNTITSPRPNSTSNKEMQKYSFPEGKESKKITTPSLNSNNCLDLDNSSLVHTDSYIQDLNDDHILLNKRVSRRSSRISAVTATSTTIKQRRNTQDSNLPNIPFHASKHSQILPMDDSDVIKLANGDTSMKPNSATKISHSMTSLPLHPLPQPSQKSKQYHMISKSTTSLPPENDHYYQHSRGTNHNHAANAAAVNNNTTTTTAATGLKRSESATAEIKKMRQSLLHKREMKRKRKTFLVDDDRVLIGNKVSEGHVNFIIAYNMLTGIRVAVSRCSGIMKPLTPADFRFTKKLAFDYHGNELTPSSQYAFKFKDYCPEVFRELRALFGLDPADYLVSLTSKYILSELNSPGKSGSFFYYSRDYKYIIKTIHHSEHIHLRKHIQEYYNHVRDNPNTLICQFYGLHRVKMPISFQNKIKHRKIYFLVMNNLFPPHLDIHITYDLKGSTWGRFTNLDKERLAKDRSYRPVMKDLNWLEEGQKIKFGPLKKKTFLTQLKKDVELLAKLNTMDYSLLIGIHDINKAKEDDLQLADTASIEEQPQTQGPIRTGTGTVVRHFFREFEGGIRASDQFNNDVDLIYYVGIIDFLTNYSVMKKLETFWRSLRHDTKLVSAIPPRDYANRFYEFIEDSVDPLPQKKTQSSYRDDPNQKNYKD"}]}