Reviewed
Homo Sapiens (Human) [TaxID: 9606]
ORF1[Gene ID: 1491970 ]
♦Genome polyprotein [Cleaved into: Protein p48
♦ NTPase (EC 3.6.1.15) (p41)
♦ Protein p22
♦ Viral genome-linked protein (VPG)
♦ 3C-like protease (3CLpro) (EC 3.4.22.66) (Calicivirin)
♦ RNA-directed RNA polymerase (RdRp) (EC 2.7.7.48)]
♦ NTPase (EC 3.6.1.15) (p41)
♦ Protein p22
♦ Viral genome-linked protein (VPG)
♦ 3C-like protease (3CLpro) (EC 3.4.22.66) (Calicivirin)
♦ RNA-directed RNA polymerase (RdRp) (EC 2.7.7.48)]
Norwalk Virus (strain GI/Human/United States/Norwalk/1968) (Hu/NV/NV/1968/US)
Viruses> SsRNA Viruses> SsRNA Positive-strand Viruses> No DNA Stage> Caliciviridae> Norovirus> Norwalk Virus> Norovirus Isolates> Norwalk Virus (strain GI/Human/United States/Norwalk/1968) (Hu/NV/NV/1968/US)
Various pathway(s) in which protein is involved
Not Available
MMMASKDVVPTAASSENANNNSSIKSRLLARLKGSGGATSPPNSIKITNQDMALGLIGQVPAPKATSVDVPKQQRDRPPRTVAEVQQNLRWTERPQDQNV
KTWDELDHTTKQQILDEHAEWFDAGGLGPSTLPTSHERYTHENDEGHQVKWSAREGVDLGISGLTTVSGPEWNMCPLPPVDQRSTTPATEPTIGDMIEFY
EGHIYHYAIYIGQGKTVGVHSPQAAFSITRITIQPISAWWRVCYVPQPKQRLTYDQLKELENEPWPYAAVTNNCFEFCCQVMCLEDTWLQRKLISSGRFY
HPTQDWSRDTPEFQQDSKLEMVRDAVLAAINGLVSRPFKDLLGKLKPLNVLNLLSNCDWTFMGVVEMVVLLLELFGIFWNPPDVSNFIASLLPDFHLQGP
EDLARDLVPIVLGGIGLAIGFTRDKVSKMMKNAVDGLRAATQLGQYGLEIFSLLKKYFFGGDQTEKTLKDIESAVIDMEVLSSTSVTQLVRDKQSARAYM
AILDNEEEKARKLSVRNADPHVVSSTNALISRISMARAALAKAQAEMTSRMRPVVIMMCGPPGIGKTKAAEHLAKRLANEIRPGGKVGLVPREAVDHWDG
YHGEEVMLWDDYGMTKIQEDCNKLQAIADSAPLTLNCDRIENKGMQFVSDAIVITTNAPGPAPVDFVNLGPVCRRVDFLVYCTAPEVEHTRKVSPGDTTA
LKDCFKPDFSHLKMELAPQGGFDNQGNTPFGKGVMKPTTINRLLIQAVALTMERQDEFQLQGPTYDFDTDRVAAFTRMARANGLGLISMASLGKKLRSVT
TIEGLKNALSGYKISKCSIQWQSRVYIIESDGASVQIKEDKQALTPLQQTINTASLAITRLKAARAVAYASCFQSAITTILQMAGSALVINRAVKRMFGT
RTAAMALEGPGKEHNCRVHKAKEAGKGPIGHDDMVERFGLCETEEEESEDQIQMVPSDAVPEGKNKGKTKKGRGRKNNYNAFSRRGLSDEEYEEYKKIRE
EKNGNYSIQEYLEDRQRYEEELAEVQAGGDGGIGETEMEIRHRVFYKSKSKKHQQEQRRQLGLVTGSDIRKRKPIDWTPPKNEWADDDREVDYNEKINFE
APPTLWSRVTKFGSGWGFWVSPTVFITTTHVVPTGVKEFFGEPLSSIAIHQAGEFTQFRFSKKMRPDLTGMVLEEGCPEGTVCSVLIKRDSGELLPLAVR
MGAIASMRIQGRLVHGQSGMLLTGANAKGMDLGTIPGDCGAPYVHKRGNDWVVCGVHAAATKSGNTVVCAVQAGEGETALEGGDKGHYAGHEIVRYGSGP
ALSTKTKFWRSSPEPLPPGVYEPAYLGGKDPRVQNGPSLQQVLRDQLKPFADPRGRMPEPGLLEAAVETVTSMLEQTMDTPSPWSYADACQSLDKTTSSG
YPHHKRKNDDWNGTTFVGELGEQAAHANNMYENAKHMKPIYTAALKDELVKPEKIYQKVKKRLLWGADLGTVVRAARAFGPFCDAIKSHVIKLPIKVGMN
TIEDGPLIYAEHAKYKNHFDADYTAWDSTQNRQIMTESFSIMSRLTASPELAEVVAQDLLAPSEMDVGDYVIRVKEGLPSGFPCTSQVNSINHWIITLCA
LSEATGLSPDVVQSMSYFSFYGDDEIVSTDIDFDPARLTQILKEYGLKPTRPDKTEGPIQVRKNVDGLVFLRRTISRDAAGFQGRLDRASIERQIFWTRG
PNHSDPSETLVPHTQRKIQLISLLGEASLHGEKFYRKISSKVIHEIKTGGLEMYVPGWQAMFRWMRFHDLGLWTGDRDLLPEFVNDDGV
KTWDELDHTTKQQILDEHAEWFDAGGLGPSTLPTSHERYTHENDEGHQVKWSAREGVDLGISGLTTVSGPEWNMCPLPPVDQRSTTPATEPTIGDMIEFY
EGHIYHYAIYIGQGKTVGVHSPQAAFSITRITIQPISAWWRVCYVPQPKQRLTYDQLKELENEPWPYAAVTNNCFEFCCQVMCLEDTWLQRKLISSGRFY
HPTQDWSRDTPEFQQDSKLEMVRDAVLAAINGLVSRPFKDLLGKLKPLNVLNLLSNCDWTFMGVVEMVVLLLELFGIFWNPPDVSNFIASLLPDFHLQGP
EDLARDLVPIVLGGIGLAIGFTRDKVSKMMKNAVDGLRAATQLGQYGLEIFSLLKKYFFGGDQTEKTLKDIESAVIDMEVLSSTSVTQLVRDKQSARAYM
AILDNEEEKARKLSVRNADPHVVSSTNALISRISMARAALAKAQAEMTSRMRPVVIMMCGPPGIGKTKAAEHLAKRLANEIRPGGKVGLVPREAVDHWDG
YHGEEVMLWDDYGMTKIQEDCNKLQAIADSAPLTLNCDRIENKGMQFVSDAIVITTNAPGPAPVDFVNLGPVCRRVDFLVYCTAPEVEHTRKVSPGDTTA
LKDCFKPDFSHLKMELAPQGGFDNQGNTPFGKGVMKPTTINRLLIQAVALTMERQDEFQLQGPTYDFDTDRVAAFTRMARANGLGLISMASLGKKLRSVT
TIEGLKNALSGYKISKCSIQWQSRVYIIESDGASVQIKEDKQALTPLQQTINTASLAITRLKAARAVAYASCFQSAITTILQMAGSALVINRAVKRMFGT
RTAAMALEGPGKEHNCRVHKAKEAGKGPIGHDDMVERFGLCETEEEESEDQIQMVPSDAVPEGKNKGKTKKGRGRKNNYNAFSRRGLSDEEYEEYKKIRE
EKNGNYSIQEYLEDRQRYEEELAEVQAGGDGGIGETEMEIRHRVFYKSKSKKHQQEQRRQLGLVTGSDIRKRKPIDWTPPKNEWADDDREVDYNEKINFE
APPTLWSRVTKFGSGWGFWVSPTVFITTTHVVPTGVKEFFGEPLSSIAIHQAGEFTQFRFSKKMRPDLTGMVLEEGCPEGTVCSVLIKRDSGELLPLAVR
MGAIASMRIQGRLVHGQSGMLLTGANAKGMDLGTIPGDCGAPYVHKRGNDWVVCGVHAAATKSGNTVVCAVQAGEGETALEGGDKGHYAGHEIVRYGSGP
ALSTKTKFWRSSPEPLPPGVYEPAYLGGKDPRVQNGPSLQQVLRDQLKPFADPRGRMPEPGLLEAAVETVTSMLEQTMDTPSPWSYADACQSLDKTTSSG
YPHHKRKNDDWNGTTFVGELGEQAAHANNMYENAKHMKPIYTAALKDELVKPEKIYQKVKKRLLWGADLGTVVRAARAFGPFCDAIKSHVIKLPIKVGMN
TIEDGPLIYAEHAKYKNHFDADYTAWDSTQNRQIMTESFSIMSRLTASPELAEVVAQDLLAPSEMDVGDYVIRVKEGLPSGFPCTSQVNSINHWIITLCA
LSEATGLSPDVVQSMSYFSFYGDDEIVSTDIDFDPARLTQILKEYGLKPTRPDKTEGPIQVRKNVDGLVFLRRTISRDAAGFQGRLDRASIERQIFWTRG
PNHSDPSETLVPHTQRKIQLISLLGEASLHGEKFYRKISSKVIHEIKTGGLEMYVPGWQAMFRWMRFHDLGLWTGDRDLLPEFVNDDGV
1789
Not Available
Not Available
01-11-1996
Evidence at protein level
Amino Acid | Count | % Frequency | Amino Acid | Count | % Frequency |
---|---|---|---|---|---|
Alanine (A) | Leucine (L) | ||||
Arginine (R) | Lysine (K) | ||||
Asparagine (N) | Methionine (M) | ||||
Aspartic Acid (D) | Phenylalanine (F) | ||||
Cysteine (C) | Proline (P) | ||||
Glutamine (Q) | Serine (S) | ||||
Glutamic Acid (E) | Threonine (T) | ||||
Glycine (G) | Tryptophan (W) | ||||
Histidine (H) | Tyrosine (Y) | ||||
Isoleucine (I) | Valine (V) |
% Number of Residues in Helices | % Number of Residues in Strands | % Number of Residues in Coils |
---|---|---|
♦Protein p48 may play a role in viral replication by interacting with host VAPA, a vesicle-associated membrane protein that plays a role in SNARE-mediated vesicle fusion. This interaction may target replication complex to intracellular membranes.
♦ NTPase presumably plays a role in replication. Despite having similarities with helicases, does not seem to display any helicase activity.
♦ Protein P22 may play a role in targeting replication complex to intracellular membranes.
♦ Viral genome-linked protein is covalently linked to the 5'-end of the positive-strand, negative-strand genomic RNAs and subgenomic RNA. Acts as a genome-linked replication primer. May recruit ribosome to viral RNA thereby promoting viral proteins translation.
♦ 3C-like protease processes the polyprotein: 3CLpro-RdRp is first released by autocleavage, then all other proteins are cleaved. May cleave host polyadenylate-binding protein thereby inhibiting cellular translation (By similarity).
♦ RNA-directed RNA polymerase replicates genomic and antigenomic RNA by recognizing replications specific signals. Transcribes also a subgenomic mRNA by initiating RNA synthesis internally on antigenomic RNA. This sgRNA codes for structural proteins. Catalyzes the covalent attachment VPg with viral RNAs (By similarity).
♦ NTPase presumably plays a role in replication. Despite having similarities with helicases, does not seem to display any helicase activity.
♦ Protein P22 may play a role in targeting replication complex to intracellular membranes.
♦ Viral genome-linked protein is covalently linked to the 5'-end of the positive-strand, negative-strand genomic RNAs and subgenomic RNA. Acts as a genome-linked replication primer. May recruit ribosome to viral RNA thereby promoting viral proteins translation.
♦ 3C-like protease processes the polyprotein: 3CLpro-RdRp is first released by autocleavage, then all other proteins are cleaved. May cleave host polyadenylate-binding protein thereby inhibiting cellular translation (By similarity).
♦ RNA-directed RNA polymerase replicates genomic and antigenomic RNA by recognizing replications specific signals. Transcribes also a subgenomic mRNA by initiating RNA synthesis internally on antigenomic RNA. This sgRNA codes for structural proteins. Catalyzes the covalent attachment VPg with viral RNAs (By similarity).
3.6.1.15 , 3.4.22.66 , 2.7.7.48
GO:0003723 ; GO:0003724 ; GO:0003968 ; GO:0004197 ; GO:0005524 ;
GO:0006351 ; GO:0016021 ; GO:0018144 ; GO:0033644 ; GO:0039694
GO:0006351 ; GO:0016021 ; GO:0018144 ; GO:0033644 ; GO:0039694
♦ Protein p48: Host membrane
♦ Single-pass membrane protein .
♦ NTPase: Host membrane
♦ Single-pass membrane protein .
♦ Protein p22: Host membrane
♦ Single-pass membrane protein .
♦ Single-pass membrane protein .
♦ NTPase: Host membrane
♦ Single-pass membrane protein .
♦ Protein p22: Host membrane
♦ Single-pass membrane protein .
♦DOMAIN 532 697 SF3 helicase.
♦ DOMAIN 1101 1281 Peptidase C37.
♦ DOMAIN 1516 1637 RdRp catalytic.
♦ DOMAIN 1101 1281 Peptidase C37.
♦ DOMAIN 1516 1637 RdRp catalytic.
Not Available
X-ray crystallography (23); NMR spectroscopy (1)
♦ACT_SITE 1130 1130 For 3CLpro activity.
♦ ACT_SITE 1154 1154 For 3CLpro activity.
♦ ACT_SITE 1239 1239 For 3CLpro activity.
♦ ACT_SITE 1154 1154 For 3CLpro activity.
♦ ACT_SITE 1239 1239 For 3CLpro activity.
Protein couldn't be modeled using I-Tasser and Raptor X because of length constraints of the software.
Not Available
- Million Molecules
Best 20 Hit molecules
Not Available