viHumans
Reviewed
Homo Sapiens (Human) [TaxID: 9606]
Rep 1a-1b[Gene ID: 2943501 ]
♦Replicase polyprotein 1ab (pp1ab) (ORF1ab polyprotein) [Cleaved into: Non-structural protein 1 (nsp1) (p9)
♦ Non-structural protein 2 (nsp2) (p87)
♦ Non-structural protein 3 (nsp3) (EC 3.4.19.12) (EC 3.4.22.-) (PL1-PRO/PL2-PRO) (PLP1/PLP2) (Papain-like proteinases 1/2) (p195)
♦ Non-structural protein 4 (nsp4) (Peptide HD2)
♦ 3C-like proteinase (3CL-PRO) (3CLp) (EC 3.4.22.-) (M-PRO) (nsp5) (p34)
♦ Non-structural protein 6 (nsp6)
♦ Non-structural protein 7 (nsp7) (p5)
♦ Non-structural protein 8 (nsp8) (p23)
♦ Non-structural protein 9 (nsp9) (p12)
♦ Non-structural protein 10 (nsp10) (Growth factor-like peptide) (GFL) (p14)
♦ RNA-directed RNA polymerase (Pol) (RdRp) (EC 2.7.7.48) (nsp12) (p100)
♦ Helicase (Hel) (EC 3.6.4.12) (EC 3.6.4.13) (nsp13) (p66) (p66-HEL)
♦ Exoribonuclease (ExoN) (EC 3.1.13.-) (nsp14)
♦ Uridylate-specific endoribonuclease (EC 3.1.-.-) (NendoU) (nsp15) (p41)
♦ Putative 2'-O-methyl transferase (EC 2.1.1.-) (nsp16)]
Human Coronavirus NL63 (HCoV-NL63)
Viruses> SsRNA Viruses> SsRNA Positive-strand Viruses> No DNA Stage> Nidovirales> Coronaviridae> Coronavirinae> Alphacoronavirus> Human Coronavirus NL63 (HCoV-NL63)
Various pathway(s) in which protein is involved
Not Available
MFYNQVTLAVASDSEISGFGFAIPSVAVRTYSEAAAQGFQACRFVAFGLQDCVTGINDDDYVIALTGTNQLCAKILPFSDRPLNLRGWLIFSNSNYVLQD
FDVVFGHGAGSVVFVDKYMCGFDGKPVLPKNMWEFRDYFNNNTDSIVIGGVTYQLAWDVIRKDLSYEQQNVLAIESIHYLGTTGHTLKSGCKLTNAKPPK
YSSKVVLSGEWNAVYRAFGSPFITNGMSLLDIIVKPVFFNAFVKCNCGSESWSVGAWDGYLSSCCGTPAKKLCVVPGNVVPGDVIITSTSAGCGVKYYAG
LVVKHITNITGVSLWRVTAVHSDGMFVASSSYDALLHRNSLDPFCFDVNTLLSNQLRLAFLGASVTEDVKFAASTGVIDISAGMFGLYDDILTNNKPWFV
RKASGLFDAIWDAFVAAIKLVPTTTGVLVRFVKSIASTVLTVSNGVIIMCADVPDAFQSVYRTFTQAICAAFDFSLDVFKIGDVKFKRLGDYVLTENALV
RLTTEVVRGVRDARIKKAMFTKVVVGPTTEVKFSVIELATVNLRLVDCAPVVCPKGKIVVIAGQAFFYSGGFYRFMVDPTTVLNDPVFTGDLFYTIKFSG
FKLDGFNHQFVTASSATDAIIAVELLLLDFKTAVFVYTCVVDGCSVIVRRDATFATHVCFKDCYNVWEQFCIDNCGEPWFLTDYNAILQSNNPQCAIVQA
SESKVLLERFLPKCPEILLSIDDGHLWNLFVEKFNFVTDWLKTLKLTLTSNGLLGNCAKRFRRVLVKLLDVYNGFLETVCSVAYTAGVCIKYYAVNVPYV
VISGFVSRVIRRERCDMTFPCVSCVTFFYEFLDTCFGVSKPNAIDVEHLELKETVFVEPKDGGQFFVSGDYLWYVVDDIYYPASCNGVLPVAFTKLAGGK
ISFSDDVIVHDVEPTHKVKLIFEFEDDVVTSLCKKSFGKSIIYTGDWEGLHEVLTSAMNVIGQHIKLPQFYIYDEEGGYDVSKPVMISQWPISNDSNGCV
VEASTDFHQLECIVDDSVREEVDIIEQPFEEVEHVLSIKQPFSFSFRDELGVRVLDQSDNNCWISTTLVQLQLTKLLDDSIEMQLFKVGKVDSIVQKCYE
LSHLISGSLGDSGKLLSELLKEKYTCSITFEMSCDCGKKFDDQVGCLFWIMPYTKLFQKGECCICHKMQTYKLVSMKGTGVFVQDPAPIDIDAFPVKPIC
SSVYLGVKGSGHYQTNLYSFNKAIDGFGVFDIKNSSVNTVCFVDVDFHSVEIEAGEVKPFAVYKNVKFYLGDISHLVNCVSFDFVVNAANENLLHGGGVA
RAIDILTEGQLQSLSKDYISSNGPLKVGAGVMLECEKFNVFNVVGPRTGKHEHSLLVEAYNSILFENGIPLMPLLSCGIFGVRIENSLKALFSCDINKPL
QVFVYSSNEEQAVLKFLDGLDLTPVIDDVDVVKPFRVEGNFSFFDCGVNALDGDIYLLFTNSILMLDKQGQLLDTKLNGILQQAALDYLATVKTVPAGNL
VKLFVESCTIYMCVVPSINDLSFDKNLGRCVRKLNRLKTCVIANVPAIDVLKKLLSSLTLTVKFVVESNVMDVNDCFKNDNVVLKITEDGINVKDVVVES
SKSLGKQLGVVSDGVDSFEGVLPINTDTVLSVAPEVDWVAFYGFEKAALFASLDVKPYGYPNDFVGGFRVLGTTDNNCWVNATCIILQYLKPTFKSKGLN
VLWNKFVTGDVGPFVSFIYFITMSSKGQKGDAEEALSKLSEYLISDSIVTLEQYSTCDICKSTVVEVKSAIVCASVLKDGCDVGFCPHRHKLRSRVKFVN
GRVVITNVGEPIISQPSKLLNGIAYTTFSGSFDNGHYVVYDAANNAVYDGARLFSSDLSTLAVTAIVVVGGCVTSNVPTIVSEKISVMDKLDTGAQKFFQ
FGDFVMNNIVLFLTWLLSMFSLLRTSIMKHDIKVIAKAPKRTGVILTRSFKYNIRSALFVIKQKWCVIVTLFKFLLLLYAIYALVFMIVQFSPFNSLLCG
DIVSGYEKSTFNKDIYCGNSMVCKMCLFSYQEFNDLDHTSLVWKHIRDPILISLQPFVILVILLIFGNMYLRFGLLYFVAQFISTFGSFLGFHQKQWFLH
FVPFDVLCNEFLATFIVCKIVLFVRHIIVGCNNADCVACSKSARLKRVPLQTIINGMHKSFYVNANGGTCFCNKHNFFCVNCDSFGPGNTFINGDIAREL
GNVVKTAVQPTAPAYVIIDKVDFVNGFYRLYSGDTFWRYDFDITESKYSCKEVLKNCNVLENFIVYNNSGSNITQIKNACVYFSQLLCEPIKLVNSELLS
TLSVDFNGVLHKAYVDVLCNSFFKELTANMSMAECKATLGLTVSDDDFVSAVANAHRYDVLLSDLSFNNFFISYAKPEDKLSVYDIACCMRAGSKVVNHN
VLIKESIPIVWGVKDFNTLSQEGKKYLVKTTKAKGLTFLLTFNDNQAITQVPATSIVAKQGAGFKRTYNFLWYVCLFVVALFIGVSFIDYTTTVTSFHGY
DFKYIENGQLKVFEAPLHCVRNVFDNFNQWHEAKFGVVTTNSDKCPIVVGVSERINVVPGVPTNVYLVGKTLVFTLQAAFGNTGVCYDFDGVTTSDKCIF
NSACTRLEGLGGDNVYCYNTDLIEGSKPYSTLQPNAYYKYDAKNYVRFPEILARGFGLRTIRTLATRYCRVGECRDSHKGVCFGFDKWYVNDGRVDDGYI
CGDGLIDLLVNVLSIFSSSFSVVAMSGHMLFNFLFAAFITFLCFLVTKFKRVFGDLSYGVFTVVCATLINNISYVVTQNLFFMLLYAILYFVFTRTVRYA
WIWHIAYIVAYFLLIPWWLLTWFSFAAFLELLPNVFKLKISTQLFEGDKFIGTFESAAAGTFVLDMRSYERLINTISPEKLKNYAASYNKYKYYSGSASE
ADYRCACYAHLAKAMLDYAKDHNDMLYSPPTISYNSTLQSGLKKMAQPSGCVERCVVRVCYGSTVLNGVWLGDTVTCPRHVIAPSTTVLIDYDHAYSTMR
LHNFSVSHNGVFLGVVGVTMHGSVLRIKVSQSNVHTPKHVFKTLKPGDSFNILACYEGIASGVFGVNLRTNFTIKGSFINGACGSPGYNVRNDGTVEFCY
LHQIELGSGAHVGSDFTGSVYGNFDDQPSLQVESANLMLSDNVVAFLYAALLNGCRWWLCSTRVNVDGFNEWAMANGYTSVSSVECYSILAAKTGVSVEQ
LLASIQHLHEGFGGKNILGYSSLCDEFTLAEVVKQMYGVNLQSGKVIFGLKTMFLFSVFFTMFWAELFIYTNTIWINPVILTPIFCLLLFLSLVLTMFLK
HKFLFLQVFLLPTVIATALYNCVLDYYIVKFLADHFNYNVSVLQMDVQGLVNVLVCLFVVFLHTWRFSKERFTHWFTYVCSLIAVAYTYFYSGDFLSLLV
MFLCAISSDWYIGAIVFRLSRLIVFFSPESVFSVFGDVKLTLVVYLICGYLVCTYWGILYWFNRFFKCTMGVYDFKVSAAEFKYMVANGLHAPHGPFDAL
WLSFKLLGIGGDRCIKISTVQSKLTDLKCTNVVLLGCLSSMNIAANSSEWAYCVDLHNKINLCDDPEKAQSMLLALLAFFLSKHSDFGLDGLIDSYFDNS
STLQSVASSFVSMPSYIAYENARQAYEDAIANGSSSQLIKQLKRAMNIAKSEFDHEISVQKKINRMAEQAATQMYKEARSVNRKSKVISAMHSLLFGMLR
RLDMSSVETVLNLARDGVVPLSVIPATSASKLTIVSPDLESYSKIVCDGSVHYAGVVWTLNDVKDNDGRPVHVKEITKENVETLTWPLILNCERVVKLQN
NEIMPGKLKQKPMKAEGDGGVLGDGNALYNTEGGKTFMYAYISNKADLKFVKWEYEGGCNTIELDSPCRFMVETPNGPQVKYLYFVKNLNTLRRGAVLGF
IGATIRLQAGKQTELAVNSGLLTACAFSVDPATTYLEAVKHGAKPVSNCIKMLSNGAGNGQAITTSVDANTNQDSYGGASICLYCRAHVPHPSMDGYCKF
KGKCVQVPIGCLDPIRFCLENNVCNVCGCWLGHGCACDRTTIQSVDISYLNRARGSSAARLEPCNGTDIDKCVRAFDIYNKNVSFLGKCLKMNCVRFKNA
DLKDGYFVIKRCTKSVMEHEQSMYNLLNFSGALAEHDFFTWKDGRVIYGNVSRHNLTKYTMMDLVYAMRNFDEQNCDVLKEVLVLTGCCDNSYFDSKGWY
DPVENEDIHRVYASLGKIVARAMLKCVALCDAMVAKGVVGVLTLDNQDLNGNFYDFGDFVVSLPNMGVPCCTSYYSYMMPIMGLTNCLASECFVKSDIFG
SDFKTFDLLKYDFTEHKENLFNKYFKHWSFDYHPNCSDCYDDMCVIHCANFNTLFATTIPGTAFGPLCRKVFIDGVPLVTTAGYHFKQLGLVWNKDVNTH
SVRLTITELLQFVTDPSLIIASSPALVDQRTICFSVAALSTGLTNQVVKPGHFNEEFYNFLRLRGFFDEGSELTLKHFFFAQNGDAAVKDFDFYRYNKPT
ILDICQARVTYKIVSRYFDIYEGGCIKACEVVVTNLNKSAGWPLNKFGKASLYYESISYEEQDALFALTKRNVLPTMTQLNLKYAISGKERARTVGGVSL
LSTMTTRQYHQKHLKSIVNTRNATVVIGTTKFYGGWNNMLRTLIDGVENPMLMGWDYPKCDRALPNMIRMISAMVLGSKHVNCCTATDRFYRLGNELAQV
LTEVVYSNGGFYFKPGGTTSGDASTAYANSIFNIFQAVSSNINRLLSVPSDSCNNVNVRDLQRRLYDNCYRLTSVEESFIDDYYGYLRKHFSMMILSDDG
VVCYNKDYAELGYIADISAFKATLYYQNNVFMSTSKCWVEEDLTKGPHEFCSQHTMQIVDKDGTYYLPYPDPSRILSAGVFVDDVVKTDAVVLLERYVSL
AIDAYPLSKHPNSEYRKVFYVLLDWVKHLNKNLNEGVLESFSVTLLDNQEDKFWCEDFYASMYENSTILQAAGLCVVCGSQTVLRCGDCLRKPMLCTKCA
YDHVFGTDHKFILAITPYVCNASGCGVSDVKKLYLGGLNYYCTNHKPQLSFPLCSAGNIFGLYKNSATGSLDVEVFNRLATSDWTDVRDYKLANDVKDTL
RLFAAETIKAKEESVKSSYAFATLKEVVGPKELLLSWESGKVKPPLNRNSVFTCFQISKDSKFQIGEFIFEKVEYGSDTVTYKSTVTTKLVPGMIFVLTS
HNVQPLRAPTIANQEKYSSIYKLHPAFNVSDAYANLVPYYQLIGKQKITTIQGPPGSGKSHCSIGLGLYYPGARIVFVACAHAAVDSLCAKAMTVYSIDK
CTRIIPARARVECYSGFKPNNTSAQYIFSTVNALPECNADIVVVDEVSMCTNYDLSVINQRLSYKHIVYVGDPQQLPAPRVMITKGVMEPVDYNVVTQRM
CAIGPDVFLHKCYRCPAEIVNTVSELVYENKFVPVKPASKQCFKVFFKGNVQVDNGSSINRKQLEIVKLFLVKNPSWSKAVFISPYNSQNYVASRFLGLQ
IQTVDSSQGSEYDYVIYAQTSDTAHACNVNRFNVAITRAKKGIFCVMCDKTLFDSLKFFEIKHADLHSSQVCGLFKNCTRTPLNLPPTHAHTFLSLSDQF
KTTGDLAVQIGSNNVCTYEHVISFMGFRFDISIPGSHSLFCTRDFAIRNVRGWLGMDVESAHVCGDNIGTNVPLQVGFSNGVNFVVQTEGCVSTNFGDVI
KPVCAKSPPGEQFRHLIPLLRKGQPWLIVRRRIVQMISDYLSNLSDILVFVLWAGSLELTTMRYFVKIGPIKYCYCGNSATCYNSVSNEYCCFKHALGCD
YVYNPYAFDIQQWGYVGSLSQNHHTFCNIHRNEHDASGDAVMTRCLAVHDCFVKNVDWTVTYPFIANEKFINGCGRNVQGHVVRAALKLYKPSVIHDIGN
PKGVRCAVTDAKWYCYDKQPVNSNVKLLDYDYATHGQLDGLCLFWNCNVDMYPEFSIVCRFDTRTRSVFNLEGVNGGSLYVNKHAFHTPAYDKRAFVKLK
PMPFFYFDDSDCDVVQEQVNYVPLRASSCVTRCNIGGAVCSKHANLYQKYVEAYNTFTQAGFNIWVPHSFDVYNLWQIFIETNLQSLENIAFNVVKKGCF
TGVDGELPVAVVNDKVFVRYGDVDNLVFTNKTTLPTNVAFELFAKRKMGLTPPLSILKNLGVVATYKFVLWDYEAERPFTSYTKSVCKYTDFNEDVCVCF
DNSIQGSYERFTLTTNAVLFSTVVIKNLTPIKLNFGMLNGMPVSSIKGDKGVEKLVNWYIYVRKNGQFQDHYDGFYTQGRNLSDFTPRSDMEYDFLNMDM
GVFINKYGLEDFNFEHVVYGDVSKTTLGGLHLLISQFRLSKMGVLKADDFVTASDTTLRCCTVTYLNELSSKVVCTYMDLLLDDFVTILKSLDLGVISKV
HEVIIDNKPYRWMLWCKDNHLSTFYPQLQSAEWKCGYAMPQIYKLQRMCLEPCNLYNYGAGIKLPSGIMLNVVKYTQLCQYLNSTTMCVPHNMRVLHYGA
GSDKGVAPGTTVLKRWLPPDAIIIDNDINDYVSDADFSITGDCATVYLEDKFDLLISDMYDGRIKFCDGENVSKDGFFTYLNGVIREKLAIGGSVAIKIT
EYSWNKYLYELIQRFAFWTLFCTSVNTSSSEAFLIGINYLGDFIQGPFIAGNTVHANYIFWRNSTIMSLSYNSVLDLSKFECKHKATVVVTLKDSDVNDM
VLSLIKSGRLLLRNNGRFGGFSNHLVSTK
6729
Not Available
Not Available
10-06-2008
Inferred from homology
Amino Acid Count % Frequency Amino Acid Count % Frequency
Alanine (A) Leucine (L)
Arginine (R) Lysine (K)
Asparagine (N) Methionine (M)
Aspartic Acid (D) Phenylalanine (F)
Cysteine (C) Proline (P)
Glutamine (Q) Serine (S)
Glutamic Acid (E) Threonine (T)
Glycine (G) Tryptophan (W)
Histidine (H) Tyrosine (Y)
Isoleucine (I) Valine (V)
% Number of Residues in Helices % Number of Residues in Strands % Number of Residues in Coils
♦The replicase polyprotein of coronaviruses is a multifunctional protein: it contains the activities necessary for the transcription of negative stranded RNA, leader RNA, subgenomic mRNAs and progeny virion RNA as well as proteinases responsible for the cleavage of the polyprotein into functional products.
♦ The papain-like proteinase 1 (PLP1) and papain-like proteinase 2 (PLP2) are responsible for the cleavages located at the N-terminus of the replicase polyprotein. In addition, PLP2 possesses a deubiquitinating/deISGylating activity and processes both 'Lys-48'- and 'Lys-63'-linked polyubiquitin chains from cellular substrates. PLP2 also antagonizes innate immune induction of type I interferon by blocking the nuclear translocation of host IRF-3 (By similarity).
♦ The main proteinase 3CL-PRO is responsible for the majority of cleavages as it cleaves the C-terminus of replicase polyprotein at 11 sites. Recognizes substrates containing the core sequence [ILMVF]-Q-|-[SGACN]. Inhibited by the substrate-analog Cbz-Val-Asn-Ser-Thr-Leu-Gln-CMK. Also contains an ADP-ribose-1''-phosphate (ADRP)-binding function (By similarity).
♦ The helicase which contains a zinc finger structure displays RNA and DNA duplex-unwinding activities with 5' to 3' polarity. Its ATPase activity is strongly stimulated by poly(U), poly(dT), poly(C), poly(dA), but not by poly(G).
♦ The exoribonuclease acts on both ssRNA and dsRNA in a 3' to 5' direction.
♦ Nsp7-nsp8 hexadecamer may possibly confer processivity to the polymerase, maybe by binding to dsRNA or by producing primers utilized by the latter.
♦ Nsp9 is a ssRNA-binding protein.
♦ NendoU is a Mn(2+)-dependent, uridylate-specific enzyme, which leaves 2'-3'-cyclic phosphates 5' to the cleaved bond.
3.4.19.12  ,   3.4.22.-  ,   3.4.22.-  ,   2.7.7.48  ,   3.6.4.12  ,   3.6.4.13  ,   3.1.13.-  ,   3.1.-.-  ,   2.1.1.-  
GO:0003723  ;   GO:0003968  ;   GO:0004197  ;   GO:0004386  ;   GO:0004519  ;  
GO:0005524  ;   GO:0006351  ;   GO:0008168  ;   GO:0008242  ;   GO:0008270  ;  
GO:0016021  ;   GO:0016896  ;   GO:0019082  ;   GO:0033644  ;   GO:0036459  ;  
GO:0039520  ;   GO:0039548  ;   GO:0039648  ;   GO:0039694  ;   GO:0044172  ;  
GO:0044220  
♦ Non-structural protein 3: Host membrane
♦ Multi-pass membrane protein .
♦ Non-structural protein 4: Host membrane
♦ Multi-pass membrane protein .
♦ Non-structural protein 6: Host membrane
♦ Multi-pass membrane protein .
♦ Non-structural protein 7: Host cytoplasm, host perinuclear region . Note=nsp7, nsp8, nsp9 and nsp10 are localized in cytoplasmic foci, largely perinuclear. Late in infection, they merge into confluent complexes (By similarity). .
♦ Non-structural protein 8: Host cytoplasm, host perinuclear region . Note=nsp7, nsp8, nsp9 and nsp10 are localized in cytoplasmic foci, largely perinuclear. Late in infection, they merge into confluent complexes (By similarity). .
♦ Non-structural protein 9: Host cytoplasm, host perinuclear region . Note=nsp7, nsp8, nsp9 and nsp10 are localized in cytoplasmic foci, largely perinuclear. Late in infection, they merge into confluent complexes (By similarity). .
♦ Non-structural protein 10: Host cytoplasm, host perinuclear region . Note=nsp7, nsp8, nsp9 and nsp10 are localized in cytoplasmic foci, largely perinuclear. Late in infection, they merge into confluent complexes (By similarity). .
♦ Helicase: Host endoplasmic reticulum-Golgi intermediate compartment . Note=The helicase interacts with the N protein in membranous complexes and colocalizes with sites of synthesis of new viral RNA. .
♦ Uridylate-specific endoribonuclease: Host cytoplasm, host perinuclear region .
♦DOMAIN 1021 1262 Peptidase C16 1.
♦ DOMAIN 1263 1421 Macro.
♦ DOMAIN 1640 1886 Peptidase C16 2.
♦ DOMAIN 2940 3242 Peptidase C30.
♦ DOMAIN 4650 4812 RdRp catalytic.
♦ DOMAIN 4971 5054 CV ZBD.
♦ DOMAIN 5228 5409 (+)RNA virus helicase ATP-binding.
♦ DOMAIN 5410 5579 (+)RNA virus helicase C-terminal.
Not Available
Predicted/Modelled
Not Available
♦ACT_SITE 1062 1062 For PL1-PRO activity.
♦ ACT_SITE 1212 1212 For PL1-PRO activity.
♦ ACT_SITE 1678 1678 For PL2-PRO activity.
♦ ACT_SITE 1836 1836 For PL2-PRO activity.
♦ ACT_SITE 2980 2980 For 3CL-PRO activity.
♦ ACT_SITE 3083 3083 For 3CL-PRO activity.
Protein couldn't be modeled using I-Tasser and Raptor X because of length constraints of the software.
Not Available
Virtual screening has been performed using RASPD
  • Million Molecules

Best 20 Hit molecules

    Not Available