1. Background
2. Objective
3. Methods
3.1. Preparation of Anti-SARS-CoV-2 Peptide Dataset and Selection of Lead Peptides
3.2. Determination of Binding Hotspots
3.3. Analogue Design and Structural Modeling
3.4. Assessment of Physicochemical Properties and Toxicity of Designed Analogues
3.5. Energy Minimization and Model Validation
3.6. Molecular Docking
3.7. Molecular Dynamics Simulations
4. Results
4.1. Dataset Preparation and Peptide Selection
| Peptide | Target Protein | Sequence | Number of Amino Acids | GRAVY | Theoretical pI (PH) | Charge at PH7 | Hydrophobic (%) | Neutral (%) | Basic (%) | Acidic (%) |
|---|---|---|---|---|---|---|---|---|---|---|
| EK1 | S | SLDQINVTFLDLEYEMKKLEEAIKKLEESYIDLKEL | 36 | -0.43 | 4.14 | -5 | 44.44 | 13.89 | 13.89 | 27.78 |
| Peptide 5 | RdRp | ARKD | 4 | -2.53 | 10.09 | 1 | 25 | 50 | 0 | 25 |
| Plectasin | N | GFGCNGPWDEDDMQCHNHCKSIKGYKGGYCAKGGFVCKCY | 40 | -0.7 | 7.64 | 0.9 | 10 | 17.5 | 32.5 | 40 |
4.2. Analysis of Key Amino Acid Interactions Between Lead Peptides and Target Proteins
The crystallographic structure of receptors. A, the crystallographic structure of the EK1 peptide and the S protein receptor; the sequence of the EK1 peptide is marked with a red box; B, the RdRp receptor's crystallographic structure and the binding region of the lead peptide to the amino acid Asp623 of the receptor are marked with a red line; C, the crystallographic structure of the N protein.
| Type of interaction | Amino Acids | Refrences | |
|---|---|---|---|
| Values | In EK1 | In S receptor | |
| Hydrogen bond | Y30 | G932 | Structural and functional basis for pan-CoV fusion inhibitors against SARS-CoV-2 and its variants with preclinical evaluation (25) |
| Hydrogen bond | E27 | Q935 | |
| Hydrogen bond | S29 | K933 | |
| Hydrogen bond | E28 | Q926 | |
| Hydrogen bond | K18 | S943 | |
| Hydrogen bond | E15 | S943 | |
| Ionic interaction | K947 | ||
| In peptide 5 | In RdRp | ||
| Ionic interaction/ water-mediated hydrogen bond | Arg2 | Asp623 | Inhibition of the RNA-dependent RNA polymerase of the SARS-CoV-2 by short peptide inhibitors (15) |
| Hydrogen bond | Thr687 | ||
| Ionic interaction | Asp4 | Mg+2 | |
| Water-mediated hydrogen bonds | Glu811 | ||
| Asp618 | |||
| In plectacin | In N | ||
| Conventional Hydrogen Bond | LYS30 | ASN322 | In-silico study of peptide-protein interaction of antimicrobial peptides potentially targeting SARS and SARS-CoV-2 nucleocapsid protein (18) |
| Hydrophobic | ALA324 | ||
| PRO323 | |||
| Hydrophobic | PHE32 | PRO323 | |
| Hydrogen bond; Electrostatic | ARG321 | ||
| Hydrophobic | VAL36 | PRO323 | |
| Pi-Donor Hydrogen Bond, Hydrophobic | TRP8 | ALA235 | |
| Conventional Hydrogen Bond | TYR283 | ||
| ARG50 | |||
| Conventional Hydrogen Bond | ASN5 | ARG281 | |
| Pi-Donor Hydrogen Bond | CYS28 | TYR110 | |
| Hydrophobic | CYS4 | TYR110 | |
| Hydrogen bond; Electrostatic | ASP9 | ARG41 | |
Abbreviation: SARS-CoV-2, syndrome coronavirus 2.
| Amino Acids in EK1/in S Receptor | Type of Interaction |
|---|---|
| Leu1010 | |
| Asn953 | Strong hydrogen bonds |
| Gln 949 | Non-bonded contacts |
| Phe1009 | |
| Gln957 | Non-bonded contacts |
| Asn 953 | |
| Ala 956 | |
| Val1007 | |
| Ala 956 | Non-bonded contacts |
| Thr1008 | |
| Ala 956 | Non-bonded contacts |
| Leu1012 | |
| Gln949 | Hydrogen bond |
| Asp1011 | |
| Gln949 | Non-bonded contacts |
| Leu1036 | |
| Tyr917 | Hydrogen bond |
| Leu1019 | |
| Tyr941 | Non-bonded contacts |
| Ala 942 | |
| Leu945 | |
| Leu1033 | |
| Asn925 | Non-bonded contacts |
| Ser1029 | |
| Phe927 | Non-bonded contacts |
| Ile1031 | |
| Asn928 | Strong hydrogen bonds |
| Phe927 | Non-bonded contacts |
| Tyr1030 | |
| Asn928 | Non-bonded contacts |
| Gly932 | |
| Ile931 | |
| Gln935 | Hydrogen bond |
| Glu1027 | |
| Gln935 | Hydrogen bond |
| Leu1026 | |
| Ile931 | Non-bonded contacts |
| Leu 938 | |
| Ile1023 | |
| Leu938 | Non-bonded contacts |
| Ser 939 | |
| Gln935 | Hydrogen bond |
| Met1016 | |
| Gly946 | Non-bonded contacts |
Interactions between amino acids of receptors and lead peptides. A, interactions between the lead peptide (EK1) and the S protein receptor by the PDBsum tool, Chain A represents the receptor, and Chain B represents the lead peptide; B, type and number of bonds between the lead peptide EK1 and the S protein; C, diagrammatic sketch illustrating the interactions between the S receptor (S2 subunit) and EK1; D, interactions between the RdRp receptor and peptide 5; E, interactions between the N receptor and Plectasin. Ligand is shown in purple, and green dashed lines indicate hydrogen bonds with distance in angstroms (Å), spoked red arcs indicate hydrophobic contacts, atoms are shown in black for carbon, blue for nitrogen, red for oxygen, green for fluorine, and yellow for sulfur.
4.3. Analogue Design and Energy Minimization
| No. | Name | Result of Secondary Structure Prediction by Chou-Fasman, GOR and Neural Network | Composition Model% | Physicochemical Model% | Relative Docking Score of ncovDock2 a.u. |
|---|---|---|---|---|---|
| S protein | |||||
| 1 | EK1 | CF: CCCEEEEEHHHHHHHHHHHHHHHHHHHEEECCCCCC; GOR: CETTCEEEEHHHHHHHHHHHHHHHHHHHHHHHHHHH; NN: CCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH; Joint:CCCCCEEEHHHHHHHHHHHHHHHHHHHHHHHHHHHH | 52.22 | 63.86 | -284.12 |
| 2 | E | CF: CCCEEEEEHHHHHHHHHHHHHHHHHHHEEECCCCCC; GOR: THHHHEEEHHHHHHHHHHHHHHHHHHHHHHHHHHHH; NN: CCCCEEEHHHHHHHHHHHHHHHHHHHHHHHHHHHHH; Joint:CCCCEEEEHHHHHHHHHHHHHHHHHHHHHHHHHHHH | 52.47 | 64.04 | -288.63 |
| 3 | F | CF: CCEEEEEEHHHHHHHHHHHHHHHHHHHEEECCCCCC; GOR: CCCHHEEEEHHHHHHHHHHHHHHHHHHHHHHHHHHH; NN: CCCCEEEEHHHHHHHHHHHHHHHHHHHHHHHHHHHH; Joint: CCCCEEEEHHHHHHHHHHHHHHHHHHHHHHHHHHHH | 53.01 | 63.98 | -289.75 |
| 4 | G | CF: CCCEEEEEHHHHHHHHHHHHHHHHHHHEEECCCCCC; GOR: THHHHEEEHHHHHHHHHHHHHHHHHHHHHHHHHHHH; NN: CCCCEEEHHHHHHHHHHHHHHHHHHHHHHHHHHHHH; Joint :CCCCEEEEHHHHHHHHHHHHHHHHHHHHHHHHHHHH | 50.15 | 63.91 | -288.4 |
| 5 | H | CF: CEEEEEEEHHHHHHHHHHHHHHHHHHHEEECCCCCC; GOR: CTCHEEEEHHHHHHHHHHHHHHHHHHHHHHHHHHHH; NN: CCCEEEEEEHHHHHHHHHHHHHHHHHHHHHHHHHHH; Joint: CCCEEEEEHHHHHHHHHHHHHHHHHHHHHHHHHHHH | 56.94 | 63.8 | -268.3 |
| 6 | I | CF: CCCEEEEEHHHHHHHHHHHHHHHHHHHEEECCCCCC; GOR: THHHHHEEHHHHHHHHHHHHHHHHHHHHHHHHHHHH; NN: CCCCEEEHHHHHHHHHHHHHHHHHHHHHHHHHHHHH; Joint: CCCCEEEEHHHHHHHHHHHHHHHHHHHHHHHHHHHH | 54.01 | 64.03 | -277.95 |
| 7 | L | CF: CCCEEEEEHHHHHHHHHHHHHHHHHHHEEECCCCCC; GOR: HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH; NN: CCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH; Joint: CCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH | 51.98 | 64.1 | -275.15 |
| 8 | CF: CCCEEEEEHHHHHHHHHHHHHHHHHHHEEECCCCCC; GOR: HHHHHHEEHHHHHHHHHHHHHHHHHHHHHHHHHHHH; NN: CCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH; Joint: CCCCCCEEHHHHHHHHHHHHHHHHHHHHHHHHHHHH | 54.01 | 64.66 | -268.6 | |
| 9 | P | CF: CCCEEEEEHHHHHHHHHHHHHHHHHHHEEECCCCCC; GOR: HHHHHHEEHHHHHHHHHHHHHHHHHHHHHHHHHHHH; NN: CCCHHHHHHHHHHHHHHHHHHHHHHHCHCHHHHCCC; Joint: CCCCCCEEHHHHHHHHHHHHHHHHHHHHCHHHHCCC | 54.4 | 64.78 | -261.99 |
| 10 | S | CF: CCCEEEEEHHHHHHHHHHHHHHHHHHHEEECCCCCC; GOR: THHHHHEEHHHHHHHHHHHHHHHHHHHHHHHHHHHH; NN: CCCCEEHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH; Joint: CCCCEEEEHHHHHHHHHHHHHHHHHHHHHHHHHHHH | 54.37 | 64.53 | -261.54 |
| 11 | T | CF: CCCEEEEEHHHHHHHHHHHHHHHHHHHEEECCCCCC; GOR: THHHHHEEHHHHHHHHHHHHHHHHHHHHHHHHHHHH; NN: CCCCEEHHHHHHHHHHHHHHHHHHHHCHCHHHHCCC; Joint: CCCCEEEEHHHHHHHHHHHHHHHHHHHHCHHHHCCC | 55.76 | 64.76 | -277.41 |
| 12 | U | CF: CCCEEEEEHHHHHHHHHHHHHHHHHHHEEECCCCCC; GOR: THHHHHEEHHHHHHHHHHHHHHHHHHHHHHHHHHHH; NN: CCCCEEHHHHHHHHHHHHHHHHHHHHHHHHHHHCCC; Joint: CCCCEEEEHHHHHHHHHHHHHHHHHHHHHHHHHCCC | 50.31 | 64.17 | -245.55 |
| 13 | V | CF: CCCEEEEEHHHHHHHHHHHHHHHHHHHEEECCCCCC; GOR: THHHHHEEHHHHHHHHHHHHHHHHHHHHHHHHHHHH; NN: CCCCEEHHHHHHHHHHHHHHHHHHHHHHHHHHHCCC; Joint: CCCCEEEEHHHHHHHHHHHHHHHHHHHHHHHHHCCC | 50.31 | 64.17 | -245.55 |
| 14 | W | CF: CCCEEEEEHHHHHHHHHHHHHHHHHHHEEECCCCCC; GOR: HHHHHCECHHHHHHHHHHHHHHHHHHHHHHHHHHHH; NN: HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH; Joint: HHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHH | 54.06 | 63.92 | -303.41 |
| 15 | Y | CF: CCCEEEEEHHHHHHHHHHHHHHHHHHHEEECCCCCC GOR: HHHHHHEEHHHHHHHHHHHHHHHHHHHHHHHHHHHH NN: HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Joint: HHHHHHEEHHHHHHHHHHHHHHHHHHHHHHHHHHHH | 52.33 | 64.06 | -290.33 |
| 16 | Z | CF: CCCEEEEEHHHHHHHHHHHHHHHHHHHEEECCCCCC; GOR: HHHHHHEEHHHHHHHHHHHHHHHHHHHHHHHHHHHH; NN: HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Joint: HHHHHHEEHHHHHHHHHHHHHHHHHHHHHHHHHHHH | 57.15 | 64.29 | -275.39 |
| 17 | 1 | CF: CCCEEEEEHHHHHHHHHHHHHHHHHHHEEECCCCCC; GOR: HHHHHHEEHHHHHHHHHHHHHHHHHHHHHHHHHHHH; NN: HHHHHHHHHHHHHHHHHHHHHHHHHHCHCHHHHCCC; Joint: HHHHHHEEHHHHHHHHHHHHHHHHHHHHCHHHHCCC | 59.26 | 64.41 | -300.87 |
| 18 | 2 | CF: CCCEEEEEHHHHHHHHHHHHHHHHHHHHCHHHHCCC; GOR: HHHHHHEEHHHHHHHHHHHHHHHHHHHHHHHHHHHH; NN: HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHC; Joint: HHHHHHEEHHHHHHHHHHHHHHHHHHHHHHHHHHHC | 55.24 | 64.7 | -286.91 |
| 19 | 3 | CF: CCCEEEEHHHHHHHHHHHHHHHHHHHHEEECCCCCC; GOR: HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH; NN: HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH; Joint: HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH | 52 | 64.13 | -275.17 |
| 20 | 4 | CF: CCCEEEEHHHHHHHHHHHHHHHHHHHHEEECCCCCC; GOR: HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH; NN: HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH; Joint: HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH | 53.6 | 64.08 | -283.11 |
| 21 | 5 | CF: CCCEEEEHHHHHHHHHHHHHHHHHHHHEEECCCCCC; GOR: HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH; NN: HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH; Joint: HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH | 60.44 | 64.06 | -292.36 |
| RdRp receptor | |||||
| 1 | Peptide 5 | ARKD | 40.74 | 19.15 | -121.3 |
| 2 | A1 | ARKE | 42.01 | 29.36 | -156.64 |
| 3 | A2 | ARHE | 41.67 | 21.44 | -117.56 |
| A3 | 4 | HRKE | 41.68 | 40.96 | -161.78 |
| A4 | 5 | DRKE | 42.15 | 39.34 | -134.93 |
| 6 | A5 | DRRE | 42.07 | 24.44 | -187.36 |
| 7 | A6 | ARRE | 42.06 | 18.03 | -181.11 |
| 8 | A7 | HRDE | 41.87 | 25.86 | -154.6 |
| N receptor | |||||
| 1 | Plectasin | CF: CCCCCCHHHHHHHCCCCCCCEECCCCEECCCCEEEEECCC; GOR: EECCCCCCCHHHHHHHTTTTTCTTCTTCCCTTTCCEEETT; NN: CCCCCCCCCCCHHHHHCCCCCCCCCCCCCCCCCCCEEECC; Joint: CCCCCCCCCHHHHHHHCCCCCCCCCCCCCCCCCCCEEECC | 48.45 | 64.09 | -255.48 |
| 2 | A1 | CF: CCCCCCHHHHHHHCCCCCCCEECCCCEECCCCEEEEECCC; GOR: EECCTCCCCHHHHHHHTTTTTCTTCTTCCCTTTCCEEETT; NN: CCCCCCCCCCCHHHHHCCCCCCCCCCCCCCCCCCCEEECC; Joint: CCCCCCCCCHHHHHHHCCCCCCCCCCCCCCCCCCCEEECC | 50.28 | 64.13 | -250.67 |
| 3 | A2 | CF:CCCCCCHHHHHHHCCCCCCCEECCCCEECCCCEEEEECCC; GOR:ETTCTCCCCHHHHHHHTTTTTCTTCTTCCCTTTCCEEETT; NN:CCCCCCCCCCCHHHHHCCCCCCCCCCCCCCCCCCCEEECC; Joint:CCCCCCCCCHHHHHHHCCCCCCCCCCCCCCCCCCCEEECC | 50.36 | 64.11 | -284.75 |
| 4 | A3 | CF: CCCCCCHHHHHHHCCCCCCCEECCCCEECCCCCCCCCCCC; GOR: EECCCCCCCHHHHHHHTTTTTCTTCTTCCCTTTCCHHTTT; NN: CCCCCCCCCCCHHHHHCCCCCCCCCCCCCCCCCCCCCCCC; Joint: CCCCCCCCCHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCC | 46.49 | 64.07 | -275.13 |
| 5 | A4 | CF: CCCCCCCHHHHHHCCCCCCCEECCCCEECCCCEEEEECCC; GOR: ETTCCCCCCCTHHHHHTTTTTCTTCTTCCCTTTCCEEETT; NN: CCCCCCCCCCCCHHHHCCCCCCCCCCCCCCCCCCCEEECC; Joint: CCCCCCCCCCCHHHHHCCCCCCCCCCCCCCCCCCCEEECC | 48.53 | 64.09 | -300.87 |
| 6 | A5 | CF: CCCCCCHHHHHHHCCCCCCCEECCCCEECCCCCCCCCCCC; GOR: ETTCTCCCCHHHHHHHTTTTTCTTCTTCCCTTTCCHHTTT; NN: CCCCCCCCCCCHHHHHCCCCCCCCCCCCCCCCCCCCCCCC; Joint: CCCCCCCCCHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCC | 48.43 | 64.09 | -307.87 |
| 7 | A6 | CF: CCCCCCHHHHHHHCCCCCCCEECCCCEECCCCCCCCCCCC; GOR: ETTCTCCCCHHHHHHHTTTTTCTTCTTCEEETTCCHHTTT; NN: CCCCCCCCCCCHHHHHCCCCCCCCCCCCCCCCCCCCCCCC; Joint: CCCCCCCCCHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCC | 47.75 | 64.10 | -276.29 |
| 8 | A7 | CF: CCCCCCHHHHHHHCCCCCCCEECCCCEEECCCCCCCCCCC; GOR: ETTCTCCCCHHHHHHHTTTTTCTTCTTTCEETTTCHHTTT; NN: CCCCCCCCCCCHHHHHCCCCCCCCCCCCCECCCCCCCCCC; Joint: CCCCCCCCCHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCC | 47.51 | 64.10 | -317.69 |
| 9 | A8 | CF: CCCCCCHHHHHHHCCCCCCCEECCCCEECCCCEECEECCC; GOR: ETTCTCCCCHHHHHHHTTTTTCTTCTTCEEETTCCEEEET; NN: CCCCCCCCCCCHHHHHCCCCCCCCCCCCCCCCCCCEEEEC; Joint: CCCCCCCCCHHHHHHHCCCCCCCCCCCCCCCCCCCEEEEC | 50.25 | 64.12 | -280.84 |
| 10 | A9 | CF: CCCCCCHHHHHHHCCCCCCCEECCCCEEECCCEECEECCC; GOR: ETTCTCCCCHHHHHHHTTTTTCTTCTTCEEETTTCEEEET; NN: CCCCCCCCCCCHHHHHCCCCCCCCCCCCCECCCCCEEECC; Joint: CCCCCCCCCHHHHHHHCCCCCCCCCCCCEECCCCCEEECC | 49.94 | 64.14 | -280.24 |
| 11 | A10 | CF: CCCCCCHHHHHHHCCCCCCCEECCCCEEECCCCCCCCCCC; GOR: ETTCTCCCCHHHHHHHTTTTTCTTCTTCEEETTTCHTTTT; NN: CCCCCCCCCCCHHHHHCCCCCCCCCCCCCECCCCCCCCCC; Joint: CCCCCCCCCHHHHHHHCCCCCCCCCCCCEECCCCCCCCCC | 47.01 | 64.11 | -281.52 |
| 12 | A11 | CF: EECCCCHHHHHHHCCCCCCCEECCCCEEECCCCCCCCCCC; GOR: TTTCTCCCCHHHHHHHTTTTTCTTCTTTCEETTTCHHTTT; NN: CCCCCCCCCCCHHHHHCCCCCCCCCCCCCECCCCCCCCCC; Joint: CCCCCCCCCHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCC | 48.04 | 64.12 | -256.88 |
| 13 | A12 | CF: CCCCCCHHHHHHHHCCCCCCEECCCCEEECCCCCCCCCCC; GOR: ETTCTCCCCTTHHHHHHTTTTCTTCTTTCEETTTCHHTTT; NN: CCCCCCCCCCCHHHHHCCCCCCCCCCCCCECCCCCCCCCC; Joint: CCCCCCCCCCCHHHHHCCCCCCCCCCCCCCCCCCCCCCCC | 47.18 | 64.10 | -312.53 |
| 14 | A13 | CF: CCCCCCCHHHHHHCCCCCCCEECCCCEEECCCCCCCCCCC; GOR: ETTCCCCCCCTHHHHHTTTTTCTTCTTTCEETTTCHHTTT; NN: CCCCCCCCCCCCHHHHCCCCCCCCCCCCCECCCCCCCCCC; Joint: CCCCCCCCCCCHHHHHCCCCCCCCCCCCCCCCCCCCCCCC | 46.12 | 64.08 | -305.09 |
| 15 | A14 | CF: CCCCCCCHHHHHHCCCCCCCEECCCCEECCCCCCCCCCCC; GOR: ETTCCCCCCCTHHHHHTTTTTCTTCTTCCCTTTCCHHTTT; NN: CCCCCCCCCCCCHHHHCCCCCCCCCCCCCCCCCCCCCCCC; Joint: CCCCCCCCCCCHHHHHCCCCCCCCCCCCCCCCCCCCCCCC | 46.74 | 64.08 | -260.52 |
| 16 | A15 | CF: CCCCCCCHHHHHHCCCCCCCEECCCCEEECCCEEEEECCC; GOR: ETTCCCCCCCTHHHHHTTTTTCTTCTTTCEETTTCEEETT; NN: CCCCCCCCCCCCHHHHCCCCCCCCCCCCCECCCCCEEECC; Joint: CCCCCCCCCCCHHHHHCCCCCCCCCCCCCCCCCCCEEECC | 48.00 | 64.10 | -260.6 |
| 17 | A16 | CF: CCCCCCHHHHHHHCCCCCCCEECCCCEECCCCEEEEECCC; GOR: EECCCCCCCHTHHHHHTTTTTCTTCTTCCCTTTCCEEETT; NN: CCCCCCCCCCCHHHHHCCCCCCCCCCCCCCCCCCCEEECC; Joint: CCCCCCCCCCCHHHHHCCCCCCCCCCCCCCCCCCCEEECC | 48.72 | 64.11 | -293.04 |
| 18 | A17 | CF:CCCCCCHHHHHHHCCCCCCCEECCCCEECCCCEEEEECCC; GOR:EECCCCCCCHHHHHHHTTTTTCTTCTTCCCTTTCCEEETT; NN:CCCCCCCCCCCHHHHHCCCCCCCCCCCCCCCCCCCEEECC; Joint:CCCCCCCCCHHHHHHHCCCCCCCCCCCCCCCCCCCEEECC | 47.67 | 64.11 | -253.5 |
| 19 | A18 | CF:CCCCCCHHHHHHHCCCCCCCEECCCCEECCCCEEEEECCC; GOR:TTCCCCCCCHHHHHHHTTTTTCTTCTTCCCTTTCCEEETT; NN:CCCCCCCCCCCHHHHHCCCCCCCCCCCCCCCCCCCEEECC; Joint:CCCCCCCCCHHHHHHHCCCCCCCCCCCCCCCCCCCEEECC | 48.49 | 64.09 | -306.04 |
| 20 | A19 | CF: CCCCCCCHHHHHHCCCCCCCEECCCCEECCCCEEEEECCC; GOR: EECCCCCCCCTHHHHHTTTTTCTTCTTCCCTTTCCEEETT; NN: CCCCCCCCCCCCHHHHCCCCCCCCCCCCCCCCCCCEEECC; Joint: CCCCCCCCCCCHHHHHCCCCCCCCCCCCCCCCCCCEEECC | 48.40 | 64.10 | -267.38 |
| 21 | A20 | CF: CCCCCCHHHHHHHCCCCCCCEECCCCEECCCCEEEEECCC; GOR: EECCCCCCCHHHHHHHTTTTTCTTCTTCCCTTTCCEEETT; NN: CCCCCCCCCCCHHHHHCCCCCCCCCCCCCCCCCCCEEECC; Joint: CCCCCCCCCHHHHHHHCCCCCCCCCCCCCCCCCCCEEECC | 48.43 | 64.10 | -307.01 |
Predicted local distance difference test (pLDDT) plots for lead peptides and designed analogues. These plots illustrate the confidence level of the predicted 3D structure (the peptide backbone) at each amino acid position. Higher lDDT scores (closer to 100) indicate higher quality and reliability of the modeling. Each colored line (rank) represents a different predicted model. A, plot for the EK1 lead peptide; B, plot for the EK1 analogue W; C, plot for the peptide 5 lead peptide; D, plot for the peptide 5 analogue A7; E, plot for the Plectasin lead peptide; F, plot for the Plectasin analogue A5.
| No | Name | Toxicity Status | SVM Score | Sequence |
|---|---|---|---|---|
| Inhibitory peptides of S protein | ||||
| 1 | EK1 | Non-Toxic | -1.10 | SLDQINVTFLDLEYEMKKLEEAIKKLEESYIDLKEL |
| 2 | E | Non-Toxic | -1.00 | SLDEIIVTFLDLEYEMKKLEEAIKKLEESYIDLKEL |
| 3 | F | Non-Toxic | -1.05 | SLDDIIVTFLDLEYEMKKLEEAIKKLEESYIDLKEL |
| 4 | G | Non-Toxic | -0.86 | SADEIIVTFLDLEYEMKKLEEAIKKLEESYIDLKEL |
| 5 | H | Non-Toxic | -0.91 | SADYIIVTFLDLEAEMKKLEEAIKKLEESYIDLKEL |
| 6 | I | Non-Toxic | -0.99 | SADEIIVTFLDLEAEMKKLEEAIKKLEESYIDLKEL |
| 7 | L | Non-Toxic | -1.01 | EADEIIVTFLDLELEMKKLEEAIKKLEESYIDLKEL |
| 8 | O | Non-Toxic | -0.83 | QADEIIVTFLDLELEMKKLEEAIKKLEESYIDLREL |
| 9 | P | Non-Toxic | -0.96 | QADEIIVTFLDLELEMKKLEEAIKKLEESYIDLRDL |
| 10 | S | Non-Toxic | -0.86 | SADEIIVTFLDLELEMKKLEEAIKKLEESYIDLREL |
| 11 | T | Non-Toxic | -0.98 | SADEIIVTFLDLELEMKKLEEAIKKLEESYIDLRDL |
| 12 | U | Non-Toxic | -1.05 | SADEIIVTFLDLELEMKKLEEAIKKLEESYIDLKDL |
| 13 | V | Non-Toxic | -1.05 | SADEIIVTFLDLELEMKKLEEAIKKLEESYIDLKDL |
| 14 | W | Non-Toxic | -1.01 | ALAQINVTFLDLEYEMKKLEEAIKKLEESYIDLKEL |
| 15 | Y | Non-Toxic | -0.93 | ALAEIIVTFLDLEYEMKKLEEAIKKLEESYIDLKEL |
| 16 | Z | Non-Toxic | -0.73 | ALAEIIVTFLDLEYEMKKLEEAIKKLEESYIDLREL |
| 17 | 1 | Non-Toxic | -0.88 | ALAEIIVTFLDLEYEMKKLEEAIKKLEESYIDLRDL |
| 18 | 2 | Non-Toxic | -1.10 | ALAEIIVTFLDLEYEMKKLEEAIKKLEESYLDLRDL |
| 19 | 3 | Non-Toxic | -0.89 | ALAEINVQFLDLEYEMKKLEEAIKKLEESYIDLKEL |
| 20 | 4 | Non-Toxic | -0.78 | ALAEIIVQFLDLEYEMKKLEEAIKKLEESYIDLKEL |
| 21 | 5 | Non-Toxic | -0.75 | ALAEIIVQFLDLEYEMKKLEEAIKKLEESYIDLKAL |
| 22 | A | Non-Toxic | -1.11 | SLDEINVTFLDLEYEMKKLEEAIKKLEESYIDLKEL |
| 23 | B | Non-Toxic | -1.15 | SLDDINVTFLDLEYEMKKLEEAIKKLEESYIDLKEL |
| 24 | C | Non-Toxic | -1.02 | SLDYINVTFLDLEYEMKKLEEAIKKLEESYIDLKEL |
| 25 | D | Non-Toxic | -1.00 | SLDQIIVTFLDLEYEMKKLEEAIKKLEESYIDLKEL |
| 26 | J | Non-Toxic | -1.03 | SADEIIVTFLDLELEMKKLEEAIKKLEESYIDLKEL |
| 27 | K | Non-Toxic | -1.03 | SADEIIVTFLDLELEMKKLEEAIKKLEESYIDLKEL |
| 28 | M | Non-Toxic | -1.00 | QADEIIVTFLDLELEMKKLEEAIKKLEESYIDLKEL |
| 29 | N | Non-Toxic | -1.00 | QADEIIVTFLDLELEMKKLEEAIKKLEESYIDLKEL |
| 30 | Q | Non-Toxic | -1.02 | QADEIIVTFLDLELEMKKLEEAIKKLEESYIDLKDL |
| 31 | R | Non-Toxic | -1.02 | QADEIIVTFLDLELEMKKLEEAIKKLEESYIDLKDL |
| 32 | X | Non-Toxic | -1.04 | ALAEINVTFLDLEYEMKKLEEAIKKLEESYIDLKEL |
| 33 | 6 | Non-Toxic | -0.93 | SLDYIIVTFLDLEYEMKKLEEAIKKLEESYIDLKEL |
| Inhibitory peptides of RdRp protein | ||||
| 1 | peptide 5 | Non-Toxic | -0.89 | ARKD |
| 2 | A1 | Non-Toxic | -0.83 | ARKE |
| 3 | A2 | Non-Toxic | -0.83 | ARHE |
| 4 | A3 | Non-Toxic | -0.70 | HRKE |
| 5 | A4 | Non-Toxic | -0.82 | DRKE |
| 6 | A5 | Non-Toxic | -0.85 | DRRE |
| 7 | A6 | Non-Toxic | -0.87 | ARRE |
| 8 | A7 | Non-Toxic | -0.71 | HRDE |
| Inhibitory peptides of N protein | ||||
| 1 | Plectasin | Toxic | 0.85 | GFGCNGPWDEDDMQCHNHCKSIKGYKGGYCAKGGFVCKCY |
| 2 | A1 | Toxic | 0.81 | GFGCNGPWEEDDMQCHNHCKSIKGYKGGYCAKGGFVCKCY |
| 3 | A2 | Toxic | 0.68 | GFGCNGPYEEDDMQCHNHCKSIKGYKGGYCAKGGFVCKCY |
| 4 | A3 | Toxic | 0.89 | GFGCNGPWDEDDMQCHNHCKSIKGYKGGYCAKGGFACKCY |
| 5 | A4 | Toxic | 0.74 | GFGCNGPYDEDDMQCHNHCKSIKGYKGGYCAKGGFVCKCY |
| 6 | A5 | Toxic | 0.72 | GFGCNGPYEEDDMQCHNHCKSIKGYKGGYCAKGGFACKCY |
| 7 | A6 | Toxic | 0.76 | GFGCNGPYEEDDMQCHNHCKSIKGYKGGYCAKGGWACKCY |
| 8 | A7 | Toxic | 0.56 | GFGCNGPYEEDDMQCHNHCKSIKGYKGGYCARGGFACKCY |
| 9 | A8 | Toxic | 0.66 | GFGCNGPYEEDDMQCHNHCKSIKGYKGGYCAKGGAVCKCY |
| 10 | A9 | Toxic | 0.49 | GFGCNGPYEEDDMQCHNHCKSIKGYKGGYCARGGAVCKCY |
| 11 | A10 | Toxic | 0.60 | GFGCNGPYEEDDMQCHNHCKSIKGYKGGYCARGGWACKCY |
| 12 | A11 | Toxic | 0.42 | GFGCRGPYEEDDMQCHNHCKSIKGYKGGYCARGGFACKCY |
| 13 | A12 | Toxic | 0.84 | GFGCNGPYEEDDMECHNHCKSIKGYKGGYCARGGFACKCY |
| 14 | A13 | Toxic | 0.63 | GFGCNGPYDEDDMQCHNHCKSIKGYKGGYCARGGFACKCY |
| 15 | A14 | Toxic | 0.78 | GFGCNGPYDEDDMQCHNHCKSIKGYKGGYCAKGGFACKCY |
| 16 | A15 | Toxic | 0.57 | GFGCNGPYDEDDMQCHNHCKSIKGYKGGYCARGGFVCKCY |
| 17 | A16 | Toxic | 0.55 | GFGCNGPRDEDDMQCHNHCKSIKGYKGGYCAKGGFVCKCY |
| 18 | A17 | Toxic | 0.67 | GFGCNGPEDEDDMQCHNHCKSIKGYKGGYCAKGGFVCKCY |
| 19 | A18 | Toxic | 0.61 | GFGCNGPFDEDDMQCHNHCKSIKGYKGGYCAKGGFVCKCY |
| 20 | A19 | Toxic | 0.81 | GFGCNGPPDEDDMQCHNHCKSIKGYKGGYCAKGGFVCKCY |
| 21 | A20 | Toxic | 0.60 | GFGCNGPLDEDDMQCHNHCKSIKGYKGGYCAKGGFVCKCY |
4.4. Molecular Docking and Interaction Study of Designed Analogues
The docking results of the nCoVDock2 tool for lead peptides and the designed analogues with their respective receptors. A, EK1-S receptor complex. EK1 binds to the second HR1 in the S2 subunit of the S protein; B, W analogue-S receptor complex; C, plectasin-N receptor complex. This ligand interacts with chains A and C of the SARS-CoV-2 nucleocapsid via disulfide bonds; D, A7 analogue-N receptor complex; E, peptide 5-RdRp complex. Peptide 5 binds to the nucleotide triphosphate (NTP) insertion sites within the RdRp protein; F, A5 analogue-RdRp complex. Inhibitory peptides are shown in purple.
Two-dimensional LigPlot analysis of key protein-analogue peptide interactions. The figure presents the non-covalent interactions for: A, S2 (7COT) and W analogue complex; B, A5 analogue and RdRp (PDB:7BV2) complex; and C, A7 analogue and N protein (PDB:6YI3). These diagrams highlight the crucial role of both hydrogen bonding and hydrophobic interactions in stabilizing the complexes. In the W analogue-S receptor complex, hydrogen bonds between the Lys34 and Asp27, Lys24, and between Lys17 and Ser99 residues are essential for stability. The A5 analogue forms seven hydrogen bonds with RdRp, primarily through its Asp1, Arg3, and Glu4 residues. Similarly, multiple hydrogen bonds also contribute significantly to the stability of the A7 analogue-N protein complex. The hydrophobic interactions are prominent, particularly in the W analogue and A7 analogue complexes, reflecting the design goal to enhance this type of non-covalent binding. (The diagrams follow the standard LigPlot conventions: dotted lines represent hydrogen bonds (with numbers indicating distance in Å), half-circles with spokes indicate hydrophobic contacts, black dots represent the corresponding atoms involved. Chain A is the receptor protein, and chain B is the analogue peptide.)
4.5. Molecular Dynamics
Root mean square deviation (RMSD) trajectories of ligand and receptor backbones for peptide-protein complexes over 100 ns molecular dynamics (MD) simulations. A, W strand ligand (S protein) and receptor backbones; B, EK1-S2 ligand (S protein) and receptor backbones; C, N-A7 ligand (N protein) and receptor backbones; D, N-Plectasin ligand (N protein) and receptor backbones; E, RdRp-A5 ligand (RdRp) and receptor backbones; F, RdRp-Peptide 5 ligand (RdRp) and receptor backbones. RMSD (nm) is plotted against simulation time (ps). Ligand backbones are shown in solid lines, and receptor backbones are shown in dashed lines. All simulations were performed in triplicate.






