Molecular Tracing of Hepatitis C Virus Genotype 1 Isolates in Iran: A NS5B Phylogenetic Analysis with Systematic Review

authors:

avatar Khashayar Hesamizadeh 1 , 2 , avatar Seyed Moayed Alavian 1 , 2 , avatar Azar Najafi Tireh Shabankareh 3 , avatar Heidar Sharafi 1 , 2 , *

Baqiyatallah Research Center for Gastroenterology and Liver Diseases (BRCGL), Baqiyatallah University of Medical Sciences, Tehran, IR Iran
Middle East Liver Diseases (MELD) Center, Tehran, IR Iran
Department of Medical Nanotechnology, School of Advanced Technologies in Medicine, Tehran University of Medical Sciences, Tehran, IR Iran

How To Cite Hesamizadeh K, Alavian S M, Najafi Tireh Shabankareh A, Sharafi H. Molecular Tracing of Hepatitis C Virus Genotype 1 Isolates in Iran: A NS5B Phylogenetic Analysis with Systematic Review. Hepat Mon. 2016;16(12):e42938. https://doi.org/10.5812/hepatmon.42938.

Abstract

Context:

Hepatitis C virus (HCV) is characterized by a high degree of genetic heterogeneity and classified into 7 genotypes and different subtypes. It heterogeneously distributed through various risk groups and geographical regions. A well-established phylogenetic relationship can simplify the tracing of HCV hierarchical strata into geographical regions. The current study aimed to find genetic phylogeny of subtypes 1a and 1b of HCV isolates based on NS5B nucleotide sequences in Iran and other members of Eastern Mediterranean regional office of world health organization, as well as other Middle Eastern countries, with a systematic review of available published and unpublished studies.

Evidence Acquisition:

The phylogenetic analyses were performed based on the nucleotide sequences of NS5B gene of HCV genotype 1 (HCV-1), which were registered in the GenBank database. The literature review was performed in two steps: 1) searching studies evaluating the NS5B sequences of HCV-1, on PubMed, Scopus, and Web of Science, and 2) Searching sequences of unpublished studies registered in the GenBank database.

Results:

In this study, 442 sequences from HCV-1a and 232 from HCV-1b underwent phylogenetic analysis. Phylogenetic analysis of all sequences revealed different clusters in the phylogenetic trees. The results showed that the proportion of HCV-1a and -1b isolates from Iranian patients probably originated from domestic sources. Moreover, the HCV-1b isolates from Iranian patients may have similarities with the European ones.

Conclusions:

In this study, phylogenetic reconstruction of HCV-1 sequences clearly indicated for molecular tracing and ancestral relationships of the HCV genotypes in Iran, and showed the likelihood of domestic origin for HCV-1a and various origin for HCV-1b.

1. Context

Hepatitis C virus (HCV) infection is still one of the major causes of mortality and morbidity worldwide (1, 2). The world health organization (WHO) has estimated 3% prevalence of HCV equating approximately to 180 million individuals globally (3-5). The prevalence of HCV infection in Iran is estimated to be less than 0.5% (6, 7). It is well established that hepatitis C contributes to the increasing risk of fatal-related diseases including cirrhosis and hepatocellular carcinoma (HCC) (8, 9). In spite of numerous progressions in hepatitis C treatment, the high prevalence of hepatitis C in developing countries is still a major concern (10).

The only one open reading frame (ORF) of HCV comprises about 9024 base pairs and encodes a polyprotein of about 3000 amino acids. Up to now, HCV is classified into seven different genotypes and more than 67 subtypes according to the genetic variability and viral sequences (11). The linkage of geographic distribution of HCV genotypes in different populations and specific risk groups to genetic diversity of HCV is obvious (12). Genotype of HCV is a considerable factor with clinical and epidemiological importance because it determines the rate of response to the HCV therapy, and may help trace the source of infection and clarify the possible modes of transmission (13). In terms of response to the HCV treatment protocol with Pegylated-Interferon and Ribavirin (PegIFN/RBV), the HCV genotype 1 (HCV-1) and HCV-4 infections are more difficult-to-treat than HCV-2 and HCV-3 infections (10, 14, 15). However, the development of new pangenotypic HCV treatments in recent years has contributed to the HCV elimination in the world (13).

The Eastern Mediterranean regional office (EMRO) is one of the six regional divisions of the WHO around the world, which serves 22 countries and territories in the Middle East, the North Africa, the Horn of Africa, and Central Asia with a total population of 605 million people. The frequency of hepatitis C infection, as estimated by the WHO, has revealed that at least 23 million individuals in the EMRO countries are infected with HCV (16). The genotype distribution of HCV through EMRO countries is interestingly heterogeneous. The distribution of HCV genotypes in EMRO countries has two main patterns: Arab countries (except Jordan) with HCV-4 as the predominant isolate and non-Arab countries with predominance of HCV genotypes other than HCV-4 such as Iran by dominance of HCV-1 and Afghanistan and Pakistan by dominance of HCV-3 (17, 18). Distribution of HCV genotypes in Iran is different from most of the other neighboring and Middle Eastern countries, as the most frequent HCV genotype in Iran is HCV-1a, followed by HCV-3a and -1b, respectively (19). Interestingly, in Turkey, Azerbaijan and Russia, as neighboring countries in North of Iran, the most prevalent genotype is HCV-1b (20), in Afghanistan and Pakistan in East of Iran, the most prevalent genotype is HCV-3 (21), and in most of the neighboring Arab countries in West and South of Iran including Iraq and Saudi Arabia, the most prevalent genotype is HCV-4 (18, 22, 23). It is thought that the genetic diversity patterns of HCV in Iran are similar to the pattern observed in North America and somewhat Western Europe (24). The controversial issue regarding the source of HCV in Iran in comparison with surrounding countries may be elucidated by phylogenetic analysis.

There is heterogeneity in the regions sequenced along the HCV genome, such as Core, NS5B, HVR-1, E2, and a segment of the NS5A gene associated with interferon sensitivity. Guidelines propose to use either the full genome, Core/E1 or NS5B sequences of HCV for classification of genotypes/subtypes (25). Furthermore, genotyping of HCV by nucleotide sequence analysis of NS5B is an effective procedure that allows discrimination of HCV subtypes properly. Moreover, NS5B is an appropriate gene region to study the molecular epidemiology of HCV (25).

Phylogenetic is a study for inferring or estimating evolutionary relationships among individuals or groups of organisms e.g. species or populations (26). In the area of virology, phylogenetic trees contain a lot of information about the inferred evolutionary relationships between a set of viruses and it can be a potential widely used molecular tool to study rapidly-evolving RNA viruses such as HCV.

The main objective of the current study was to investigate the genetic relationship among all HCV-1a and -1b sequences derived from Iran, EMRO, Middle Eastern, and some European and North American countries by applying phylogenetic analysis, and understanding of the source of spread of HCV-1a and -1b in Iran.

2. Evidence Acquisition

2.1. Search Strategy

An electronic systematic search of available systematic reviews was conducted on all literature to find relevant studies reporting molecular prevalence and evaluation of HCV-1a and -1b in different Iranian patient groups, and also all studies about HCV-1 molecular epidemiology in EMRO and Middle Eastern countries. To compare the obtained findings, we also extended the search on some European and North American countries.

The search was performed on all peer-reviewed journals indexed in PubMed, Scopus, and Web of Science databases. The literature review was carried out using the following key terms: “hepatitis C virus”, “HCV”, “genotype”, “genotype 1a and 1b”, “molecular sequence data”, “sequence analysis”, “phylogeography”, “phylogenetic analysis”, and “Iran”. In addition to aforementioned search terms, the names of twenty-two EMRO countries were added to our search as follows: Afghanistan, Bahrain, Djibouti, Egypt, Iran, Iraq, Jordan, Kuwait, Lebanon, Libya, Morocco, Oman, Pakistan, Palestine, Qatar, Saudi Arabia, Somalia, Sudan, Syria, Tunisia, United Arab Emirates, and Yemen. Moreover, Middle Eastern countries that were not in EMRO including Cyprus and Turkey were added in the search. Furthermore, Azerbaijan was added in the search, as a neighboring country of Iran with wide commuting between the countries. Also, some European and North American countries including France, Germany, Italy, Netherlands, Spain, UK, USA and Canada, were added to the search strategy.

In addition, to find appropriate sequences from unpublished studies which were registered in the GenBank database, we searched into GenBank using the names of aforementioned EMRO and Middle Eastern countries in addition to “Hepatitis C virus” and “NS5B”. After all searches were completed, the NS5B sequences of all HCV-1a and -1b were selected and FASTA format of the sequences were extracted from the GenBank database.

2.2. Selection of Studies

All published and unpublished studies with proper data were surveyed according to the following criteria: 1) all molecular studies in English among different patient groups with HCV-1a and -1b enrolled from Iran, EMRO, Middle Eastern, and some aforementioned European and North American countries, 2) molecular studies that reported nucleotide sequence accession numbers based on NS5B gene sequences which were registered in the GenBank database, 3) gene sequences with 243 base pair (bp) coverage which were between nucleotides 8319 - 8561 for HCV-1a (the coverage obtained after full alignment along with HCV-1a reference with accession number M62321), and 232 bp coverage between nucleotides 8315 - 8546 for HCV-1b (the coverage obtained after full alignment along with HCV-1b reference with accession number U84014). The mentioned frames were obtained after alignment and trimming of the included nucleotide sequences and exclusion of the sequences with less than 200 bp lengths. Generally, the short nucleotide sequences or the nucleotide sequences which were not in the proper coverage were removed from the MEGA file.

The exclusion criteria were as follows: 1) studies with possible errors and confusing data, 2) studies that used HCV genomic regions other than NS5B, 3) the HCV-2 to -6 isolates and also, HCV-1 other than HCV-1a and -1b isolates.

To confirm the genotype/subtype of the selected isolates, we used NCBI Viral Genotyping tool (http://www.ncbi.nlm.nih.gov/projects/genotyping/ formpage.cgi) and HCV geno2pheno (http://www.geno2pheno.org). For the eligibility criteria, all favorable articles obtained through the search strategy were independently reviewed by three authors (KH-H, H-SH and A-NJTSH). If there was any discrepancy between authors, it was resolved by consulting the supervisor of the study (SMA).

2.3. Data Extraction and Quality Assessment

The reviewing and screening processes in this study were based on the PRISMA guidelines for reporting systematic reviews (27). We independently screened the title, abstract, and full-text of papers identified through the database searches. After full-text screening, the following data were extracted from each study: first author’s name, publication year, country of origin, date of study, type of patient groups, and GenBank accession numbers. All extracted data were systematically double checked by authors independently to avoid any errors. The quality of the included studies was assessed using a modified STROBE checklist (28).

2.4. Sequence Collection and Phylogenetic Analyses

In this study, the evolutionary relationships of isolates were inferred using the Neighbor-Joining method and Kimura 2-Parameter model. The percentage of replicate trees which were associated with clusters was estimated using bootstrap test (500 replicates). The results in this study were based on the clusters identified using the neighbor-joining phylogeny according to bootstrap test with a cut-off value 50% for defining the clusters (values > 50% have been shown next to the branches). The trees were drawn to scale, with branch lengths in the same units as those of the evolutionary distances used to infer the phylogenetic tree (29).

Pair-wise and multiple alignments of the HCV NS5B sequences were performed using multiple sequence alignment-MUSCLE (Multiple Sequence Comparison by Log-Expectation) through molecular evolutionary genetics analysis software version 7.0 (MEGA 7.0) (30-32). To find the coverage and to edit the nucleotide sequences, the downloaded sequences were transferred to CLC software (CLC Main Workbench 5). Following the full alignment of the sequences and manually trimming, the phylogenetic trees of suitable sequences were constructed by MEGA 7.0 software. Initially, the trees were drawn with traditional-rectangular branch style. Due to large number of data, it would be obscure to show the unclear long traditional tree style. Thus, we decided to convert the trees to circular style for better understanding.

3. Results

3.1. Study Screening and Selection

In the present study, HCV NS5B nucleotide sequences from 30 various studies that had been totally conducted on more than 3000 subjects were collected. We studied all available nucleotide sequences reported from Iran and other EMRO and Middle Eastern countries. Out of seven studies from Iran, four studies were published in 2004 - 2014 (33-36) and three were unpublished. Furthermore, 15 published and unpublished studies from EMRO and Middle Eastern countries including Afghanistan (37), Azerbaijan, Cyprus (38, 39), Egypt, Morocco (40), Pakistan (41), and Tunisia (42, 43) were collected. Furthermore, other studies from the US and European countries were investigated, randomly (44-49).

Based on Figure 1, 683 published papers were identified via database searching. After removal of 187 duplicates, 382 irrelevant titles, and 81 papers with irrelevant abstracts, finally, 14 studies were eligible to be assessed in phylogenetic analyses (33-43, 50-52). In total, 505 sequences of HCV-1a and -1b were obtained from these 14 studies. Moreover, after searching the GenBank database to find unpublished studies with registered sequences, 1831 sequences were obtained. After removal of 1283 sequences (including 47 sequences because of non-NS5B gene sequences and 1236 sequences because of HCV genotypes other than HCV-1a and -1b), a total of 548 sequences were obtained. Therefore, a total of 1053 sequences from both published and unpublished studies were collected. After removal of 413 duplicated sequences, 640 sequences were obtained. Furthermore, 195 different sequences from Italy, France, Netherlands, Spain, the United Kingdom, and the United States were randomly included (44-49). After appropriate sequence collection, all of the 835 sequences were transferred into MEGA software for full alignment. Finally, 161 sequences were removed because they were not in the coverage setting; thus, a total of 674 sequences were used in the phylogenetic analyses including 442 sequences for HCV-1a and 232 sequences for HCV-1b.

Flowchart Diagram of Searching Databases
Flowchart Diagram of Searching Databases

3.2. Characteristics of the Included Studies

The characteristics of published studies are presented in Table 1. Although 8 included studies were unpublished (from Iran, Azerbaijan, Egypt, Pakistan, and Tunisia), the nucleotide sequences of these studies had been registered in the GenBank database. The characteristics of unpublished studies with registered sequences in the GenBank database are shown in Table 2.

Table 1.

Characteristics of the Included Published Studies

Publication YearCountrySample Size, nAge, Min - MaxMale, %Subtype 1a, No. (%)Subtype 1b, No. (%)Patient GroupRef.
12013Afghanistan7123-3910025 (35.2)2 (2.8)IDU(37)
22009Cyprus10418 to > 60509 (8.6)38 (36.5)NA(38)
32010Cyprus4025 - 478504 (10)IDU(39)
42004Iran1585 - 767659 (37)10 (6.3)IDU, blood or blood product recipient, hemodialysis, NA(34)
52012Iran8319 - 659835 (42)0IDU(35)
62013Iran13011 - 635169 (53)19 (14.6)Thalassemia(33)
72014Iran14222 - 828571 (50)20 (14)NA(36)
82012Morocco14139 - 80451 (0.7)106 (75)NA(40)
92009Pakistan18946 - 66663 (1.5)2 (0.8)NA(51)
102013Pakistan153731 - 534353 (3.5)12 (0.8)NA(41)
112004Tunisia3214 - 768110 (31)14 (43.7)NA(52)
122007Tunisia39518 - 88604 (1)10 (2.5)Hemodialysis(43)
132008Tunisia381 - 5610020 (52.6)17 (44.7)Hemophilia(42)
142013Tunisia3334 - 566704 (12)Hemodialysis(50)
Table 2.

Characteristics of the Unpublished Studies With Direct Gene Submission in the GenBank Database

Year of Registry in GenBankCountrySubtype 1a, nSubtype 1b, nPatient GroupTitle in GenBank
12008Azerbaijan025IDUHepatitis C recombinant form 1 - 2k/1b prevalent in IDU networks in Azerbaijan
22010Egypt11NAHCV intrafamilial transmission in Greater Cairo, Egypt
32012Iran10824Inherited bleeding disorderMolecular epidemiology of hepatitis C virus among patients with inherited bleeding disorders in Iran
42012Iran233Blood donorGenotype distribution of hepatitis C virus among Iranian blood donors, 2006
52013Iran221Blood donorGenotype distribution of hepatitis C virus among Iranian blood donors, 2006 - 2008
62010Pakistan20NAHepatitis C virus subtype 1a isolate Pk-NS5B 1a non-structural protein 5B gene
72011Pakistan53NANS5B genome based HCV genotyping and evolutionary analysis
82005Tunisia120HemophiliaGenetic variability of genotype 1 HCV strains obtained from Tunisian haemophiliacs and assessed by phylogenetic analyses in the NS5b region database

3.3. Phylogenetic Analysis of HCV Subtype 1a

Out of 442 extracted sequences for HCV-1a, 325 (73.5%) sequences were obtained from Iranian studies. These sequences were derived from different groups including blood donors, inherited bleeding disorders, thalassemia, intravenous drug users, patients on hemodialysis, and patients without known risk factors. The phylogenetic analysis of HCV-1a demonstrated various clusters (Figure 2).

Most of the Afghan isolates clustered among Iranian isolates within the same clades. There were sequences from the UK intravenous drug users clustered with some French blood donors and some sequences from Iranian patients in the phylogenetic tree. Also, the specific sequences from Iranian patients clustering with the UK and Cypriots intravenous drug users were observed (Figure 2).

Phylogenetic Analysis of NS5B Sequences of HCV Subtype 1a, The analysis involved 442 nucleotide sequences, and codon positions included 1st+2nd+3rd+Noncoding. All positions containing gaps and missing data were eliminated. Phylogenetic clusters were defined by bootstrap analysis (cut-off 50%). Values for these clusters are indicated next to the branches (values > 50% are shown). The accession number, the country of origin, and patient group are listed for all isolates. Solid blue circles indicate sequences attributed to the Iranian strains. Abbreviations of country names are as follows: Afgh: Afghanistan; Azer: Azerbaijan; Cyp: Cyprus; Egy: Egypt; Fra: France; IRI: Iran; Ita: Italy; Mor: Morocco; Ned: Netherlands; Pak: Pakistan; Spa: Spain; Tun: Tunisia; UK: the United Kingdom; USA: the United States of America. Abbreviations of patient groups are as following: BDs: blood donors; Dial: hemodialysis; IDU: intravenous drug users; IBD: inherited bleeding disorders; Hemo: hemophilia; Thal: thalassemia; LT: liver transplants; Und: undetermined.The optimal tree with the sum of branch length of 1.380 is shown, and there were a total of 187 positions in the final dataset.
Phylogenetic Analysis of NS5B Sequences of HCV Subtype 1a, The analysis involved 442 nucleotide sequences, and codon positions included 1st+2nd+3rd+Noncoding. All positions containing gaps and missing data were eliminated. Phylogenetic clusters were defined by bootstrap analysis (cut-off 50%). Values for these clusters are indicated next to the branches (values > 50% are shown). The accession number, the country of origin, and patient group are listed for all isolates. Solid blue circles indicate sequences attributed to the Iranian strains. Abbreviations of country names are as follows: Afgh: Afghanistan; Azer: Azerbaijan; Cyp: Cyprus; Egy: Egypt; Fra: France; IRI: Iran; Ita: Italy; Mor: Morocco; Ned: Netherlands; Pak: Pakistan; Spa: Spain; Tun: Tunisia; UK: the United Kingdom; USA: the United States of America. Abbreviations of patient groups are as following: BDs: blood donors; Dial: hemodialysis; IDU: intravenous drug users; IBD: inherited bleeding disorders; Hemo: hemophilia; Thal: thalassemia; LT: liver transplants; Und: undetermined.The optimal tree with the sum of branch length of 1.380 is shown, and there were a total of 187 positions in the final dataset.

It can be concluded from the tree that a proportion of the Iranian isolates clustered along with each other.

3.4. Phylogenetic Analysis of HCV Subtype 1b

A comparison of sequences of HCV-1b is shown in the phylogenetic tree in Figure 3. Seventy one (30.6%) sequences of total 232 sequences were isolates from Iranian patients. These sequences were isolated from blood donors, inherited bleeding disorders, thalassemia, and patients without known risk factors. Based on the phylogenetic analysis, the Iranian sequences of HCV-1b had heterogeneous dispersion (Figure 3). Some of the Iranian sequences clustered with each other and some clustered with European sequences particularly sequences from France, Spain and Italy. Most likely, the HCV-1b isolates from Iranian patients may have similarities with the European ones. Also, in the phylogenetic tree there are isolates from different geographical regions which clustered together. It is likely that the subtype 1b has different origins.

Phylogenetic Analysis of NS5B Sequences of HCV Subtype 1b, The analysis involved 232 nucleotide sequences. The codon positions included 1st+2nd+3rd+Noncoding. All positions containing gaps and missing data were eliminated. Phylogenetic clusters were defined by bootstrap analysis (cut-off 50%). Values for these clusters are indicated next to the branches (values > 50% are shown). The accession number, the country of origin, and patient groups are listed for all isolates. Solid green circles indicate sequences attributed to the Iranian strains. Abbreviations of country names are: Afgh: Afghanistan; Azer: Azerbaijan; Cyp: Cyprus; Egy: Egypt; Fra: France; IRI: Iran; Ita: Italy; Mor: Morocco; Ned: Netherlands; Pak: Pakistan; Spa: Spain; Tun: Tunisia; UK: the United Kingdom; USA: the United States of America. Abbreviations of patient groups are written as: BDs: blood donors; Dial: hemodialysis; IDU: intravenous drug users; IBD: inherited bleeding disorders; Hemo: hemophilia; Thal: thalassemia; LT: liver transplants; Und: undetermined. The optimal tree with the sum of branch length of 3.468 is shown, and there were a total of 203 positions in the final dataset.
Phylogenetic Analysis of NS5B Sequences of HCV Subtype 1b, The analysis involved 232 nucleotide sequences. The codon positions included 1st+2nd+3rd+Noncoding. All positions containing gaps and missing data were eliminated. Phylogenetic clusters were defined by bootstrap analysis (cut-off 50%). Values for these clusters are indicated next to the branches (values > 50% are shown). The accession number, the country of origin, and patient groups are listed for all isolates. Solid green circles indicate sequences attributed to the Iranian strains. Abbreviations of country names are: Afgh: Afghanistan; Azer: Azerbaijan; Cyp: Cyprus; Egy: Egypt; Fra: France; IRI: Iran; Ita: Italy; Mor: Morocco; Ned: Netherlands; Pak: Pakistan; Spa: Spain; Tun: Tunisia; UK: the United Kingdom; USA: the United States of America. Abbreviations of patient groups are written as: BDs: blood donors; Dial: hemodialysis; IDU: intravenous drug users; IBD: inherited bleeding disorders; Hemo: hemophilia; Thal: thalassemia; LT: liver transplants; Und: undetermined. The optimal tree with the sum of branch length of 3.468 is shown, and there were a total of 203 positions in the final dataset.

4. Conclusions

Currently, HCV is considered as one of the most important viruses threatening human life. HCV has 7 major genotypes and more than 67 different subtypes. The distribution pattern of HCV genotypes is various among HCV-infected individuals which depends on different status of public health behavior and social risk factors. It is clear to all that the predominance of risk factors for HCV transmission has changed over time, from blood transfusion to intravenous drug use (24). The distribution of HCV genotypes and subtypes in Iran and other Middle Eastern and EMRO countries has a very diverse pattern. Altogether, the HCV-1 (-1a and -1b) is the predominant genotype in Iran, so that more than half of the HCV-infected patients in Iran are infected with this genotype (with a rate of 54%) (12, 18).

The genetic diversity of HCV is due to the unique characteristic of the RNA molecule. The genetic variation stems from the error-prone NS5B polymerase. As a result, different populations of viruses called “quasispecies” are produced, almost with a single mutation in each cycle of replication. The production of highly different viruses through the dynamic replication process of HCV will occur with a count of 10 trillion viruses per day (53).

The phylogenetic pattern gives valuable information about hierarchal relationships and genetic evolution. The results of the detailed phylogenetic analysis by using NS5B sequences of HCV-1a indicated that a proportion of the Iranian HCV-1a isolates was in common clades. Therefore, it can be concluded that “a proportion of Iranian HCV-1a isolates most probably has domestic origin”. In this study, most of Afghan strains have fallen into Iranian strains. These results showed that the HCV-1a sequences from Afghan patients were likely similar to the isolates from Iran. This indicates that the HCV-1a sequences from both countries are closely related to each other genetically. This may be because of the fact that the Afghanistan land was a part of Iranian territory years ago and now after occurring fled wars in Afghanistan since 1978, Iran’s border gates were opened on Afghan refugees and it provided the conditions for the large numbers of Afghans to immigrate to Iran. Perhaps this could be one of the main explanations for the HCV genotypes similarity in Afghanistan and Iran.

In Pakistan, HCV-1a was the third predominant genotype (with a rate of 4.82%). It is plausible that the most of Pakistani patients who were infected with HCV-1a, acquired this infection due to unsafe medical practices during surgeries (54). In a previous phylogenetic analysis of HCV-1a in Pakistan, the virus was identified with polyphyletic origin, and the sequences were found to be closely related to European strains (55).

Discussion surrounding HCV-1b is more sophisticated, as it has different geographical distribution patterns. HCV-1b is the third dominant genotype in Iran. Cyprus, Morocco, Tunisia, and Turkey are the countries wherein HCV-1b is predominant. The EMRO and Middle Eastern countries take diverse patterns of HCV genotype distribution, dominantly HCV-G4 in Arab countries to HCV-G3 and -G1 (-1a and -1b) in non-Arab countries (18).

As mentioned earlier, some of the HCV-1b sequences of Iranian isolates were similar to counterpart sequences from European isolates including those from France and Spain. It seems that this similarity is more likely among Iranian patients with inherited congenital bleeding disorder such as hemophilia. The prevalence of HCV among Iranian hemophilia patients is high, as it is said that the overall prevalence in these Iranian patients group is 40.8%, with a range from 13.3% to 80.5% (8, 56). It is noteworthy that in the 1980s, with the arrival of blood and blood products from France to Iran, a large number of patients, particularly hemophiliac patients, were infected with HIV and HCV. The different pattern of HCV infection in Iranian hemophiliac patients has been clearly defined. Previously, it was shown that HCV-1b is more frequently observed in Iranian hemophiliac patients than other Iranian HCV infected groups (57). This may indicate the possible similarity of HCV-1b of Iranian and European isolates and suggests an infection through blood products such as clotting factors imported to Iran.

Phylogenetic analysis of HCV genotypes and subtypes is a useful molecular method which helps scientists in every geographic region provide a substantial contribution to monitoring the virus for any purpose including HCV molecular tracing, ancestral studies, performing different genetic assessments, and guidance for any treatment decision. There are some limitations in this study: 1) the short (< 300 bp) nucleotide sequences which were used in the phylogenetic analyses, 2) HCV NS5B sequences were available from limited number of EMRO and middle eastern countries, and 3) we could not establish a proper analysis for assessment of the relationship between HCV risk factors and the phylogeny of HCV NS5B sequences. However, more studies should be conducted in next future to find more genetic relationships between these different sequences from different regions and patient groups including genetic distances for measuring genetic divergence, phylodynamic inference, and evolutionary methods to define circulating strains and molecular clock analysis for understanding the ancestral relationships.

In conclusion, the NS5B sequences of different infected-patients were phylogenetically- evolutionarily analyzed for molecular tracing of HCV-1 in Iran. The phylogenetic trees of HCV-1a and -1b according to 500 pseudo-replicates indicated many clades and codon positions with ancestral relationships of all data. Phylogenetic reconstruction of all sequences of HCV-1 pinpoints phylogenetic dispersion of most of HCV-1b of Iranian isolates among other European ones with a considerable diversity; whereas, most of Iranian HCV-1a isolates are genetically defined probably with domestic origin.

References

  • 1.

    Alavian S, Fallahian F. Epidemiology of Hepatitis C in Iran and the World. Shiraz E Med J. 2009;10(4):162-72.

  • 2.

    Alavian SM, Tabatabaei SV, Mahboobi N. Epidemiology and risk factors of HCV infection among hemodialysis patients in countries of the Eastern Mediterranean Regional Office of WHO (EMRO): a quantitative review of literature. J Public Health. 2011;19(2):191-203. https://doi.org/10.1007/s10389-010-0366-2.

  • 3.

    Mohd Hanafiah K, Groeger J, Flaxman AD, Wiersma ST. Global epidemiology of hepatitis C virus infection: new estimates of age-specific antibody to HCV seroprevalence. Hepatology. 2013;57(4):1333-42. [PubMed ID: 23172780]. https://doi.org/10.1002/hep.26141.

  • 4.

    Lavanchy D. Global surveillance and control of hepatitis C. Report of a WHO Consultation organized in collaboration with the Viral Hepatitis Prevention Board, Antwerp, Belgium. J Viral Hepatitis. 1999;6(1):35-47. https://doi.org/10.1046/j.1365-2893.1999.6120139.x.

  • 5.

    Lavanchy D. Evolving epidemiology of hepatitis C virus. Clin Microbiol Infect. 2011;17(2):107-15. [PubMed ID: 21091831]. https://doi.org/10.1111/j.1469-0691.2010.03432.x.

  • 6.

    Alavian SM, Adibi P, Zali MR. Hepatitis C virus in Iran: Epidemiology of an emerging infection. Arch Iranian Med. 2005;8(2):84-90.

  • 7.

    Alavian SM, Ahmadzad-Asl M, Lankarani KB, Shahbabaie MA, Bahrami Ahmadi A, Kabir A. Hepatitis C infection in the general population of Iran: a systematic review. Hepat Mon. 2009;9(3):211-23.

  • 8.

    Alavian SM, Aalaei-Andabili SH. Lack of Knowledge About Hepatitis C Infection Rates Among Patients With Inherited Coagulation Disorders in Countries Under the Eastern Mediterranean Region Office of WHO (EMRO): A Meta-Analysis. Hepat Mon. 2012;12(4):244-52. [PubMed ID: 22690231]. https://doi.org/10.5812/hepatmon.844.

  • 9.

    Hajarizadeh B, Grebely J, Dore GJ. Epidemiology and natural history of HCV infection. Nat Rev Gastroenterol Hepatol. 2013;10(9):553-62. [PubMed ID: 23817321]. https://doi.org/10.1038/nrgastro.2013.107.

  • 10.

    Alavian SM, Hajarizadeh B, Bagheri Lankarani K, Sharafi H, Ebrahimi Daryani N, Merat S. Recommendations for the Clinical Management of Hepatitis C in Iran: A Consensus-Based National Guideline. Hepat Mon. https://doi.org/10.5812/hepatmon.guideline.

  • 11.

    Smith DB, Bukh J, Kuiken C, Muerhoff AS, Rice CM, Stapleton JT, et al. Expanded classification of hepatitis C virus into 7 genotypes and 67 subtypes: updated criteria and genotype assignment web resource. Hepatology. 2014;59(1):318-27. [PubMed ID: 24115039]. https://doi.org/10.1002/hep.26744.

  • 12.

    Sadeghi F, Salehi-Vaziri M, Almasi-Hashiani A, Gholami-Fesharaki M, Pakzad R, Alavian SM. Prevalence of Hepatitis C Virus Genotypes Among Patients in Countries of the Eastern Mediterranean Regional Office of WHO (EMRO): A Systematic Review and Meta-Analysis. Hepat Mon. 2016;16(4):35558. [PubMed ID: 27274353]. https://doi.org/10.5812/hepatmon.35558.

  • 13.

    Hesamizadeh K, Sharafi H, Rezaee-Zavareh MS, Behnava B, Alavian SM. Next Steps Toward Eradication of Hepatitis C in the Era of Direct Acting Antivirals. Hepat Mon. 2016;16(4):37089. [PubMed ID: 27275164]. https://doi.org/10.5812/hepatmon.37089.

  • 14.

    Manns MP, McHutchison JG, Gordon SC, Rustgi VK, Shiffman M, Reindollar R, et al. Peginterferon alfa-2b plus ribavirin compared with interferon alfa-2b plus ribavirin for initial treatment of chronic hepatitis C: a randomised trial. Lancet. 2001;358(9286):958-65. [PubMed ID: 11583749]. https://doi.org/10.1016/S0140-6736(01)06102-5.

  • 15.

    Behnava B, Sharafi H, Keshvari M, Pouryasin A, Mehrnoush L, Salimi S, et al. The Role of Polymorphisms Near the IL28B Gene on Response to Peg-Interferon and Ribavirin in Thalassemic Patients With Hepatitis C. Hepat Mon. 2016;16(1):32703. [PubMed ID: 27110259]. https://doi.org/10.5812/hepatmon.32703.

  • 16.

    The growing threats of hepatitis B and C in the Eastern Mediterranean region: a call for action. WHO; 2009.

  • 17.

    Ramia S, Eid-Fares J. Distribution of hepatitis C virus genotypes in the Middle East. Int J Infect Dis. 2006;10(4):272-7. [PubMed ID: 16564719]. https://doi.org/10.1016/j.ijid.2005.07.008.

  • 18.

    Ghaderi-Zefrehi H, Gholami-Fesharaki M, Sharafi H, Sadeghi F, Alavian SM. The Distribution of Hepatitis C Virus Genotypes in Middle Eastern Countries: A Systematic Review and Meta-Analysis. Hepat Mon. 2016;16(9):40357. [PubMed ID: 27826320]. https://doi.org/10.5812/hepatmon.40357.

  • 19.

    Khodabandehloo M, Roshani D. Prevalence of hepatitis C virus genotypes in Iranian patients: a systematic review and meta-analysis. Hepat Mon. 2014;14(12):22915. [PubMed ID: 25685164]. https://doi.org/10.5812/hepatmon.22915.

  • 20.

    Kabakci Alagoz G, Karatayli SC, Karatayli E, Celik E, Keskin O, Dinc B, et al. Hepatitis C virus genotype distribution in Turkey remains unchanged after a decade: performance of phylogenetic analysis of the NS5B, E1, and 5'UTR regions in genotyping efficiency. Turk J Gastroenterol. 2014;25(4):405-10. [PubMed ID: 25254523]. https://doi.org/10.5152/tjg.2014.7083.

  • 21.

    Khan N, Akmal M, Hayat M, Umar M, Ullah A, Ahmed I, et al. Geographic distribution of hepatitis C virus genotypes in pakistan. Hepat Mon. 2014;14(10):20299. [PubMed ID: 25477975]. https://doi.org/10.5812/hepatmon.20299.

  • 22.

    Osoba AO. Hepatitis C virus genotypes in Saudi Arabia. Saudi Med J. 2002;23(1):7-12. [PubMed ID: 11938356].

  • 23.

    Bokharaei Salim F, Keyvani H, Amiri A, Jahanbakhsh Sefidi F, Shakeri R, Zamani F. Distribution of different hepatitis C virus genotypes in patients with hepatitis C virus infection. World J Gastroenterol. 2010;16(16):2005-9. [PubMed ID: 20419838]. https://doi.org/10.3748/wjg.v16.i16.2005.

  • 24.

    Taherkhani R, Farshadpour F. Epidemiology of hepatitis C virus in Iran. World J Gastroenterol. 2015;21(38):10790-810. [PubMed ID: 26478671]. https://doi.org/10.3748/wjg.v21.i38.10790.

  • 25.

    Murphy DG, Willems B, Deschenes M, Hilzenrat N, Mousseau R, Sabbah S. Use of sequence analysis of the NS5B region for routine genotyping of hepatitis C virus with reference to C/E1 and 5' untranslated region sequences. J Clin Microbiol. 2007;45(4):1102-12. [PubMed ID: 17287328]. https://doi.org/10.1128/JCM.02366-06.

  • 26.

    Ciccozzi M, Lo Presti A, Ciccaglione AR, Zehender G, Ciotti M. Phylogeny and phylodinamic of Hepatitis C in Italy. BMC Infect Dis. 2012;12 Suppl 2:5. [PubMed ID: 23173700]. https://doi.org/10.1186/1471-2334-12-S2-S5.

  • 27.

    Moher D, Liberati A, Tetzlaff J, Altman DG, Prisma Group. Preferred reporting items for systematic reviews and meta-analyses: the PRISMA statement. Int J Surg. 2010;8(5):336-41. [PubMed ID: 20171303]. https://doi.org/10.1016/j.ijsu.2010.02.007.

  • 28.

    Vandenbroucke JP, Elm EV, Altman DG, Gotzsche PC, Mulrow CD, Pocock SJ. Strengthening the Reporting of Observational Studies in Epidemiology (STROBE): Explanation and Elaboration. Ann Internal Med. 2007;147(8):163. https://doi.org/10.7326/0003-4819-147-8-200710160-00010-w1.

  • 29.

    Alfaro ME, Zoller S, Lutzoni F. Bayes or bootstrap? A simulation study comparing the performance of Bayesian Markov chain Monte Carlo sampling and bootstrapping in assessing phylogenetic confidence. Mol Biol Evol. 2003;20(2):255-66. [PubMed ID: 12598693]. https://doi.org/10.1093/molbev/msg028.

  • 30.

    Kumar S, Stecher G, Tamura K. MEGA7: Molecular Evolutionary Genetics Analysis Version 7.0 for Bigger Datasets. Mol Biol Evol. 2016;33(7):1870-4. [PubMed ID: 27004904]. https://doi.org/10.1093/molbev/msw054.

  • 31.

    Aiyar A. The use of CLUSTAL W and CLUSTAL X for multiple sequence alignment. Methods Mol Biol. 2000;132:221-41. [PubMed ID: 10547838].

  • 32.

    Edgar RC. MUSCLE: multiple sequence alignment with high accuracy and high throughput. Nucleic Acids Res. 2004;32(5):1792-7. [PubMed ID: 15034147]. https://doi.org/10.1093/nar/gkh340.

  • 33.

    Samimi-Rad K, Asgari F, Nasiritoosi M, Esteghamati A, Azarkeyvan A, Eslami SM, et al. Patient-to-Patient Transmission of Hepatitis C at Iranian Thalassemia Centers Shown by Genetic Characterization of Viral Strains. Hepat Mon. 2013;13(1):7699. [PubMed ID: 23585766]. https://doi.org/10.5812/hepatmon.7699.

  • 34.

    Samimi-Rad K, Nategh R, Malekzadeh R, Norder H, Magnius L. Molecular epidemiology of hepatitis C virus in Iran as reflected by phylogenetic analysis of the NS5B region. J Med Virol. 2004;74(2):246-52. [PubMed ID: 15332273]. https://doi.org/10.1002/jmv.20170.

  • 35.

    Samimi-Rad K, Nasiri Toosi M, Masoudi-Nejad A, Najafi A, Rahimnia R, Asgari F, et al. Molecular epidemiology of hepatitis C virus among injection drug users in Iran: a slight change in prevalence of HCV genotypes over time. Arch Virol. 2012;157(10):1959-65. [PubMed ID: 22695769]. https://doi.org/10.1007/s00705-012-1369-9.

  • 36.

    Salehi Moghadam F, Mohebbi SR, Hosseini SM, Romani S, Mirtalebi H, Azimzadeh P, et al. Phylogenetic analysis of hepatitis C virus strains and risk factors associated with infection and viral subtypes among Iranian patients. J Med Virol. 2014;86(8):1342-9. [PubMed ID: 24838700]. https://doi.org/10.1002/jmv.23947.

  • 37.

    Sanders-Buell E, Rutvisuttinunt W, Todd CS, Nasir A, Bradfield A, Lei E, et al. Hepatitis C genotype distribution and homology among geographically disparate injecting drug users in Afghanistan. J Med Virol. 2013;85(7):1170-9. [PubMed ID: 23918535]. https://doi.org/10.1002/jmv.23575.

  • 38.

    Demetriou VL, van de Vijver DA, Cyprus H, Kostrikis LG. Molecular epidemiology of hepatitis C infection in Cyprus: evidence of polyphyletic infection. J Med Virol. 2009;81(2):238-48. [PubMed ID: 19107977]. https://doi.org/10.1002/jmv.21370.

  • 39.

    Demetriou VL, van de Vijver DA, Hezka J, Kostrikis LG, Cyprus IN, Kostrikis LG. Hepatitis C infection among intravenous drug users attending therapy programs in Cyprus. J Med Virol. 2010;82(2):263-70. [PubMed ID: 20029809]. https://doi.org/10.1002/jmv.21690.

  • 40.

    Brahim I, Akil A, Mtairag el M, Pouillot R, Malki AE, Nadir S, et al. Morocco underwent a drift of circulating hepatitis C virus subtypes in recent decades. Arch Virol. 2012;157(3):515-20. [PubMed ID: 22160625]. https://doi.org/10.1007/s00705-011-1193-7.

  • 41.

    Aziz H, Raza A, Murtaza S, Waheed Y, Khalid A, Irfan J, et al. Molecular epidemiology of hepatitis C virus genotypes in different geographical regions of Punjab Province in Pakistan and a phylogenetic analysis. Int J Infect Dis. 2013;17(4):247-53. [PubMed ID: 23183233]. https://doi.org/10.1016/j.ijid.2012.09.017.

  • 42.

    Djebbi A, Bahri O, Langar H, Sadraoui A, Mejri S, Triki H. Genetic variability of genotype 1 hepatitis C virus isolates from Tunisian haemophiliacs. New Microbiol. 2008;31(4):473-80. [PubMed ID: 19123302].

  • 43.

    Hmaied F, Ben Mamou M, Dubois M, Pasquier C, Sandres-Saune K, Rostaing L, et al. Determining the source of nosocomial transmission in hemodialysis units in Tunisia by sequencing NS5B and E2 sequences of HCV. J Med Virol. 2007;79(8):1089-94. [PubMed ID: 17597483]. https://doi.org/10.1002/jmv.20877.

  • 44.

    Cochrane A, Searle B, Hardie A, Robertson R, Delahooke T, Cameron S, et al. A genetic analysis of hepatitis C virus transmission between injection drug users. J Infect Dis. 2002;186(9):1212-21. [PubMed ID: 12402190]. https://doi.org/10.1086/344314.

  • 45.

    Harris KA, Teo CG. Diversity of Hepatitis C Virus Quasispecies Evaluated by Denaturing Gradient Gel Electrophoresis. Clin Diagnostic Laboratory Immunol. 2001;8(1):62-73. https://doi.org/10.1128/cdli.8.1.62-73.2001.

  • 46.

    Cantaloube JF, Biagini P, Attoui H, Gallian P, de Micco P, de Lamballerie X. Evolution of hepatitis C virus in blood donors and their respective recipients. J Gen Virol. 2003;84(Pt 2):441-6. [PubMed ID: 12560577]. https://doi.org/10.1099/vir.0.18642-0.

  • 47.

    Bracho MA, Gosalbes MJ, Blasco D, Moya A, Gonzalez-Candelas F. Molecular epidemiology of a hepatitis C virus outbreak in a hemodialysis unit. J Clin Microbiol. 2005;43(6):2750-5. [PubMed ID: 15956393]. https://doi.org/10.1128/JCM.43.6.2750-2755.2005.

  • 48.

    Lopez-Labrador FX, Bracho MA, Berenguer M, Coscolla M, Rayon JM, Prieto M, et al. Genetic similarity of hepatitis C virus and fibrosis progression in chronic and recurrent infection after liver transplantation. J Viral Hepat. 2006;13(2):104-15. [PubMed ID: 16436128]. https://doi.org/10.1111/j.1365-2893.2005.00670.x.

  • 49.

    Ansaldi F, Bruzzone B, Salmaso S, Rota MC, Durando P, Gasparini R, et al. Different seroprevalence and molecular epidemiology patterns of hepatitis C virus infection in Italy. J Med Virol. 2005;76(3):327-32. [PubMed ID: 15902713]. https://doi.org/10.1002/jmv.20376.

  • 50.

    Kchouk FH, Gorgi Y, Bouslama L, Sfar I, Ayari R, Khiri H, et al. Phylogenetic analysis of isolated HCV strains from tunisian hemodialysis patients. Viral Immunol. 2013;26(1):40-8. [PubMed ID: 23374151]. https://doi.org/10.1089/vim.2012.0043.

  • 51.

    Khan A, Tanaka Y, Azam Z, Abbas Z, Kurbanov F, Saleem U, et al. Epidemic spread of hepatitis C virus genotype 3a and relation to high incidence of hepatocellular carcinoma in Pakistan. J Med Virol. 2009;81(7):1189-97. [PubMed ID: 19475617]. https://doi.org/10.1002/jmv.21466.

  • 52.

    Djebbi A, Mejri S, Thiers V, Triki H. Phylogenetic analysis of hepatitis C virus isolates from Tunisian patients. Eur J Epidemiol. 2004;19(6):555-62. [PubMed ID: 15330128]. https://doi.org/10.1023/B:EJEP.0000032348.83087.01.

  • 53.

    Blackard JT, Sherman KE. Hepatitis C virus coinfection and superinfection. J Infect Dis. 2007;195(4):519-24. [PubMed ID: 17230411]. https://doi.org/10.1086/510858.

  • 54.

    Idrees M, Riazuddin S. Frequency distribution of hepatitis C virus genotypes in different geographical regions of Pakistan and their possible routes of transmission. BMC Infect Dis. 2008;8:69. [PubMed ID: 18498666]. https://doi.org/10.1186/1471-2334-8-69.

  • 55.

    Hussain A, Idrees M. The first complete genome sequence of HCV-1a from Pakistan and a phylogenetic analysis with complete genomes from the rest of the world. Virol J. 2013;10:211. [PubMed ID: 23805872]. https://doi.org/10.1186/1743-422X-10-211.

  • 56.

    Alavian SM. Hepatitis C infection in Iran; A review article. Arch Clin Infect Dis. 2009;4(1):47-59.

  • 57.

    Kadjbaf D, Keshvari M, Alavian SM, Pouryasin A, Behnava B, Salimi S, et al. The Prevalence of Hepatitis C Virus Core Amino Acid 70 Substitution and Genotypes of Polymorphisms Near the IFNL3 Gene in Iranian Patients With Chronic Hepatitis C. Hepat Mon. 2016;16(6):37011. [PubMed ID: 27630727]. https://doi.org/10.5812/hepatmon.37011.