Since its discovery, CRISPR systems have been primarily used for genome editing, stemming from the finding that Cas proteins can make precise DNA cuts. The type II Cas9 protein from
S. pyogenes was the first protein demonstrated to achieve this in a straightforward manner (
12). Efforts are underway to explore whether other CRISPR Cas9 protein types can be employed for gene editing, particularly with the emergence of Cas9 orthologs and SpyCas9 variants. Although type I CRISPR systems were the first to be thoroughly studied, their requirement for several proteins (at different stoichiometries) made their use in eukaryotic systems less appealing. The implementation of type I systems for editing in human cells is relatively recent, although type I systems are widely used in microbial engineering (
13). Another popular CRISPR system is Cas12a (formerly known as Cpf1), a component of type V systems, which offers several advantages over Cas9 proteins. Three Cas12a proteins have been described, including those from
Francisellanovicida (FnCas12a),
Acidaminococcus sp. (AsCas12a), and
Lachnospiraceae bacterium (LbCas12a). These proteins use T-rich PAM sequences such as TTN and TTTN, distinguishing them from Cas9 proteins that utilize G-rich PAM sequences. Unlike Cas9 proteins, which require tracrRNA, Cas12a proteins can process crRNA from pre-crRNA independently (
14). This feature has been utilized to enhance simultaneous genome editing at multiple loci by expressing several Cas12a-compatible crRNAs as a single pre-crRNA transcript, instead of multiple expression modules. However, the main advantage of Cas12a is its cleavage mechanism, producing staggered endpoints outside the critical seed region, unlike the blunt ends produced by most Cas9 proteins (
15). The research discussed involves conducting extensive CRISPR functional genomics studies that simultaneously alter the function of multiple genes to systematically identify genes that influence biological processes related to disease characteristics. Epigenetic editing, which can be highly specific and easily adjusted, allows for the exploration of functional genomics across the entire genome with high accuracy, helping to identify true positives and minimize false hits (
16).
The initial loss-of-function at the genome scale using dCas9 KRAB was the focus of CRISPR interference (CRISPRi) functional genomics investigations aimed at inhibiting the expression of protein-coding genes. These studies yielded expected results, including the identification of genes encoding ribosomal components, spliceosomes, and DNA replication machinery proteins, which are often essential for survival or cell proliferation (
17). The first genome-scale gain-of-function study using CRISPRa, in a functional genomics context, revealed key findings in a screen for cell survival and proliferation, such as the identification of tumor suppressor genes, factors that regulate gene expression and influence cell status, and mechanisms that control the cell cycle apparatus. Tumor suppressor genes must serve as predicted positive control genes for CRISPR-based screens conducted in proliferating cells. During CRISPR-based functional genomic screening, increased expression of tumor suppressor genes leads to cell death or inhibition of cell cycle progression, resulting in the deletion of sgRNAs targeting tumor suppressor genes (
18). Fusion genes are identified in solid tumor types, and due to their tumor-specific expression, they hold significant potential as therapeutic targets (
19). Despite the importance of fusion genes in cancer, it has been challenging to model chromosomal rearrangements. The echinoderm microtubule-associated protein-like 4 (EML4) and anaplastic lymphoma receptor tyrosine kinase (ALK) genes were connected by an intrachromosomal inversion, which was the first fusion gene identified in lung adenocarcinoma (
20). Two groups developed a CRISPR/Cas9-based technique to induce specific chromosomal rearrangements and generate an Eml4-Alk gene fusion in vivo. This was achieved by using Cas9 and two sgRNAs to target the intronic regions of Eml4 and Alk, enabling a concomitant DSB. The resulting tumors responded to crizotinib treatment, an approved ALK kinase inhibitor, making those models clinically relevant (
21). Another team explored the use of the same method to induce various chromosomal rearrangements in HEK293 cells, leading to the creation of novel fusion genes that play a key role in lung cancer. They demonstrated the generation of intrachromosomal inversions, both paracentric and pericentric, as well as the unexpected promotion of interchromosomal translocations. However, further research is required to determine how effectively such techniques can induce translocations in vivo (
22). Clustered regularly interspaced short palindromic repeat /Cas9 components have been delivered in vivo using recombinant adeno-associated viruses (AAV) and recombinant adenoviral viruses (AdV). Pooled CRISPR screening techniques enable large-scale, concurrent interrogation of many genotype-phenotype relationships within a single cell population. Briefly, titrated lentiviruses are used to introduce CRISPR/Cas9 libraries into a pool of cells. These cells can then be enriched for a specific phenotype, and NGS is used to identify the genes involved in that phenotype (
23).
The CRISPR reagents are delivered by lentiviruses, which integrate stably into the genome, allowing the phenotype to be linked to a specific disruption within a cell. As a result, sgRNAs or gene knockouts that influence the desired phenotype can be identified through sequencing of sgRNAs in downstream cells within a population that has been specifically targeted (
24). Modified CRISPR systems can precisely and reliably modify the transcriptome and epigenome. These technologies have significantly improved our ability to conduct experimental searches for genes that influence disease phenotypes (
25).