Earlier this year, MIT researchers developed a way to easily and efficiently edit the genomes of living cells. Now, the researchers have discovered key factors that influence the accuracy of the system, an important step toward making it safer for potential use in humans, says Feng Zhang, leader of the research team.
With this technology, scientists can deliver or disrupt multiple genes at once, raising the possibility of treating human disease by targeting malfunctioning genes. To help with that process, Zhang’s team, led by graduate students Patrick Hsu and David Scott, has now created a computer model that can identify the best genetic sequences to target a given gene.
“Using this, you will be able to identify ways to target almost every gene. Within every gene, there are hundreds of locations that can be edited, and this will help researchers narrow down which ones are better than others,” says Zhang, the W.M. Keck Assistant Professor in Biomedical Engineering at MIT and senior author of a paper describing the new model, appearing in the July 21 online edition of Nature Biotechnology.
The genome-editing system, known as CRISPR, exploits a protein-RNA complex that bacteria use to defend themselves from infection. The complex includes short RNA sequences bound to an enzyme called Cas9, which slices DNA. These RNA sequences are designed to target specific locations in the genome; when they encounter a match, Cas9 cuts the DNA.
This approach can be used either to disrupt the function of a gene or to replace it with a new one. To replace the gene, the researchers must also add a DNA template for the new gene, which would be copied into the genome after the DNA is cut.
This technique offers a much faster and more efficient way to create transgenic mice, which are often used to study human disease. Current methods for creating such mice require adding small pieces of DNA to mouse embryonic cells. However, the process is inefficient and time-consuming.
With CRISPR, many genes are edited at once, and the entire process can be done in three weeks, says Zhang, who is a core member of the Broad Institute and MIT’s McGovern Institute for Brain Research. The system can also be used to create genetically modified cell lines for lab experiments much more efficiently.
Since Zhang and his colleagues first described the original system in January, more than 2,000 labs around the world have started using the system to generate their own genetically modified cell lines or animals. In the new paper, the researchers describe improvements in both the efficiency and accuracy of gene editing.
To modify genes using this system, an RNA “guide strand” complementary to a 20-base-pair sequence of targeted DNA is delivered to cells. After the RNA strand binds to the target DNA, it recruits the Cas9 enzyme, which snips the DNA in the correct location.
The researchers discovered they could minimize the chances of the Cas9-RNA complex accidentally cleaving the wrong site by making sure the target sequence is not too similar to other sequences found in the genome. They found that if an off-target sequence differs from the target sequence by three or fewer base pairs, the editing complex will likely also cleave that sequence, which could have deleterious effects for the cell.
The team’s new computer model can search any sequence within the mouse or human genome and identify 20-base-pair sequences within that region that have the least overlap with sequences elsewhere in the genome.
Another way to improve targeting specificity is by adjusting the dosage of the guide RNA, the researchers found. In general, decreasing the amount of RNA delivered minimizes damage to off-target sites but has a much smaller effect on cleavage of the target sequence. For each sequence, the “sweet spot” with the best balance of high on-target effects and low off-target effects can be calculated, Zhang says.
“The real value of this paper is that it does a very comprehensive and systematic analysis to understand the causes of off-target effects. That analysis suggests a lot of possible ways to eliminate or reduce off-target effects,” says Michael Terns, a professor of biochemistry and molecular biology at the University of Georgia who was not part of the research team.
Zhang and his colleagues also optimized the structure of the RNA guide needed for efficient activation of Cas9. In the January paper describing the original system, the researchers found that two separate RNA strands working together — one that binds to the target DNA and another that recruits Cas9 — produced better results than when those two strands were fused together before delivery. However, in experiments reported in the new paper, the researchers found that they could boost the efficiency of the fused RNA strand by making the strand longer. These longer RNA guide strands include a hairpin structure that may stabilize the molecules and help them interact with Cas9, Zhang says.
Zhang’s team is now working on further improving the specificity of the system, and plans to start generating cell lines and animals that could be used to study how the brain develops and builds neural circuits. By disrupting genes known to be involved in those processes, they can learn more about how they work and how they are impaired in neurological disease.
The research was funded by a National Institutes of Health Director’s Pioneer Award; an NIH Transformative R01 grant; the Keck, McKnight, Damon Runyan, Searle Scholars, Klingenstein and Simons foundations; Bob Metcalfe; and Jane Pauley.