Molecular Biology 19: 'Gene regulation V - nucleosomes and enhancers'
These are my notes from lecture 19 in Harvard’s BCMP 200: Molecular Biology course, delivered by Timur Yusufzai on October 27, 2014.
Take home messages for today
- “Transcription occurs in the context of chromatin.”
- The default state of your genome is OFF.†
†This is in contrast to bacteria, where the default state is ON. In eukaryotes, turning on a gene requries overcoming the nucleosomal barriers.
Facts about nucleosomes, enhancers and gene regulation
When there are nucleosomes at a promter, the gene is OFF. A nucleosome-free promoter is ON, and is not necessarily protein-free - other proteins may be bound there.
How do nucleosomes along the gene affect transcription elongation? Elongation is facilitated by chromatin remodeling factors, and histone modifications.
ChIP-seq on RNA pol II and histones on an inducible galactose-response gene has demonstrated that sometimes nucleosomes go away when a gene becomes actively transcribed [Schwabish & Struhl 2004]. (This turns out not to be true of all genes). Another experiment used heat shock to induce Hsp70. RNA pol II takes two minutes after heat shock to reach the end of the gene. Yet with MNase digestion + qPCR you can see nucleosomes late in the gene disappearing even before RNA pol II would reach there, thus, chromatin remodeling runs ahead of RNA pol II [Petesch & Lis 2008]. This turns out to involve ADP ribosylation and histone acetylation [Petesch & Lis 2012].
H3K27me3 is a repressive mark. H3K27ac, H3K36me3 and H3K4me3 are associated with active genes.
The readers of H3K36me3 appear to be ISWI and CHD1, both of which are histone remodelers.
Many genes have some basal level of constitutive transcription, but need additional factors to become activated and achieve higher transcription.
CREB, Sp1 and C/EBP which bind in the promoter recruit chromatin remodelers to work downstream.
Some transcription factors recruit “co-activators” which are usually histone acetyltransferases (HATs) such as p300 HAT.
Enhancers are segments of DNA. It was shown that enhancers are usually H3K4me1 but not H3K4me3, in contrast to promoters which are usually H3K4me3 and not H3K4me1 [Creyghton 2010]. Though this association is probably true and is still widely accepted, there is also a bit of circular reasoning in there, as H3K4me1 is used to identify enhancers, and then enhancers are shown to be enriched for H3K4me1.
Sometimes there are super enhancers which greatly increase transcription. These super enhancers are just bigger than regular enhancers and have more sites to bind factors that will help activate transcription.
It is not known how signals get from the enhancer to the promoter, though models abound. In yeast, enhancers and promoters are close, so people have proposed that a bunch of proteins just bind until they reach all the way to the promoter. But in mammals, sometimes enhancers can be megabases away, so there has to be some other mechanism. One idea is “tracking” - a protein binds the enhancer and then searches downstream until it finds the promoter. Another proposed mechanism is “looping”, where the enhancer and promoter are close together in three-dimensional space even if separated by megabases of sequence. You can test the “looping” model by chromosome conformation capture [Dekker 2002] in which you cross-link proximate DNA pieces to each other and then ligate, then perform sequencing to see what was next to what. This method has indeed given evidence for enhancers and promoters being physically together for some genes [Kagey 2010].
An enormous (22 subunits, total 1.0 MDa) complex called Mediator mediates all the signals incoming to the promoter from various activators and stuff. All RNA pol II-dependent genes require Mediator for their expression. Surprisingly, although Mediator is essential for life, there are deletions and missense mutations of various subunits which cause human diseases [reviewed in Yin & Wang 2014].
Ways of detecting changes in RNA expression
Northern blots are pretty easy to set up and easy to quantify results. They work for long and short RNAs, and can detect splice variants. But it requires radioactivity (at least for best results), is not as sensitive as other methods, and takes a long time.
Primer extension reveals the start site of the transcript. But it also requires radioactivity, plus a sequencing gel, and has low sensitivity.
qPCR is highly automatable, quick and labor-saving if you are doing a lot of samples. Downsides are you must carefully select primers to be specific for the transcript of interest (or else rule out other PCR products by other means later on), you have to run a standard curve every time in order to quantify results, and you need to buy a qPCR machine.
More facts about enhancers and gene regulation
The H19 locus is imprinted - only the maternal copy is expressed. Only mammals have imprinting. It is accomplished by DNA methylation. Imprinting is reviewed in [Reik & Walter 2001]. The current model is that imprinting arose as a battle for limited resources. In some mammals, one litter will contain half-siblings from multiple fathers. So a theory is that each father wants to help its offspring outcompete other fathers’ offspring, while the mother (because all of the offspring are hers) wants to silence the various fathers’ genes.
An example is that a male tiger and female lion yield a tigon (smaller than either a lion or tiger) while a male lion and female tiger yield a liger (larger than either a lion or tiger). This is believed to be because lions have strong imprinting signals and tigers have weak imprinting signals. So the thought is that the male tiger’s imprinting gets overruled by the female lion in the tigon, whereas the female tiger’s imprints get overruled by the male lion in the liger.
In the maternal chromosome, CTCF binds unmethylated promoters of H19, while in the paternal chromosome CTCF cannot bind the methylated promoter. This signals downstream through FoxA and Igf2 to control gene expression.
There are also things called insulators. Sometimes an enhancer sits between two genes that each point away from the center, and if you put an insulator in between the enhancer and one gene, it will turn off that gene. An example of this has been studied with a two-enhancer reporter gene in Drosophila embryos [Cai & Shen 2001].
Transgenes sometimes get integrated into a random piece of heterochromatin, which gradually spreads its heterochromatin character over the transgene, turning it off. Thus you only get transient expression. To make sure a transgene is expressed you can put insulators before it to prevent the encroachment of heterochromatin marks. This isn’t a big deal for transgenetic studies in the lab because you can just use selection (say, with antibiotic resistance genes) to pick clones that have the transgene expressed. But for gene therapy, this idea of using an insulator to ensure gene expression is really important, and there are a bunch of patents on it.