Cell Biology 03: Targeting proteins to organelles
These are notes from Harvard Extension’s Cell Biology course. This lecture and the next one will focus on the organelles (nucleus and mitochondria today, and the ‘secretory pathway‘ of ER and Golgi next week). There exist stains specific to each organelle, and you can use these to see where each organelle is within the cell. A major focus of today will be how proteins (and other molecules) move between the cytosol and organelles.
The Nucleus
DNA replication and DNA -> RNA transcription occur in the nucleus. The nucleus holds the DNA and keeps it organized throughout the cell’s life cycle. Its organization has some analogues to the whole cell’s organization (a membrane, skeleton(s), pores). Here are some components of the nucleus:
- Nuclear envelope. This is a double membrane – two lipid bilayers – surrounding the nucleus. It is relatively impermeable – all movement in and out of the nucleus is tightly regulated. The nuclear pores (see below) that regulate this movement must span the inner and outer nuclear memrbanes. The outer membrane is continuous with the ER membrane. The inner membrane has specialized surface proteins that allow attachment of lamina and matrix proteins.
- Nuclear lamina. A supportive ‘mesh’ inside the nucleus made of fibrillar proteins. It has roles in organizing chromatin, regulating DNA replication and cell division, and anchoring the nuclear pore complexes (see below).
- Nuclear matrix. This is another supportive fibrous network found inside the nucleus, distinct from the lamina (above). Its function is less well understood.
- Chromosomal domains (chromatin). DNA comes in the form of chromatin: condensed DNA wrapped tightly around histones. The most actively transcribed chromosomes tend to be located towards the center of the nucleus, with less active chromosomes near the periphery.
- Nuclear pore complex (NPC). This is a huge complex of many proteins that make up a pore spanning the inner and outer nuclear membranes. Bottom (on the nuclear side) is called the nuclear basket, and it actually looks like a basket on electron micrographs.
Let’s learn more about the NPC (image thanks to Wikimedia Commons user Mike Jones):
The average vertebrate cell has 2000 NPCs. NPCs weigh in among the cell’s largest protein complexes, at 60-80 MDa – 16x larger than a ribosome. It is made of 30 different proteins called nucleoporins (nups). The central aperture is aqueous and large enough for ions, metabolites and proteins up to 40 kDa to simply pass through by passive diffusion. Proteins too large to diffuse through passively must move through via a Brownian gate model, which is like facilitated diffusion plus gated transport dependent on ATP.
The NPC’s structure includes the ‘basket’ portion on the nucleoplasm side, which is joined by filaments and attached to the lamina. There are also cytoplasmic filaments on the outside. Some of the nucleoporins have hydrophobic parts rich in phenylalanine (F) and glycine (G), accordingly called FG nups. The FG nups help to regulate import and export as we’ll see shortly. There are also structural (aka ‘symmetric’) nups embedded in the nuclear envelope which form a scaffold for the whole complex. Y-complexes (16 of them per pore) form the basic structure scaffold
Nuclear import and export
Obviously, many proteins are needed in the nucleus – polymerases, helicases (used in DNA replication), histones, transcription factors, the list goes on. All of these are translated outside of the nucleus and therefore have to get back in somehow. And of course, mRNAs and tRNAs and some ribosomal subunits are transcribed in the nucleus and must exit into the cytosol. Still other proteins go back and forth in both directions.
Nuclear import and export depend on a host of different actors: importins or exportins respectively; Ran; Ran helper proteins GEF and GAP; GTP as an energy source; and the FG nups in the nuclear pore complex. Let’s learn how it works.
The entire import and export processes are simple and elegant as long as you keep your eye on the energy and not on the horrid acronyms. Nuclear import/export is powered by GTP, the “other” energy currency of the cell. If ATP is $1 of value on a prepaid debit card, GTP is sort of like a $1 Starbucks gift card: same nominal value, but only usable in certain transactions. Like ATP to ADP, GTP is hydrolyzed to GDP to release energy.
Nuclear import relies on an importin protein. Importin floats around in the cytosol and binds to a nuclear localization signal (NLSs) on a ‘cargo’ protein to be imported. There are a whole bunch of different importins that recognize a whole bunch of different NLSs. NLS tend to be rich in K & R, and one popular NLS motif looks like PKKKRKV. This step of importin binding the NLS is energetically downhill – think of importin as a set mouse trap, ready to spring shut. Importins are (usually) actually heterodimeric complexes of two different proteins – an alpha subunit and a beta subunit, with alpha being the one that binds the NLS and beta being the one that interacts with the FG nups to get through the nuclear pore. (It is believed that beta can also act alone, though we don’t understand how.)
Once importin has bound to its cargo, it interacts with the FG nups in the pore in order to get through into the nucleus. Once there it encounters Ran bound to a molecule of GTP (often called Ran-GTP). When, and only when, Ran is ‘loaded’ with GTP, it has an affinity for binding to importin. This, too, is energetically downhill – like a mouse trap snapping shut. When Ran-GTP binds to importin, it causes a conformational change that makes importin release its cargo. Voila: the cargo has been imported. But we’ve only engaged in downhill reactions thus far, so clearly for Ran and importin to be able to do this again, energy has to be expended somewhere.
Ran-GTP bound to importin has a tendency to diffuse back out of the nucleus, where it encounters GAP (GTPase-activating protein). Ran is a GTPase – it is capable of hydrolyzing GTP – but it doesn’t have the inclination to do so on its own; it needs GAP to stimulate it. GAP causes Ran to ‘spend’ its GTP, using the energy to induce the energetically uphill conformational change that releases importin. Now importin is free in the cytosol again and ready to import another cargo molecule. Having spent its GTP, Ran is now Ran-GDP, which moves back into the nucleus where it encounters GEF (guanine nucleotide exchange factor), which removes the GDP and re-loads Ran with a new GTP. Now Ran is ready to bind to the next importin/cargo complex that wanders in.
Note that this whole process keeps concentrations of importin high in the cytosol and low in the nucleus, and concentrations of Ran-GTP (as opposed to Ran-GDP) high in the nucleus and low in the cytosol. This gradient allows passive diffusion to power the largely unidirectional travel of the import cycle. (Although that may be a slight simplification: Wikipedia says there are some other proteins involved in the transport process).
Export is incredibly similar to import. Exportin binds proteins inside the nucleus that have a nuclear export signal (NES) – apparently a popular sequence motif for an NES is LXXXLXXLXL. Exportin has an affinity for Ran-GTP and forms a trimolecular complex of exportin/Ran-GTP/cargo. The trimolecular complex has a higher affinity for FG nups than exportin alone does, thus driving export. The complex moves through the pore and Ran-GTP encounters GAP which promotes burning GTP which provides the energy to dissociate the whole trimolecular complex. Now the Ran-GDP wanders back into the nucleus, where GEF re-loads it with GTP making it Ran-GTP.
Notice that the energetics of export are identical to those of import. Binding to cargo is downhill, association with Ran-GTP is downhill, and the dissociation from Ran-GTP is uphill, requiring the GTP to be spent. The difference that makes importin import and exportin export is that importin releases its cargo when bound by Ran-GTP, and exportin releases its cargo when released by Ran-GTP.
Aside: the above explanation of exportin is only for export of proteins. mRNA export does not use Ran-GTP, instead it uses NXF1 and NXT1 and is powered by ATP using an ATPase called Dbn5.
This video shows a protein NFAT whose import/export is regulated by calcium concentrations:
Above we talked about nuclear localization signals and nuclear export signals. These are just two examples of signal peptides. Proteins can also have localization signals specific to mitochondria or to the secretory pathway (ER + Golgi). This signal is likened to a ‘zip code’. Signal sequences are stretches of amino acids usually found in the beginning (i.e. N terminus) of proteins. Signal peptides work by having chemical properties determined simply by their amino acid sequence which offer binding specificity for transport receptors such as importin. For NLS, the signals stay as part of the protein, whereas, say, secretory pathway signals are cleaved off in post-translational modification. (The first 22 amino acids of PrP: MANLGCWMLVLFVATWSDLGLC are a secretory pathway localization sequence which is cleaved off in post-translational modification).
Proteins found in both the nucleus and cytoplasm may need both an NLS and an NES – maybe. The proteins are translated in the cytoplasm so depending on the relative concentrations needed, it may spend enough time in the cytoplasm before nuclear import and could meet its needs without an NES.
Digitonin (which, randomly, was a hit in UCSF’s antiprion compound screen) is a detergent that permabilizes the cytoplasmic membrane but leaves the nuclear envelope intact. This makes it possible to isolate the nucleus in the lab. Once you’ve done that you can test to see if a protein of interest gets imported into the nucleus. (Though, obviously, the import machinery such as importin still need to be available).
Mitochondria
(Image thanks to Wikimedia Commons user Wikigraphists)
Mitochondria also have a double bilayer membrane. The outer membrane (2) is a simple bilayer permeable to ions and small molecules. The inner membrane (1) is totally impermeable to all molecules of any size, except through carrier channels. The inner membrane is convoluted – it has a complex shape kind of like the surface of the cerebral cortex, which gives it extra surface area. That extra surface area allows more capacity for the electron transport chain. The space in between the two membranes is called the intermembrane space (the deep grooves are cristae (3)) and is where ATP is produced. The space inside the inner membrane is the mitochondrial matrix (4).
Mitochondria harness energy via oxidation of glucose to pyruvate to produce ATP using the citric acid cycle (Krebs cycle) and electron transport chain. This process consumes oxygen and releases carbon dioxide, thus it’s often called cellular respiration. Here is a video of it:
Inside the inner membrane, in the mitochondrial matirx, is where the mitochondrial chromosome lives, along with proteins for mitochondrial division and a few other things. Although the mitochondria have their own ribosomes and tRNAs, they aren’t totally independent. In fact, the majority of proteins required for mitochondrial replication are encoded in the nucleus. These nuclear-encoded proteins are translated in the cytoplasm and shipped to the mitochondria. This requires an import/export system.
The signal for mitochondrial localization is part sequence, part structure. It’s an amphipathic alpha helix, with R & K (charged amino acids) on one side and hydrophobic amino acids on the other. This general characteristic is more important than any specific sequence. The signal sequence is cleaved off after entry into the mitochondrial matrix – until this happens, the proteins are considered ‘precursor proteins’.
This requires membrane receptors in the mitochondria, called translocons. There are translocons of the outer membrane (Toms) and translocons of the inner membrane (Tims). Transport into the mitochondrial matrix occurs only in places where the inner and outer membranes are close together. Chaperones like Hsc70 (gene: HSPA8) keep the precursor proteins in an unfolded state – i.e. just a linear chain of amino acids – which is energetically unfavorable and requires expenditure of ATP – but makes them small enough, radius-wise, to fit through the import pores. Hsc70 is often depicted as a Pac-man with the mouth as a generic binding pocket for polypeptides. ATP closes the mouth, which clinches the precursor protein to keep it from folding. Of course, it can still form some simple secondary structures – such as the signal alpha helix required for mitochondrial localization. But Hsc70 keeps it from forming large bulky tertiary or quaternary structures.
Tom20/22 or 70/22 receptors in the outer membrane receive Hsc70, and Tom40 comprises the actual pore itself. Tim 23/17 and 44 in the inner membrane. These stand for translocon of inner/outer membrane. These co-located at ‘contact sites’ where the inner and outer membrane are close together, and the transport through both layers occurs simultaneously, i.e. in one swoop. The matrix has its own Hsc70 proteins which help pull the incoming precursor protein in and keep it unfolded until import is complete, which again requires ATP. Once import is complete, a matrix protease (‘signal peptidase’) cleaves off the signal sequence and then the finished protein folds. We watched this video in class:
But I’m actually more fond of this video:
That description (and both of those videos) depicted the life of a protein destined to exist inside the mitochondrial matrix. But there are three other places a protein might need to end up: embedded in the inner membrane, floating in the intermembrane space, or embedded in the outer membrane. (For instance, the Toms themselves need to follow a path to get embedded in the outer membrane).
To get embedded in the inner membrane, there are three paths available.
Path A: Some proteins have a matrix targeting sequence and an internal stop-transfer sequence – a hydrophobic sequence recognized by the Tim23/17 channel – which causes Tim23/17 to move that sequence out sideways into the membrane (mechanism unknown), where it becomes a transmembrane protein lodged in the inner membrane.
Path B: Other proteins have a matrix targeting sequence plus a different hydrophobic domain recognized by Oxa1. These proteins make it all the way into the matrix but then find their way back into the inner membrane to become transmembrane proteins.
Path C: Still other proteins do not have a matrix targeting sequence, yet are somehow recognized by a bunch of other Tims, and end up transmembrane. (tl;dr: basically we know nothing about Path C).
There are also 2 different paths to the intermembrane space.
Path A requires a matrix targeting sequence and an intermembrane-space-targeting sequence; the latter is recognized by Tim23/17 which move it laterally into the membrane just like in the trans-inner-membrane Path A, but then the protein is cleaved, leaving only the part upstream of the intermembrane-space-targeting sequence floating in the intermembrane space.
Path B’ is simple – some proteins just have a sequence which is recognized by the Toms but not the Tims.
We don’t know very much about the path to becoming an outer membrane transmembrane protein such as Tom40 itself, and its interacting proteins. We know that proteins bound for the outer membrane interact with Tom40 which transfers them to the SAM (sorting and assembling machinery) complex comprised of 3 proteins which (by an unknown mechanism) insert them into the membrane. The outermembrane proteins are usually quite hydrophobic, which probably drives the transfer from Tom40 into the membrane, but the exact mechanism is not known.
Nuclear Pore Complex Deterioriation
In section we discussed D’Angelo 2009. This paper showed that the scaffold proteins – a crucial part of the nuclear pore complex discussed above – are produced only when cells are dividing and then have to last the entire lifetime of a cell – indeed, the entire lifetime of an organism, in post-mitotic cells. So in your neurons, these critical components of the nuclear pore complex never get replaced or refreshed as long as you live. They’re pretty durable, but they do undergo oxidative damage over time, and eventually this leads to the deterioration of the whole pore complex, to the point where the pore loses its selectivity and all sorts of large molecules in the cytosol can just diffuse into the nucleus.
This might have some pretty bad implications. Cytosolic proteins getting into the nucleus through damaged pores might then contribute to more oxidative damage, creating a sort of vicious cycle of aging. The authors speculate that evolution simply never figured out a way to disassemble and replace the pore complexes – or just replace their scaffold constituents – without opening up the nuclear envelope.
I finally got straight on one distinction of methods in molecular biology. When people put GFP under the promoter of a gene of interest, that’s a proxy for whether the gene of interest gets transcribed. When people put the original gene + GFP (a GFP fusion protein) under that promoter, they’re checking whether the gene of interest gets translated.
There are some crazy methods in this paper too – new to me at least. They use RNAi against RNAi. They put GFP under the promoter of their gene of interest and then used RNAi against GFP until the GFP was gone to wipe out the existing GFP. Then to see if GFP was still being actively expressed they had to stop RNAi, which they did by using RNAi against Dicer, a protein required for RNAi.
Huntingtin
Huntingtin doesn’t have a nuclear localization signal, but it does have an evolutionarily conserved nuclear export signal [Xia 2003 (ft)]. That’s kind of peculiar, right? It’s translated in the cytosol, so why would it need an export signal if it has no way of getting into the nucleus in the first place? Ah, but somehow polyQ proteins do have a way of getting into the nucleus, especially when the polyQ tract is extra long [see for instance Wheeler 2000 (ft)]. We don’t understand it well, and it’s a big subject of research because mutant huntingtin fragments are often found in the nucleus and some people think this nuclear localization might be required for Huntington’s Disease to cause its toxicity.
PrP
PrP goes through the secretory pathway so next week’s lecture should be more relevant to it. When I Googled ‘PrP nuclear localization’ and ‘PrP mitochondrial localization’ I did find a couple of reports of PrPSc being able to get into the nucleus and maybe having a ‘cryptic’ NLS [Gu 2003, Jaegly 1998] or being associated with mitochondrial membrane lipid rafts [Sorice 2012] but I am not sure how well-accepted either of these ideas are.