Henrietta Lacks could never have imagined that tiny, invisible parts of her would enthrall, empower, and perplex scientists for decades. In 1951, she developed a particularly aggressive cervical cancer, which unfortunately could not be treated – Henrietta died just a few months later. But her cells lived on, becoming arguably one of biology’s most valuable tools for decades to come.
Yet until now, our knowledge of the genetics of these so-called HeLa cells was rather hazy. I’m happy to be part of the team that has just published the first genome sequence of a HeLa cell line. (We were a little shocked by the amount of coverage our press release generated, even including a Nature News feature!) Here I’d like to explain why we did this and what we learned – and perhaps also importantly, what we did not learn.
(Part 2 of this post will be an interview with the study’s lead author, a colleague and friend of mine, Jonathan Landry. Stay tuned!)
Say you’re a biologist who knows that a mutation in gene x leads to disease y, but you want to figure out how. How do you study that process in your lab? (Well, you can use model organisms, which is a subject for a whole different post. But what if you really want to know what happens in humans?) As it turns out, you can’t very well line up a bunch of sick people and regularly hack pieces of their limbs off and throw bags of drugs at them to see what happens.
What you need is to find a way to take a sample of them that you can maintain in the lab. Something that you can run tests on, something you can keep in a controlled environment, something that will keep replenishing itself. Something immortal.
It was around the time that Henrietta was admitted to Johns Hopkins Hospital in 1951 that researchers were trying to figure out how to make such a thing – an immortal cell line – possible. Cancer cells make pretty attractive candidates: their defining problem is that they keep replicating themselves, because they’re missing the checkpoints that our normal cells have to prevent unlimited division. Henrietta’s doctors took a sample of her tumour, which was and still is standard practice for assessing a patient’s prognosis. What they didn’t tell her was that they kept aside some of that sample to try and generate a cell line (informed consent was nothing but a twinkle in a lawyer’s eye back then). In this case they weren’t necessarily interested in using Henrietta’s cells to study her cancer – they just wanted to see if they could keep some human cells alive in the lab. And her cells must have been pretty special, because they were the first ones in history that just refused to die.
This event opened up countless new avenues for the way research could be done. The discoveries made using Henrietta’s cells revolutionized biology. They yielded two Nobel prizes and the polio vaccine. HeLa cells were used to study cancer, dissect basic biological processes common to all cells, and develop new technologies to enable further discoveries. Even though countless other cell lines have been created since 1951, HeLa was established as the default and remains the most commonly used – over 60,000 studies using these cells have been published. Their contribution to science is immeasurable.
I would recommend that anyone even slightly interested in any of the above read The Immortal Life of Henrietta Lacks, a brilliantly written, international bestseller that weaves the heart-rending story of the Lacks family with the groundbreaking research that Henrietta’s cells enabled.
More recently, researchers have started using HeLa for studying gene function like the disease example I mentioned, just because it was so easy to work with in the lab. Researchers have known for decades that the HeLa genome is not quite normal, however, for two reasons:
- Cancer. HeLa cells were derived from a tumour. Cancer is a disease of the genome, which means that a cancer genome will be different from that of the patient. Genome instability is also a common feature of cancer, which means that as cancer cells replicate, their genomes keep on mutating.
- Long-term growth. Recipe for aberrance: Take cells with an unstable genome and grow them for decades. (Nowadays people freeze them, but at first no one knew that was possible.)
The level of our knowledge about the HeLa genome was extremely low resolution. For genetic studies, this is like having a map of the world when what you actually need is Google Street View. So people took the only human genome they had: the one sequenced by the Human Genome Project, commonly referred to as the ‘reference’ genome. Which is a bit like taking a map of the Orange County to navigate downtown Mumbai.
We stumbled on this limitation when my labmate Jonathan embarked on a Ph.D. project that used a HeLa cell line and required some pretty detailed genetic information (stay tuned for part 2 of this post to hear from him). Why don’t we take 6 months and just sequence the genome, said our boss. Then we can do this more accurately.
Of course, this turned out to be way more complicated than any of us imagined, which may not be unrelated to the fact that nobody had done it yet. Two years and a Ph.D. later, though, Jonathan and his team had fully sequenced and analyzed a HeLa genome and transcriptome.
Our primary goal was to provide a resource that other researchers using this HeLa cell line could benefit from. What we built, in addition to the DNA and RNA sequences, was a catalog of all of its variations relative to the reference genome. As the Nature News article so succinctly points out, this is one messed up genome. It could not have been more different from the reference. It’s like someone walked into a room of normal human genome with a pile of C-4, a bucket of Polyjuice Potion, and a really crappy photocopier and just went to town.
Before we get into the juicy details, let’s take note of one important point: we do not know where any of these genetic differences came from or what they do. (Calling them ‘errors’, as many news sources and Tweeters have done, isn’t really accurate.) They could have arisen during the decades of growth or during Henrietta’s cancer; they could have led to Henrietta’s cancer or just been part of what made her unique. Without Henrietta and her tumour’s genomes (whose appearance could not be less likely), we will unfortunately never be able to make these distinctions. All we have done here is describe what we found and make educated guesses about what it could mean. So let’s get to it.
Behold, the HeLa genome, in all its chromosomes. Its notable features are as follows:
- The inner oscillations between green and varying shades of red show how many copies there are (copy number, CN) of each segment of the genome. A normal human genome only has two copies and would thus be green throughout. So yup, most of this genome has three copies (often more) because a whole bunch of it has been amplified over time. (From this alone, I think we can safely conclude that no human could ever survive with this genome.)
- All that blue? The thousands of single-letter sequence differences relative to the reference. It’s important to point out that these differences are not necessarily mutations – many of them look like they might be causing some damage to the genes that they are part of, but many others may be harmless. The point of reporting them is that they are part of the Street View detail – designing and interpreting certain experiments could benefit from this information.
- The purple versus the pink? All the areas of the genome with only one version (not copy) of the sequence. Normal people usually have two different versions of every gene, which means there’s a backup in case one is faulty. All that purple means HeLa is lacking a lot of backup. This is also part of a cute trick that cancer does, called loss of heterozygosity (LOH), which allows the mutations that favour cancer to dominate. One of the purple regions on chromosome 11 has shown LOH in other cases of cervical cancer, so it might have had something to do with Henrietta’s.
- The spiderweb in the middle? Think shuffling a deck of cards: those lines connect the ‘normal’ locations of hundreds of genomic segments to where they’ve been relocated to in HeLa. The colours correspond to the sequencing techniques used to detect them, and the differences indicate that combining these techniques is probably a good idea if you want to catch everything.
- Remember the C-4 analogy? It looks like some of the chromosomes, for example 11, were blown apart and stuck back together in a random order. This is actually a signature of a ‘catastrophic’ phenomenon recently discovered in certain cancers called chromothripsis – fortunately for us, Jan Korbel, a chromothriptic (now that’s a word too!) expert, works at our institute and teamed up with us for this study. The question is when chromothripsis occurred in HeLa cells: during the cancer or during the long-term growth that followed? We know it occurs in cancer, but could it also occur in cell lines?
- Not depicted: our identification of human papillomavirus (HPV) insertion sites in the HeLa genome, which agree with previous studies. HPV is a sexually-transmitted virus that is known to cause cervical cancer (incidentally one of the Nobel-awarded discoveries using HeLa). According to the book I mentioned earlier, Henrietta was likely infected with HPV many times over due to her husband’s extramarital activities, which may well have made her cancer especially aggressive. Ugh.
- Also not depicted: our profile of how genes express themselves in HeLa. Studying HeLa gene expression with the reference genome can be problematic as per the OC-Mumbai analogy from earlier, so we also made a catalog of gene expression in the context of the correct genome sequence. For me, the most surprising aspect of this was that the expression of nearly 20,000 genes was not detected. Also, several DNA repair pathways were kicked into high gear – I’m guessing this is in response to the rampant genomic instability, but a lot of the components of these pathways have some pretty severe mutations in them, which begs the question of how well this response is actually working.
So in summary, while we always knew the HeLa genome was different from the reference, now we know the precise nature and extent of these differences.
Two things our study cannot tell you:
- Anything new about cancer (although previous studies using HeLa have). We can only surmise about which elements of this sequence were Henrietta’s, her cancer’s, or the lab’s. (For questions about cancer, it is much more direct and straightforward to sequence so-called ‘primary’ tumour samples – and ‘match’ them to the patient’s ‘normal’ genomes, to see what went wrong during cancer – before they get the chance to evolve during growth as a cell line.)
- What every HeLa cell has in its genome. We sequenced one of MANY HeLa cell lines. When Henrietta’s cells were first cultured, people were distributing them like YouTube links. This means that all of that long-term growth I described was happening in countless labs, worldwide, simultaneously, and they were each producing their own unique cocktail of mutations. The cell line we worked with was named Kyoto, and the only indication we ever found of where that name came from is that some guy from Kyoto sent it to somebody else. That’s about the extent of tracking that was in place at the time. And our idea of how greatly these HeLa cell lines differ genetically is approximately just as murky. But that’s a question for a whole different study.
Our take-home message of choice: researchers now have a genomic resource for HeLa cells. Hopefully people can use it to improve the way they design genetic or genomic studies using HeLa. Those who feel ambitious can also use our resources to reinterpret old data (oh wait, that was actually a right turn on Marine Drive?).
Now that we know just how abnormal this genome really is, might it be time to reconsider how we use HeLa to model human biology? I wonder, for example: does it make sense to study the function of genes in cells where they are mutated to a seemingly crippling extent or not even expressed to begin with? Should we study process x in a cell line that probably really sucks at that process? Food for thought.
It’s 11 o’clock. Do you know what your cell line genome is? Cell lines are absolutely indispensable for studying human biology, but they also usually come with a free dose of genomic instability. I think our study emphasizes just how vastly that can affect what you’re working with. Look at chromothripsis, for example: if chromosomes can just spontaneously shatter in cell lines, then that could complicate things oh-so-slightly. So the moral of the story is: Sequence your cell lines! This might even become a standard way to validate them over time. Our study shows that at least one-off sequencing of aberrant genomes is feasible, and illustrates some of the associated analytical challenges.
And the final important take-home, which I had to frame as a mini-tirade due to some online interpretations of our study:
Neither I nor any of my co-authors intend to detract from the significant discoveries and technological breakthroughs that HeLa cells have enabled. As I mentioned, researchers have been aware of HeLa’s abnormal genome for a long time, but there were good reasons to use it anyway. Even now, there are certainly types of experiments where HeLa might be the best model to use. So please don’t mistake (ahem) our findings to mean that using this cell line is stupid and we should toss out those 60,000 papers and revoke those Nobel prizes, because that’s not what they mean at all. All we hope they do is inform the use of HeLa in future studies.
I want to close by paying tribute to Henrietta Lacks for her endless, albeit unknowing, contributions to science. Her cells inherited 4 letters from her name, and many more from her genome. No matter how we continue to use these cells, their status in biological research is nothing short of immortal.
Source article (open-access): Landry J.*, Pyl P.T.*, et al. The genomic and transcriptomic landscape of a HeLa cell line. Genes | Genomes | Genetics (published ahead of print March 11, 2013)
Source press release: Havoc in biology’s most-used cell line: Genome of HeLa cells sequenced for the first time (hat tip to Adam Gristwood and Isabelle Kling for their help here)