Transgenerational Epigenetic Instability as a Source of Novel Methylation Variants Matthew D. Schultz1,2,3, Robert J. Schmitz1,2, Mathew G. Lewsey1,2, Ronan C. O’Malley2, Mark A. Urich1,2, Ondrej Libiger4, Nicholas J. Schork4, Joseph R. Ecker1,2,5 1Plant Biology Laboratory, The Salk Institute for Biological Studies, La Jolla, CA 92037, USA. 2Genomic Analysis Laboratory, The Salk Institute for Biological Studies, La Jolla, CA 92037, USA. 3Bioinformatics Program, University of California at San Diego, La Jolla, CA 92093, USA. 4The Scripps Translational Science Institute and the Department of Molecular and Experimental Medicine, The Scripps Research Institute, La Jolla, CA 92037, USA. 5Howard Hughes Medical Institute, The Salk Institute for Biological Studies, 10010 North Torrey Pines Road, La Jolla, CA 92037, USA.
Motivation and Background
Single Methylation Polymorphisms
Methylome Sequencing
At4g14570
15.87% 1.14% 0.21% 0.05% 0.02% 0.01% 82.70%
1 rep1 1 rep2 19 rep1 19 rep2 12 rep1 12 rep2 29 rep1 29 rep2 49 rep1 49 rep2 59 rep1 59 rep2 69 rep1 69 rep2 119 rep1 119 rep2
0 Discordant descendant line 1 Discordant descendant lines 2 Discordant descendant lines 3 Discordant descendant lines 4 Discordant descendant lines 5 Discordant descendant lines Total unmethylated CG
Bisulfite Treatment
Intergenic n = 21
1
12
19
rep 1 rep 2
rep 1 rep 2
rep 1 rep 2
SMPs Diverge Over Time Ancestor vs Ancestor
Ancestor vs Descendant
Descedant vs Descendant
At5g24250
-1
0
1
2
3
4
5
6
7
8
0
1
1
19
19
12
12
29
29
49
49
59
59
69
69
119
119
2
4
6
8
10
12
14
16
24nt 23nt 22nt 21nt
We expanded our search to look for differentially methylated regions and found 72 regions. Interestingly, 14 of these regions fell within protein-coding genes, and we looked at the correlation between methylation, expression, and small RNAs, which are known to help direct DNA methylation. As can be seen above, the absence of methylation correlates with the expression of this gene and with a loss of 24-nucleotide small RNAs. This result supports the idea that these epigenetic changes can have an effect on gene production and potentially an individual.
1
12
es t
An c
19
9
or
s
1
s nt da en
sc
De
For this study, we took advantage of an existing population of plants that had been highly inbred and is consequently very genetically similar. We sequenced the methylomes of 5 individuals that were 31 generations removed from the original founder of this population and 3 individuals that were only 3 generations removed from the founder. To account for technical variability, we also sequenced a biological replicate of each of these samples which were highly similar.
nt s
en da
De sc
Conclusions
In the design of this experiment, we have tried to account for genetic and environmental variability that could have led to the changes we observed. Arabidopsis thaliana offered us a unique opportunity to show convincing evidence that it is possible for these modifications to vary across generations independent of 0 20000 50000 genetic changes. Furthermore, we were able to find examples of # of dissimilar SMPs these changes that had an effect on the transcriptional output When we compared the SMPs from different individuals we found a striking of some genes. Although no major phenotypic changes have ever been observed in this population of plants, our results pattern. The more generations that separated two individuals the more the methylation patterns at SMP sites differed. Above, dark red indicates more dif- leave open the intriguing possibility that these changes could be acted upon by evolution. ferences. 49
Descendants
11
rep 1 rep 2
69
rep 1 rep 2
59
rep 1 rep 2
29
rep 1 rep 2
49
rep 1 rep 2
12
119
69
69
59
59
29
49
s
29
or
st ce An
The epigenetic modification that we chose to focus on in this study is DNA methylation, which is the addition of a methyl group to cytosines in DNA. To measure this modification, we take advantage of a chemical known as bisulfite, which will convert unmethylated cytosines to thymines and leave methylated cytosines (red Cs above) unconverted. After this conversion, any remaining cytosines detected by a DNA sequencing platform should be methylated. We refer to the sum total of these methylated cytosines as the methylome.
1 rep1 1 rep2 19 rep1 19 rep2 12 rep1 12 rep2 29 rep1 29 rep2 49 rep1 49 rep2 59 rep1 59 rep2 69 rep1 69 rep2 119 rep1 119 rep2
smRNA levels at At5g24240 C-DMR (RPKCMs)
9
High Throughput Sequencing
11
19
Ancestors
Log2 fold change in mRNA levels of At5g24240 (relative to line 1)
At5g24240
AUTGGAGTCGATGCTUUAA ATTGGAGTCGATGCTTTAA
Transposons n = 27
Genes n = 14
Total unmethylated CG dinucleotides in the genome
PCR Amplification
Pseudogene n=1
Promoters n=7
Invariable mCGs in the descendant population
With these methylome data, we were able to ask how different the descendants were from the original founder. We called these differences single methylation polymorphisms (SMPs). Although most of the cytosines in all of our individuals were either completely methylated or unmethylated, a fraction of them varied between individuals. In the left panel above, gold bars represent sites of methylation.
Col-0
ncRNAs n=2
Ancestors
Descendants
Ancestors
SMP SMP
Mutation Accumulation Population
ACTGGAGTCGATGCTTCAA
Total SMPs in the descendant population
Descendants
Every cell that makes up an individual has the exact same genome and yet that individual is made up of a whole host of different cells. To achieve this variety of cell types, organisms utilize a variety of chemical modifications to DNA, which can be thought of as dials that selectively turn up or down how much of that gene’s product is produced. For example, some of the genes that are turned on in heart cells will not be turned on in liver cells and vice versa. Consequently, when a heart cell divides, it needs to send a signal to its daughter cells about which genes should be on and off. The term epigenetics is used to describe the inheritance of these kinds of non-genetic modifications. An open question is whether or not changes in these modifications can vary over time like mutations in DNA. Undoubtedly, if one picked any of these modifications and compared them across two individuals there would be many differences. What makes this problem challenging is disentangling those differences from genetic and environmental effects that could have also caused them. To overcome these challenges in an attempt to answer this fundamental question, we turned to the model plant Arabidopsis thaliana and DNA sequencing technology.
Differentially Methylated Regions
Color Ke y