IGERT Poster 2012

Report 1 Downloads 98 Views
Transgenerational Epigenetic Instability as a Source of Novel Methylation Variants Matthew D. Schultz1,2,3, Robert J. Schmitz1,2, Mathew G. Lewsey1,2, Ronan C. O’Malley2, Mark A. Urich1,2, Ondrej Libiger4, Nicholas J. Schork4, Joseph R. Ecker1,2,5 1Plant Biology Laboratory, The Salk Institute for Biological Studies, La Jolla, CA 92037, USA. 2Genomic Analysis Laboratory, The Salk Institute for Biological Studies, La Jolla, CA 92037, USA. 3Bioinformatics Program, University of California at San Diego, La Jolla, CA 92093, USA. 4The Scripps Translational Science Institute and the Department of Molecular and Experimental Medicine, The Scripps Research Institute, La Jolla, CA 92037, USA. 5Howard Hughes Medical Institute, The Salk Institute for Biological Studies, 10010 North Torrey Pines Road, La Jolla, CA 92037, USA.

Motivation and Background

Single Methylation Polymorphisms

Methylome Sequencing

At4g14570

15.87% 1.14% 0.21% 0.05% 0.02% 0.01% 82.70%

1 rep1 1 rep2 19 rep1 19 rep2 12 rep1 12 rep2 29 rep1 29 rep2 49 rep1 49 rep2 59 rep1 59 rep2 69 rep1 69 rep2 119 rep1 119 rep2

0 Discordant descendant line 1 Discordant descendant lines 2 Discordant descendant lines 3 Discordant descendant lines 4 Discordant descendant lines 5 Discordant descendant lines Total unmethylated CG

Bisulfite Treatment

Intergenic n = 21

1

12

19

rep 1 rep 2

rep 1 rep 2

rep 1 rep 2

SMPs Diverge Over Time Ancestor vs Ancestor

Ancestor vs Descendant

Descedant vs Descendant

At5g24250

-1

0

1

2

3

4

5

6

7

8

0

1

1

19

19

12

12

29

29

49

49

59

59

69

69

119

119

2

4

6

8

10

12

14

16

24nt 23nt 22nt 21nt

We expanded our search to look for differentially methylated regions and found 72 regions. Interestingly, 14 of these regions fell within protein-coding genes, and we looked at the correlation between methylation, expression, and small RNAs, which are known to help direct DNA methylation. As can be seen above, the absence of methylation correlates with the expression of this gene and with a loss of 24-nucleotide small RNAs. This result supports the idea that these epigenetic changes can have an effect on gene production and potentially an individual.

1

12

es t

An c

19

9

or

s

1

s nt da en

sc

De

For this study, we took advantage of an existing population of plants that had been highly inbred and is consequently very genetically similar. We sequenced the methylomes of 5 individuals that were 31 generations removed from the original founder of this population and 3 individuals that were only 3 generations removed from the founder. To account for technical variability, we also sequenced a biological replicate of each of these samples which were highly similar.

nt s

en da

De sc

Conclusions

In the design of this experiment, we have tried to account for genetic and environmental variability that could have led to the changes we observed. Arabidopsis thaliana offered us a unique opportunity to show convincing evidence that it is possible for these modifications to vary across generations independent of 0 20000 50000 genetic changes. Furthermore, we were able to find examples of # of dissimilar SMPs these changes that had an effect on the transcriptional output When we compared the SMPs from different individuals we found a striking of some genes. Although no major phenotypic changes have ever been observed in this population of plants, our results pattern. The more generations that separated two individuals the more the methylation patterns at SMP sites differed. Above, dark red indicates more dif- leave open the intriguing possibility that these changes could be acted upon by evolution. ferences. 49

Descendants

11

rep 1 rep 2

69

rep 1 rep 2

59

rep 1 rep 2

29

rep 1 rep 2

49

rep 1 rep 2

12

119

69

69

59

59

29

49

s

29

or

st ce An

The epigenetic modification that we chose to focus on in this study is DNA methylation, which is the addition of a methyl group to cytosines in DNA. To measure this modification, we take advantage of a chemical known as bisulfite, which will convert unmethylated cytosines to thymines and leave methylated cytosines (red Cs above) unconverted. After this conversion, any remaining cytosines detected by a DNA sequencing platform should be methylated. We refer to the sum total of these methylated cytosines as the methylome.

1 rep1 1 rep2 19 rep1 19 rep2 12 rep1 12 rep2 29 rep1 29 rep2 49 rep1 49 rep2 59 rep1 59 rep2 69 rep1 69 rep2 119 rep1 119 rep2

smRNA levels at At5g24240 C-DMR (RPKCMs)

9

High Throughput Sequencing

11

19

Ancestors

Log2 fold change in mRNA levels of At5g24240 (relative to line 1)

At5g24240

AUTGGAGTCGATGCTUUAA ATTGGAGTCGATGCTTTAA

Transposons n = 27

Genes n = 14

Total unmethylated CG dinucleotides in the genome

PCR Amplification

Pseudogene n=1

Promoters n=7

Invariable mCGs in the descendant population

With these methylome data, we were able to ask how different the descendants were from the original founder. We called these differences single methylation polymorphisms (SMPs). Although most of the cytosines in all of our individuals were either completely methylated or unmethylated, a fraction of them varied between individuals. In the left panel above, gold bars represent sites of methylation.

Col-0

ncRNAs n=2

Ancestors

Descendants

Ancestors

SMP SMP

Mutation Accumulation Population

ACTGGAGTCGATGCTTCAA

Total SMPs in the descendant population

Descendants

Every cell that makes up an individual has the exact same genome and yet that individual is made up of a whole host of different cells. To achieve this variety of cell types, organisms utilize a variety of chemical modifications to DNA, which can be thought of as dials that selectively turn up or down how much of that gene’s product is produced. For example, some of the genes that are turned on in heart cells will not be turned on in liver cells and vice versa. Consequently, when a heart cell divides, it needs to send a signal to its daughter cells about which genes should be on and off. The term epigenetics is used to describe the inheritance of these kinds of non-genetic modifications. An open question is whether or not changes in these modifications can vary over time like mutations in DNA. Undoubtedly, if one picked any of these modifications and compared them across two individuals there would be many differences. What makes this problem challenging is disentangling those differences from genetic and environmental effects that could have also caused them. To overcome these challenges in an attempt to answer this fundamental question, we turned to the model plant Arabidopsis thaliana and DNA sequencing technology.

Differentially Methylated Regions

Color Ke y