A Novel MST Method

Report 0 Downloads 310 Views
W. Tang, C. J. Sabatini, E. R. Neal, K. McGaughey, A. Goodman, A. Dekhtyar, M. Black, C. Kitts

http://cplop.cosam.calpoly.edu/

http://cplop.cosam.calpoly.edu/

California Polytechnic State University San Luis Obispo, CA

Swab Technique Distribution of E. coli Isolates among Detected Strains

Temporal Distribution of Detected E. coli Strains

16

14

Immediate

10 8

Fecal

6

Before

4

Number of Isolates

12

Person A

12

Later

10 8 6 4

2

2

0

0 Month 1

Month 2

Month 3

Month 4

Month 5

Month 6

16 *

14 12 * Number of Isolates

Person B

10 9 8 7 6 5 4 3 2 1 0

10 8 6 4 2 0

Example A A C T G T G A G A C A G G AA C T G T G A G A C A G G Strain 1 A A C T G T G A G A C A G G

Graph 2.

4

14

Pyrogram

*

12

Person C

10

Month 4

Month 5

6 *

*

2 0

10 8 6 4 2

Relative Light Photon #

0

5’

5’

G

C AC G G A T T C A C T A C G GCA TT T A

5’

Month 1

3

A TT C

*Strains with a significant p-value less than α. Person A; α= 0.003. Person B; α= 0.002. Person C; α= 0.002.

A

G T C A A G T C GGA

C T A G G A C G T A C G T T T A G G A C T G A A T C C G T A G A G C T A G G A C G T A C G T T T AAG G G A C T G A A T C C G T A G

3’

A G C T A G G A C G T A C G T T T A G G A A T C C G A C T T G A C

3’

5’ 3’

5’

E. coli Sampling & Confirmation. E. coli were sampled from three individuals once a month for six months. The individuals are two females and one male, ranging in age from 20-25. Four swabbing techniques were used to obtain intestinal E. coli. Anal swabs were collected before, immediately after, and several hours later after defecation. Lastly, collected feces were sampled with a sterile swab post homogenization. Inoculated swabs were streaked onto agar plates for single isolated colonies. From these initial plates, 15 single colonies from each swab technique were selected for a series of metabolic tests to confirm E. coli. Molecular Analysis. Confirmed E. coli isolates were used in colony PCR to amplify both ITS 1 and ITS 2 in the ribosomal RNA operon of the E. coli genome. All seven copies of the rRNA operon in E. coli genome were amplified during PCR. Both ITS 1 and ITS 2 were pyroprinted using pyrosequencers. Normally during pyrosequencing, a sequence is determined by the heights and order of peaks of light produced when nucleotides dispensed in a particular order are incorporated to an extending strand of DNA. The amount of light produced is proportional to the number of nucleotides added in a row.

5’

Month 3

Month 4

Month 5

Strain 120

Strain 50

Strain 49

Strain 48

Strain 47

Strain 46

Strain 45

Strain 44

Strain 43

Strain 42

Strain 41

Strain 40

Strain 39

Strain 38

Strain 37

Strain 36

Strain 35

Strain 34

Strain 33

Strain 32

Strain 27

Strain 10

Strain 9

Month 6

Strain 130 Strain 128 Strain 126 Strain 124 Strain 94 Strain 83 Strain 81 Strain 79 Strain 77 Strain 75 Strain 73 Strain 71 Strain 30 Strain 12 Strain 10

Strain 129 Strain 127 Strain 125 Strain 96 Strain 93 Strain 82 Strain 80 Strain 78 Strain 76 Strain 74 Strain 72 Strain 31 Strain 22 Strain 11

Only fecal isolates were used for this temporal analysis.

7

2 1

T

Month 2

Strain 121

Month 6

12

8

4

Month 3

14

Number of Isolates

Example Strain 2

AA C T G T G A G A C A G G AA C T G T G A G A C A G G AA C T G T G A G A C A G G AA C T G T G A G A C A G G AA C T G T G A G A C A G G AA C T G T G A G A C A G G AA C C G T G A G A C A G G

Month 2

16

14

AA C T G T G A G A C A G G AA C T G T G A G A C A G G AA C C G T G A G A C A G G AA C C G T G A G A C A G G

Graph 2. Example pyroprints for two strains with a single nucleotide difference.

Graph 1.

Month 1

Number of Isolates

Graph 1. An example pyrogram from a single DNA sequence.

Strain 110 Strain 99 Strain 13 Strain 12 Strain 11 Strain 10 Strain 9 Strain 8 Strain 7 Strain 6 Strain 5 Strain 4 Strain 3 Strain 2 Strain 1

14

Number of Isolates

Escherichia coli is a well-studied commensal bacterium inhabiting the mammalian intestinal tract and is an indicator of fecal contamination when found in the environment. Thus, E. coli is commonly selected for Microbial Source Tracking (MST) in which a bacterial contaminant is matched to its host species by identifying and differentiating between strains. Current MST methods can be very expensive, in both money and labor, and many have issues in reproducibility when identifying different E. coli strains. In order to address many of the common complaints regarding MST methods, our lab has developed a novel pyrosequencing based method to distinguish between different E. coli strains. The method designates E. coli isolates into strains using the DNA sequences of the seven ribosomal RNA operons. The fingerprints (or pyroprints) that are obtained for comparison inspired the method’s name: Pyroprinting. This study used our new method to investigate the success of different sampling strategies for obtaining E. coli from humans. Success was determined by the variation of strains detected from each sampling type. Ideally, different types of sampling methods should detect similar proportions of strains. The secondary purpose of this study was to track strain distribution of intestinal E. coli populations over a six-month period in select human subjects. We hypothesized that each individual would host one or two dominant strains over the entire six month period, but that minor strains would vary.

Number of Isolates

Microbial Source Tracking (MST) is the science of identifying the source of a bacterial species to its host species. This practice is used frequently in applied science to identify sources of contamination in the environment like beaches or lakes, for example, but MST techniques can also be used to study microbial population dynamics in specific organisms. In this study, a new MST method was developed to investigate the variation of Escherichia coli strains in humans. The new approach uses pyrosequencing to generate DNA fingerprints (or pyroprints) based on the sequences from two polymorphic regions within the ribosomal RNA operon of E. coli. Seven copies of the ribosomal RNA operons are present in the E. coli genome and each possesses two highly variable regions called Intergenic Transcribed Spacers (ITS). These regions of DNA are non-coding, and thus are able to accumulate nucleotide changes. The pyroprinting MST method developed by our lab capitalizes on these differences to create patterns of data by simultaneously sequencing all seven copies of each ITS: the 16-23S rRNA region (ITS 1) and 23-5S rRNA region (ITS 2). A “match” between E. coli isolates is determined when the patterns are nearly identical at both ITS 1 and 2. A cluster developed by matching isolates is considered to be a unique strain. Specifically, this study investigated E. coli strain diversity in three humans over a six-month time period and between different sampling methods. E. coli isolates were collected and confirmed from a series of anal and fecal swabs taken once a month for each of the six months. The swab series consisted of four distinct sampling strategies. The results obtained indicated that different sampling techniques may detect different strains. The data obtained from the temporal study confirmd the hypothesis that individuals host one or two dominant strains. Interestingly, the population changed such that different strains were dominant from month to month. Additionally, minor strains and number of minor strains varied monthly for most individuals. Lastly, the amount of diversity of E. coli populations was different between individuals.

G A T C G A T C

G A T C G A T C G A T C

A pattern of peaks is produced based on the 7 different sequences; these are the pyroprints that are used for comparison in this method. Each E. coli isolate pyroprint is based on the results of two pyroprints: one for each ITS region. Unique E. coli pyroprints are considered to represent unique E. coli strains. Statistics. Pearson Correlation was used to analyze differences between each E. coli isolate in peak heights at each nucleotide dispensation. A computer algorithm written by collaborating computer scientists was used to build strains using the correlation matrices for both ITS 1 and ITS 2. Isolates were clustered into a strain if the correlation between them was 99.5 or above. Any isolates with a correlation below 99.0 were never clustered together. Isolates with correlations between these values were clustered into a strain based on its average correlation with all other isolates within the strain. Once the strains were built, differences in swab techniques were analyzed statistically using binary logistic regression which determines the significance between differences in proportions. A p-value below α indicates there is a significantly different proportion of swab types in a strain.

• Sampling technique may influence the strains that are detected by pyroprinting. • Could be influenced by strain diversity (see Person A v. B & C) • These results could pose a question for other MST/detection methods. • Over time individual hosts seem to harbor a diverse E. coli population • Type of strain detected as dominant changed frequently • Months that contain smaller, less represented strains may be months of transition • Less represented strains could be transient • Human hosts are variable in the amount diversity and change within E. coli populations • Results between individuals in both swab type analysis and temporal analysis yields differences in distributions. • Variation seen both in overall diversity and temporal diversity • May be due to both genetic and environmental factors, but we hypothesize environment plays a larger role

• Larger study regarding sampling method • Conduct study including other hosts • Investigate E. coli strains sampled from a larger human populations: • Even though individual hosts are variable, do the strains detected in a large population of human hosts stay the same over time?

Acknowledgments Special thanks to our supporters from the W.M. Keck Foundation and Cal Poly San Luis Obispo College-Based Fees. Thank you also to the Undergraduate Biotechnology Lab, the Environmental Biotechnology Institute, the entire M. Black lab, J. VanderKelen, and additional supporters at the Cal Poly Biological Sciences Department.