Skip to main content
SearchLogin or Signup

Review 2: "Genome sequencing of sewage detects regionally prevalent SARS-CoV-2 variants"

This preprint offers a successful demonstration of WGS-based detection of emerging SARS-CoV-2 variants in wastewater samples. Reviewers deemed major claims reliable, but experimental methodology and justification should be described in further detail.

Published onJan 24, 2021
Review 2: "Genome sequencing of sewage detects regionally prevalent SARS-CoV-2 variants"
1 of 2
key-enterThis Pub is a Review of
Genome sequencing of sewage detects regionally prevalent SARS-CoV-2 variants

Viral genome sequencing has guided our understanding of the spread and extent of genetic diversity of SARS-CoV-2 during the COVID-19 pandemic. SARS-CoV-2 viral genomes are usually sequenced from nasopharyngeal swabs of individual patients to track viral spread. Recently, RT-qPCR of municipal wastewater has been used to quantify the abundance of SARS-CoV-2 in several regions globally. However, metatranscriptomic sequencing of wastewater can be used to profile the viral genetic diversity across infected communities. Here, we sequenced RNA directly from sewage collected by municipal utility districts in the San Francisco Bay Area to generate complete and near-complete SARS-CoV-2 genomes. The major consensus SARS-CoV-2 genotypes detected in the sewage were identical to clinical genomes from the region. Using a pipeline for single nucleotide variant (SNV) calling in a metagenomic context, we characterized minor SARS-CoV-2 alleles in the wastewater and detected viral genotypes which were also found within clinical genomes throughout California. Observed wastewater variants were more similar to local California patient-derived genotypes than they were to those from other regions within the US or globally. Additional variants detected in wastewater have only been identified in genomes from patients sampled outside of CA, indicating that wastewater sequencing can provide evidence for recent introductions of viral lineages before they are detected by local clinical sequencing. These results demonstrate that epidemiological surveillance through wastewater sequencing can aid in tracking exact viral strains in an epidemic context.

RR:C19 Evidence Scale rating by reviewer:

  • Strong. The main study claims are very well-justified by the data and analytic methods used. There is little room for doubt that the study produced has very similar results and conclusions as compared with the hypothetical ideal study. The study’s main claims should be considered conclusive and actionable without reservation.



Crits-Christoph et al sequence RNA isolated from wastewater collected at the municipal utility districts in the San Francisco Bay Area. Sequencing results in complete or near-complete genomes for SARS-CoV-2, bocavirus 3, PMMoV, and other plant viruses. They focus on the SARS-CoV-2 sequences and show that the genotypes predominantly correspond to those isolated from clinical genomes in the region. Minor genotypic variants are detected, and the authors suggest that this represents a sensitive way to detect recent introductions to viral lineages before they are detected in clinical samples. Overall, I consider the work to be well- suited for publication in RR:C19.


In figure 1a, it is unclear why there are two entries (relative abundances) reported for a single location on some dates, while other locations have a single entry.

Have the authors followed single-nucleotide variants (SNVs) 8083A and 1738T? If these SNVs have appeared in CA clinical samples since July, then it will strengthen author’s claim about early detection of recent introduction events.

Minor recommendations:

I think the understanding by a broad readership may benefit from clear definition of “milk of silica.”

When authors describe Figure 1d in the text “Only samples with RT-qPCR Ct-values <33 (~25 gc/uL) yielded complete consensus genomes” it might be better to talk about total genome copies (as on x-axis on the plot) rather than gc/uL. A dashed vertical line showing the threshold (>1000 genome copies) for achieving complete genome might also help to read the plot.

Consider commenting on why plant viruses predominate RNA samples isolated using ultrafiltration (panel a) as compared to those from total RNA column and milk of silica samples (panel b). Were both samples amplified using oligo-capture approach to enrich for human respiratory viruses?


No comments here