/usr/lib/R/site-library/ShortRead/template/3000-ReadDistribution.html is in r-bioc-shortread 1.36.0-1.
This file is owned by root:root, with mode 0o644.
The actual contents of the file can be viewed below.
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 | <h2>Read Distribution</h2>
<a name="Read-Distribution"></a>
<p>
These curves show how coverage is distributed amongst
reads. Ideally, the cumulative proportion of reads will transition
sharply from low to high.
</p>
<p>
Portions to the left of the transition might correspond roughly to
sequencing or sample processing errors, and correspond to reads that
are represented relatively infrequently. 10-15%; of reads in a
typical Genome Analyzer 'control' lane fall in this category.
</p>
<p>
Portions to the right of the transition represent reads that are
over-represented compared to expectation. These might include
inadvertently sequenced primer or adapter sequences, sequencing or
base calling artifacts (e.g., poly-A reads), or features of the
sample DNA (highly repeated regions) not adequately removed during
sample preparation. About 5% of Genome Analyzer 'control' lane
reads fall in this category.
</p>
<p>
Broad transitions from low to high cumulative proportion of reads
may reflect sequencing bias or (perhaps intentional) features of
sample preparation resulting in non-uniform coverage. the transition
is about 5 times as wide as expected from uniform sampling across
the Genome Analyzer 'control' lane.
</p>
<pre>
df <- qa[["sequenceDistribution"]]
ShortRead:::.plotReadOccurrences(df[df$type=="read",], cex=.5)
</pre>
@READ_OCCURRENCES_FIGURE@
<p>
Common duplicate reads might provide clues to the source of
over-represented sequences. Some of these reads are filtered by the
alignment algorithms; other duplicate reads might point to sample
preparation issues.
</p>
<pre>
ShortRead:::.freqSequences(qa, "read")
</pre>
@FREQUENT_SEQUENCES_READ@
<p>
Common duplicate reads after filtering
</p>
<pre>
ShortRead:::.freqSequences(qa, "filtered")
</pre>
@FREQUENT_SEQUENCES_FILTERED@
<p>
Common aligned duplicate reads are
</p>
<pre>
ShortRead:::.freqSequences(qa, "aligned")
</pre>
@FREQUENT_SEQUENCES_ALIGNED@
|