/usr/share/doc/garlic-doc/commands/seq.html is in garlic-doc 1.6-1.
This file is owned by root:root, with mode 0o644.
The actual contents of the file can be viewed below.
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 121 122 123 124 125 126 127 128 129 130 131 132 133 134 135 136 137 138 139 140 141 142 143 144 145 146 147 148 149 150 151 152 153 154 155 156 157 158 159 160 161 162 163 164 165 166 167 168 169 170 171 172 173 174 175 176 177 178 179 180 181 182 183 184 185 186 187 188 189 190 191 192 193 194 195 196 197 198 199 200 201 202 203 204 205 206 207 208 209 210 211 212 213 214 215 216 217 218 219 220 221 222 223 224 225 226 227 228 229 230 231 232 233 234 235 236 237 238 239 240 241 242 243 244 245 246 247 248 249 250 251 252 253 254 255 256 257 258 259 260 261 262 263 264 265 266 267 268 269 270 271 272 273 274 275 276 277 278 279 280 281 282 283 284 285 286 287 288 289 290 291 292 293 294 295 296 297 298 299 300 301 302 303 304 305 306 307 | <html>
<head>
<title>
SEQ, SEQUENCE
</title>
</head>
<h1 align=center>
SEQ, SEQUENCE
</h1>
<hr size="3">
<font color=#880000>
<b>
NAME
<br>
</b>
</font>
SEQ, SEQUENCE - manipulate the content of the sequence buffer.
<br><br>
<font color=#880000>
<b>
SYNOPSIS
<br>
</b>
</font>
SEQ = three_letters_code
<br>
SEQ LOAD filename
<br>
SEQ READ filename
<br>
SEQ FROM structure_identifier
<br>
SEQ COPY
<br>
SEQ SAVE filename
<br>
SEQ SWN filename
<br>
SEQ RESET
<br><br>
<font color=#880000>
<b>
DESCRIPTION
<br>
</b>
</font>
The command SEQ (long form: SEQUENCE) manipulates the content of the main
sequence buffer. Garlic mantains two sequence buffers: the main buffer and
the reference buffer. The main sequence buffer is used to prepare the average
hydrophobicity plot, the hydrophobic moment plot, helical wheel plot and for
some other operations which require the sequence information. The reference
sequence buffer is used for sequence comparison and other operations which
require two sequences.
<br><br>
Both buffers store the following sequence information:
<br>
(1) The number of residues.
<br>
(2) The sequence in the form of three letters code. Uppercase letters are used.
<br>
(3) Disulfide bond flag, if information about disulfide bonds is available.
<br>
(4) Residue serial numbers.
<br>
(5) Raw hydrophobicity values (replaced by average value for exotic residues).
<br><br>
In addition, the main sequence buffer contains the following information:
<br>
(6) The average hydrophobicity.
width.
<br>
(7) The hydrophobic moment.
<br><br>
As sequence information may be given independently from any structure, atomic
coordinates are not required for most sequence manipulation routines. Thus,
garlic may be used as the sequence analysing tool.
<br><br>
All version of the command SEQ, except one, are used to manipulate the content
of the main sequence buffer. The only exception is SEQ COPY, which copies the
content of the main sequence buffer to the reference buffer. This is the only
way to store information to the reference buffer.
<br><br>
<font color=#880000>
<b>
SEQ = three_letters_code
<br>
</b>
</font>
The command SEQ may be used with the keyword = (equal sign) to define sequence
at garlic command prompt. This may be practical to define a short sequence
fragment. This fragment may be used for helical wheel plot, or to locate the
given sequence fragment in a structure which is being investigated.
The syntax:
<br><br>
SEQ = three_letters_code
<br><br>
Example:
<br><br>
seq = ala phe tyr trp asn
<br><br>
The sequence fragment will be converted to uppercase. The sequence is not
checked for exotic residues so you can use the non-standard codes. However,
the routine which assigns the hydrophobicity values will fail to recognize
them. The average hydrophobicity value (calculated for the current scale)
will be assigned to these residues. At present, 23 codes are recognized:
<br>
<img src="aacodes.gif">
<br><br>
<font color=#880000>
<b>
SEQ LOAD filename
<br>
</b>
</font>
The keyword LOAD (or READ, short forms LOA and REA) may be used to read the
sequence from the specified file. Garlic is capable to recognize two types
of input file formats: FASTA files (one letter code) and files which contain
three letters code in a free format.
<br><br>
If input file contains the symbol > (greater than) in the first column of
the first useful line, the file is treated as one letter protein code in
FASTA format. Empty lines are ignored. The lines beginning with the symbol
# (numbersign) in the first column are treated as comments (ignored too).
Thus, the lines which are not empty and do not contain the symbol # in the
first column are treated as useful.
<br><br>
If input file is not recognized as FASTA file, it is expected to contain the
three letters code in a free format. Empty lines and all lines which
contain # in the first column are ignored. All other lines are treated as
useful. Digits (serial numbers, for example) are ignored.
The following characters are threated as separators:
<br>
(1) space
<br>
(2) tab
<br>
(3) comma (,)
<br>
(4) semicolon (;)
<br>
(5) newline (line feed)
<br><br>
If input file contains at least one bad code (a residue name which consists
of four letters, for example) the reading will fail. The hard-coded maximal
number of residues is 20000, but it may be easily changed (see MAXRESIDUES
in the header file defines.h).
<br><br>
Example:
<br>
load sample.fasta
<br><br>
<font color=#880000>
<b>
SEQ FROM structure_identifier
<br>
</b>
</font>
The keyword FROM (short form: FRO) may be used to copy the sequence from
the specified structure to the main sequence buffer. Only selected residues
are copied. Residue is treated as selected if the first atom is selected.
For proteins, this is typically N (nitrogen). Residue insertion codes are
ignored! Thus, the same residue serial index (number) may appear more than
once in the array of residue serial numbers.
<br><br>
Example:
<br>
seq from 1
<br><br>
<font color=#880000>
<b>
SEQ COPY
<br>
</b>
</font>
The command SEQ COPY (short form: SEQ COP) copies the sequence from the main
sequence buffer to the reference buffer. This is the only way to initialize
the reference buffer. This command must be executed (i.e., the keyword COPY
must be used) before executing commands which require two sequences for proper
operation. The main sequence buffer may be initialized prior to SEQ COPY by
using one of the keywords described above (=, LOAD or FROM).
<br><br>
Example:
<br>
seq copy
<br><br>
<font color=#880000>
<b>
SEQ SAVE filename
<br>
</b>
</font>
The command SEQ SAVE (short form: SEQ SAV) saves the sequence to the
specified file. Ten codes (each consisting of up to three letters) are
written per line, separated by space. Serial numbers are not included
(but see the keyword SWN).
<br><br>
Example:
<br>
seq save 9pap.seq
<br><br>
<font color=#880000>
<b>
SEQ SWN filename
<br>
</b>
</font>
The command SEQ SWN saves the sequence to the specified file. Both residue
names and serial numbers are written to the output file. Insertion codes will
be missing! Five serial numbers and residue names are written per line,
separated by space.
<br><br>
Example:
<br>
seq swn 9pap.seq
<br><br>
<font color=#880000>
<b>
SEQ RESET
<br>
</b>
</font>
Reset (clear) the main sequence buffer. The command SEQ RESET (short form:
SEQ RES) sets the number of residues in the main sequence buffer to zero.
The storage is not freed, so the buffer may be used again later.
<br><br>
Example:
<br>
seq reset
<br><br>
<font color=#880000>
<b>
RELATED COMMANDS
<br>
</b>
</font>
PLOT prepares the average hydrophobicity and/or hydrophobic moment plot.
COMPARE compares two sequences. VENN draws Venn diagram. WHEEL draws helical
wheel plot. SEL SEQ selects portions of the structure which contain the
sequence stored to the main sequence buffer. To use any of these commands,
the main sequence buffer (to use COMPARE both buffers) must be initialized by
using the command SEQ. STR defines the secondary structure and CREATE may be
used to create a new peptide.
<br><br>
<hr size="3">
</html>
|