This file is indexed.

/usr/share/EMBOSS/test/swnew/trembl.dat is in emboss-test 6.6.0-1.

This file is owned by root:root, with mode 0o644.

The actual contents of the file can be viewed below.

  1
  2
  3
  4
  5
  6
  7
  8
  9
 10
 11
 12
 13
 14
 15
 16
 17
 18
 19
 20
 21
 22
 23
 24
 25
 26
 27
 28
 29
 30
 31
 32
 33
 34
 35
 36
 37
 38
 39
 40
 41
 42
 43
 44
 45
 46
 47
 48
 49
 50
 51
 52
 53
 54
 55
 56
 57
 58
 59
 60
 61
 62
 63
 64
 65
 66
 67
 68
 69
 70
 71
 72
 73
 74
 75
 76
 77
 78
 79
 80
 81
 82
 83
 84
 85
 86
 87
 88
 89
 90
 91
 92
 93
 94
 95
 96
 97
 98
 99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317
318
319
320
321
322
323
324
325
326
327
328
329
330
331
332
333
334
335
336
337
338
339
340
341
342
343
344
345
346
347
348
349
350
351
352
353
354
355
356
357
358
359
360
361
362
363
364
365
366
367
368
369
370
371
372
373
374
375
376
377
378
379
380
381
382
383
384
385
386
387
388
389
390
391
392
393
394
395
396
397
398
399
400
401
402
403
404
405
406
407
408
409
410
411
412
413
414
415
416
417
418
419
420
421
422
423
424
425
426
427
428
429
430
431
432
433
434
435
436
437
438
439
440
441
442
443
444
445
446
447
448
449
450
451
452
453
454
455
456
457
458
459
460
461
462
463
464
465
466
467
468
469
470
471
472
473
474
475
476
477
478
479
480
481
482
483
484
485
486
487
488
489
490
491
492
493
494
495
496
497
498
499
500
501
502
503
504
505
506
507
508
509
510
511
512
513
514
515
516
517
518
519
520
521
522
523
524
525
526
527
528
529
530
531
532
533
534
535
536
537
538
539
540
541
542
543
544
545
546
547
548
549
550
551
552
553
554
555
556
557
558
559
ID   O42495_TAKRU            Unreviewed;       433 AA.
AC   O42495;
DT   01-JAN-1998, integrated into UniProtKB/TrEMBL.
DT   01-OCT-2002, sequence version 2.
DT   16-MAY-2012, entry version 58.
DE   SubName: Full=SkmBOP;
GN   Name=skmBOP;
OS   Takifugu rubripes (Japanese pufferfish) (Fugu rubripes).
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC   Actinopterygii; Neopterygii; Teleostei; Euteleostei; Neoteleostei;
OC   Acanthomorpha; Acanthopterygii; Percomorpha; Tetraodontiformes;
OC   Tetradontoidea; Tetraodontidae; Takifugu.
OX   NCBI_TaxID=31033;
RN   [1]
RP   NUCLEOTIDE SEQUENCE.
RX   MEDLINE=98024154; PubMed=9356472; DOI=10.1073/pnas.94.23.12462;
RA   Venkatesh B., Si-Hoe S.L., Murphy D., Brenner S.;
RT   "Transgenic rats reveal functional conservation of regulatory controls
RT   between the Fugu isotocin and rat oxytocin genes.";
RL   Proc. Natl. Acad. Sci. U.S.A. 94:12462-12466(1997).
RN   [2]
RP   NUCLEOTIDE SEQUENCE.
RX   MEDLINE=22220114; PubMed=12234665; DOI=10.1016/S0378-1119(02)00793-X;
RA   Gilligan P., Brenner S., Venkatesh B.;
RT   "Fugu and human sequence comparison identifies novel human genes and
RT   conserved non-coding sequences.";
RL   Gene 294:35-44(2002).
CC   -!- SIMILARITY: Contains 1 SET domain.
CC   -----------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution-NoDerivs License
CC   -----------------------------------------------------------------------
DR   EMBL; U90880; AAC60295.2; -; Genomic_DNA.
DR   ProteinModelPortal; O42495; -.
DR   HOGENOM; HOG000050244; -.
DR   GO; GO:0008270; F:zinc ion binding; IEA:InterPro.
DR   InterPro; IPR001214; SET_dom.
DR   InterPro; IPR002893; Znf_MYND.
DR   Pfam; PF00856; SET; 1.
DR   Pfam; PF01753; zf-MYND; 1.
DR   SMART; SM00317; SET; 1.
DR   PROSITE; PS50280; SET; 1.
DR   PROSITE; PS01360; ZF_MYND_1; 1.
DR   PROSITE; PS50865; ZF_MYND_2; 1.
PE   4: Predicted;
KW   Metal-binding.
SQ   SEQUENCE   433 AA;  49500 MW;  7CA7DE79E73EEE26 CRC64;
     MENVAIFDSP GKGRGLKTTK EFWAGDVIFS EPSLAAVVFD SLAERICHSC FRRQEKLQKC
     SQCKFAHYCD RTCQRAGWAE HKQECGAIKA YGKAPNENIR VVSHMQLITV EELEDHVADM
     QEDEIKELKV DIHNFLDYWP RNSKQHTIDD ISHIFGVINC NGFTVSDQRG LQAVGVGLFP
     NLCMVNHNCW PNCTVILNHG KIELRSLGKI AEGEELTVAY VDFLNLSEER RRLLKTQYFF
     DCQCDYCKNG TKDDLKLAGR EVDGVKVVKI CRDVIDRTEP VLADNHIYLL RMWSTLSEVQ
     AYLQYFNDAA EKLYHPNNAA LGMAAMRAGV NHWQAGLIEV GHGMVCKAYA ILLRFCHHIW
     CKTRKCLMSS VVVIVSAFTG DADQTEMELR MFKQNEYVYH SMRDAALQNK PITMLHEPKG
     VEEGIKNLFH RRK
//
ID   Q1KKT3_TAKRU            Unreviewed;       412 AA.
AC   Q1KKT3;
DT   30-MAY-2006, integrated into UniProtKB/TrEMBL.
DT   30-MAY-2006, sequence version 1.
DT   16-MAY-2012, entry version 40.
DE   SubName: Full=Even-skipped homeobox 2;
DE   SubName: Full=Uncharacterized protein;
GN   Name=Evx2; Synonyms=EVX2;
OS   Takifugu rubripes (Japanese pufferfish) (Fugu rubripes).
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC   Actinopterygii; Neopterygii; Teleostei; Euteleostei; Neoteleostei;
OC   Acanthomorpha; Acanthopterygii; Percomorpha; Tetraodontiformes;
OC   Tetradontoidea; Tetraodontidae; Takifugu.
OX   NCBI_TaxID=31033;
RN   [1]
RP   NUCLEOTIDE SEQUENCE.
RX   PubMed=16636282; DOI=10.1073/pnas.0601492103;
RA   Lee A.P., Koh E.G.L., Tay A., Brenner S., Venkatesh B.;
RT   "Highly conserved syntenic blocks at the vertebrate Hox loci and
RT   conserved regulatory elements within and outside Hox gene clusters.";
RL   Proc. Natl. Acad. Sci. U.S.A. 103:6994-6999(2006).
RN   [2]
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=Hd-rR;
RX   PubMed=17554307; DOI=10.1038/nature05846;
RA   Kasahara M., Naruse K., Sasaki S., Nakatani Y., Qu W., Ahsan B.,
RA   Yamada T., Nagayasu Y., Doi K., Kasai Y., Jindo T., Kobayashi D.,
RA   Shimada A., Toyoda A., Kuroki Y., Fujiyama A., Sasaki T., Shimizu A.,
RA   Asakawa S., Shimizu N., Hashimoto S., Yang J., Lee Y., Matsushima K.,
RA   Sugano S., Sakaizumi M., Narita T., Ohishi K., Haga S., Ohta F.,
RA   Nomoto H., Nogata K., Morishita T., Endo T., Shin-I T., Takeda H.,
RA   Morishita S., Kohara Y.;
RT   "The medaka draft genome and insights into vertebrate genome
RT   evolution.";
RL   Nature 447:714-719(2007).
RN   [3]
RP   IDENTIFICATION.
RG   Ensembl;
RL   Submitted (FEB-2012) to UniProtKB.
CC   -!- SUBCELLULAR LOCATION: Nucleus (By similarity).
CC   -----------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution-NoDerivs License
CC   -----------------------------------------------------------------------
DR   EMBL; DQ481668; ABF22461.1; -; Genomic_DNA.
DR   SMR; Q1KKT3; 165-221.
DR   Ensembl; ENSTRUT00000045103; ENSTRUP00000044951; ENSTRUG00000017536.
DR   eggNOG; NOG306682; -.
DR   GeneTree; ENSGT00640000091339; -.
DR   HOGENOM; HOG000231897; -.
DR   GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR   GO; GO:0043565; F:sequence-specific DNA binding; IEA:InterPro.
DR   GO; GO:0003700; F:sequence-specific DNA binding transcription factor activity; IEA:InterPro.
DR   Gene3D; G3DSA:1.10.10.60; Homeodomain-rel; 1.
DR   InterPro; IPR017970; Homeobox_CS.
DR   InterPro; IPR020479; Homeobox_metazoa.
DR   InterPro; IPR001356; Homeodomain.
DR   InterPro; IPR009057; Homeodomain-like.
DR   Pfam; PF00046; Homeobox; 1.
DR   PRINTS; PR00024; HOMEOBOX.
DR   SMART; SM00389; HOX; 1.
DR   SUPFAM; SSF46689; Homeodomain_like; 1.
DR   PROSITE; PS00027; HOMEOBOX_1; 1.
DR   PROSITE; PS50071; HOMEOBOX_2; 1.
PE   3: Inferred from homology;
KW   Complete proteome; DNA-binding; Homeobox; Nucleus; Reference proteome.
SQ   SEQUENCE   412 AA;  44015 MW;  7E7FEA72B2BC421E CRC64;
     MMERIRKEMI LMERGLHSPV AGKRLTDAPG NSVLEALENS QHSGRLSPRI TSASLHGNLG
     DIPSKGKFEI DSLFGTHHSS DNASSVDINS SENRKKMSIY SEVSPDSDIN SDVEVGCPAH
     RSPSQHKENN KGFSDSNNGS SSSNSGANIN GNSSAGSNNS DQVRRYRTAF TREQIARLEK
     EFYRENYVSR PRRCELAAAL NLPETTIKVW FQNRRMKDKR QRLAMSWPHP ADPSFYTYMM
     THAAATGSLP YPFHSHMPLH YYPHVGVTAA AAAAAATGAA SSPFATSIRP LDTFRALSHP
     YSRPELLCSF RHPGLYQSTA GLNSSAAASA AAAAAAAAAA VSAPSASGPC SCLSCHSSQA
     ASALGSRSAG ADFTCTASGQ RSESGFLPYS AAVLSKTSVP SPDPREETSL TR
//
ID   Q1KL02_TAKRU            Unreviewed;       676 AA.
AC   Q1KL02;
DT   30-MAY-2006, integrated into UniProtKB/TrEMBL.
DT   30-MAY-2006, sequence version 1.
DT   16-MAY-2012, entry version 42.
DE   SubName: Full=Chimerin 2;
GN   Name=Chn2;
OS   Takifugu rubripes (Japanese pufferfish) (Fugu rubripes).
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC   Actinopterygii; Neopterygii; Teleostei; Euteleostei; Neoteleostei;
OC   Acanthomorpha; Acanthopterygii; Percomorpha; Tetraodontiformes;
OC   Tetradontoidea; Tetraodontidae; Takifugu.
OX   NCBI_TaxID=31033;
RN   [1]
RP   NUCLEOTIDE SEQUENCE.
RX   PubMed=16636282; DOI=10.1073/pnas.0601492103;
RA   Lee A.P., Koh E.G.L., Tay A., Brenner S., Venkatesh B.;
RT   "Highly conserved syntenic blocks at the vertebrate Hox loci and
RT   conserved regulatory elements within and outside Hox gene clusters.";
RL   Proc. Natl. Acad. Sci. U.S.A. 103:6994-6999(2006).
CC   -----------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution-NoDerivs License
CC   -----------------------------------------------------------------------
DR   EMBL; DQ481664; ABF22392.1; -; Genomic_DNA.
DR   ProteinModelPortal; Q1KL02; -.
DR   SMR; Q1KL02; 3-115, 419-676.
DR   eggNOG; NOG295484; -.
DR   InParanoid; Q1KL02; -.
DR   OrthoDB; EOG4W9J3S; -.
DR   GO; GO:0005622; C:intracellular; IEA:InterPro.
DR   GO; GO:0046872; F:metal ion binding; IEA:UniProtKB-KW.
DR   GO; GO:0035556; P:intracellular signal transduction; IEA:InterPro.
DR   Gene3D; G3DSA:1.10.555.10; RhoGAP; 1.
DR   Gene3D; G3DSA:3.30.505.10; SH2; 1.
DR   InterPro; IPR020454; DAG/PE-bd.
DR   InterPro; IPR002219; Prot_Kinase_C-like_PE/DAG-bd.
DR   InterPro; IPR008936; Rho_GTPase_activation_prot.
DR   InterPro; IPR000198; RhoGAP_dom.
DR   InterPro; IPR000980; SH2.
DR   Pfam; PF00130; C1_1; 1.
DR   Pfam; PF00620; RhoGAP; 1.
DR   Pfam; PF00017; SH2; 1.
DR   PRINTS; PR00008; DAGPEDOMAIN.
DR   SMART; SM00109; C1; 1.
DR   SMART; SM00324; RhoGAP; 1.
DR   SMART; SM00252; SH2; 1.
DR   SUPFAM; SSF48350; Rho_GAP; 1.
DR   PROSITE; PS50238; RHOGAP; 1.
DR   PROSITE; PS50001; SH2; 1.
DR   PROSITE; PS00479; ZF_DAG_PE_1; 1.
DR   PROSITE; PS50081; ZF_DAG_PE_2; 1.
PE   4: Predicted;
KW   Metal-binding; Zinc; Zinc-finger.
SQ   SEQUENCE   676 AA;  77365 MW;  417B610C5ACC1B97 CRC64;
     MLFQKLFHGR ISREYADELL ATAEGAYLIR ESQRQPGSHT LALRFGHQTL NYRLFYDGKH
     FVGEKRFESV HDLVTDALIT LYIETKAAEY IAKMTTNPIY EHLGYTSLLK DKTVHRLNRG
     RTEPRRVTFQ RDERVSLLVI EAVEPSVRGC FVESVCRSVL LSYRVSPHCA VWSVSSRPTE
     SHLPSHKRAQ PLVLPPQQPH QAEISIYKSS LPRLSPQTTH VCIFSRNVSH NGKYLHLPCH
     SGKRQDFPDG REKPLLRGLA VVFYFASKRP ALSFYLGFAV ALKVFSFLLR HCDTLSSLIH
     FAPESDLVRV LQLLPRWAVL SPLCNFPQRR WSVCGNPTGP LWRLKKTEEV WEAELITHKA
     RRHERKRQEL LALALGVKLG SKGTILWKPI KLLASCPQIA SPLVRRTVLK DAPEKQCSYE
     KMHNFKVHTF RGPHWCEYCA NFMWGLIAQG VRCSDCGLNV HKQCSKLVPS DCQPDLRRIK
     KVFSCDLTTL VKAHNSTRPM VVDMCIQEIE LRGMKSEGLY RVSGFSEHIE DVRLAFDRDG
     DKADISASAY ADINIIAGAL KLYLRDLPIP VITFELYSKF IQAARIPNAD TRLEAIHDSL
     LQLPPAHYET LRYLMAHLKR VTLFEKYNLM NAENLGIVFG PTLMQPPEMN ALTTLNDMRL
     QKLVVQLMIE HEDVLF
//
ID   Q2HPN2_TAKRU            Unreviewed;       317 AA.
AC   Q2HPN2;
DT   21-MAR-2006, integrated into UniProtKB/TrEMBL.
DT   21-MAR-2006, sequence version 1.
DT   16-MAY-2012, entry version 28.
DE   SubName: Full=Beta1,4-galactosyltransferase 7;
DE            EC=2.4.1.133;
GN   Name=b4Gal-T7;
OS   Takifugu rubripes (Japanese pufferfish) (Fugu rubripes).
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC   Actinopterygii; Neopterygii; Teleostei; Euteleostei; Neoteleostei;
OC   Acanthomorpha; Acanthopterygii; Percomorpha; Tetraodontiformes;
OC   Tetradontoidea; Tetraodontidae; Takifugu.
OX   NCBI_TaxID=31033;
RN   [1]
RP   NUCLEOTIDE SEQUENCE.
RA   Talhaoui I., Bui C., Oriol R., Mulliert G., Gulberti S., Netter P.,
RA   Coughtrie M.W.H., Ouzzine M., Fournel-Gigleux S.;
RT   "Identification of Key Functional Residues in the Active Site of Human
RT   E21,4-Galactosyltransferase 7: A MAJOR ENZYME IN THE GLYCOSAMINOGLYCAN
RT   SYNTHESIS PATHWAY.";
RL   J. Biol. Chem. 285:37342-37358(2010).
CC   -----------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution-NoDerivs License
CC   -----------------------------------------------------------------------
DR   EMBL; AM231267; CAJ77200.1; -; mRNA.
DR   RefSeq; NP_001072098.1; NM_001078630.1.
DR   UniGene; Tru.3653; -.
DR   CAZy; GT7; Glycosyltransferase Family 7.
DR   GeneID; 778009; -.
DR   CTD; 779002; -.
DR   eggNOG; NOG305756; -.
DR   HOGENOM; HOG000286021; -.
DR   GO; GO:0046525; F:xylosylprotein 4-beta-galactosyltransferase activity; IEA:EC.
DR   GO; GO:0005975; P:carbohydrate metabolic process; IEA:InterPro.
DR   InterPro; IPR003859; Galactosyl_T_2_met.
DR   PANTHER; PTHR19300; Galactosyl_T_2; 1.
DR   Pfam; PF02709; Glyco_transf_7C; 1.
DR   PRINTS; PR02050; B14GALTRFASE.
PE   2: Evidence at transcript level;
KW   Glycosyltransferase; Transferase.
SQ   SEQUENCE   317 AA;  36672 MW;  DA3B946E1934C1F0 CRC64;
     MMYPSRRKPV LYFKEERRCT IYKLFIVCTV LLLVSLLWLQ LSCSGDMTSS AEDRPQLLPQ
     QRPPPCQAEA QASAADDPSW GPHKLAVIVP FRERFEELLV FVPFMHGFLS KKKIRHKILV
     INQVDRYRFN RASLINVGHL ESGNDTDYLA MHDVDLLPLN DALDYGFPEE GPFHVASPEL
     HPLYHYKTYV GGILLLTKKH YDMCNGMSNR FWGWGREDDE FYRRLKKAQL QLFRPSGITT
     GYKTFLHIHD PAWRKRDQKR VAAQKQEQFK VDPEGGLSNL RYEVESRQEV VIGGAPCTVI
     NTRLGCDQNQ TPWCMLG
//
ID   Q2HWR4_TAKRU            Unreviewed;       217 AA.
AC   Q2HWR4;
DT   21-MAR-2006, integrated into UniProtKB/TrEMBL.
DT   21-MAR-2006, sequence version 1.
DT   16-MAY-2012, entry version 32.
DE   SubName: Full=T-cell surface glycoprotein CD8alpha;
GN   Name=CD8alpha;
OS   Takifugu rubripes (Japanese pufferfish) (Fugu rubripes).
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC   Actinopterygii; Neopterygii; Teleostei; Euteleostei; Neoteleostei;
OC   Acanthomorpha; Acanthopterygii; Percomorpha; Tetraodontiformes;
OC   Tetradontoidea; Tetraodontidae; Takifugu.
OX   NCBI_TaxID=31033;
RN   [1]
RP   NUCLEOTIDE SEQUENCE.
RA   Suetake H., Saha N.R., Araki K., Akatsu K., Kikuchi K., Suzuki Y.;
RT   "Lymphocyte surface marker genes in fugu.";
RL   Comp. Biochem. Physiol. Part D Genomics Proteomics 1:102-108(2006).
CC   -----------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution-NoDerivs License
CC   -----------------------------------------------------------------------
DR   EMBL; AB232548; BAE79813.1; -; mRNA.
DR   RefSeq; NP_001072086.1; NM_001078618.1.
DR   UniGene; Tru.3654; -.
DR   ProteinModelPortal; Q2HWR4; -.
DR   STRING; Q2HWR4; -.
DR   GeneID; 777995; -.
DR   CTD; 777995; -.
DR   eggNOG; fiNOG20049; -.
DR   HOGENOM; HOG000126626; -.
DR   InParanoid; Q2HWR4; -.
DR   Gene3D; G3DSA:2.60.40.10; Ig-like_fold; 1.
DR   InterPro; IPR015468; CD8_asu.
DR   InterPro; IPR007110; Ig-like.
DR   InterPro; IPR013783; Ig-like_fold.
DR   InterPro; IPR003599; Ig_sub.
DR   InterPro; IPR013106; Ig_V-set.
DR   PANTHER; PTHR10441; CD8_alpha; 1.
DR   Pfam; PF07686; V-set; 1.
DR   SMART; SM00409; IG; 1.
DR   PROSITE; PS50835; IG_LIKE; 1.
PE   2: Evidence at transcript level;
SQ   SEQUENCE   217 AA;  24169 MW;  11B791A87D22A954 CRC64;
     MEQKWRKVLV FLVFCQQTTP GDNQKEDVVK EGAQVDIHCQ PSQAASMTVW FRVRDNSWME
     FIGSFSNGLK KTENNVSSEF THGKINRDIL TLKSFQRQKD SGLYCCASLY KGKELRFGPV
     TQLRGETVEQ KPTVAQTTPR QQPVTTAPAC TCDSSKARGG IGTQLSCAPL ILGPLAGGCG
     LLLLLLIVTA LYCNRVRTRR CPHHYKRKPR GKPPGNK
//
ID   Q2KT90_TAKRU            Unreviewed;       230 AA.
AC   Q2KT90;
DT   07-MAR-2006, integrated into UniProtKB/TrEMBL.
DT   07-MAR-2006, sequence version 1.
DT   16-MAY-2012, entry version 40.
DE   RecName: Full=Cytochrome c oxidase subunit 2;
GN   Name=COII;
OS   Takifugu rubripes (Japanese pufferfish) (Fugu rubripes).
OG   Mitochondrion.
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC   Actinopterygii; Neopterygii; Teleostei; Euteleostei; Neoteleostei;
OC   Acanthomorpha; Acanthopterygii; Percomorpha; Tetraodontiformes;
OC   Tetradontoidea; Tetraodontidae; Takifugu.
OX   NCBI_TaxID=31033;
RN   [1]
RP   NUCLEOTIDE SEQUENCE.
RX   PubMed=16607039; DOI=10.1266/ggs.81.29;
RA   Yamanoue Y., Miya M., Inoue J.G., Matsuura K., Nishida M.;
RT   "The mitochondrial genome of spotted green pufferfish Tetraodon
RT   nigroviridis (Teleostei: Tetraodontiformes) and divergence time
RT   estimation among model organisms in fishes.";
RL   Genes Genet. Syst. 81:29-39(2006).
CC   -!- FUNCTION: Cytochrome c oxidase is the component of the respiratory
CC       chain that catalyzes the reduction of oxygen to water. Subunits 1-
CC       3 form the functional core of the enzyme complex. Subunit 2
CC       transfers the electrons from cytochrome c via its binuclear copper
CC       A center to the bimetallic center of the catalytic subunit 1 (By
CC       similarity).
CC   -!- COFACTOR: Copper A (By similarity).
CC   -!- SUBCELLULAR LOCATION: Mitochondrion inner membrane; Multi-pass
CC       membrane protein (By similarity).
CC   -!- SIMILARITY: Belongs to the cytochrome c oxidase subunit 2 family.
CC   -----------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution-NoDerivs License
CC   -----------------------------------------------------------------------
DR   EMBL; AP006045; BAE79207.1; -; Genomic_DNA.
DR   ProteinModelPortal; Q2KT90; -.
DR   SMR; Q2KT90; 1-227.
DR   STRING; Q2KT90; -.
DR   eggNOG; COG1622; -.
DR   HOGENOM; HOG000264988; -.
DR   InParanoid; Q2KT90; -.
DR   OrthoDB; EOG4BZN3N; -.
DR   GO; GO:0016021; C:integral to membrane; IEA:UniProtKB-KW.
DR   GO; GO:0005743; C:mitochondrial inner membrane; IEA:UniProtKB-SubCell.
DR   GO; GO:0070469; C:respiratory chain; IEA:UniProtKB-KW.
DR   GO; GO:0005507; F:copper ion binding; IEA:InterPro.
DR   GO; GO:0004129; F:cytochrome-c oxidase activity; IEA:InterPro.
DR   GO; GO:0009055; F:electron carrier activity; IEA:InterPro.
DR   GO; GO:0020037; F:heme binding; IEA:InterPro.
DR   GO; GO:0022904; P:respiratory electron transport chain; IEA:InterPro.
DR   Gene3D; G3DSA:1.10.287.90; COX2_TM; 1.
DR   Gene3D; G3DSA:2.60.40.420; Cupredoxin; 1.
DR   InterPro; IPR001505; Copper_CuA.
DR   InterPro; IPR008972; Cupredoxin.
DR   InterPro; IPR014222; Cyt_c_oxidase_su2.
DR   InterPro; IPR015964; Cyt_c_oxidase_su2-like_TM_dom.
DR   InterPro; IPR002429; Cyt_c_oxidase_su2_C.
DR   InterPro; IPR011759; Cyt_c_oxidase_su2_TM_dom.
DR   Pfam; PF00116; COX2; 1.
DR   Pfam; PF02790; COX2_TM; 1.
DR   PRINTS; PR01166; CYCOXIDASEII.
DR   SUPFAM; SSF49503; Cupredoxin; 1.
DR   SUPFAM; SSF81464; Cyt_c_oxidase_II-like_TM; 1.
DR   TIGRFAMs; TIGR02866; CoxB; 1.
DR   PROSITE; PS00078; COX2; 1.
DR   PROSITE; PS50857; COX2_CUA; 1.
DR   PROSITE; PS50999; COX2_TM; 1.
PE   3: Inferred from homology;
KW   Copper; Electron transport; Membrane; Metal-binding; Mitochondrion;
KW   Mitochondrion inner membrane; Respiratory chain; Transmembrane;
KW   Transmembrane helix; Transport.
SQ   SEQUENCE   230 AA;  26042 MW;  19C6D6E9BDB9D12C CRC64;
     MAHPSQLGFQ DAASPVMEEL LHFHDHALMI VFLISTLVLY IIVAMVSTKL TNKYILDSQE
     IEIIWTILPA IILILIALPS LRILYLMDEI NDPHLTIKAM GHQWYWSYEY TDYSDLAFDS
     YMVPTQDLAP GQFRLLETDH RMVVPVDSPI RILVSAEDVL HSWAVPSLGV KMDAVPGRLN
     QTAFILSRPG VFYGQCSEIC GANHSFMPIV VEAVPLEHFE NWSSLMLEDA
//
ID   Q4JHL5_TAKRU            Unreviewed;       854 AA.
AC   Q4JHL5;
DT   02-AUG-2005, integrated into UniProtKB/TrEMBL.
DT   02-AUG-2005, sequence version 1.
DT   16-MAY-2012, entry version 43.
DE   SubName: Full=Aryl hydrocarbon receptor 1B;
OS   Takifugu rubripes (Japanese pufferfish) (Fugu rubripes).
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC   Actinopterygii; Neopterygii; Teleostei; Euteleostei; Neoteleostei;
OC   Acanthomorpha; Acanthopterygii; Percomorpha; Tetraodontiformes;
OC   Tetradontoidea; Tetraodontidae; Takifugu.
OX   NCBI_TaxID=31033;
RN   [1]
RP   NUCLEOTIDE SEQUENCE.
RX   PubMed=19539383; DOI=10.1016/j.aquatox.2009.05.015;
RA   Merson R.R., Karchner S.I., Hahn M.E.;
RT   "Interaction of fish aryl hydrocarbon receptor paralogs (AHR1 and
RT   AHR2) with the retinoblastoma protein.";
RL   Aquat. Toxicol. 94:47-55(2009).
CC   -!- SIMILARITY: Contains 1 PAC (PAS-associated C-terminal) domain.
CC   -!- SIMILARITY: Contains 1 basic helix-loop-helix (bHLH) domain.
CC   -!- SIMILARITY: Contains 2 PAS (PER-ARNT-SIM) domains.
CC   -----------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution-NoDerivs License
CC   -----------------------------------------------------------------------
DR   EMBL; DQ088138; AAY96630.1; -; mRNA.
DR   RefSeq; NP_001033048.1; NM_001037959.1.
DR   UniGene; Tru.3516; -.
DR   ProteinModelPortal; Q4JHL5; -.
DR   GeneID; 654292; -.
DR   CTD; 554265; -.
DR   GO; GO:0005634; C:nucleus; IEA:UniProtKB-KW.
DR   GO; GO:0003677; F:DNA binding; IEA:UniProtKB-KW.
DR   GO; GO:0004872; F:receptor activity; IEA:UniProtKB-KW.
DR   GO; GO:0004871; F:signal transducer activity; IEA:InterPro.
DR   GO; GO:0006355; P:regulation of transcription, DNA-dependent; IEA:UniProtKB-KW.
DR   GO; GO:0006351; P:transcription, DNA-dependent; IEA:UniProtKB-KW.
DR   Gene3D; G3DSA:4.10.280.10; HLH_DNA_bd; 1.
DR   InterPro; IPR011598; HLH_DNA-bd.
DR   InterPro; IPR001610; PAC.
DR   InterPro; IPR000014; PAS.
DR   InterPro; IPR013767; PAS_fold.
DR   InterPro; IPR013655; PAS_fold_3.
DR   Pfam; PF00989; PAS; 1.
DR   Pfam; PF08447; PAS_3; 1.
DR   SMART; SM00353; HLH; 1.
DR   SMART; SM00086; PAC; 1.
DR   SMART; SM00091; PAS; 2.
DR   PROSITE; PS50888; HLH; 1.
DR   PROSITE; PS50112; PAS; 1.
PE   2: Evidence at transcript level;
KW   DNA-binding; Nucleus; Receptor; Repeat; Transcription;
KW   Transcription regulation.
SQ   SEQUENCE   854 AA;  93640 MW;  C6B5B41CFE5BCB96 CRC64;
     MYAGRKRRKP LQKGVKPAPT EGAKSNPSKR HRDRLNSELD RLASLLPFSE DVIASLDKLS
     ILRLSVSFLR TKGFFSGVLN NLPSDGINKS SDHGGGGGAA SGAEERRLPE GELLLQALNG
     FVLVVTTEGN IFFCSHTIRD YLGFHQTDVM HQSVFEMIHT EDQQEFRRNL HWGPDTTPTA
     EPETDGESVS TSSLLSCDPD QPPRDNSSFL DRSFICRFRC LLDNTSGFLA LNIQGRLKFL
     HGQHHPQRSS KVSSPPQLAL FAIATPLQPP TILEIRTRNM IFRTKHKLDF TPMACDAKGK
     IVLGYTEAEL RVRGSGYQFI HAADMLYCAE NHVRMIKTGE SGLTVFRLLT KDNRWKWVQA
     NARLVYKNGK PDYIVATQRP LVDEEGGEHL RKRSMHLPFT FATGEAMLYQ TGHPLHSFSE
     SVQGKAKGSK TKKGKQSSSD NLDPKSLLGA LMSQDESVYV CQPDSEPAVS GPSSLLSQQQ
     TDSECSSFLG HNSLHVFSNE TSSYDPLLAT LDSLTLDGED PCSNTEIFNA LENLGLNAED
     LELLLLDERM IQVELGPNHI PTLSDLLTNN EILSYIHNKL ENSPEPADGD AGRYGVNADQ
     AAVPEPPAFV QQSQQMQQHV GSGVPAKAAP PTAEAKGQTR LPNGHWVTNT ANAHQPDNQV
     QPHPVLTPSR LNSELKHLLE SSQQWSQDQL VHYPSHPQVS QDPSLLGFHN QRTINASYIP
     NGHTSFLPSV EVGHTYSITA APCAPAALLN GLTAPDVCHY QSYQQQVSVA QSSTLELEQL
     LGLSQSQHSL PAYAMFNTSA QGSAHSKLEN GCLLNATNAA YIRTCLMPNG NAVVAANVDG
     LSTLQDHQKP GFLL
//
ID   Q50J40_TAKRU            Unreviewed;       372 AA.
AC   Q50J40;
DT   07-JUN-2005, integrated into UniProtKB/TrEMBL.
DT   07-JUN-2005, sequence version 1.
DT   16-MAY-2012, entry version 28.
DE   SubName: Full=Alpha-2,8-sialyltransferase;
DE   Flags: Fragment;
GN   Name=st8Sia V;
OS   Takifugu rubripes (Japanese pufferfish) (Fugu rubripes).
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC   Actinopterygii; Neopterygii; Teleostei; Euteleostei; Neoteleostei;
OC   Acanthomorpha; Acanthopterygii; Percomorpha; Tetraodontiformes;
OC   Tetradontoidea; Tetraodontidae; Takifugu.
OX   NCBI_TaxID=31033;
RN   [1]
RP   NUCLEOTIDE SEQUENCE.
RA   Lehmann F.;
RT   "Phylogeny of sialyltransferases.";
RL   Submitted (MAY-2004) to the EMBL/GenBank/DDBJ databases.
CC   -----------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution-NoDerivs License
CC   -----------------------------------------------------------------------
DR   EMBL; AJ705092; CAG29214.1; -; mRNA.
DR   UniGene; Tru.2023; -.
DR   eggNOG; NOG323775; -.
DR   GO; GO:0030173; C:integral to Golgi membrane; IEA:InterPro.
DR   GO; GO:0008373; F:sialyltransferase activity; IEA:InterPro.
DR   GO; GO:0006486; P:protein glycosylation; IEA:InterPro.
DR   Gene3D; G3DSA:3.90.1480.10; A-2_3-sialyltransferase; 1.
DR   InterPro; IPR009251; A-2_3-sialyltransferase.
DR   InterPro; IPR001675; Glyco_trans_29.
DR   InterPro; IPR012163; Sialyl_trans.
DR   Pfam; PF00777; Glyco_transf_29; 1.
DR   PIRSF; PIRSF005557; Sialyl_trans; 1.
PE   2: Evidence at transcript level;
KW   Glycosyltransferase; Golgi apparatus; Membrane; Transferase;
KW   Transmembrane; Transmembrane helix.
FT   NON_TER       1      1
FT   NON_TER     372    372
SQ   SEQUENCE   372 AA;  42962 MW;  74842EC22385F239 CRC64;
     YADTSSGKDI LGNRSLCFIF ICAFGLVTLI QQILSGKNYI KRYLGNYDGP FEYNSTTCRE
     LRQEIMDVKV LTMVKTSDLF ERWRNLQICR WEQNKEETSN FKMSLSRCCN APSFLFTTKR
     NTPAGTKLRY EVDTSGILPI TAEVFKMFPD DMPYSKSQFK KCAVVGNGGI IKNSKCGKEI
     DSADFVFRCN IPPISEKYSA DVGTKTDLVS INPSIITERF QKLEKWRRPF YEVLQNYENS
     SAVLPAFYNT RNTDVSFRVK YMLDDFDSQR GVFFFHPQYL LNVQRFWAVQ GVRAKRLSSG
     LMLVTAALEM CEEVHLYGFW AFPMNPSGIF ITHHYYDNVK PRPGFHAMPH EIFNFIHMHT
     RGIVNVHTGQ CT
//
ID   Q68HA9_TAKRU            Unreviewed;       606 AA.
AC   Q68HA9;
DT   11-OCT-2004, integrated into UniProtKB/TrEMBL.
DT   11-OCT-2004, sequence version 1.
DT   16-MAY-2012, entry version 37.
DE   SubName: Full=E1A-associated protein p300;
DE   Flags: Fragment;
GN   Name=EP300;
OS   Takifugu rubripes (Japanese pufferfish) (Fugu rubripes).
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC   Actinopterygii; Neopterygii; Teleostei; Euteleostei; Neoteleostei;
OC   Acanthomorpha; Acanthopterygii; Percomorpha; Tetraodontiformes;
OC   Tetradontoidea; Tetraodontidae; Takifugu.
OX   NCBI_TaxID=31033;
RN   [1]
RP   NUCLEOTIDE SEQUENCE.
RA   Fernandes J.M., Mackenzie M.G., Kinghorn J.R., Johnston I.A.;
RT   "Cloning and characterization of the E1A-associated protein p300 gene
RT   in Takifugu rubripes.";
RL   Submitted (JUL-2004) to the EMBL/GenBank/DDBJ databases.
CC   -----------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution-NoDerivs License
CC   -----------------------------------------------------------------------
DR   EMBL; AY690625; AAT99578.1; -; mRNA.
DR   UniGene; Tru.1841; -.
DR   ProteinModelPortal; Q68HA9; -.
DR   eggNOG; COG5076; -.
DR   HOGENOM; HOG000111353; -.
DR   InParanoid; Q68HA9; -.
DR   OrthoDB; EOG4Z0B4S; -.
DR   GO; GO:0005634; C:nucleus; IEA:InterPro.
DR   GO; GO:0004402; F:histone acetyltransferase activity; IEA:InterPro.
DR   GO; GO:0003712; F:transcription cofactor activity; IEA:InterPro.
DR   GO; GO:0008270; F:zinc ion binding; IEA:InterPro.
DR   GO; GO:0006355; P:regulation of transcription, DNA-dependent; IEA:InterPro.
DR   Gene3D; G3DSA:1.10.246.20; KIX; 1.
DR   Gene3D; G3DSA:1.20.1020.10; Znf_TAZ; 1.
DR   InterPro; IPR003101; KIX.
DR   InterPro; IPR000197; Znf_TAZ.
DR   Pfam; PF02172; KIX; 1.
DR   Pfam; PF02135; zf-TAZ; 1.
DR   SMART; SM00551; ZnF_TAZ; 1.
DR   SUPFAM; SSF47040; KIX; 1.
DR   SUPFAM; SSF57933; TAZ_finger; 1.
DR   PROSITE; PS50952; KIX; 1.
DR   PROSITE; PS50134; ZF_TAZ; 1.
PE   2: Evidence at transcript level;
FT   NON_TER     606    606
SQ   SEQUENCE   606 AA;  62682 MW;  B60774021C25DA7A CRC64;
     MADNVLDSGA PSAKRPKLSS PALSVSASDG NDFGSPFELE QDLPDELISS ADLGLSNGGD
     LSQLHTSPSG PLGGLGLGGQ DAASKHKQLS ELLRAGAPPQ QGVPASNSTA PGAPMGMMGG
     VGVSPGGPQG MHPHGQPQQP GLMPQVGMVG GVAALSRVAA MMGTQKGNPG QQPHGMMGGQ
     VMNGSPRMGY PGNTGMGNNS NLLADTLQQQ QQQGGQQMVP GAQATMRPQQ PGALNKMNMM
     ANTGPYAGPY SQSAGQALPG AGLGSPLQNK ASMPNMLNQF NVDKKTLPGM LTAATGVGPV
     GLGPVGVGPS AGPPTADPEK RKLIQQQLVL LLHAHKCQRR EQANGEVRQC NLPHCRTMKN
     VLNHMTHCQA GKSCQVAHCA SSRQIISHWK NCTRHDCPVC LPLKNAGDKR NQQSLLNSAG
     VGLVNSLGAG LPGGQSNNPN LNPPNQIDPS SIERAYAALG LTYQGNQMPQ QPSQPNMPTQ
     GLQGQPGMGA LNSMGGNSMG VNGGVGVQPP NQQSAVLSNA MLHSNMNAQS LLNDGVANVG
     SMPTAAPSAA GIRKSWHEDI TQDLRNHLVH KLVQAIFPTP DPAALKDRRM ENLVAYARKV
     EGDMYE
//