/usr/share/EMBOSS/test/data/testseqs.ncbi is in emboss-test 6.6.0-1.
This file is owned by root:root, with mode 0o644.
The actual contents of the file can be viewed below.
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 | >gi|2285|emb|CAA00003.1| (A00040) DHFR [Penicillium chrysogenum]gi|491347|emb|C AA01033.1| (A12434) R388-dihydrofolate reductase gene [synthetic construct] **from blast/db/pataa (PID as acc, no id, original id in brackets)
MGQSSDEANAPVAGQFALPLSATFGLGDRVRKKSGAAWQGQVVGWYCTKLTPEGYAVESESHPGSVQIYPVAALERVA
>gi|14406|emb|A00021.1|A00021 Artificial DNA sequence of the Beta-EA2-Block Leu-15 (reverse complement) ** from blast/db/patnt
CCTGCCTTTGCATTGTAGAAGTAACGGATGATACGAGCCAGGCAG
>gi|5835182|ref|NP_007229.1|ATP8_10587 ATP synthase F0 subunit 8 [Rattus norvegicus] ** from refseq/rat.faa
MPQLDTSTWFITIISSMATLFILFQLKISSQTFPAPPSPKTMATEKTNNPWESKWTKIYLPLSLPPQ
>gi|6978439|ref|NP_036622.1| acrosin [Rattus norvegicus] ** from refseq/rat.faa (no locus)
MVEMLPTVVALVLAVSVVAKDGPCGLRFRQNPQAGIRIVGGQTSSRWAWPWMVSLQIFTSHNSRRYHACG
GSLLNSHWVLTAAHCFDNKKRKVYDWRLVFGAHEIEYGRNKPVKEPQQERYVQKIVIHEKYNAVTEGNDI
ALLKVTPPVTCGDFVGPGCLPHFKSGPPRIPHTCYVTGWGYIKDNAPRPSPVLMEARVDLIDLDLCNSTQ
WYNGRVTSTNVCAGYPEGKIDTCQGDSGGPLMCRDTRRQPLCDRGDHELGGRLCRAKRPGVYTATWDYLD
WIASKIGPTALHLIQPATPHPPTTQQPVISFHPPSTPPSLVLPTPVSSAALPTPPRPLLHQPSSVHTSSA
PVIPLLSLLTPVQPVSFTLAAYHTRHHTTLSFASALQHLIEALKMRTYPIKYPSRYSGPVNYQHRFSTFE
PLSNKPSEPLLHS
>gi|5835182|ref|NP_007229.1|ATP8_10587 ATP synthase F0 subunit 8 [Rattus norvegicus] ** from refseq/rat.fsa
MPQLDTSTWFITIISSMATLFILFQLKISSQTFPAPPSPKTMATEKTNNPWESKWTKIYLPLSLPPQ
>gi|8392912|ref|NP_036633.1| apolipoprotein C-III [Rattus norvegicus] ** from refseq/rat.fsa (no locus)
MQPRMLLIVALVALLASARADEGEGSLLLGSMQGYMEQASKTVQDALSSMQESDIAVVASRGWMDNRFKS
LKGYWSKFTDKFTGLWESGPEDQLTTPTLEP
>lcl|JK5 No definition line found ** from blast/db/humdjgene.nc
GGATCACCTTCGGCCAAGGGACACGACTGGAGATTAAAC
>lcl|D1-1 No definition line found ** from blast/db/humdjgene.nc
GGTACAACTGGAACGAC
>gi|229743|pdb|1CGC|B Chain B, DNA (5'-d(CpCpGpGpCpGpCpCpGpG)-3') ** from blast/db/pdbnt
CCGGCGCCGG
>gi|229780|pdb|1D10| DNA (5(Prime)-d(CpGpApTpCpG)-3(Prime)) Complex With Daunomycin ** from blast/db/pdbnt (no chain)
CGATCG
>gi|229672|pdb|1AL1| Alpha - 1 (Amphiphilic Alpha Helix)gi|3891468|pdb|3AL1|A Chain A, Designed Peptide Alpha-1, Racemic P1bar Form ** from blast/db/pdbaa (no chain)
XELLKKLLEELKG
>gi|229659|pdb|1AAP|A Chain A, Protease Inhibitor Domain Of Alzheimer's Amyloid Beta-Protein Precursor (APPI)gi|229660|pdb|1AAP|B Chain B, Protease Inhibitor Domain Of Alzheimer's Amyloid Beta-Protein Precursor (APPI) ** from blast/db/pdbaa
VREVCSEQAETGPCRAMISRWYFDVTEGKCAPFFYGGCGGNRNNFDTEEYCMAVCGSA
>gi|39394|emb|X66730.1|BBPLAS B.bronchiseptica plasmid pBBR1 genes for mobilization and replication ** from blast/db/vector
CTCGGGCCGTCTCTTGGGCTTGATCGGCCTTCTTGCGCATCTCACGCGCTCCTGCGGCGGCCTGTAGGGC
AGGCTCATACCCCTGCCGAACCGCTTTTGTCAGCCGGTCGGCCACGGCTTCCGGCGTCTCAACGCGCTTT
GAGATTCCCAGCTTTTCGGCCAATCCCTGCGGTGCATAGGCGCGTGGCTCGACCGCTTGCGGGCTGATGG
>gi|208958|gb|J01749.1|SYNPBR322 Cloning vector pBR322, complete genome ** from blast/db/vector
TTCTCATGTTTGACAGCTTATCATCGATAAGCTTTAATGCGGTAGTTTATCACAGTTAAATTGCTAACGC
AGTCAGGCACCGTGTATGAAATCTAACAATGCGCTCATCGTCATCCTCGGCACCGTCACCCTGGATGCTG
TAGGCATAGGCTTGGTTATGCCGGTACTGCCGGGCCTCTTGCGGGATATCGTCCATTCCGACAGCATCGC
>gi|1199492|dbj|D45833.1|SYNPKF4 Enforcement (kyosei-) cloning vector pKF4 DNA, complete sequence ** from blast/db/vector
ATGGCAACAGTCAATCAGCTGGTTCGAAAGCCGCGAGCTCGTAAAGTGGCCAAATCTAACGTTCCGGCTC
TCGAGGCATGCCCGTAGAAGCGTGGCATATGCACACGCGTATACACTACTACTCCGAAGAAACCGAATTC
AGCGCTGCGCAAGCTTTGCCGCGTACGCCTGACCAACGGTTTCGAGGTCACCTCATATATAGGTGGTGAA
>gnl|alu|HSU14568_Alu_Sb_consensus_rf1 ** from blast/db/alu.a
grarwltpvipalweaeaggsrgqeietilantvkprlyXkyknXpgvvagacspsysgg
XgrrmaXtreaelavsrdratalqpgrqsetpsqkk
>gnl|alu|M38064_HSAL002939 (Alu-Sx) ** from blast/db/alu.n
GGCTGGGTGCGGTGACTCATCCTGGAATCCAGCACTTTGGGAGGCGAGGCAGGTGGATCA
CTGAGGTCAGCGAGTTTCGATGACCACCCTGGCCAACATAGTAAAACCCTGTCTCTACTA
AAAATTACAAAATTAGCTCAGTGTGGTGGTAGGCGCCTGTAGTCCCAGCTACTCTGGAGG
CTGAGGCAAGAGAATCACTTGAACCTGGGAGGCAGAAGTTTCAGCAAGCTGAGACTGCAC
CACTGCACTTCAGCCTGGGAGGCAGAAGTTTCAGCAAGCTGAGACTGCACCACTGCACTT
CAGCCTGGGAGACAGAGCAAGACTCCATCTCAAAACAAAAAACAAAACAAAAAAAAGAAA
AGAAATAGATGTAGTCAGA
>lcl|DFL16.1 No definition line found ** from blast/db/modjgene
TTTATTACTACGGTAGTAGCTAC
>lcl|45.21.1 No definition line found ** from blast/db/migallaaseq
EVQLQQSGPELVKPGASVKISCKASGYTFTDYYMNWVKQSHGKSLEWIGDINPNNGGTSYNQKFKGKATLTVDKSSSTAY
MGLRSLTSEDSAVYYCAR
>lcl|VMU-3.2 No definition line found ** from blast/db/migallaaseq
QVQLQQSGPELVKPGASVKISCKASGYAFSSSWMNWVKQRPGKGLEWIGRIYPGDGDTNYNGKFKGKATLTADKSSSTAY
MQLSSLTSEDSAVYFCAR
>gnl|dbSNP|rs20323_allelePos=51 total len = 101 |taxid = 10116|snpClass = 1 ** from /snp/rat/rs_fasta/rs_10116.fas
tatgtatatg tagctacatg tgtataaata tattacatat acaagtgtgc
R
catgtataaa cacatacata tgtacacata ggtatatatg catgtatGCA
>gi|14209819|gb|AAK56855.1|AF263993_1 (AF263993) pipe sulfotransferase box 3 isoform [Drosophila melanogaster] ** from blast/db/month.aa (this is a protein)
MSLNAERSYKMKLRDVENAFKYRRIPYPKRSVELIALLAISCTFFLFMHTNKLNSRLKEMEVKLQPSEFSALGLTGNHIS
GHDAGKHDDINTLHGTYQYLKSTGQLWRLNPKFLNNTKFHFRDIIFYNRVPKTGSETLIELMIQLGKKNDFQNERSPFSK
PTGMYWDVKRQKQEATRILELQEEPAFVYVEHMNYMNIRPFHLPQPIYINMIRDPVERVISWFYYKRTPWNSVKMYKVTG
KFQNRTHYTKNFEECVLTHDPECRYDYGLLFKDDSADHKRQSLFFCGHSPICEPFNTPAAIARAKQNVERDFSVVGSWED
TNVTLTVLEHYIPRFFKGTMELYYEPNIGLAFKKANINPWKPKISERIKQIMRANFTQEYEFYYFCKQRLYRQYFAINKQ
MHF
>gi|14285490|sp|Q9RYJ6|HYPA_DEIRA PROBABLE HYDROGENASE NICKEL INCORPORATION PROTEIN HYPA ** from /blast/db/month.aa
MHEASIALALIDVAGDVLREHGAARASALTVRVGQWSSVVPEALAAAFPACAEGTPLAGARLSIERVPGVGECPQHGPVE
LEVWRGLRCPLCGAPTPRLLQGDELELDQLELDQLELENL
>gi|14285506|sp|Q9HE76|KAD_NEUCR PROBABLE ADENYLATE KINASE (ATP-AMP TRANSPHOSPHORYLASE) ** from /blast/db/month.aa
MVLMGPPGAGKGTQAPKIKEKFNCCHLATGDMLRAQVAKGTALGKQAKKIMNEGGLVSDDIVIGMIKDELENNKECQGGF
ILDGFPRTVPQAEGLDAMLRERNLPLQHAVELKIDDSLLVARITGRLVHPASGRSYHRIFNPPKDDMKDDITGEPLVQRS
DDNAEALRKRLETYHKQTAPVVGYYQNTGIWKAIDASQEPAQVWKSLLAIFEGDKAKASSAGSGIMSKIASAAKSS
>gi|14285516|sp|Q9K0D9|KTHY_NEIMB THYMIDYLATE KINASE (DTMP KINASE) ** from /blast/db/month.aa
MKPQFITLDGIDGAGKSTNLAVIKAWFERRGLPVLFTREPGGTPVGEALREILLNPETKAGLRAETLMMFAARMQHIEEV
ILPALSDGIHVVSDRFTDATFAYQGGGRGMPSEDIEILEHWVQGGLKPDLTLLLDVPLEVSMARIGQTREKDRFEQEQAD
FFMRVRGVYLDRAAACPERYAVIDSNRNLDEVRNSIEKVLDGHFGC
>gnl|UG|Rn#S5294 C06734 Rattus norvegicus cDNA /gb=C06734 /gi=1503510 /ug=Rn.2 /len=140 ** from /repository/UniGene/rn.seq.all
GTTGCAGCGGCCGATGCCGGTGAATCAGCACGGGTTTTTTGGACTCGGAGGTCGTGCANA
TCTGCTGGACTTGGGTCCGGGGAGTCCTGGTGATGGGCTGAGCCTANCCGCGCCGAGCTG
GGGTGTCCCGGAGGAGCCAC
>gb|F236785_1|AF236785 Fusarium subglutinans strain MRC3478 beta-tubulin gene, e
xons 4 and 5 and partial cds. 15-FEB-2001 ** from genpept
LRKLAVNMVPFPRLHFFMVGFAPLTSRGAHSFRAVSVPELTQQMFDPKNMMAASDFRNGR
YLTCSAIFRGRVAMKEVEDQMRNVQSKNSSYFVEWI
|