blast数据库含义

最新推荐文章于 2025-01-11 17:07:29 发布

翻译最新推荐文章于 2025-01-11 17:07:29 发布 · 6.3k 阅读

生物信息学专栏收录该内容

10 篇文章

订阅专栏

blast的数据库里面有这几个数据库，每一个的具体含义：

https://ncisf.org/index.php?q=software-databases/blast-databases

A list of the databases available on the cluster, including information about the database, it's source, update method and description.

All databases are located in /sw/db

Name	Type	Update Method	Source	Description
nt	nucleic	Automatic - NCBI formatted.	ftp://ftp.ncbi.nih.gov/blast/db/nt.*	nucleotide sequence database, with entries from all traditional divisions of GenBank, EMBL, and DDBJ excluding bulk divisions (gss, sts, pat, est, and htg divisions. wgs entries are also excluded. Not non-redundant.
nr	protein	Automatic - NCBI formatted.	ftp://ftp.ncbi.nih.gov/blast/db/nr.*	non-redundant protein squence database with entries from GenPept, Swissprot, PIR, PDF, PDB and NCBI RefSeq
swissprot	protein	Automatic - NCBI formatted.	ftp://ftp.ncbi.nih.gov/blast/db/swissprot.tar.gz	swiss-prot sequence databases (last major update), it's parent database is nr.
human_genomic	nucleic	Automatic - NCBI formatted.	ftp://ftp.ncbi.nih.gov/blast/db/human_genomic.*	Human RefSeq (NC_######) chromosome records with gap adjusted concatenated NT_ contigs
est_human	nucleic	Automatic - NCBI formatted.	ftp://ftp.ncbi.nih.gov/blast/db/est_human.*	Alias and mask files for human subset of the est database. These alias and mask files need all volumes of est to function properly.
pataa	protein	Automatic - NCBI formatted.	ftp://ftp.ncbi.nih.gov/blast/db/pataa.*	Patent protein sequence database. Directly from USPTO or from EU/Japan Patent Agencies via EMBL/DDBJ
patnt	nucleic	Automatic - NCBI formatted.	ftp://ftp.ncbi.nih.gov/blast/db/patnt.*	Patent nucleotide sequence database. Directly from USPTO or from EU/Japan Patent Agencies via EMBL/DDBJ
pdbaa	protein	Automatic - NCBI formatted.	ftp://ftp.ncbi.nih.gov/blast/db/pdbaa.*	Protein sequneces from PDB protein structures, it's parent database is nr.
pdbnt	nucleic	Automatic - NCBI formatted.	ftp://ftp.ncbi.nih.gov/blast/db/pdbnt/*	Nucleotide sequences from pdb nucleic acid structures. It's parent database is nt. They are NOT the protein coding sequences for the corresponding pdbaa entries.
sts	nucleic	Automatic - NCBI formatted.	ftp://ftp.ncbi.nih.gov/blast/db/sts.*	Sequences from the STS division of GenBank, EMBL, and DDBJ
vector	nucleic	Automatic - NCBI formatted.	ftp://ftp.ncbi.nih.gov/blast/db/vector.*	Vector sequence database. (Note that for vector screening, NCBI recommend using the UniVec database, please contact support@qfab.org should you require this database).