Bioinfo U2 KD 2
Bioinfo U2 KD 2
Thus, biological databases are organized collections of biological information that are
stored in a digital format. They are essential resources for researchers and scientists to access,
analyze, and share data in areas such as genomics, proteomics, molecular biology, and more.
These databases facilitate the management of large volumes of biological data and are widely
used to store sequences of DNA, RNA, proteins, metabolic pathways, and other biological data
types. Biological databases come in various types, each with its own purpose and scope, and
can be accessed and queried to retrieve relevant biological information for research and
analysis.
Biological databases can be classified based on the type of data they store or the purpose they
serve. The major types include:
GenBank: An NIH database that contains all publicly available DNA sequences
dbGaP: A database that stores data and results from studies that investigate the
interaction between genotype and phenotype in humans
NCBI genome: A collection of genome sequences, assemblies, and mapped
annotations
UCSC Genome Browser: A database that contains genetic information for
vertebrate model organisms
RefSeq: A collection of human gene-specific reference genomic sequences
Ensembl: A genome browser for vertebrate genomes that supports research in
evolution, comparative genomics, and more
3. Protein Structure Databases: These store 3D structural information of proteins. The
Protein Data Bank (PDB) is a widely used example.
4. Pathway and Interaction Databases: It is a collection of information about the
molecular interactions and relationships between different biological components
within a cell, essentially mapping out the complex pathways that govern cellular
processes, often with a focus on specific biological functions. KEGG (Kyoto
Encyclopedia of Genes and Genomes) is a knowledge base for systematic analysis of
gene functions, linking genomic information with higher order functional information.
5. Literature Databases: These focus on storing scientific literature, publications, and
annotations. PubMed is a well-known example for biomedical research.
Popular Biological Database Retrieval Systems include tools like BLAST (Basic Local
Alignment Search Tool), Ensembl, UCSC Genome Browser, and Gene Ontology.
The advantage of these retrieval systems is that they not only return matches to a query but
also provide handy pointers to additional important information in related databases. In using
any of these systems, queries can be as simple as entering the accession number of a newly
published sequence than or as complex as searching multiple database fields for specific terms.
Depending on the type of data at hand, there are two basic ways of searching: Using
descriptive words to search - text databases and using a nucleotide or protein sequence to search
- Sequence databases.