Web Resources
From Bioinformatics Lab
				
								
				
				
																
				
				
								
				Knowledgebase, GATEWAY DBs, Genome/Gene Annotations
- Scitable by Nature Learn Science at Nature Education
 - Plant Physiology web
 - EBI
 - NCBI
 - GOLD Genome Online Database-Genome project statistics and download links
 - UCSC genome browser home
 - ENSEMBL genome browser home
 - CCDS The concensus protein coding regions among NCBI, Ensembl, and Sanger (Havana) annotation
 - GENCODE The Encyclopedia of Genes
 - DBTSS Transcription start site DB (with tissue specific information)
 - EPD The Eukaryotic Promoter Database
 - TiProD The tissue-specific Promoter DB
 
The Encyclopedia of DNA Element (ENCODE) Data links
- ENCODE data summary List all released/approved experiments
 - ENCODE Encyclopedia of DNA Elements project (human)
 - ENCODE explore Access to collected papers exploring ENCODE data
 - UWencode ENCODE (human and mouse) data browser and download site
 - RNA Dashboard DB for raw transcriptome data from ENCODE
 - Mouse ENCODE ENCODE for mouse
 - modENCODE ENCODE for animal models (worm, fly)
 
NGS data analysis tools
- BEDOPSA suite for common genome analysis tasks with high scalability and flexibility
 - BEDTools A suite for BED (Browser Extensible Data) and GFF (General Feature Format) format.
 - SAMtools A suite for SAM (Sequence Alignment/Map) format
 - Homer A suite of tools for Motif Discovery and NGS (ChIP-Seq, RNA-Seq, DNase-Seq, Hi-C). Excellent documentation!
 - F-Seq A Feature Density Estimator for High-Throughput Sequence Tags
 
Pathway Annotation DBs
- Gene Ontology by Gene Ontology Consortium
 - AgBase Curated DB for functional analysis of agriculural animals and plants
 - UniProt-GOA by EBI (support multi-species annotation)
 - UniPathway a fully manually curated resource of metabolic pathways (cross-linked to KEGG, MetaCyc)
 - Biocyc includes Metacyc, Ecocyc, Humancyc, Aracyc, Yeastcyc
 - Pathway Interaction Database (PID)
 - Reactome A manually curated and peer-reviewed pathway DB
 
Genomic Variation DBs
- NCBI Variation Variation DBs (dbSNP, dbVar, dbGaP, ClinVar)
 - 1000 Genome Project Deep catalog of human genetic variation
 - Complete Genomics Very accurate 69 human WGS public data and more
 - Exome Variant Server by NHLBI GO Exome sequencing project (ESP)
 - MutaDATABASE a centralized and standardized DNA variation DB
 - HGMD The human gene mutation database (The professional version of DB is commercial.)
 - AtPolyDB Everything about Arabidopsis natural variants (by Magnus Nordborg, GMI)
 - RegMap panel Reginal Mapping Project for Arabidopsis natural variants (by Joy Bergelson, U on Chicago)
 - 1001 Genome Project Genetic variation] of natural population of Arabidopsis (by Detlef Weigel, MPI)
 
Epigenomics Resources
- Road map Epigenomics NIH Roda map Epigenomics project home
 - Epigenie An informative web community for epigenetics-related research
 - EpGenSys European network to bring together epigenetic and systems biology
 
Genotype-to-Phenotype Resources
- dbGaP The database of Genotypes and Phenotypes (GWAS, WGS, Exome-seq...)
 - EGA European Genome-phenome Archive (GWAS, WGS, Exome-seq...)
 - GWAS catalog contains SNPs with p<10^-5
 - GWASdb contains SNPs with p<10^-3
 - DistiLD Diseases and Traits in Linkage Disequilibrium Blocks
 - Personal Genome Project
 - NCBI GTex(Genotype-Tissue Expression) browser eQTL data download and analysis
 
Mutation Effect Prediction Tools
- SIFT(Sorting Intolerent from Tolerent substitution)
 - PolyPhen-2 (Polymorphism Phenotyping v2)
 - RegulomeDB Exploring DNA functional elements for noncoding variants (by Stanford, Snyder lab)
 - HaplogReg Exploring DNA functional elements for noncoding variants (by MIT, Kellis lab)
 
Gene Expression DBs
- GEO
 - Arrayexpress
 - PLEXdb Gene expression resources for plant and plant pathogens
 - AtGenExpress Arabidopsis gene expression DB by Weigel lab
 - Connectivity Map Expression profiles from cultured human cells treated with bioactive small molecules
 - EBI Gene Expression Atlas Gene expression atlas for many organisms collected from various experiments
 - Human Cell/tissue-specific gene expression map for 369 different cell and tissue types with 5,372 human samples from GEO
 - GXD The mouse Gene Expression Database (by MGI)
 - FlyAtlas fly gene expression in 25-17 adult and 8 larval tissues
 
Protein Expression DBs
- SUBA SUBcellular location DB for Arabidopsis proteins
 
Protein/Gene Interaction DBs
- Protemic Standard Initiative Common QUery InterfaCe (PSICQUIC)
 - International Molecular Exchange Consortium (IMEx)
 - iRefWeb a web interface to PPI consolidated from 10 public DB (BIND, BioGRID, CORUM, DIP,IntAct, HPRD, MINT, MPact, MPPI, OPHID(predicted PPIs))
 - BIND the Biomolecular Interaction Network Database
 - BioGRID
 - CORUM Comprehensive Resource of Mammalian Protein Complexes
 - DIP Database of Interacting Proteins
 - IntAct
 - HPRD Human Protein Reference Database
 - MINT Molecular Interaction DB
 - Mpact Representation of Interaction Data at MIPS
 - MPPI Mammalian PPI DB at MIPS
 - STRING Known and predicted PPI
 
TF Regulation DBs
- HOCOMOCO Homo sapiencs comprehensive (TFBS) model collection (include TRANSFAC, JASPAR, ENCODE, other literatures)
 - TRANSFAC TFBS(PWM) and more
 - JASPAR TFBS(PWM)
 - UNIPROBE Universal protein-binding motif inferred from Protein-binding Array experiments
 - TRED a transcriptional regulatory element database (contains curated TF-target links for 36 TF families)
 - ORegAnno DNA regulatory regions, TFBS, regulatory variants
 - AGRIS Arabidopsis Gene Regulatory Information Server (by OSU)
 - PlnTFDB Plant TF database by University of Potsdam, Germany
 - PlantTFDB Plant TF database by Peking University, China
 
miRNA and target DBs
- miRBase miRNA database by Manchester University
 - microRNA.org download miRNA expression atlas for human, mouse, rat
 - miRGator data for miRNA expression, miRNA-mRNA paired expression profile, miRNA perturbation experiments...
 - TarBase Manually curated microRNA-target links (includes high throughput evidences) download page
 - miRecords Manually curated microRNA-target links + predicted links (by 11 computational algorithms)
 - miRTarBase Manually curated microRNA-target links
 - miR2Disease Manually curated microRNA-target links and microRNA-disease links
 - Carrington Lab Resource Various DBs for plant miRNA
 
Phenotype/Disease Annotation DBs
- Disease Ontology Disease ontology files FUNDO DOLite_term-to-genes map
 - DGA Disease and Gene Annotation, an integrative set of disease-to-gene, gene-to-gene, disease-to-disease relationships
 - Human Phenotype Ontology
 - OMIM Human disease DB
 - UMLS Unified Medical Language Systems
 - ICD International Classification of Disease by WHO
 - GenomeRNAi A Phenotype DB for large-scale RNAi screens (human, mouse, fly, worm...)
 - OGEE Online GEne Essentiality database
 
Drug/Bio-active chemical DBs
- PubChem A DB contains drug structure and function by NCBI
 - ChEMBL A DB contains drug structure and functions by EBI
 - SuperDrug A DB contains 3D-structures of drugs
 
Drug-Target relationship DBs
- KEGG DRUG contains information about only approved drugs
 - STITCH DB for known and predicted chemical-protein interaction
 - Drugbank A major DB of drug/target
 - Therapeutic Target Database (TTD) A major DB of drug/target
 - MATADOR Manually Annotated Targets and Drugs Online Resource
 - PDSP Ki DB data warehouse for published and internally-derived Ki, or affinity of drugs at targets
 
Clinical Trials and Pharmaco/Toxicogenomics DBs
- ClinicalTrials.gov DB for clinical trials conducted around the world
 - CTD The Comparative Toxicogenomics database
 - CEBS Chemical Effects in Biological Systems, an integrated public repository for toxicogenomics data
 - PharmGKB The Parmacogenomics Knowledgebase
 - SIDER Side Effect Resource
 
Organism-centric DBs
- WormBase
 - FlyBase
 - MGI
 - TAIR
 - Gramene
 - RGAP Rice Genome Annotation Project by MSU (Go get the part list here!)
 - MaizeGDB
 - PortEco Portal for E. coli
 - Pseudomonas Genome Database
 - Saccharomyces Genome Database
 - Candida Genome Database
 - IMG Integrated Microbial Genomes (include metagenome data)
 
Metagenome DBs
- CAMERA 2.0 Curated DB and analysis tool for metagenomes
 - NCBI Metagenome DB Categorized with metagenome types, sources and sequencing methods.
 - OMI Open Microbiome Initiative
 - EMP Earh Microbiome Project will sequence 10,000 metagenomes in an year, 200,000 metagenomes in 3 years
 
Cancer Genome/Cell Line Biology DBs
- TCGA The Cancer Genome Atlas
 - CGHub The Cancer Genomics Hub by UCSC (a repository for TCGA data)
 - ICGC cancer genome project International Cancer Genome Consortium Cancer genome project
 - PCGP The Pediatric Cancer Genome Project
 - Genomics of Drug Sensitivity in Cancer (GDSC) The largest public DB for drug sensitivity of cancer cell line and biomarkers
 - Cancer Cell line Encyclopedia (CCLE) by Broad-Novartis, 1000 cancer cell lines, ~1200 compounds and their combinations
 - DTP human tumor cell line screen by NCI-60
 - Developmental Therapeutics Program by NCI (contains NCI-60 human tumor cell-line screen data)
 - NCI60 mutation data
 - GKS Cancer Cell Line Data genomic profiles for 300 cell lines
 - COLT-cancer database shRNA-based essential gene profiles for 70 breast, pancreatic, ovarian cancer cell lines
 
Stem Cell Biology DBs
- SCDE The Stem Cell Discovery Engine
 - ESCAPE Embryonic Stem Cell Atlas of Pluripotency Evidence (Many stem cell related networks)
 
Other Resources
- DREAM Dialogue for Reverse Engineering Assessments and Methods
 - Sage Bionetworks
 - Assay depot Online marketplace for pharmaceutical research service
 - CAGI Critical Assessment of Genome Interpretation
 - Numedii New Indications for Medicines
 - nightscience Collective creativity in scientific discovery games