Web Resources
From Bioinformatics Lab
Knowledgebase, GATEWAY DBs, Genome/Gene Annotations
- Scitable by Nature Learn Science at Nature Education
- Plant Physiology web
- EBI
- NCBI
- GOLD Genome Online Database-Genome project statistics and download links
- UCSC genome browser home
- ENSEMBL genome browser home
- CCDS The concensus protein coding regions among NCBI, Ensembl, and Sanger (Havana) annotation
- GENCODE The Encyclopedia of Genes
- DBTSS Transcription start site DB (with tissue specific information)
- EPD The Eukaryotic Promoter Database
- TiProD The tissue-specific Promoter DB
The Encyclopedia of DNA Element (ENCODE) Data links
- ENCODE data summary List all released/approved experiments
- ENCODE Encyclopedia of DNA Elements project (human)
- ENCODE explore Access to collected papers exploring ENCODE data
- UWencode ENCODE (human and mouse) data browser and download site
- RNA Dashboard DB for raw transcriptome data from ENCODE
- Mouse ENCODE ENCODE for mouse
- modENCODE ENCODE for animal models (worm, fly)
NGS data analysis tools
- BEDOPSA suite for common genome analysis tasks with high scalability and flexibility
- BEDTools A suite for BED (Browser Extensible Data) and GFF (General Feature Format) format.
- SAMtools A suite for SAM (Sequence Alignment/Map) format
- Homer A suite of tools for Motif Discovery and NGS (ChIP-Seq, RNA-Seq, DNase-Seq, Hi-C). Excellent documentation!
- F-Seq A Feature Density Estimator for High-Throughput Sequence Tags
Pathway Annotation DBs
- Gene Ontology by Gene Ontology Consortium
- AgBase Curated DB for functional analysis of agriculural animals and plants
- UniProt-GOA by EBI (support multi-species annotation)
- UniPathway a fully manually curated resource of metabolic pathways (cross-linked to KEGG, MetaCyc)
- Biocyc includes Metacyc, Ecocyc, Humancyc, Aracyc, Yeastcyc
- Pathway Interaction Database (PID)
- Reactome A manually curated and peer-reviewed pathway DB
Genomic Variation DBs
- NCBI Variation Variation DBs (dbSNP, dbVar, dbGaP, ClinVar)
- 1000 Genome Project Deep catalog of human genetic variation
- Complete Genomics Very accurate 69 human WGS public data and more
- Exome Variant Server by NHLBI GO Exome sequencing project (ESP)
- MutaDATABASE a centralized and standardized DNA variation DB
- HGMD The human gene mutation database (The professional version of DB is commercial.)
- AtPolyDB Everything about Arabidopsis natural variants (by Magnus Nordborg, GMI)
- RegMap panel Reginal Mapping Project for Arabidopsis natural variants (by Joy Bergelson, U on Chicago)
- 1001 Genome Project Genetic variation] of natural population of Arabidopsis (by Detlef Weigel, MPI)
Epigenomics Resources
- Road map Epigenomics NIH Roda map Epigenomics project home
- Epigenie An informative web community for epigenetics-related research
- EpGenSys European network to bring together epigenetic and systems biology
Genotype-to-Phenotype Resources
- dbGaP The database of Genotypes and Phenotypes (GWAS, WGS, Exome-seq...)
- EGA European Genome-phenome Archive (GWAS, WGS, Exome-seq...)
- GWAS catalog contains SNPs with p<10^-5
- GWASdb contains SNPs with p<10^-3
- DistiLD Diseases and Traits in Linkage Disequilibrium Blocks
- Personal Genome Project
- NCBI GTex(Genotype-Tissue Expression) browser eQTL data download and analysis
Mutation Effect Prediction Tools
- SIFT(Sorting Intolerent from Tolerent substitution)
- PolyPhen-2 (Polymorphism Phenotyping v2)
- RegulomeDB Exploring DNA functional elements for noncoding variants (by Stanford, Snyder lab)
- HaplogReg Exploring DNA functional elements for noncoding variants (by MIT, Kellis lab)
Gene Expression DBs
- GEO
- Arrayexpress
- PLEXdb Gene expression resources for plant and plant pathogens
- AtGenExpress Arabidopsis gene expression DB by Weigel lab
- Connectivity Map Expression profiles from cultured human cells treated with bioactive small molecules
- EBI Gene Expression Atlas Gene expression atlas for many organisms collected from various experiments
- Human Cell/tissue-specific gene expression map for 369 different cell and tissue types with 5,372 human samples from GEO
- GXD The mouse Gene Expression Database (by MGI)
- FlyAtlas fly gene expression in 25-17 adult and 8 larval tissues
Protein Expression DBs
- SUBA SUBcellular location DB for Arabidopsis proteins
Protein/Gene Interaction DBs
- Protemic Standard Initiative Common QUery InterfaCe (PSICQUIC)
- International Molecular Exchange Consortium (IMEx)
- iRefWeb a web interface to PPI consolidated from 10 public DB (BIND, BioGRID, CORUM, DIP,IntAct, HPRD, MINT, MPact, MPPI, OPHID(predicted PPIs))
- BIND the Biomolecular Interaction Network Database
- BioGRID
- CORUM Comprehensive Resource of Mammalian Protein Complexes
- DIP Database of Interacting Proteins
- IntAct
- HPRD Human Protein Reference Database
- MINT Molecular Interaction DB
- Mpact Representation of Interaction Data at MIPS
- MPPI Mammalian PPI DB at MIPS
- STRING Known and predicted PPI
TF Regulation DBs
- HOCOMOCO Homo sapiencs comprehensive (TFBS) model collection (include TRANSFAC, JASPAR, ENCODE, other literatures)
- TRANSFAC TFBS(PWM) and more
- JASPAR TFBS(PWM)
- UNIPROBE Universal protein-binding motif inferred from Protein-binding Array experiments
- TRED a transcriptional regulatory element database (contains curated TF-target links for 36 TF families)
- ORegAnno DNA regulatory regions, TFBS, regulatory variants
- AGRIS Arabidopsis Gene Regulatory Information Server (by OSU)
- PlnTFDB Plant TF database by University of Potsdam, Germany
- PlantTFDB Plant TF database by Peking University, China
miRNA and target DBs
- miRBase miRNA database by Manchester University
- microRNA.org download miRNA expression atlas for human, mouse, rat
- miRGator data for miRNA expression, miRNA-mRNA paired expression profile, miRNA perturbation experiments...
- TarBase Manually curated microRNA-target links (includes high throughput evidences) download page
- miRecords Manually curated microRNA-target links + predicted links (by 11 computational algorithms)
- miRTarBase Manually curated microRNA-target links
- miR2Disease Manually curated microRNA-target links and microRNA-disease links
- Carrington Lab Resource Various DBs for plant miRNA
Phenotype/Disease Annotation DBs
- Disease Ontology Disease ontology files FUNDO DOLite_term-to-genes map
- DGA Disease and Gene Annotation, an integrative set of disease-to-gene, gene-to-gene, disease-to-disease relationships
- Human Phenotype Ontology
- OMIM Human disease DB
- UMLS Unified Medical Language Systems
- ICD International Classification of Disease by WHO
- GenomeRNAi A Phenotype DB for large-scale RNAi screens (human, mouse, fly, worm...)
- OGEE Online GEne Essentiality database
Drug/Bio-active chemical DBs
- PubChem A DB contains drug structure and function by NCBI
- ChEMBL A DB contains drug structure and functions by EBI
- SuperDrug A DB contains 3D-structures of drugs
Drug-Target relationship DBs
- KEGG DRUG contains information about only approved drugs
- STITCH DB for known and predicted chemical-protein interaction
- Drugbank A major DB of drug/target
- Therapeutic Target Database (TTD) A major DB of drug/target
- MATADOR Manually Annotated Targets and Drugs Online Resource
- PDSP Ki DB data warehouse for published and internally-derived Ki, or affinity of drugs at targets
Clinical Trials and Pharmaco/Toxicogenomics DBs
- ClinicalTrials.gov DB for clinical trials conducted around the world
- CTD The Comparative Toxicogenomics database
- CEBS Chemical Effects in Biological Systems, an integrated public repository for toxicogenomics data
- PharmGKB The Parmacogenomics Knowledgebase
- SIDER Side Effect Resource
Organism-centric DBs
- WormBase
- FlyBase
- MGI
- TAIR
- Gramene
- RGAP Rice Genome Annotation Project by MSU (Go get the part list here!)
- MaizeGDB
- PortEco Portal for E. coli
- Pseudomonas Genome Database
- Saccharomyces Genome Database
- Candida Genome Database
- IMG Integrated Microbial Genomes (include metagenome data)
Metagenome DBs
- CAMERA 2.0 Curated DB and analysis tool for metagenomes
- NCBI Metagenome DB Categorized with metagenome types, sources and sequencing methods.
- OMI Open Microbiome Initiative
- EMP Earh Microbiome Project will sequence 10,000 metagenomes in an year, 200,000 metagenomes in 3 years
Cancer Genome/Cell Line Biology DBs
- TCGA The Cancer Genome Atlas
- CGHub The Cancer Genomics Hub by UCSC (a repository for TCGA data)
- ICGC cancer genome project International Cancer Genome Consortium Cancer genome project
- PCGP The Pediatric Cancer Genome Project
- Genomics of Drug Sensitivity in Cancer (GDSC) The largest public DB for drug sensitivity of cancer cell line and biomarkers
- Cancer Cell line Encyclopedia (CCLE) by Broad-Novartis, 1000 cancer cell lines, ~1200 compounds and their combinations
- DTP human tumor cell line screen by NCI-60
- Developmental Therapeutics Program by NCI (contains NCI-60 human tumor cell-line screen data)
- NCI60 mutation data
- GKS Cancer Cell Line Data genomic profiles for 300 cell lines
- COLT-cancer database shRNA-based essential gene profiles for 70 breast, pancreatic, ovarian cancer cell lines
Stem Cell Biology DBs
- SCDE The Stem Cell Discovery Engine
- ESCAPE Embryonic Stem Cell Atlas of Pluripotency Evidence (Many stem cell related networks)
Other Resources
- DREAM Dialogue for Reverse Engineering Assessments and Methods
- Sage Bionetworks
- Assay depot Online marketplace for pharmaceutical research service
- CAGI Critical Assessment of Genome Interpretation
- Numedii New Indications for Medicines
- nightscience Collective creativity in scientific discovery games