A probabilistic functional gene network of 18,714 validated protein-encoding genes of Homo sapiens (by NCBI March 2007), constructed by a modified Bayesian integration of 21 types of 'omics' data from multiple organisms.
Information about genes and proteins presented as "literature networks" based on instances where gene or protein names appear in articles together, providing a way to visualize possible direct or indirect connections (e.g., biological interactions).
GeneCards is a searchable, integrative database that provides comprehensive, user-friendly information on all annotated and predicted human genes. It automatically integrates gene-centric data from ~125 web sources, including genomic, transcriptomic, proteomic, genetic, clinical and functional information.
Information about genes and proteins hyperlinked via relevant words found in PubMed abstracts, creating a network of protein-related topics from the research literature.
The EIGENSOFT package combines functionality from our population genetics methods (Patterson et al. 2006) and EIGENSTRAT stratification correction method (Price et al. 2006)
Program for quantifying abundances of transcripts from RNA-Seq data: it doesn’t require alignment to a genome, it uses a technique called “pseudoalignment” and is fast.
The Arabidopsis Information Resource (TAIR) maintains a database of genetic and molecular biology data for the model higher plant Arabidopsis thaliana. Data available from TAIR include the complete genome sequence along with gene structure, gene product information, gene expression, DNA and seed stocks, genome maps, genetic and physical markers, and publications. Gene product function data is updated every week from the latest published research literature and community data submissions.
BreakDancer, released under GPLv3, is a Perl/C++ package that provides genome-wide detection of structural variants from next generation paired-end sequencing reads.