Computational Biology and Bioinformatics Services and Consultations

Consulting for the study design, including the selection of appropriate technologies and experimental parameters

Grant proposal and manuscript preparation support

Analysis of next generation sequencing data, including RNA-sequencing, ChIP-sequencing, whole exome-sequencing, ATAC-sequencing, whole genome re-sequencing, and long-read sequencing (e.g., Oxford Nanopore)

Analysis of single-cell sequencing, spatial transcriptomics, TCR/BCR repertoire profiling, and targeted gene panels

Various downstream and systems-level analyses of results from sequencing and microarray data

Analysis of nCounter NanoString data

Analysis of genomic editing (e.g., shRNA and CRISPR/Cas9 screening) data

Single-cell RNA-seq (scRNA-seq) data analysis services

Basic scRNA-seq data analyses, including quality control, alignment, trimming, assembly, differentiation expression, and clustering

Advanced scRNA-seq data analyses, including cell trajectory analysis, cell type prediction, and co-regulated gene module identification

Interpretation of results

Integrative analysis of single-cell multi-omics data

Spatial transcriptomics data analysis services

Basic spatial transcriptomics data analyses, including quality control, tissue architecture identification, spatial variable gene identification, and differential expression analysis

Advanced spatial transcriptomics data analyses, including deconvolution and cell-cell communication analyses

Interpretation of results

Integrative analysis of spatial multi-omics data

Immune profiling

CyToF data analysis

Analysis of immunome flow panels

Metabolomics data analysis

LC-MS data preprocessing, including quality control assessment of raw chromatograms, peak calling, retention time alignment, and preliminary identification using MS2 data, if available

Interpretation of liquid-chromatography untargeted metabolomics datasets, which include relative abundances of a broad spectrum of polar and non-polar small molecules (< 1500 Daltons)

Statistical analysis to identify biologically relevant metabolites

Pathway enrichment analysis of relevant metabolites

Metagenomic and metatranscriptomics services

Metagenomic data analysis, including species/strain composition profiling, taxonomic analysis, abundance analysis, phylogenetic analysis, and whole-genome shotgun analysis

Metatranscriptomic data analysis, including functional profiling, expression activity analysis,16S ribosomal RNA analysis, and whole-transcriptome shotgun analysis

Public domain and commercial data source mining services

Obtaining multiple data sources, including dbGaP, GEO, ENCODE, TCGA, ICGC, GTEx, and ORIEN Avatar

Implementing integrative analyses of these datasets in concert with locally generated data

Data management and sharing services

Big data transfer

Data storage management

Data/code submission to repositories

Data management and sharing policy development, along with relevant cost estimation

Analysis workflow development and standardization

Reviewing best practices

Defining standard operating procedures

Adopting standard data analysis protocols

Novel bioinformatics tool development

Custom software engineering when existing tools do not meet investigators’ needs

Development of novel computational methods for genomic data analysis and integration

Development of R and Python packages

Cloud computing and web development

Development and management of interactive and dynamic web interfaces

Development and management of cloud infrastructures

Training and Workshops

BISR faculty and staff organize short 1-2 day workshops and seminars focused on bioinformatics data analysis. The workshops cover various areas of bioinformatics and contain hands-on activities to help CCC researchers understand the process of data analysis and the interpretation of the results.

Massively Parallel Sequence Analysis

The sequence output from the Illumina HiSeq 4500. The sequencer output is useful to biologists when the BISR provides several standard data transformations. The BISR provides three classes of service for the sequence analysis: 1) running an automated pipeline from the sequencing machine to compile sequence reads and their quality assignments; 2) running best practice workflows for the analysis of RNA-sequencing, ChIP-sequencing, Exome-sequencing, and whole genome re-sequencing. Final output besides alignment results are gene expression quantification for RNA-sequencing, variant analysis for exome-sequencing and whole genome re-sequencing, and peak detection for ChIP-sequencing datasets; 3) performing downstream analysis including sample comparisons, pathway analysis and integration with private and publicly available datasets.

Analysis of Microarray Methods

The analysis of microarray methods includes mRNA (Affymetrix), SNP and microRNA. Microarray methods results are first analyzed by the Biostatistics Shared Resource to identify genes/probes that have significantly changed expression level in the assay. The BISR not only provides downstream analysis of identifying pathways enriched in the results, but it also helps with data presentation and results submission in public databases.

Publicly Available Datasets Analysis

The Gene Expression Omnibus (GEO) is one of the world’s largest databases in which primary data from nearly all published microarray experiments are stored. The BISR staff analyzes this publicly available data with the biologist.

The Cancer Genome Atlas (TCGA) and International Cancer Genome Consortium (ICGC) are data portals that allows access to results of thousands of deep sequencing projects. BISR staff implement and run custom workflows and integrate these publicly available datasets to generate new hypotheses or draw new conclusions.

Pathway and Network Analysis of Results from Sequencing and Microarray

Typical analysis of sequencing and microarray data results in set of genes that have altered gene expression or have functional mutations. BISR staff are expert users of the Ingenuity Pathway Analysis program, which enables the identification of biological pathways and networks that are enriched in results. The pathway analysis assists the biologist with navigating the large numbers of gene hits, and it can generate new hypotheses for the biologist to test in the laboratory.

Additionally, the BISR can provide advanced network analysis such as co-expression network analysis, gene regulatory and interaction network analysis, and dense network motif discovery.

Computational Biology and Bioinformatics Services and Consultations