SoftwareCoded in the lab
BioDiscML (Biomarker Discovery by Machine Learning) is a tool that automates the analysis of complex biological datasets using machine learning methods. From a collection of samples and their associated characteristics BioDiscML produce a minimal subset of biomarkers and a model that will predict efficiently a specified outcome. It uses a large variety of machine learning algorithms to select the best combination of biomarkers for predicting either categorical or continuous outcome from highly unbalanced datasets. The software has been implemented to automate all machine learning steps, including data pre-processing, feature selection, model selection, and performance evaluation.
Varian/t ExplOreR: Exploratory tool for fine-mapping. VEXOR is a platform-independent browser-based integrative environment for functional annotation in R, based on the Shiny package. This interface provides a comprehensive analytical framework to characterize the role of variants driving susceptibility signals in regions defined by GWAS.
R-Omix: Epigenomics Portal
A portal containing novel bioinformatics tools with highly customizable user interface based on the Shiny framework has been developed by our team. Those interfaces offer the possibility to rapidly generate graphs which are easily integrable in publications. All those bioinformatics tools are closely related to epigenomics fields and are hosted on the Compute Canada‘s infrastructure.
Our groups have developed a C++ library designed for Next Generation Sequencing data manipulation. It is specifically tailored to help develop applications that work with genomic regions and features, such as epigenomics marks, gene features and data that are often associated with BED type files.
Bio2RDF: Toward interlinked life science data
The Bio2RDF project uses a data integration approach based on semantic web rules to provide a service to help biologists to understand the mechanisms of life and to better exploit the vast amount of publicly available data.
A package for ChIP-Seq and motif analysis, rGADEM: rGADEM is an efficient de novo motif discovery tool for large-scale genomic sequence data. It is an open-source R package, which is based on the GADEM software.
A package for ChIP-ChIP and tiling arrays, rMAT: This package is an R version of the package MAT and contains functions to parse and merge Affymetrix BPMAP and CEL tiling array files.
A package for nucleosome positioning, RJMCMC : This package uses informative Multinomial-Dirichlet prior in a t-mixture with reversible jump estimation of nucleosome positions for genome-wide profiling.
A package for single-nucleotide polymorphisms visualization, ShinySNP: This package provide a highly customizable graphical user interface which enable the visualization of single-nucleotide polymorphisms (SNPs).
Permutation analysis, based on Monte Carlo sampling, for testing the hypothesis that the number of conserved differentially methylated elements, between several generations, is associated to an effect inherited from a treatment and that stochastic effect can be dismissed. Available here.