2nd Machine Learning for Personalized Medicine (MLPM), Paris 2014

Methodological aspects in integromics: integrating multiple omics data sets

author: Kristel Van Steen, Department of Electrical Engineering and Computer Science, University of Liège
published: Feb. 17, 2015, recorded: September 2014, views: 2227

Slides

Report a problem or upload files

If you have found a problem with this lecture or would like to send us extra material, articles, exercises, etc., please use our ticket system to describe your request and upload the data.
Enter your e-mail into the 'Cc' field, and we will keep you updated with your request's status.

Lecture popularity: You need to login to cast your vote.

Description

The advent of high-throughput technologies including sequencers and array-based assays (expression, SNP, CpG) have caused the generation of humongous amounts of data often referred to as “Big Data”. The biological datasets are heterogeneous and often include gene expression, genotype, epigenome and other types of data that are referred to as “-omics” data. As a result, there is a strong effort across multi-disciplinary scientific communities to develop robust, computationally efficient and sensible data processing pipelines to effectively analyze “-omics” data in order to extract biologically and clinically relevant information – “useful knowledge”.

The enthusiasm of having access to vast amounts of information resources comes with a caveat. In contrast to single omics studies, integrated omics studies are extremely challenging. These challenges include protocol development for standardizing data generation and pre-processing or cleansing in integrative analysis contexts, development of computationally efficient analytic tools to extract knowledge from dissimilar data types to answer particular research questions, the establishment of validation and replication procedures, and tools to visualize results. However, from a personalized medicine point of view the anticipated advantages are believed to outweigh any difficulty related to “integromics”. The strong interest in the topic has already resulted in the emergence of new integrative cross-disciplinary techniques based on for instance kernel fusion, probabilistic Bayesian networks, correlation networks, statistical data-dimensionality reduction models, and clustering.

In this contribution, we will highlight the key steps involved in omics integration efforts and will summarize main analytic paths. We will then zoom in on a novel integrated analysis framework (based on genomic MB-MDR). This framework will be used as a red thread to discuss main issues, pitfalls and merits of integrated analyses. Unprecedented opportunities lie ahead!

Link this page

Would you like to put a link to this lecture on your homepage?
Go ahead! Copy the HTML snippet !

Write your own review or comment:

Comment:
Name:
Email address:
URL:

make sure you have javascript enabled or clear this field: