X-Meeting, BSB and Genética Drops speakers
Network science is an emerging field of research that analyzes complex networks of biological data (and people). In this seminar I will show how this approach can be used for projects related to human health
programThe field of metagenomics has expanded rapidly over the past decade, both in terms of the number and diversity of different datasets. Furthermore, with application of either deep sequenced short-read or long-read sequencing the depth of the biological insight that can be gleaned from the sequence data has changed dramatically. One area of significant growth has been the assembly of datasets, and the subsequent elucidation of genomes, so called metagenome assembled genomes (MAG). In this presentation I will describe out latest efforts in understanding the microbial biodiversity found in different environmental samples. Having these large contiguous units also allows deeper insights into the metabolic functions encoded, and which microbes are producing them. For example, biosynthetic gene clusters (BGCs) encode the genes necessary for natural products such as antimicrobials and signalling molecules that can play major roles in various ecological processes. Many of these natural products have been exploited for industrial biotechnology or pharmaceutical applications. Thus, accurate identification of BGCs in (meta)genomic data is key to unveiling ecological dynamics and/or the discovery of new commercially important products. Thus, I will also present a new machine learning based tool for BGC-detection in either genomic or metagenomic assemblies, called SanntiS. Compared to other tools, our benchmarks show that our tool outperforms in the ability to detect BGCs across different classes, and notably retains precision in metagenomic datasets. Application to our metagenomic assemblies has revealed millions of potential BGCs, many of which are likely to give rise to new natural products.
program In this talk, I will describe some recent work from our group on the development of deep learning approaches towards predicting the properties and activity of compounds and proteins, based on different representations and model classes from traditional machine learning and deep learning.
I will also address the development of some tools and applications of deep generative models, to create novel compounds with desired activities, and how we use multiobjective Evolutionary Computation to guide the search of these compounds towards different aims. A case study
on computationally designing novel sweeteners will be used to illustrate the approach.
Bioinformatic researchers contribute to science through development of innovative methods, and through application of established methods to understand data they and their collaborators generate. Bioconductor (https://bioconductor.org) plays an essential role, providing a way for researchers to share their scientific contributions with a wide audience while encouraging many ‘best practices’ in software development, dissemination, and use. But science and technology change quickly. This talk addresses how R / Bioconductor adapts to challenges like single-cell and spatial genomics, multi-omics, using R with other software languages, containerization, and cloud computing initiatives.
program