Title: Developing Efficient Scientific Gateways for Bioinformatics in Supercomputer Environments Supported by Artificial Intelligence
Abstract: This project aims to develop green and intelligent scientific gateways for bioinformatics supported by
high-performance computing environments (HPC) and specialized technologies such as scientific workflows, data mining,
machine learning, and deep learning. The efficient analysis and interpretation of Big Data open new challenges to explore
molecular biology, genetics, biomedical, and healthcare to improve personalized diagnostics and therapeutics; then, it
becomes necessary to availability of new avenues to deal with this massive amount of information. New paradigms in
Bioinformatics and Computational Biology drive the storing, managing, and accessing of data. HPC and Big Data advances
in this domain represent a vast new field of opportunities for bioinformatics researchers and a significant challenge.
The BioinfoPortal (https://bioinfo.lncc.br/) science gateway is a multiuser Brazilian infrastructure for bioinformatics
applications, benefiting from the HPC infrastructure. We present several challenges for efficiently executing applications
and discuss the findings on how to improve the use of computational resources. We performed several large-scale
bioinformatics experiments that are considered computationally intensive and time-consuming. We are currently coupling
artificial intelligence to generate models to analyze computational and bioinformatics metadata to understand how
automatic learning can predict computational resources’ efficient use. The computational executions are carried out
at Santos Dumont (SDumont, https://sdumont.lncc.br/), the largest supercomputer in Latin America that has 5.1 Petaflops
and 36,472 computational cores distributed in 1,134 computational nodes.
Bio: Carla Osthoff holds a degree in Electrical Engineering from the Pontifical Catholic University of Rio de Janeiro (1983), a Master's degree in Systems and Computer Engineering from the Federal University of Rio de Janeiro (1989), and a DSc. in Systems and Computer Engineering from the Federal University of Rio de Janeiro (2000). She has been working in the area of high-performance processing since 1985, initially in hardware development projects for distributed parallel multiprocessors and later as a researcher in Computer Architecture. Currently, she is a researcher in the area of High-Performance Computing at the National Laboratory for Scientific Computing (LNCC), is a professor at the Multidisciplinary Postgraduate Program at the National Laboratory for Scientific Computing and coordinates the National Center for High-Performance Processing (CENAPAD) at the LNCC., is a member of the Technical-Scientific body of the Santos Dumont Supercomputer Advisory Committee and coordinates the High-Performance Processing Sector of LNCC, which has several collaborative projects in the area of High-Performance Computing. Topics of interest are high-performance computing, distributed systems, parallel processing, parallel I/O systems, parallel programming models, and scientific computing.