DEI Talks | “Software process modeling and test automation: Introducing the Reliable Software Architectures Research Group” by Prof. Přemek Brada

The talk “Software process modeling and test automation: Introducing the Reliable Software Architectures Research Group” will be presented October the 9th, at 15:30, room B031, and will be moderated by Prof. Ana Paiva (DEI).

Abstract:

“In this talk, I will give an overview of research done by the Reliable Software Architectures Research Group at the University of West Bohemia in Pilsen, Czechia. The focus will be on analysing software process data to detect project management (anti-)patterns, where we’ll discuss the challenges in modeling software process elements in a way that is conducive to mapping onto the information gathered in project management tools. We’ll also touch the topic of analyzing software implementations to perform advanced verification and testing.”

About the Speaker:

Přemek Brada is an Associate Professor in Software Engineering at the Department of Computer Science and Engineering, University of West Bohemia in Pilsen, Czechia.  His research has covered the areas of software architecture consistency, interactive methods of architecture visualization, and software development methodologies including analysis of related process data.  He teaches bachelor and master level courses on object-oriented design and modeling, advanced software engineering practices, and also knowledge management. Currently he serves as the head of department, and is a member of the Board of Informatics Europe, the association of European informatics faculties and departments.

GNU Tools Cauldron 2025 brought together international experts at FEUP

From 26 to 28 September, the Faculty of Engineering of the University of Porto (FEUP) hosted the 14th edition of the GNU Tools Cauldron, a world reference technical conference dedicated to the GNU Toolchain and associated open source development tools.

This international meeting was held for the first time in Portugal and brought together around 140 participants from more than a dozen countries, including Canada, Germany, the Czech Republic, the United Kingdom, Ireland, Portugal, the Netherlands, France, India, the United States, Belgium, China, South Africa and Brazil.

An event with history and global impact

Created in 2012, the GNU Tools Cauldron has been organised annually, passing through some of the world’s most prestigious universities, such as the University of Cambridge (UK), Charles University (Czech Republic) and the University of Manchester (UK), and now arriving at the University of Porto. Throughout its history, the event has taken place in cities such as Mountain View, Prague, Cambridge, Manchester, Hebden Bridge, Montreal and Porto. The aim of organising the conference in partnership with higher education institutions is to strengthen the link between the international open source development community and academia, promoting the direct involvement of students and researchers.

This technical conference focuses on the GNU Toolchain – which includes fundamental tools such as gcc and gdb, and utilities and libraries such as binutils and glibc – and associated projects (ltrace, poke, systemtap, valgrind, among others). It is a critical ecosystem for most of the reference Linux distributions (AlmaLinux, CentOS Stream, Debian, Fedora, Gentoo, RHEL, Rocky Linux, SUSE, Oracle Linux), playing a central role in the global supply chain for secure open source software.

Collaboration between industry and academia

The 2025 edition was supported by FEUP’s Department of Computer Engineering (DEI) as co-organiser, bringing the academic community closer to the people who contribute to the GNU toolchain and other open source software. For three days, software developers, researchers, university professors, engineers and students had the opportunity to attend presentations and debates led by international experts in the field of compilers, toolchains and software language standardisation.

Participants included active contributors to international standard-setting bodies such as ISO C, ISO C++, DWARF, OpenMP, POSIX/IEEE and Rust, contributing directly to the evolution of languages and tools used by millions of programmers around the world.
“It’s a pleasure to be hosting this event for the first time in Portugal and, in particular, in Porto. GNU’s contributions have had a profound impact on teaching, research and technological advancement for the common good,” stressed DEI Director Prof João Paiva Cardoso at the opening session.

Institutional and corporate support

The development of the GNU toolchain is part of the GNU Project and is supported by the FSF and a worldwide community of programmers and corporate sponsors.
The GNU Tools Cauldron 2025 was sponsored and supported by important international companies and institutions: AdaCore, AMD, ARM, BayLibre, Embecosm, NVIDIA, Open Source Security, Synopsys, Pretalx (conference management software), Pretix (ticketing platform) and FEUP, which co-organised and logistically supported the event.

Event website: https://conf.gnu-tools-cauldron.org/opo25/
Videos of all the event sessions: https://www.youtube.com/playlist?list=PL_GiHdX17WtxuKn7QYme8EfbBS-RKSn0w

DEI Talks | “Networks, networks, and more networks: applications in humanities, data science, and machine learning” by Prof. Ana Bazzan

The talk ‘Networks, networks, and more networks: applications in humanities, data science, and machine learning’ will be presented on October 1st, at 14:45, in room B004, moderated by Prof. Rosaldo Rossetti (DEI).

Abstract:

“It is known that networks or graphs can be used in machine learning and data science to represent and analyze data that has complex relationships. Besides these uses, networks are also relevant to the overall AI agenda in at least two aspects. First, it relates to automated data gathering and language models in the semantic web, since the actual data have to be acquired in some manner in order to form the graphs. Second, it can be used to accelerate learning tasks, as in the case of reinforcement learning. In this talk I present examples of how data is acquired and used in applications in the Humanities (history, storytelling) in order to discover patterns and/or to investigate assumptions. Then, I discuss applications on data science and machine learning, as for instance the use of networks in reinforcement learning, with examples from urban mobility and car to infrastructure communication.”

About the Speaker:

Ana Bazzan is a Full Professor of Computer Science at the Institute of Informatics, Universidade Federal do Rio Grande do Sul (UFRGS), in Porto Alegre, Brazil. Her research focuses on multiagent systems, in particular on agent-based modeling and simulation (ABMS), and multiagent learning for the transportation domain. Since 1996, she has collaborated with various researchers in the application of ABMS and game theory to social science domains, such as the emergence of cooperation, the prisoner’s dilemma and public goods games. In recent years, she has contributed to different topics regarding smart cities, focusing on transportation, as well as on the synergies between multiagent systems, machine learning, and complex systems. In 2014, Bazzan was General Co-chair of AAMAS (the premier conference in the area of autonomous agents and multiagent systems).

Free Software Festival 2025

Next week, from October 3rd to 5th, the Faculty of Engineering of the University of Porto (FEUP) will not just host an event, but a practical demonstration of the future of technology. The Free Software Festival 2025, with free admission, goes beyond the concept of a simple conference. It positions itself as an open and essential lesson for students, educators, and entrepreneurs on one of the most important—and often invisible—pillars of the digital world: Free Software.

In an era dominated by expensive licenses and closed ecosystems, FSL serves as a powerful reminder that a more democratic, secure, and flexible alternative exists. But what exactly is the importance of free software, and why is an event like this so crucial for the Portuguese educational and business landscape?

A Lesson in Autonomy and Innovation
At the heart of the free software movement lies a simple yet revolutionary idea: the technology we use should serve us, not the other way around. It is based on four fundamental freedoms: the freedom to use, study, share, and, crucially, modify software. This ability to “look under the hood” transforms a student from a mere consumer of technology into an active creator and problem solver.

For the education system, this represents an immense pedagogical opportunity. Schools and universities can equip their labs with cutting-edge operating systems and programming tools, such as Linux or Blender (for 3D modeling), without spending a single cent on licenses. More importantly, it allows students to explore, deconstruct, and understand the code that powers the digital world, fostering critical thinking and innovation from the ground up. The Free Software Festival embodies this idea, with hands-on workshops where participants can learn to code, protect their online privacy, or take their first steps in Artificial Intelligence, using open tools accessible to all.

The Secret Engine of the Digital Economy
For the business sector, the message is equally clear: free software is not a “second-tier” alternative but the engine that drives technological giants. The internet, as we know it, is largely built on open-source technologies. Adopting free software allows Portuguese companies, from startups to SMEs, to drastically reduce operational costs, but the benefits go far beyond savings. It means technological sovereignty: the ability to adapt software to the exact needs of the business without being dependent on a single vendor and their pricing policies. It also means enhanced security, as the code can be audited by a global community that identifies and fixes vulnerabilities transparently and quickly.

The presence at FSL of entities like ESOP (Association of Portuguese Open Source Software Companies) demonstrates that a vibrant business ecosystem is already thriving in Portugal based on this model. The event thus serves as a bridge, showing future engineers the career opportunities in this sector and entrepreneurs the competitive advantages of a strategic investment in open technology.

An Investment in the Future
In short, the Free Software Festival 2025 is much more than a gathering of enthusiasts. It is an investment in the country’s future. It is living proof that, by embracing the principles of collaboration and open knowledge, Portugal can empower its students to become the innovators of tomorrow and strengthen its companies to compete on a global scale. The class is about to begin, and admission is free.

Check the event program and join us!
https://festa2025.softwarelivre.eu

FSL 2025 is supported by the Department of Informatics of Engineering (DEI).

PhD Defense in Informatics Engineering (ProDEI): ”Generative models for soccer”

Candidate:
Tiago Filipe Mendes Neves

Date, Time and Location:
16 September 2025, 15h30, Sala de Atos, Faculdade de Engenharia da Universidade do Porto

President of the Jury:
Pedro Nuno Ferreira da Rosa da Cruz Diniz (PhD), Full Professor, Department of Informatics Engineering, Faculdade de Engenharia da Universidade do Porto

Members:
Keisuke Fujii (PhD), Associate Professor, Department of Intelligent Systems, Graduate School of Informatics of the Nagoya University, Japan;
Jesse Jon Davis (PhD), Full Professor, Department of Computer Science, Faculty of Engineering Science, Katholieke Universiteit Leuven, Belgium;
Luís Paulo Gonçalves dos Reis (PhD), Associate Professor with Habilitation, Departament of Informatics Engineering, Faculdade de Engenharia da Universidade do Porto;
João Pedro Carvalho Leal Mendes Moreira (PhD), Associate Professor, Departament of Informatics Engineering, Faculdade de Engenharia da Universidade do Porto (Supervisor).

The thesis was co-supervised by Luís Jorge Machado da Cunha Meireles (PhD), Senior Psychologist & Data Scientist, FC Porto.

Abstract:

Self-supervised large models that disrupt domains such as language, vision, and biology are transforming the world. However, these generative models that learn the underlying data distribution do not perform at the same level on all tasks. For example, Large Language Models (LLMs) do not yet have concrete applicability in soccer analytics. The models lack reasoning capabilities to provide concrete and actionable insights that can compete with the wide range of case-specific metrics within soccer analytics. While there have been some studies exploring the applicability of generative models in soccer, no study aimed for the moonshot of building a complete self-supervised learning model for soccer event data. Let’s consider the individual events (each shot, pass, tackle, …) in a soccer match the “words” that describe what is happening. We can consider each possession a “sentence,” each game an “essay,” and event data as a whole a “language.” By working within this framework, we have all the tools to build a self-supervised model in the same image as LLMs. The goal of this thesis is to build a foundation self-supervised model for soccer event data – termed Large Events Model (LEM) – and demonstrate its real-world applicability and generality in solving a wide range of tasks, such as simulation and modeling, that would otherwise require multiple different approaches. We propose three approaches to building LEMs: a chain of classifiers, causal mask modeling, and sequential language modeling with transformers. First, the chain of classifiers provides the first generative model that models all aspects of event data without posing restrictions on event types, reaching a level of performance that allows large-scale simulation of soccer matches. Then, we investigate two alternative approaches to remove some of the constraints of the first approach. The causal mask modeling approach using multilayer perceptrons reaches the state-of-the-art performance of several of our proposed benchmarks, providing a set of application-ready models to solve a wide range of soccer analytics tasks. We explore a wide range of applications, from automated strategy search with reinforcement learning to risk-reward behaviors of soccer players. More than a dozen use cases for LEMs are present in this thesis. The implications of our work are far-reaching. LEMs have the potential to become the operating system for event data in soccer analytics. They will transform the way clubs work, with easier access to machine learning models that would otherwise require tremendous modeling effort. With LEMs, the barrier to entry will lower significantly as any club in the world can access a model capable of solving its most relevant problems.

Keywords: generative models; foundation models; sports analytics; deep learning applications; simulation; soccer.

PhD Defense in Informatics Engineering (ProDEI): “Text Information Retrieval in Tetun”

Candidate:
Gabriel de Jesus

Date, Time and Location:
1 September 2025, 14:30, Sala de Atos, Faculdade de Engenharia da Universidade do Porto

President of the Jury:
Rui Filipe Lima Maranhão de Abreu (PhD), Full Professor, Departament of Informatics Engineering, Faculdade de Engenharia da Universidade do Porto

Members:
Arjen P. de Vries (PhD), Full Professor at the Institute for Computing and Information Sciences of the Radboud Universiteit, Nimega, The Netherlands;
Bruno Emanuel da Graça Martins (PhD), Associate Professor, Departament of Electrical and Computer Engineering, Instituto Superior Técnico da Universidade de Lisboa;
Henrique Daniel de Avelar Lopes Cardoso (PhD), Associate Professor, Departament of Informatics Engineering, Faculdade de Engenharia da Universidade do Porto;
Sérgio Sobral Nunes (PhD), Associate Professor, Departament of Informatics Engineering, Faculdade de Engenharia da Universidade do Porto (Supervisor).

Abstract:

Ensuring access to information in all languages is crucial for bridging disparities in communities’ participation in the digital age and fostering a more inclusive and equitable society, particularly for speakers of low-resource languages. However, enabling such access remains a significant challenge for many of these communities. Tetun, a language that transitioned from a dialect to one of Timor-Leste’s official languages when the country restored its independence in 2002, faces similar challenges. According to the 2015 census, Tetun is spoken by approximately 79% of the country’s 1.18 million population. Despite its official status, Tetun remains underserved in language technology. Specifically, information retrieval-based solutions for the language do not exist, making it challenging to find relevant information on the internet and digital platforms for text-based search in Tetun.
This work tackles these challenges by investigating retrieval strategies for text-based search that can enable the application of information retrieval techniques to develop search solutions for Tetun, with a specific focus on the ad-hoc text retrieval task. Given that language-specific algorithms, tools, and document collections for Tetun were previously unavailable, this work began by creating these foundational resources, which serve as contributions relevant to information retrieval and natural language processing domains. These resources include a tokenizer, a language identification model, a stemmer, a stopword list, a document collection, a test collection, baselines for the ad-hoc text retrieval task, and a search log dataset. The contributions to information retrieval for low-resource languages include: (1) A data collection pipeline tailored for low-resource languages to streamline the construction of textual data from the web; (2) A human-in-the-loop methodology for annotating, processing, and constructing a dataset well-suited for a variety of information retrieval and natural language processing tasks; (3) A novel network-based approach for stopword detection; (4) Methodologies for developing a stemmer, designed for a language heavily influenced by loanwords, and the construction of a ground truth set for evaluating stemmer performance; (5) A detailed approach for constructing a test collection to evaluate the effectiveness of retrieval systems; (6) A methodology for establishing a robust baseline for the ad-hoc text retrieval task; and (7) Document contextualization and dual-parameter tuning strategies for hybrid text retrieval. The results from this work contribute to the development of technologies associated with the computational processing of Tetun, address gaps in its linguistic resources, and achieve impactful outcomes that elevate Tetun’s status. These advancements open new opportunities for future research and innovation. Moreover, this work introduces promising methodologies that can be adapted to other languages facing similar challenges, thereby contributing to the broader advancement of information retrieval for low-resource languages.

“FC Portugal 3D” four-time champion at RoboCUP 2025

The joint team from the Universities of Porto (FCUP/FEUP) and Aveiro was crowned world champion in Simulated Robot Soccer for the fourth consecutive time, winning the final of the RoboCup Soccer (3D Simulation League) league, part of RoboCUP 2025, held from July 15 to 21 in the city of Salvador, Brazil.

As in the 2024 final, the FC Portugal 3D team beat by 4-1 the German team magmaOffenburg, from the University of Offenburg, a remarkable result that sets the Portuguese team apart from the other teams in the competition.

The team that took the podium brings together a highly specialized community from the Faculties of Science and Engineering, including students, graduates, and faculty members. The following individuals contributed decisively to this victory: Luís Paulo Reis (professor at DEI, director of LIACC, and co-director of L:IACD), Pedro Mota (research fellow at FEUP and doctoral student at FCUP), M.IA master’s students Francisco Silva and Tomás Azevedo (IACD graduates), Eduardo Portugal (L.EIC alumnus and current M.EIC master’s student) and Nuno Lau (professor at the University of Aveiro).

The success of this team is proven by the 30 international awards it has won, inspiring the publication of more than 100 scientific articles in indexed journals and conferences throughout its 25 years of existence.

According to Ubbo Visser, Chairman of the Board, a new chapter in RoboCup is being written, in an era where the field of Robotics and AI is advancing at an unprecedented pace. In his words, Ubbo Visser states that “the emergence of a new generation of robots, AI methods, and technologies is a unique historical opportunity. RoboCup is seizing this opportunity to continue to lead. That is why we are introducing changes to the international RoboCup competitions, which are our window to the world. We are bringing together all the advances from all the soccer leagues into a single entity focused on humanoid robot soccer. We are advancing the state of the art in robotics and AI by introducing humanoid robots for general-purpose applications, including the @home, work, and rescue leagues…”.

On this side, we will continue to applaud the successes of the Portuguese team, hoping that they will serve as inspiration for new generations.

PhD Defense in Informatics Engineering: ”Onboard detection and guidance based on side scan sonar images for autonomous underwater vehicles”

Candidate: Martin Joseph Aubard

Date, time and location:
25 July 2025, 14:00, Sala de Atos DEEC – I-105, Faculty of Engineering, University of Porto

President of the Jury:
Pedro Nuno Ferreira da Rosa da Cruz Diniz (PhD), Full Professor, Department of Informatics Engineering, Faculty of Engineering, University of Porto

Members:
Bilal Wehbe (PhD), Senior Researcher at the German Research Center for Artificial Intelligence, Germany;
Catarina Helena Branco Simões da Silva (PhD), Associate Professor, Department of Computer Engineering, Faculty of Science and Technology, University of Coimbra;
Andry Maykol Gomes Pinto (PhD), Associate Professor, Department of Electrical and Computer Engineering, Faculty of Engineering, University of Porto;
Ana Maria Dias Madureira Pereira (PhD), Coordinating Professor with Aggregation, Department of Computer Engineering, Instituto Superior de Engenharia do Porto, Instituto Polítécnico do Porto (Supervisor).

The thesis was co-supervised by Luís Filipe Pinto de Almeida Teixeira (PhD), Associate Professor in the Department of Informatics Engineering at the Faculty of Engineering of the University of Porto.

Abstract:

This thesis addresses the challenge of improving Autonomous Underwater Vehicles (AUVs) onboard detection and interaction capabilities using Side-Scan Sonar (SSS) data. Traditionally, underwater missions relied on pre-defined plans where data are analyzed post-mission by operators or experts. This workflow is time-consuming, often requiring multiple missions to identify and localize underwater targets. The need for repeated missions increases operational costs and complexity, highlighting the inefficiency of current methodologies. Moreover, such approaches do not allow the AUV to interact with detected targets in real time, limiting the scope of mission adaptation and real-time decision-making. To overcome these limitations, this thesis presents a novel framework integrating deep learning models for object detection directly onboard AUVs. This integration enables the vehicle to detect, localize, and interact with underwater targets in real time, offering significant improvements over traditional post-mission analysis. The framework builds upon the LSTS toolchain, which is responsible for AUV motion control and communication, and introduces enhanced real-time data processing capabilities. However, implementing such a model into an embedded system suffers from computational limitations affecting the model’s performance. Thus, the knowledge distillation methods have been implemented, ensuring smaller, more efficient models to perform onboard detection without sacrificing accuracy. Additionally, to improve the model’s robustness against underwater noise, a novel adversarial retraining framework, ROSAR, is introduced, ensuring reliable operation even in noisy sonar environments. Following the onboard detection and localization enhancement, we focused on onboard interaction with the detected object. This is realized by extending the previous onboard framework and validating it through a customized simulator, enhancing interaction with the detected objects, and validating through a pipeline inspection use case, which reduces mission time by combining sonar detection and camera data collection in a single mission, utilizing behavior trees and safety-assessed models. Given the lack of open-source sonar datasets in the field, this thesis contributes to two novel publicly available side-scan sonar datasets, SWDD and Subpipe, which include field-collected data on walls and pipelines and are manually annotated for object detection. By shifting from post-mission analysis to real-time detection and interaction, this thesis significantly improves the operational efficiency of AUV missions. The proposed framework streamlines underwater operations and enhances AUVs’ autonomous behavior, relying on efficient, accurate, and robust object detection model for efficient underwater exploration and monitoring applications.

PhD Defense in Informatics Engineering : ”Uncertainty interpretations for the robustness of object detection in self-driving vehicles”

Candidate:
Filipa Marília Monteiro Ramos Ferreira

Date, time and location:
23 July 2025, 14:30, Sala de Atos, Faculty of Engineering, University of Porto

President of the Jury:
Carlos Miguel Ferraz Baquero-Moreno (PhD), Full Professor, Department of Informatics Engineering, Faculty of Engineering, University of Porto

Members:
Tiago Manuel Lourenço Azevedo (PhD), Associate Researcher, Department of Computer Science and Technology, University of Cambridge, United Kingdom;
Marco António Morais Veloso (PhD), Coordinating Professor, Department of Science and Technology, Oliveira do Hospital School of Technology and Management, Polytechnic Institute of Coimbra;
Luís Filipe Pinto de Almeida Teixeira (PhD), Associate Professor, Department of Informatics Engineering, Faculty of Engineering, University of Porto;
Rosaldo José Fernandes Rossetti (PhD), Full Professor, Department of Informatics Engineering, Faculty of Engineering, University of Porto (Supervisor).

Abstract:

Ensuring the reliability and robustness of deep learning remains a pressing challenge, particularly as neural networks gain traction in safety-critical applications. While extensive research has focused on improving accuracy across datasets, generalisation, interpretability and robustness in the deployment domain remain poorly understood. In fact, in real-world scenarios, models often underperform without clear explanations. Addressing these concerns, uncertainty quantification has emerged as a key research direction, offering deeper insight into neural networks and enhancing confidence, interpretability, and robustness. Among critical applications, self-driving vehicles stand out, where uncertainty-aware object detection can significantly improve perception and decision-making. This thesis explores interpretations of uncertainty tailored to object detection in the context of self-driving vehicles. In this sense, two novel methods to estimate the aleatoric component and one approach to modelling the epistemic uncertainty are proposed. Through the utilisation of anchor distributions readily available in any anchor-based object detector, uncertainty is estimated holistically while avoiding costly sampling procedures. Further, the concept of existence is introduced, a probability measure that indicates whether an object truly exists in the real-world, regardless of classification. Building upon these ideas, three applications of uncertainty and existence are explored, namely the Existence Map, the Uncertainty Map and the Existence Probability. Whilst the aforementioned maps encode the existence measure and the aleatoric uncertainty over the space of input samples, the Existence Probability merges the information provided by the Existence Map with the standard detections, supplementing model outputs. Evaluation showcases the coherence of uncertainty estimates and demonstrates the usefulness of the Existence and Uncertainty Map in supporting the standard model, providing open-set capabilities and giving a degree of confidence to true positives, false positives and false negatives. The merging strategy of the Existence Probability reports a considerable improvement in the performance of the object detector both in validation and perturbation, while detecting all classes of the dataset despite being trained only on cars, pedestrians and cyclists. The second part of this thesis features a study of the underspecification distribution and its connection with the epistemic uncertainty. Underspecification, recently coined, greatly endangers deep learning deployment in safety-critical systems as it depicts the variability of predictors generated by a single architecture with increasingly diverging performance in the application domain. The analysis performed showcases that, if the uncertainty estimates are correctly calibrated, a single predictor is sufficient to predict the spread of the underspecification distribution, avoiding running repeated costly training sessions. All proposed methods are designed to be model-agnostic, real-time compatible, and seamlessly applicable to deployed models without requiring retraining, underscoring their significance for robust and interpretable object detection in autonomous driving.

PhD Defense in Informatics Engineering: ”Aiding researchers making their computational experiments reproducible”

Candidate:
Lázaro Gabriel Barros da Costa

Date, Time and Location:
18th of July 2025, 16:00, Sala de Atos of the Faculty of Engineering of University of Porto.

President of the Jury:
Pedro Nuno Ferreira da Rosa da Cruz Diniz (PhD), Full Professor, Department of Informatics Engineering, Faculty of Engineering, University of Porto.

Members:
Tanu Malik (PhD), Associate Professor, Department of Electrical Engineering and Computer Science, University of Missouri, U.S.A;
Miguel Carlos Pacheco Afonso Goulão (PhD), Associate Professor, Department of Computer Science, Faculty of Science and Technology, New University of Lisbon;
Gabriel de Sousa Torcato David (PhD), Associate Professor, Department of Informatics Engineering, Faculty of Engineering, University of Porto;
Jácome Miguel Costa da Cunha (PhD), Associate Professor, Department of Informatics Engineering, Faculty of Engineering, University of Porto (Supervisor).

The thesis was co-supervised by Susana Alexandra Tavares Meneses Barbosa (PhD), Senior Researcher at INESCTEC Porto.

Abstract:

Scientific reproducibility and replicability are essential pillars of credible research, especially as computational experiments become increasingly prevalent across diverse scientific disciplines such as chemistry, climate science, and biology. Despite strong advocacy for Open Science and adherence to FAIR (Findable, Accessible, Interoperable, and Reusable) principles, achieving true reproducibility remains a formidable challenge for many researchers. Key issues such as complex dependency management, inadequate metadata, and the often cumbersome access to necessary code and data severely hamper reproducibility efforts. Moreover, existing reproducibility tools frequently offer piecemeal solutions that fail to address the multifaceted needs of diverse and complex experimental setups, particularly those that span multiple programming languages and involve intricate data systems. This thesis addresses these challenges by presenting a comprehensive framework designed to enhance computational reproducibility across a variety of scientific fields. Our approach involved a detailed systematic review of existing reproducibility tools to identify prevailing gaps and limitations in their design and functionality. This review underscored the fragmented nature of these tools, each supporting only aspects of the reproducibility process but none providing a holistic solution, particularly for experiments that require robust data handling or support for many programming languages.
To bridge these gaps, we introduced SCIREP, an innovative framework that automates essential aspects of the reproducibility workflow such as dependency management, containerization, and cross platform compatibility. This framework was rigorously evaluated using a curated dataset of computational experiments, achieving a reproducibility success rate of 94%.
Furthering the accessibility and usability of reproducible research, we developed SCICONV, a conversational interface that simplifies the configuration and execution of computational experiments using natural language processing. This interface significantly reduces the technical barriers traditionally associated with setting up reproducible studies, allowing researchers to interact with the system through simple, guided conversations. Evaluation results indicated that SCICONV successfully reproduced 83% of the experiments in our curated dataset with minimal user input, highlighting its potential to make reproducible research more accessible to a broader range of researchers.
Moreover, recognizing the critical role of user studies in evaluating tools, methodologies, and prototypes, particularly in software engineering and behavioral sciences, this thesis also extends into the realm of experimental tool evaluation. We conducted a thorough analysis of existing tools used for software engineering and behavioral science experiments, identifying and proposing specific features designed to enhance their functionality and ease of use for conducting user studies. These proposed features were validated through a survey involving the research community, confirming their relevance and the need for their integration into existing and future tools. The contributions of this thesis are manifold, encompassing the development of a classification framework for reproducibility tools, the creation of a standardized benchmark dataset for assessing tool efficacy, and the formulation of SCIREP and SCICONV to significantly advance the state-of-the-art in computational reproducibility. Looking forward, the research will focus on expanding the capabilities of reproducibility tools to support more complex scientific workflows, further enhancing user interfaces, and integrating additional functionalities to fully support user studies. By doing so, this work aims to pave the way for a more robust, accessible, and efficient computational reproducibility ecosystem that can meet the evolving needs of the global research community.

Keywords: Reproducibility; Replicability; Reusability; Computational Experiments; Conversational User Interface; User Studies.