Program for Tuesday, September 16th

PROGRAM FOR TUESDAY, SEPTEMBER 16TH

Days:

09:00-10:30 Session 1A: DC: Keynote 1 & Applied XAI

The doctoral consortium aims to offer early stage researchers from any AI subject area a unique opportunity to present their planned research, and to connect to fellow PhD students as well as senior researchers in AI.

Chair:

Gesina Schwalbe

Location: Lecture Hall H06

09:00	Gesina Schwalbe Welcome
09:05	Laure Poirson Why is Networking Important in Research? A Short Introduction to the AI Grid
09:30	Mohammad Shabanibozorg and Prof. Dr. Rer. Nat. Thomas Martinetz AI-Driven Chest Radiography Report Generation: Integrating LLMs, CLIP, Tree-of-Thoughts, Multimodal Retrieval-Augmented Generation, Classification and Direct Preference Optimization ABSTRACT. Manual chest radiography report creation is time-consuming and prone to variability, increasing radiologist workload and potentially affecting diagnostic consistency. This research introduces a novel, integrated AI system to automate chest radiology report generation. The proposed system leverages Large Language Models (LLMs) elevated by several key components. A trained classifier provides pathology probabilities from the input image for guiding the LLMs and a Contrastive Language-Image Pre-training (CLIP) model to establish a shared embedding space for efficient multimodal retrieval. Following a multimodal Retrieval-Augmented Generation (RAG) approach to retrieve the relevant prior image-report pairs, to improve factual grounding and contextual understanding. The Tree-of-Thoughts (ToT) framework enhances the diversity of report generation while maintaining clinical validity. Finally, Direct Preference Optimization (DPO) refines the LLM model using automatically generated preference data based on clinical efficacy, including CheXbert F1 score, and cosine similarity to calculate embedding similarity. This comprehensive approach aims to improve the clinical accuracy, coherence, and overall quality of generated radiology reports compared to existing methods, addressing limitations such as hallucinations and lack of specificity in current automated systems.
09:50	Marcel Wentzien Analyzing Deep Generative Models for Steel Microstructures ABSTRACT. Deep generative models compose synthetic, yet realistic images of different visual concepts by applying learned building rules. The set of all visual concepts and their relationship is of great importance for the materials sciences, as it allows to automatically characterize and objectively quantify a material's microstructural images. A material's microstructure is believed to encode all information of a material's chemical composition and processing (i.e. heat treatment) and helps to predict the material's mechanical properties. Currently the microstructure characterization requires expert annotation, mainly done manually on a case-by-case basis. Thus, an objective and automatic characterization allows to improve the understanding on how a combination of visual concepts relate to the material's processing and mechanical properties. This research proposal hypothesizes that deep generative models for steel microstructure images learn visual concepts that correspond to the visual signatures of underlying physical processes. First, deep generative models were trained on real steel microstructure images and investigated by domain experts. The results of the expert study indicates, that synthetic images by a StyleGANv2-Ada look realistic. Next, various approaches will be investigated on how to extract the set of visual concepts and building rules from the StyleGANv2. In a final step the visual concepts and building rules will be correlated to the material's physical processing.
10:10	Marco Hurst Economically-Driven AI Process for Quality Assurance: Analysis in Optics Manufacturing ABSTRACT. This research proposes the development of an economically driven, AI-based quality assurance (QA) process for optical manufacturing, with a specific focus on detecting and mitigating of subsurface damage (SSD) during multi-stage grinding. The approach combines optical coherence tomography (OCT) as a non-destructive imaging method, machine learning for defect detection, and explainable AI (XAI) to ensure transparency and trust. A volumetric segmentation model will be developed to enable automated SSD detection and classification from OCT data. Additionally, the project will integrate quality-relevant pro-cess parameters and economic evaluation models into a digital platform to support adaptive, cost-efficient manufacturing control. The goal is to improve efficiency, reduce costs, and enhance product quality, while advancing intelligent, explainable, and economically sustainable QA in high-precision manufacturing.

09:00-10:30 Session 1B: CHAI & FCR Workshop

Joint workshop on Humanities-Centered AI (CHAI 2025) and Formal and Cognitive Reasoning (FCR 2025); Individual workshop pages:

Workshop: Humanities-Centered AI (CHAI) Sylvia Melzer, Stefan Thiemann, Hagen Peukert, and Magnus Bender
Workshop: 11th Workshop on Formal and Cognitive Reasoning (FCR-2025) Özgür Lütfü Özçep, Nele Rußwinkel, Kai Sauerwald, Diedrich Wolter

Location: Lecture Hall H08

09:00-10:30 Session 1C: SIG Knowledge Management Workshop

Workshop: Workshop SIG Knowledge Management (FG WM) Johannes Wichmann, Lisa Grumbach, Pascal Reuß

Location: Seminar Room S13

09:00-10:30 Session 1D: AI in Production Workshop

Workshop: Second Workshop on AI in Production Judith Knoblach, Erik Voigt, Christoph Wehner, Martin Krockert, Lukas Bahr

Location: Lecture Hall H01

09:00-10:30 Session 1E: AI4POSH Workshop

Workshop: Smarter Workplaces: The Role of AI in Promoting Occupational Safety and Health (AI4POSH) Thea Radüntz

Location: Lecture Hall H07

09:00-10:30 Session 1F

10:30-11:00Coffee Break

11:00-12:30 Session 2A: DC: XAI & DNN Efficiency

Chair:

Gesina Schwalbe

Location: Lecture Hall H06

11:00	Jana Fischer Explainable Artificial Intelligence for Multivariate Sensor Data: Towards Transparency and Correctness in Model Explanations ABSTRACT. The continuous growth in sensor technologies has led to the generation of a massive amount of time series sensor data across various sectors such as healthcare, industry, transportation, and smart home. Artificial intelligence (AI), particularly deep learning, provide a powerful tool to analyze this high-dimensional data automatically and thereby revealing the full potential of this valuable data source. However, AI models often are opaque in their decision-making. To enhance the transparency of AI, the research field of eXplainable AI (XAI) has gained significant attention in recent years. The majority of methods has been developed for image classification and do not consider time series properties such as seasonality and trend. In addition, the field lacks a common agreement on the quantitative evaluation of XAI explanations, e.g., regarding their correctness. The research question of this proposal is how XAI methods can enhance the transparency and correctness of model explanations in the analysis of multivariate sensor data. The aim of this research is threefold: First, we review XAI evaluation methods regarding their suitability for multivariate time series classification. Second, we systematically compare existing XAI methods on the resulting evaluation methods. This comparison will reveal their strengths and help to develop or improve XAI methods that are suitable for multivariate sensor data, which is our third step.
11:20	Sparsh Tiwari Learning Interpretable Disentangled Concepts for Neurosymbolic Integration ABSTRACT. We propose a hierarchical Disentangled Representation Learning (DRL) framework that categorizes concepts into primitive (e.g. color) and higher-order relational types. Our first approach combines a pre-trained backbone with a specialized beta-VAE for concept disentanglement, enabling the separation of statistically independent concepts into interpretable latent factors. Building on this, we integrate Predicate Generation and Inductive Logic Programming (ILP) to map these factors into symbolic, human-understandable semantics. The ultimate goal of our framework is to bridge the gap between disentangled representations and human interpretability, aligning learned concepts with intuitive, semantic meanings to facilitate explainable AI. We validate our initial framework on the DSprites and CLEVR datasets, demonstrating its ability to hierarchically disentangle and symbolically ground concepts while advancing toward interpretable machine learning
11:40	Julia Burr Promoting Flatness of Representation Manifolds to Improve Deep Network Training ABSTRACT. Modern deep learning achieves remarkable results on various tasks. However, it requires large amounts of labeled data and significant electrical energy for training. We aim to enhance training efficiency by leveraging the manifold hypothesis. By explicitly penalizing the curviness of manifolds in neural network representations, we seek to accelerate con- vergence during training. This approach may also yield better objectives for unsupervised representation learning. By creating well-fitted foundation models and training small networks for downstream tasks, we could reduce the amount of labeled data needed for strong performance. These results could make deep learning more accessible.
12:00	Abeer Mostafa Efficient Graph-Based Neural Architectures for Multimodal Learning ABSTRACT. The growing ubiquity of multimodal data ranging from time series and textual inputs to images, videos, and event logs necessitates unified learning frameworks that can reason across heterogeneous modalities without relying on modality-specific architectures. Existing models often struggle to jointly represent structured and unstructured, static and temporal signals due to rigid assumptions about modality geometry, alignment, and connectivity. This research proposes a graph-centric framework for multimodal learning, wherein each data token is abstracted as a node within a dynamically constructed heterogeneous graph. Inter- and intra-modality relationships are encoded through learned graph topologies and adaptive attention mechanisms that operate across both semantic and temporal dimensions. By extending transformer architectures to operate over these sparse or dense graph structures, the approach seeks to retain cross-modal dependencies while scaling to large, asynchronous data streams. The proposed methodology explores core innovations in modality-aware tokenization and embedding strategies, attention mechanisms adapted to heterogeneous graph sparsity, and fusion techniques that unify token representations across diverse modalities. Special emphasis is placed on modeling temporal phenomena, such as dynamic correlations, lag effects, and alignment, within a graph-based transformer backbone, allowing for joint inference over signals with varying structure and temporal granularity. The research aims to answer fundamental questions around graph topology design, efficient cross-modal fusion, and scalable attention over multimodal data, with the ultimate goal of enabling generalizable, interpretable, and computationally efficient neural architectures for multimodal graphs.

11:00-12:30 Session 2B: CHAI & FCR Workshop

Joint workshop on Humanities-Centered AI (CHAI 2025) and Formal and Cognitive Reasoning (FCR 2025); Individual workshop pages:

Workshop: Humanities-Centered AI (CHAI) Sylvia Melzer, Stefan Thiemann, Hagen Peukert, and Magnus Bender
Workshop: 11th Workshop on Formal and Cognitive Reasoning (FCR-2025) Özgür Lütfü Özçep, Nele Rußwinkel, Kai Sauerwald, Diedrich Wolter

Location: Lecture Hall H08

11:00-12:30 Session 2C: SIG Knowledge Management Workshop

Workshop: Workshop SIG Knowledge Management (FG WM) Johannes Wichmann, Lisa Grumbach, Pascal Reuß

Location: Seminar Room S13

11:00-12:30 Session 2D: AI in Production Workshop

Workshop: Second Workshop on AI in Production Judith Knoblach, Erik Voigt, Christoph Wehner, Martin Krockert, Lukas Bahr

Location: Lecture Hall H01

11:00-12:30 Session 2E: AI4POSH Workshop

Workshop: Smarter Workplaces: The Role of AI in Promoting Occupational Safety and Health (AI4POSH) Thea Radüntz

Location: Lecture Hall H07

11:00-12:30 Session 2F: NFDIxCS Workshop

Location: Lecture Hall H02

12:30-14:00Lunch Break

14:00-15:30 Session 3A: DC: Keynote 2 & DNN Efficiency

Chair:

Gesina Schwalbe

Location: Lecture Hall H06

14:00	Ralf Möller How (not) to PhD
14:40	Souptik Sen Model Efficiency Techniques in Multimodal Learning ABSTRACT. In the increasingly complex domain of deep learning architectures, multimodal models encounter significant efficiency constraints arising from their substantial parameter complexity and computational requirements. This ongoing doctoral research investigates the fundamental limitation: the quadratic computational complexity of attention mechanisms in cross-modal contexts. We aim to develop a comprehensive theoretical framework addressing efficiency paradigms through strategic initialization techniques, sparse attention factorization, and progressive capacity scaling methodologies. Our investigative approach will integrate principles from the Lottery Ticket hypothesis to identify optimal substructures, employ knowledge distillation to transfer capabilities to more compact architectures, and implement model-centric curriculum learning to balance computational efficiency with representational power. By focusing on multimodal applications with image, time series, video-audio, and textual data, this research seeks to contribute to both the theoretical understanding of efficiency-performance tradeoffs and the practical methodologies for deploying sophisticated models in resource-constrained environments. The work intends to establish principled foundations for accessible multimodal learning while maintaining competitive performance benchmarks.
15:00	Carolin Cissée Data-Efficient Multimodal Training Strategies ABSTRACT. The training of current large neural networks requires huge amounts of data, which is an immense burden in terms of time, money, and computational resources. In multimodal use cases, the amount and complexity of data are often even further increased. This doctoral consortium proposal outlines the planned research to develop data-efficient multimodal training strategies as part of the research project "Enhancing Data and Model Efficiency in Multimodal Learning". By developing training strategies to reduce the amount of data and computational power required for model training, we aim to overcome the hurdle of limited available resources and make the exceptional capabilities of current large networks more accessible. We will investigate coreset subset selection strategies and data-centric curriculum learning as approaches to reduce the amount of data and computational power while still maintaining a high quality. We will also examine methods suitable for extracting meaningful representations from multimodal data in scenarios where labeled data is scarce, as labeling large amounts of data is often very costly and complex. The methods we intend to develop will be tested with a variety of public datasets to ensure usability for a broad field of use cases.

14:00-15:30 Session 3B: CHAI & FCR Workshop

Joint workshop on Humanities-Centered AI (CHAI 2025) and Formal and Cognitive Reasoning (FCR 2025); Individual workshop pages:

Workshop: Humanities-Centered AI (CHAI) Sylvia Melzer, Stefan Thiemann, Hagen Peukert, and Magnus Bender
Workshop: 11th Workshop on Formal and Cognitive Reasoning (FCR-2025) Özgür Lütfü Özçep, Nele Rußwinkel, Kai Sauerwald, Diedrich Wolter

Location: Lecture Hall H08

14:00-15:30 Session 3C: SIG Knowledge Management Workshop

Workshop: Workshop SIG Knowledge Management (FG WM) Johannes Wichmann, Lisa Grumbach, Pascal Reuß

Location: Seminar Room S13

14:00-15:30 Session 3D: AI in Production Workshop

Workshop: Second Workshop on AI in Production Judith Knoblach, Erik Voigt, Christoph Wehner, Martin Krockert, Lukas Bahr

Location: Lecture Hall H01

14:00-15:30 Session 3E

14:00-15:30 Session 3F

15:30-16:30Coffee Break

16:30-18:00 Session 4A: DC: Formal Methods & Closing

Chair:

Gesina Schwalbe

Location: Lecture Hall H06

16:30	Leonhard Kunz Privacy Risk Assessment in Federated Learning: Extracting and Protecting Sensitive Information from Vision Language Models in Manufacturing Applications ABSTRACT. Federated learning enables collaborative machine learning while preserving data privacy by keeping data local. However, shared model parameters still pose privacy risks, notably through the memorization effect, where models may unintentionally expose sensitive training data. This research evaluates such risks for manufacturing data, particularly technical drawings used in visual manufacturability assessments. It compares the performance of public foundation models with those fine-tuned on federated data and explores attacks targeting model weights to extract private information. Mitigation strategies will be assessed for their effectiveness and impact on model performance. Ultimately, the project aims to develop a decision-making framework that balances model utility and privacy in federated learning.
16:50	Raik Dankworth Verification of Neural Networks ABSTRACT. Formal verification of neural networks is essential for their safe deployment in critical domains such as autonomous driving. However, current verification techniques struggle to scale to deep networks and rely on symbolic inputs, which is why use in computer vision is challenging. In parallel, explainable AI (XAI) aims to understand their decision making, especially in image recognition. This research proposal describes how explainability techniques can be exploited to formally verify neural networks against symbolic constraints. The core contribution is the formulation of the concept-based verification problem and some approaches are proposed to solve it within the scope of a PhD thesis. The proposed approaches include local searches for counterexamples using adversarial attack or global verification strategies. Furthermore, realism constraints on images are also explored to reduce the verification search space in computer vision contexts.
17:10	Christopher Walther Research Proposal: Runtime Verification on Spatial Objects ABSTRACT. Runtime verification with linear temporal logic (LTL) is an established technique for verifying systems against temporal specifications. However, the world often involves physical objects in addition to time. This research proposal outlines my ideas for performing runtime verification on combined temporal and spatial logics where objects are modeled as continuous sweeps in space. Formulas should be allowed both use of LTL-semantics and operators for spatial intersection and distance. Although expressive spatio-temporal logics can easily lead to undecidable model checking, potential mitigating approaches could involve discounting or adding growing spatial uncertainty to prevent unlimited knowledge of the future. Further extensions could include timed events, which allow for the processing of unsynchronized measurements of object positions, as well as quantitative or multi-valued semantics. To be useful in practice, it is important to determine which acceleration structures are necessary for efficient query answering and what form a practical implementation of such a system could take. Interesting applications involve moving objects for which there is limited knowledge, yet continuous monitoring is required.
17:30	Gesina Schwalbe Closing, Open Discussion Round

16:30-18:00 Session 4B: CHAI & FCR Workshop

Joint workshop on Humanities-Centered AI (CHAI 2025) and Formal and Cognitive Reasoning (FCR 2025); Individual workshop pages:

Workshop: Humanities-Centered AI (CHAI) Sylvia Melzer, Stefan Thiemann, Hagen Peukert, and Magnus Bender
Workshop: 11th Workshop on Formal and Cognitive Reasoning (FCR-2025) Özgür Lütfü Özçep, Nele Rußwinkel, Kai Sauerwald, Diedrich Wolter

Location: Lecture Hall H08

16:30-18:00 Session 4C: SIG Knowledge Management Workshop

Workshop: Workshop SIG Knowledge Management (FG WM) Johannes Wichmann, Lisa Grumbach, Pascal Reuß

Location: Seminar Room S13

16:30-18:00 Session 4D: AI in Production Workshop

Workshop: Second Workshop on AI in Production Judith Knoblach, Erik Voigt, Christoph Wehner, Martin Krockert, Lukas Bahr

Location: Lecture Hall H01

16:30-18:00 Session 4E: FAIR4ML Tutorial

Tutorial: Towards FAIR4ML Rohitha Ravinder, Nelson Quiñones, Dietrich Rebholz-Schuhmann, Leyla Jael Castro

Location: Lecture Hall H07

16:30-18:00 Session 4F

18:00-23:00 Welcome Reception

Together with the co-located conferences, the day ends in the welcome reception, starting at 6pm in the rooms of the Hasso-Plattner-Institute. The evening includes the award ceremony for the GI junior fellows and the Balzert prize.

More details in German: https://informatik2025.gi.de/abendveranstaltungen.html

Location: HPI House L