Interactive Exploration of Model Predictions for Multivariate Data
- Decision-making processes increasingly rely on complex machine learning models, but their predictive power often comes at the expense of transparency, limiting our ability to understand and learn from predictions. Such models achieve their accuracy by exploiting multivariate feature relationships, yet these relationships remain difficult to interpret: high-dimensional spaces prevent direct inspection, dimensionality reduction can distort neighborhoods and decision boundaries, and existing tools rarely connect model behavior to domain knowledge. This dissertation addresses this gap by contributing three model-agnostic visual analytics techniques for multivariate tabular data, organized around complementary perspectives of model interpretation. Input-based analysis: Decision boundaries, i.e., regions where a model’s prediction flips under multivariate input changes, are hard to explore in high dimensions. This dissertation presents an interactive system that systematically probes input perturbations using local linear maps, preserving input distances more accurately and revealing closer boundaries than prior approaches. Relationship-based analysis: Visual feature enrichment in non-linear dimensionality reductions struggles to balance multiple objectives, such as representing clusters, outliers and distortion. This dissertation introduces a topology-based augmentation method that effectively relates feature distributions to projection regions, simultaneously highlighting clusters, outliers, and ambiguities. Knowledge-based analysis: Matrix completion yields prediction matrices whose patterns are challenging to interpret beyond basic heatmap expansion and sorting. This dissertation proposes a hierarchical evaluation framework that aggregates and links these patterns to domain-knowledge not used during model building. Its application yielded insights that substantially advanced thermodynamic modeling. All approaches are integrated in comprehensive visual analytics systems designed for interactive use. Case studies, stakeholder evaluations, and successful applications confirm their effectiveness in promoting the explainability of model predictions and domain insight. Together, these contributions establish structured methods for exploring multivariate feature spaces visually, driving measurable advances in both model and data interpretability.
| Author: | Jan-Tobias SohnsORCiD |
|---|---|
| URN: | urn:nbn:de:hbz:386-kluedo-130687 |
| DOI: | https://doi.org/10.26204/KLUEDO/13068 |
| Advisor: | Heike LeitteORCiD |
| Document Type: | Doctoral Thesis |
| Cumulative document: | No |
| Language of publication: | English |
| Date of Publication (online): | 2026/04/20 |
| Year of first Publication: | 2026 |
| Publishing Institution: | Rheinland-Pfälzische Technische Universität Kaiserslautern-Landau |
| Granting Institution: | Rheinland-Pfälzische Technische Universität Kaiserslautern-Landau |
| Acceptance Date of the Thesis: | 2026/02/12 |
| Date of the Publication (Server): | 2026/04/21 |
| Page Number: | ix, 133 |
| Faculties / Organisational entities: | Kaiserslautern - Fachbereich Informatik |
| DDC-Cassification: | 0 Allgemeines, Informatik, Informationswissenschaft / 004 Informatik |
| Licence (German): |
