Selecting Feature Interactions for Generalized Additive Models by Distilling Foundation Models

Published 14 Apr 2026 in cs.LG | (2604.13332v1)

Abstract: Identifying meaningful feature interactions is a central challenge in building accurate and interpretable models for tabular data. Generalized additive models (GAMs) have shown great success at modeling tabular data, but often rely on heuristic procedures to select interactions, potentially missing higher-order or context-dependent effects. To meet this challenge, we propose TabDistill, a method that leverages tabular foundation models and post-hoc distillation methods. Our key intuition is that tabular foundation models implicitly learn rich, adaptive feature dependencies through large-scale representation learning. Given a dataset, TabDistill first fits a tabular foundation model to the dataset, and then applies a post-hoc interaction attribution method to extract salient feature interactions from it. We evaluate these interactions by then using them as terms in a GAM. Across tasks, we find that interactions identified by TabDistill lead to consistent improvements in downstream GAMs' predictive performance. Our results suggest that tabular foundation models can serve as effective, data-driven guides for interaction discovery, bridging high-capacity models and interpretable additive frameworks.

Abstract PDF Upgrade to Chat

Authors (4)

Summary

The paper introduces TabDistill, a framework that distills complex feature interactions from tabular foundation models into generalized additive models.
It employs a modular pipeline combining TFM fitting, FBII-based attribution, and interaction ranking to systematically enhance model performance.
Empirical results demonstrate that TabDistill outperforms traditional greedy methods, achieving superior accuracy and interpretability on multiple tabular benchmarks.

Selecting Feature Interactions for Additive Models via Foundation Model Distillation

Introduction

The paper "Selecting Feature Interactions for Generalized Additive Models by Distilling Foundation Models" (2604.13332) addresses the persistent challenge of identifying statistically and practically meaningful feature interactions for interpretable models, specifically Generalized Additive Models (GAMs), in tabular domains. Standard GAMs provide transparent, feature-level effect decomposition, yet their predictive power on real-world datasets is limited by their inability to capture important non-additive interactions. Existing interaction selection methods often employ greedy, pairwise heuristics, thereby risking omission of higher-order, context-dependent dependencies critical for both prediction and mechanistic insight.

To overcome these deficiencies, the authors introduce TabDistill, a modular framework which leverages modern tabular foundation models (TFMs) as the substrate for feature interaction discovery. Instead of relying on heuristic or model-constrained searches, TabDistill harnesses the adaptive, large-scale representation learning capabilities of TFMs to uncover a diverse spectrum of salient interaction structures. This distillation process lifts previously opaque, high-capacity black-box model information into the interpretable, additive framework of GAMs, yielding model instances that are both accurate and intelligible.

Figure 1: TabDistill operational pipeline combining TFM-based adaptive dependency learning with post-hoc explanation for interaction feature selection.

Methodological Framework

Model Structure and Objective

GAMs augmented with interaction terms are parametrized as

$g ( \mathbb{E}[y|x] ) = \beta + \sum_{j=1}^p f_j(x_j) + \sum_{I\in\mathcal{I}} f_I(x_I)$

where $f_j$ are univariate (main effect) shape functions, and $f_I$ are functions over feature subsets $I$ representing non-additive interactions. The central objective is to identify a minimal, performance-maximizing subset of interactions $\mathcal{I}$ (potentially higher-order, non-pairwise) that yield optimal expected loss for the downstream GAM.

The TabDistill Pipeline

The key components of TabDistill are:

TFM Fitting: A TFM (e.g. TabPFN-2, TabICL) is fitted to the dataset, learning complex feature dependencies via large-scale or synthetic pretraining.
Post-Hoc Attribution: A model-agnostic, sample-efficient interaction attribution algorithm—specifically the Faith-Banzhaf Interaction Index (FBII) calculated via the SPEX framework—extracts salient feature interactions from the fitted TFM. SPEX constructs an efficient, sparse surrogate in the Boolean Fourier basis, circumventing the exponential enumeration of the interaction space.
Interaction Ranking and Selection: Salient interaction terms are ranked and selected based on their estimated importance across samples.
Interpretable Model Update: The selected interactions are incorporated as additional terms in a GAM (e.g., EBM), thus boosting predictive capacity while retaining interpretability.

This approach preserves black-box model benefits while remaining model-agnostic; attribution is feasible for arbitrary TFM architectures. By design, TabDistill accommodates both pairwise and higher-order interactions.

Empirical Evaluation

Benchmarking on Real-World Data

TabDistill is empirically evaluated across a set of standardized tabular benchmarks (TabArena, TALENT, PMLB), predominantly comprising datasets under 2000 samples and 10 features, in both regression and classification regimes. Key baselines include FAST (greedy, pairwise addition) and RuleFit (interaction mining via tree rules). All methods are evaluated via downstream predictive performance on GAMs, with rankings assessed across multiple metrics (MAE, MSE, $R^2$ , F1, accuracy, AUROC), controlling for the number of included interactions ( $N_{int}$ from 1 to 8).

Figure 2: Regression average rank (MAE) comparison: TabDistill consistently outperforms baselines with lower ranks.

Figure 3: Comparative classification ranks (F1 left, accuracy right) for TabDistill on TabICL/TabPFN, showing robust superiority over alternative methods.

TabDistill consistently achieves superior (lower) ranks compared to baselines for both regression and classification tasks at low interaction budgets, substantiating the claim that the TFM-guided approach is more effective at identifying high-value interactions than data-driven heuristics or decision rule mining. As the interaction budget increases, performance convergence across methods is observed, indicating that initial interaction selection is critical for practical, parsimony-demanding tasks.

Transferability and Stability

Ablation studies demonstrate several salient properties:

Index Robustness: FBII consistently produces the strongest and most stable interaction selection compared to alternative attribution indices (SII, BII, etc.).
Query Efficiency: Interaction selection exhibits stability across varying SPEX query budgets (sample sizes from 100 to 500) with average rank variation typically within 0.5, indicating practical feasibility even when post-hoc evaluation budgets are constrained.
Figure 4: Rank stability for downstream GAM performance versus SPEX query sample size, illustrating minimal sensitivity to sample count.
TFM Backbone Independence: The effectiveness of TabDistill is replicated across both TabPFN and TabICL TFMs, indicating that the technique generalizes across TFM families.

Synthetic Validation of TFM Interaction Capacity

On controlled synthetic domains, the authors demonstrate that TabPFN can recover ground truth non-additive interactions with high precision, outperforming standard machine learning models even in low-data, high-dimensional (Fourier-sparse) and complex interaction (tree-structured) regimes.

Figure 5: TabPFN performance against baselines on Fourier-sparse functions under low-data, noise, and sparsity variations.

Case Study: Interpretable Used-Car Price Prediction

On the TabArena Fiat 500 car pricing task, TabDistill-augmented EBM achieves state-of-the-art MAE and $R^2$ performance (MAE: 536.1, $R^2$ : 0.86). Critically, TabDistill uniquely discovers a higher-order interaction among longitude, latitude, and model type, revealing a fundamentally non-additive dependency not isolated by pairwise-only (FAST) or tree rule-based (RuleFit) methods.

Figure 6: Visualized higher-order interaction: joint effect of seller location and model type on predicted price.

Theoretical and Practical Implications

TabDistill demonstrates that high-capacity TFMs encode complex, context-dependent feature interactions that, when effectively distilled, can be efficiently injected into additive interpretable frameworks such as GAMs. This approach bridges the performance–interpretability tradeoff by utilizing state-of-the-art representations for targeted structure transfer, not blind knowledge distillation. In principle, this pipeline generalizes to any scenario where structured black-box representations must be synthesized into transparent, domain-interpretable hypotheses—spanning scientific discovery, regulatory domains, or high-stakes operational models.

In practice, limitations remain. The procedure inherits failure modes and spurious interaction artifacts from the underlying TFM, and attribution is computationally intensive at large scale. Future work may involve more sample-efficient attribution schemes (e.g., ProxySPEX), direct interaction-aware TFM training objectives, or integration into human-in-the-loop settings for hypothesis or mechanism discovery.

Conclusion

The TabDistill framework empirically and methodologically advances the identification of meaningful, high-impact feature interactions for interpretable additive models by distilling knowledge from tabular foundation models. It substantiates consistent performance improvements over established heuristics, enables detection of complex non-additive effects, and offers a generalizable, model-agnostic paradigm for bridging predictive performance and interpretability in tabular machine learning. The integration of post-hoc attribution, efficient sparse recovery, and modular interfacing with diverse TFMs positions TabDistill as a practical path forward for accurate, transparent modeling across scientific and operational domains.

Markdown Report Issue