Papers
Topics
Authors
Recent
Search
2000 character limit reached

Equivariant Efficient Joint Discrete and Continuous MeanFlow for Molecular Graph Generation

Published 9 Apr 2026 in cs.LG | (2604.08189v1)

Abstract: Graph-structured data jointly contain discrete topology and continuous geometry, which poses fundamental challenges for generative modeling due to heterogeneous distributions, incompatible noise dynamics, and the need for equivariant inductive biases. Existing flow-matching approaches for graph generation typically decouple structure from geometry, lack synchronized cross-domain dynamics, and rely on iterative sampling, often resulting in physically inconsistent molecular conformations and slow sampling. To address these limitations, we propose Equivariant MeanFlow (EQUIMF), a unified SE(3)-equivariant generative framework that jointly models discrete and continuous components through synchronized MeanFlow dynamics. EQUIMF introduces a unified time bridge and average-velocity updates with mutual conditioning between structure and geometry, enabling efficient few-step generation while preserving physical consistency. Moreover, we develop a novel discrete MeanFlow formulation with a simple yet effective parameterization to support efficient generation over discrete graph structures. Extensive experiments demonstrate that EQUIMF consistently outperforms prior diffusion and flow-matching methods in generation quality, physical validity, and sampling efficiency.

Summary

  • The paper presents EQUIMF, an SE(3)-equivariant framework that jointly models discrete graph topology and continuous 3D geometry via synchronized MeanFlow dynamics.
  • It achieves high atom (98.9%) and molecule (93.0%) stability on QM9, surpassing existing methods in fidelity and sampling efficiency with few-step denoising.
  • The approach leverages mutual conditioning to ensure chemically valid and physically consistent molecular graphs, enhancing applications in drug design and materials science.

Equivariant Joint MeanFlow for Efficient Molecular Graph Generation

Introduction

This paper introduces Equivariant MeanFlow (EQUIMF), a generative modeling framework for molecular graphs that enforces SE(3)\mathrm{SE}(3)-equivariance and unifies the synthesis of discrete topological structure and continuous 3D geometry. Standard approaches in graph generative modeling separately treat topology (nodes/edges) and geometry (3D coordinates), typically using iterative, decoupled denoising, resulting in physically inconsistent samples or slow inference. In contrast, EQUIMF simultaneously models both domains through synchronized MeanFlow dynamics and mutual conditioning, leveraging a backbone SE(3)-equivariant graph neural network (EGNN). The framework introduces a new discrete MeanFlow formulation and efficient joint training, enabling few-step, high-fidelity molecular generation with rigorous geometric and physical constraints.

Framework Architecture

The core architecture involves a shared EGNN-based encoder ϕθ1\phi_{\theta_1} for cross-modal fusion, with two generation heads: a continuous MeanFlow head ϕθ2\phi_{\theta_2} for coordinates and a discrete MeanFlow head ϕθ3\phi_{\theta_3} for categorical graph attributes. Both heads evolve synchronously via a shared time bridge and jointly optimized loss, producing mutually conditioned updates where geometry and structure consistently inform one another. Figure 1

Figure 1: Architecture of Equivariant MeanFlow. The framework processes both discrete graph structure and continuous 3D molecular geometry via a shared SE(3)-equivariant encoder. Discrete and continuous MeanFlow heads are optimized for mutually conditioned joint generation.

Mutual conditioning is realized by feeding the latent representation from the shared encoder to both heads, such that discrete topology predictions depend on geometric features and vice versa. This coupling is critical for maintaining chemical validity and realistic conformations, directly addressing the mode collapse and conformation-inconsistency issues of prior approaches.

MeanFlow Dynamics: Discrete and Continuous Domains

Discrete MeanFlow

The discrete component introduces a MeanFlow parameterization for categorical states (nodes and edges) based on a Continuous-Time Markov Chain (CTMC). Unlike previous discrete flow matching (DFM) models, which estimate instantaneous rate matrices and require fine-grained time steps, EQUIMF directly parameterizes the average rate matrix over a finite interval. This allows for efficient few-step sampling and accelerates training convergence without sacrificing fidelity.

Continuous MeanFlow

For the continuous (coordinate) domain, EQUIMF applies a conditional, symmetry-preserving MeanFlow. The model encodes the average velocity field connecting instances along the time bridge, contrasting with standard ODE/velocity-based flows which rely on local, instantaneous information. Utilizing the EGNN backbone ensures strict SE(3)-equivariance in both coordinate updates and neural parameterization, which is theoretically guaranteed by explicit proofs in the paper.

Theoretical Properties

The model's design enforces that (i) all discrete features are SE(3)-invariant; (ii) continuous MeanFlow velocity fields and (iii) discrete transition kernels are SE(3)-equivariant or invariant as appropriate. The universal equivariance ensures physical plausibility: generated molecules have consistent atomic types, bonding, and stable 3D conformations under any rigid transformation. Theoretical results, justified via explicit propositions and proofs, confirm that the coupled generation process is equivariant, a property not simultaneously achieved in previous frameworks.

Experimental Evaluation

Molecular Generation Quality

The framework is evaluated on QM9 (small organics) and GEOM-DRUG (large molecules) benchmarks. Key metrics include atom stability, molecule stability, chemical validity (RDKit), and uniqueness. EQUIMF attains atom stability of 98.9% and molecule stability of 93.0% on QM9, outperforming state-of-the-art equivariant diffusion and flow-matching methods on all axes. Furthermore, validity and uniqueness exhibit substantial gains, showing the model's capacity for both fidelity and diversity.

Property-Controlled Generation

On conditional molecule generation (property matching), EQUIMF achieves lower mean absolute errors than both EDM and EQUIFM baselines on key electronic and thermodynamic properties, indicating enhanced controllability and reduced bias in conditional scenarios.

Sampling Efficiency

EQUIMF enables efficient few-step denoising: at a 0.95 stability threshold, it requires half as many steps as the current SOTA baseline, confirming the practical advantage of MeanFlow-based interval modeling. Figure 2

Figure 2: Comparison of molecular stability versus sampling steps. EQUIMF surpasses the baseline in convergence rate and final stability.

Ablation Analyses

Ablations demonstrate that SE(3)-equivariant inductive bias in the backbone is pivotal—removal leads to significant drops (up to 5% absolute) in molecule stability. Similarly, the mutual conditioning strategy is essential; decoupling geometry and structure produces chemically invalid or unstable samples, with the fully independent generation pipeline collapsing to as low as 28% molecule stability.

Implications and Future Directions

The primary practical implication of EQUIMF is the reliable, efficient synthesis of physically plausible, chemically valid molecular graphs, which is critical for generative design in chemistry and materials science. By bridging discrete and continuous modalities in a single equivariant framework, EQUIMF provides a scalable basis for large-scale molecule libraries, property-conditioned generation, and downstream tasks such as drug design and targeted discovery.

Theoretically, this work demonstrates that explicit mutual conditioning and interval-based MeanFlow dynamics offer concrete advantages over decoupled or instantaneous-velocity paradigms, both in sample quality and efficiency. Remarkably, the discrete MeanFlow parameterization points toward broader applications in generative modeling over general discrete structures, including text or program synthesis, if suitably extended.

Still, the current design balances between few-step efficiency and sample accuracy; future research will likely explore the feasibility of true one-step generation and direct optimal transport couplings in discrete-continuous domains. Extending equivariant MeanFlow to handle multi-modal distributions, large graphs, or active learning paradigms represents an open, impactful direction for generative modeling research.

Conclusion

EQUIMF establishes an efficient, theoretically principled, and empirically validated framework for molecular graph generation. Its main strengths are (i) joint, SE(3)-equivariant synthesis of discrete and continuous domains; (ii) substantial improvements in sample fidelity, validity, and uniqueness; and (iii) order-of-magnitude faster inference through few-step MeanFlow processes. This advances the state of generative modeling for structured, symmetry-rich data and provides a valuable methodology for both scientific and engineering applications.

Paper to Video (Beta)

No one has generated a video about this paper yet.

Whiteboard

No one has generated a whiteboard explanation for this paper yet.

Open Problems

We haven't generated a list of open problems mentioned in this paper yet.

Collections

Sign up for free to add this paper to one or more collections.