FedCRF: A Federated Cross-domain Recommendation Method with Semantic-driven Deep Knowledge Fusion

Published 20 Apr 2026 in cs.IR | (2604.17681v1)

Abstract: As user behavior data becomes increasingly scattered across different platforms, achieving cross-domain knowledge fusion while preserving privacy has become a critical issue in recommender systems. Existing PPCDR methods usually rely on overlapping users or items as a bridge, making them inapplicable to non-overlapping scenarios. They also suffer from limitations in the collaborative modeling of global and local semantics. To this end, this paper proposes a Federated Cross-domain Recommendation method with deep knowledge Fusion (FedCRF). Using textual semantics as a cross-domain bridge, FedCRF achieves cross-domain knowledge transfer via federated semantic learning under the non-overlapping scenario. Specifically, FedCRF constructs global semantic clusters on the server side to extract shared semantic information, and designs a FGSAT module on the client side to dynamically adapt to local data distributions and alleviate cross-domain distribution shift. Meanwhile, it builds a semantic graph based on textual features to learn representations that integrate both structural and semantic information, and introduces contrastive learning constraints between global and local semantic representations to enhance semantic consistency and promote deep knowledge fusion. In this framework, only item semantic representations are shared, while user interaction data remains locally stored, effectively mitigating privacy leakage risks. Experimental results on multiple real-world datasets show that FedCRF significantly outperforms existing methods in terms of Recall@20 and NDCG@20, validating its effectiveness and superiority in non-overlapping cross-domain recommendation scenarios.

Abstract PDF Upgrade to Chat

Authors (6)

Summary

The paper introduces a federated framework that leverages semantic item embeddings for privacy-preserving cross-domain recommendation.
It employs a two-stage design with global semantic clustering followed by local fine-tuning using contrastive and graph-based fusion.
Experimental results demonstrate up to 26% Recall@10 improvement while ensuring minimal data leakage and faster per-epoch training.

FedCRF: A Federated Cross-domain Recommendation Method with Semantic-driven Deep Knowledge Fusion

Motivation and Problem Setting

FedCRF addresses privacy-preserving cross-domain recommendation (PPCDR) in a non-overlapping scenario, where user and item sets are strictly disjoint across domains, and personal behavioral data cannot be directly exchanged due to legal and commercial compliance (such as GDPR). Existing PPCDR methods often depend on user/item overlaps as bridges or suffer from suboptimal alignment of global and local semantics, limiting their efficacy and realism. Furthermore, most approaches rely heavily on item IDs, overlooking the richer semantic structure embedded in textual content.

FedCRF proposes a federated semantic learning framework, leveraging continuous item textual embeddings as a universal bridge for cross-domain knowledge transfer. By avoiding entity-level alignment and keeping user data strictly local, the method ensures robust privacy guarantees and applicability to real-world scenarios with heterogeneous, non-overlapping data sources.

Framework Design

FedCRF operates in two stages: federated semantic pre-training and local fine-tuning with contrastive fusion, as depicted in Fig. 1 and Fig. 3.

In Stage 1, clients encode item text using Sentence-Transformers and upload semantic representations to a central server. The server performs global K-means clustering, discovering domain-agnostic semantic centroids and distributing them back to clients. Each client then activates the Fine-Grained Semantic Adaptation and Transfer (FGSAT) module, which constructs secondary semantic centers and a semantic graph, adapting global prototypes to local distributions through GNN propagation and an attention-based fusion mechanism.

Figure 1: Federated semantic clustering and client-side semantic adaptation in Stage 1 of FedCRF.

Stage 2 takes the pre-trained semantic representations and refines them locally. Clients build both local and pre-trained semantic graphs, inject textual and structural information into item IDs via GCNs, and enforce bidirectional contrastive loss to align global and local views. Multi-source semantic and ID features are fused for final prediction, optimizing a joint objective that balances ranking, contrastive, and knowledge distillation losses.

Figure 2: Stage 2 workflow of FedCRF, combining graph convolution and pre-trained-local contrastive learning for deep semantic fusion.

This architecture allows adaptive balancing between global consensus and local granularity, overcoming the pitfalls of static prototype-based alignment and single-modality representation.

Component Analysis

Semantic Clustering and Adaptation:

FedCRF's server-side K-means produces coarse global semantic clusters, which act as seeds for local adaptation. The FGSAT module constructs fine-grained local centroids, mitigates distribution shift, and uses GNNs to model inter-item semantic relations. Attention-driven fusion further enhances expressiveness by dynamically weighting global, local, and item-level features.

Contrastive Fusion and Graph-based Representation:

Stage 2 introduces dual-view contrastive learning—aligning local GCN-propagated features with pre-trained semantic priors, pushing semantically similar items closer and dissimilar items apart. This mechanism is empirically shown to provide finer cross-domain alignment than static matching.

Privacy Preservation:

Throughout, only semantic item representations are communicated, keeping user interaction logs strictly on-device. Experiments using Similarity-based Inference Attacks demonstrate negligible privacy leakage (F1 scores < 0.05 across all datasets), confirming the practical robustness of the framework.

Multi-modality Fusion:

FedCRF achieves deep integration of textual semantics and collaborative signals (item IDs), combining both modalities at the representation level through GNNs and attention—unlike prior works relying on simple concatenation or addition.

Experimental Evaluation

FedCRF is evaluated on three cross-domain pairs (Kitchen–Food, Care–Beauty, OnlineRetail–Food), using full ranking and Recall@K, NDCG@K metrics. Comparison spans single-domain GNNs, non-overlapping CDR, and federated baselines (FFMSR, FedDCSR), controlling for architecture and hyperparameters.

Key Findings:

FedCRF consistently outperforms all baselines by up to 26% Recall@10 and 24% NDCG@10 in non-overlapping scenarios (see Table 1).
Ablation studies reveal that the removal of FGSAT, contrastive learning, or federated pre-training each significantly degrades accuracy, confirming their synergistic necessity.
Hyperparameter sensitivity analysis establishes optimal settings for contrastive loss, distillation weights, and cluster count. Overly strong contrastive or distillation terms hurt personalization.
In cross-platform setups, FedCRF demonstrates robust transfer and generalization, outperforming federated and non-overlapping CDR approaches.
Time complexity analysis indicates FedCRF's graph-based structure offers faster per-epoch training than sequential alignment or prototype-driven methods.
Visualization via t-SNE illustrates improved cross-domain semantic overlap and reduced distribution gap after FGSAT adaptation.
Figure 3: Detailed sensitivity analysis of FedCRF performance versus $\alpha$ , $\lambda_{\text{KD}$, and cluster count $K$ .

Figure 4: t-SNE visualization confirming the enhanced overlap and semantic alignment between Kitchen and Food domains via FGSAT adaptation.

Practical and Theoretical Implications

FedCRF demonstrates that federated semantic clustering and adaptive local fusion can enable robust, privacy-preserving, and effective cross-domain recommendation even in non-overlapping environments. The approach circumvents the fundamental limitations of static prototypes, entity-level alignment, and single-modality representations. These results implicate the viability of semantic-driven federated learning as a core paradigm for multi-platform personalized recommendation under real-world privacy constraints.

Future directions include efficient communication compression, asynchronous federated updates, and dynamic adaptation to evolving user/item repositories—moving towards scalable, online, and interactive PPCDR systems.

Conclusion

FedCRF substantially advances the state of privacy-preserving cross-domain recommendation by introducing semantic-driven federated clustering, fine-grained local adaptation, dual-view contrastive fusion, and multi-modal representation learning. Experimental evidence supports its superiority in accuracy, efficiency, and privacy robustness across diverse domains and platforms. Limitations regarding communication overhead and adaptation to non-static environments remain, motivating future enhancements in federated recommendation research.

Markdown Report Issue