MUDAS: Mote-scale Unsupervised Domain Adaptation in Multi-label Sound Classification

Published 12 Jun 2025 in cs.AI, cs.SD, and eess.AS | (2506.11331v1)

Abstract: Unsupervised Domain Adaptation (UDA) is essential for adapting machine learning models to new, unlabeled environments where data distribution shifts can degrade performance. Existing UDA algorithms are designed for single-label tasks and rely on significant computational resources, limiting their use in multi-label scenarios and in resource-constrained IoT devices. Overcoming these limitations is particularly challenging in contexts such as urban sound classification, where overlapping sounds and varying acoustics require robust, adaptive multi-label capabilities on low-power, on-device systems. To address these limitations, we introduce Mote-scale Unsupervised Domain Adaptation for Sounds (MUDAS), a UDA framework developed for multi-label sound classification in resource-constrained IoT settings. MUDAS efficiently adapts models by selectively retraining the classifier in situ using high-confidence data, minimizing computational and memory requirements to suit on-device deployment. Additionally, MUDAS incorporates class-specific adaptive thresholds to generate reliable pseudo-labels and applies diversity regularization to improve multi-label classification accuracy. In evaluations on the SONYC Urban Sound Tagging (SONYC-UST) dataset recorded at various New York City locations, MUDAS demonstrates notable improvements in classification accuracy over existing UDA algorithms, achieving good performance in a resource-constrained IoT setting.

Abstract PDF Upgrade to Chat

Authors (4)

Summary

MUDAS: Mote-scale Unsupervised Domain Adaptation in Multi-label Sound Classification

This paper investigates Unsupervised Domain Adaptation (UDA) strategies explicitly tailored for multi-label sound classification problems within resource-constrained IoT settings, where computational limitations are significant challenges. The introduction of Mote-scale Unsupervised Domain Adaptation for Sounds (MUDAS) marks a noteworthy advancement, designed to surmount the limitations that traditional UDA algorithms face when applied to multi-label tasks.

Framework Overview

MUDAS is a specialized framework that addresses the multi-label classification complexity inherent in urban sound monitoring systems. The framework operates efficiently on low-power IoT devices by employing selective retraining techniques based on high-confidence data. By utilizing class-specific adaptive thresholds and incorporating diversity regularization, MUDAS enhances the generation of pseudo-labels, thereby improving multi-label classification accuracy. These methodological choices enable MUDAS to function robustly in changing acoustic environments, which often feature overlapping sounds and diverse acoustics typical of urban settings.

Methodological Contributions

Among its main contributions are several techniques beyond what current single-label UDA methods offer:

Selective Retraining: MUDAS adapts models in situ, minimizing computational and memory demands by periodically retraining only the classifier using high-confidence embeddings.
Adaptive Thresholding: This process uses class-specific positive and negative thresholds to improve label reliability, a departure from single-threshold approaches common in less complex classification tasks.
Diversity Regularization: A strategic approach to reinforce generalization and robustness by discouraging overfitting and promoting balanced predictions across multiple classes.

The paper also includes robust evaluations of MUDAS using the SONYC Urban Sound Tagging dataset, where the framework demonstrated appreciable improvements in classification accuracy compared to existing benchmarks. It reached performance levels close to upper-bound scenarios while constrained by the typical limited resources of IoT devices.

Experimental Insights

Experiments were conducted across various urban settings in New York City, simulating real-world domain shifts by treating different geographical locations as source and target domains. MUDAS consistently outperformed lower-bound models, showing adeptness at managing domain shift effects. Moreover, factors such as the effective calibration of thresholds and periodic batch size adjustments were explored, providing insights into optimizing UDA for urban noise applications.

Implications and Future Developments

The paper suggests several theoretical and practical implications of MUDAS. The framework's robust design can potentially scale to other IoT-driven applications, particularly in smart city infrastructures for noise monitoring and environmental sound analytics. Moreover, its lightweight computational demands ease deployment on edge devices without compromising performance, enabling continuous data processing and immediate adaptability to domain shifts.

Further research could explore integrating federated learning approaches where MUDAS could benefit from shared insights among distributed IoT networks. This could significantly enhance model accuracy and responsiveness to acoustic changes in large-scale deployments, offering improved domain adaptation capabilities with even fewer resources required per device.

In summary, MUDAS represents a critical evolution in domain adaptation strategies tailored explicitly for complex multi-label settings in constrained environments. This framework paves the way for more efficient and accurate urban sound classification, significantly contributing to the field of UDA while keeping practical deployment at its core.

Markdown Report Issue