- The paper introduces IBA-Net, which combines Mixture-of-Experts feature customization with Neural Collapse-driven classifier calibration to address non-optimal sampling and class imbalance in animal activity recognition.
- The paper demonstrates that dynamic fusion of multi-rate sensor data significantly enhances recall, with improvements up to 16.98 percentage points on imbalanced datasets.
- The paper highlights that integrating ETF-based regularization into classifier design yields robust real-time detection, critical for effective animal health monitoring in precision livestock farming.
Optimal Sampling Strategies and Unbiased Classification Methods for Animal Activity Recognition: A Critical Analysis of IBA-Net
Introduction
The paper "Toward Optimal Sampling Rate Selection and Unbiased Classification for Precise Animal Activity Recognition" (2604.00517) addresses core challenges in wearable sensor-aided animal activity recognition (AAR), focusing on two persistent technical issues: the non-optimality of single sampling rates for heterogeneous behaviors and model bias under class imbalance. The authors propose the Individual-Behavior-Aware Network (IBA-Net), which integrates Mixture-of-Experts (MoE)-based Feature Customization (MFC) and Neural Collapse-driven Classifier Calibration (NC3) modules. The study provides comprehensive validation on multi-species (goat, cattle, horse) public datasets, demonstrating robust improvements in recall, especially for minority classes.
Wearable sensor data for AAR present two fundamental classification bottlenecks. First, distinct animal behaviors (e.g., grazing, running, trotting) distribute discriminative features at behavior-specific temporal granularities. The selection of a unified sensor sampling rate is suboptimal; it leads to inconsistent behavior-level classification accuracy as signal attributes relevant for some classes are lost or overwhelmed by noise at other rates. Second, behavioral class imbalance inherent to natural activity distributions biases deep models toward majority classes, severely impairing minority class recognition. Existing literature has focused on naive resampling or cost-sensitive loss reweighting, which fails to address the geometric degeneration of classifier representations under imbalance.
IBA-Net Architecture
Mixture-of-Experts Feature Customization (MFC)
The MFC module implements an MoE architecture to extract behavior-specialized representations by integrating features from sensor data at multiple sampling rates (e.g., 50 Hz, 25 Hz, 12.5 Hz). Each expert backbone is parameter-shared, with temporal dimensionality resolved via global average pooling. Soft router-based fusion, governed by a learnable MLP and SoftMax controller, adaptively weights expert outputs to maximize the presence of the optimal sampling rate's discriminative content for each behavior in unknown label settings. As highlighted in ablation studies, soft-weighted fusion outperforms concatenation, addition, and hard feature selection in overall metrics, confirming the necessity of dynamic fusion for multi-scale sensor signals.
Neural Collapse-driven Classifier Calibration (NC3)
IBA-Net's NC3 module exploits the geometrical theory of neural collapse, specifically the tendency of well-trained networks under balanced conditions to achieve embedding classifier structures akin to simplex equiangular tight frames (ETF). The NC3 module introduces a fixed ETF classifier branch—its weights synthesized to form a maximally spread simplex—linearly combined with a conventional learnable FC classifier. The ETF branch remains static during training, regularizing the angular distribution of class prototypes in embedding space and maximizing separation between minority and majority vectors. This mitigates the "minority collapse" observed in classical imbalance settings, as confirmed by visualizations (pairwise angles clustered around 90∘ vs. highly variable in the baseline).
Experimental Results
Comprehensive Multi-Species Evaluation
IBA-Net was benchmarked against state-of-the-art resampling (KMeansSMOTE, RandomUnderSampler), reweighting (cost-sensitive cross-entropy, class-balanced focal loss, Adaptive Class Suppression) and strong single-rate baselines. Evaluations on the goat, cattle, and horse datasets show that IBA-Net achieves the highest accuracy, F1-score, precision, and recall across all cases. Specifically, on the severely imbalanced goat dataset (imbalance ratio 98.05), the model achieves 93.17% accuracy, 87.17% F1-score, and 91.71% recall—a recall improvement of 16.98 percentage points over the baseline. Similar patterns are observed for cattle (recall: 93.64%, +15.78 pp) and horse (recall: 84.65%, +4.91 pp).
Ablations, Sensitivity, and Robustness
Ablation confirms criticality of both MFC and NC3. Disabling MFC results in substantial performance collapse; ablating NC3 has pronounced negative impact on minority behavior accuracy. The model is robust to artificially increased imbalance; performance degrades gracefully, far outperforming baselines. Optimal k (ETF mixing coefficient) is dataset-specific (optimum around $0.3$); fully fixing the classifier to the ETF (i.e., k=1) eliminates classification capability, confirming the necessity of hybrid static-learnable classifier composition.
Implications and Theoretical Advances
The incorporation of behavior-aware optimal sampling through MoE and geometric classifier calibration via ETF regularization substantiates a paradigm shift towards fine-grained, per-class optimization in AAR. The results directly challenge the sufficiency of maximizing aggregate accuracy/f1 in real-world multi-behavior AAR, emphasizing equitable per-class performance. The NC3 mechanism demonstrates a practically viable path to enforcing neural collapse geometry in imbalanced regimes, a property previously understood only in theory or under synthetic balance.
Practically, deployment of IBA-Net leads to more reliable detection of rare but welfare-critical animal behaviors (e.g., lameness, abnormal feeding), directly improving animal health monitoring and thus operational outcomes for precision livestock farming. The modular architecture enables straightforward integration of edge-compatible variants for IoT deployments, with the adaptive fusion matching the heterogeneity of bandwidth and battery constraints encountered in practice.
Limitations and Future Directions
Model complexity increases due to multiple expert branches; although inference remains real-time (<1.5 ms per sample on commodity CPUs), future work should integrate parameter-efficient adapters (e.g., LoRA) to decrease resource footprint. Current encoders are CNN-based; future work should extend to architectures with proven superiority in time series domains (InceptionTime, ResNet-1D, Transformers), which may further enhance discriminative capacity for long-range dependencies or fine temporal signals.
Current models are animal-species specific, with generalization across species unresolved due to domain shift. There is a clear pathway towards foundational models for universal AAR, in analogy with recent advances in multimodal foundation models for NLP and vision, with potential for cross-species transfer and few-shot adaptation.
Conclusion
The IBA-Net architecture presents a methodologically rigorous, empirically validated solution for the simultaneous challenges of optimal sampling rate selection and unbiased behavioral classification in wearable sensor-based AAR. Through MoE-driven feature customization and ETF-inspired classifier calibration, IBA-Net consistently improves minority-class sensitivity and overall robustness without sacrificing real-time applicability. The proposed innovations significantly advance both the technical state-of-the-art and practical deployment potential of AAR systems, while the theoretical integration of geometric regularization into classifier design opens new avenues for imbalanced learning both in and beyond livestock monitoring.