Papers
Topics
Authors
Recent
Search
2000 character limit reached

Deep Transfer Learning: Model Framework and Error Analysis

Published 12 Oct 2024 in cs.LG and stat.ML | (2410.09383v3)

Abstract: This paper presents a framework for deep transfer learning, which aims to leverage information from multi-domain upstream data with a large number of samples $n$ to a single-domain downstream task with a considerably smaller number of samples $m$, where $m \ll n$, in order to enhance performance on downstream task. Our framework offers several intriguing features. First, it allows the existence of both shared and domain-specific features across multi-domain data and provides a framework for automatic identification, achieving precise transfer and utilization of information. Second, the framework explicitly identifies upstream features that contribute to downstream tasks, establishing clear relationships between upstream domains and downstream tasks, thereby enhancing interpretability. Error analysis shows that our framework can significantly improve the convergence rate for learning Lipschitz functions in downstream supervised tasks, reducing it from $\tilde{O}(m{-\frac{1}{2(d+2)}}+n{-\frac{1}{2(d+2)}})$ ("no transfer") to $\tilde{O}(m{-\frac{1}{2(d*+3)}} + n{-\frac{1}{2(d+2)}})$ ("partial transfer"), and even to $\tilde{O}(m{-1/2}+n{-\frac{1}{2(d+2)}})$ ("complete transfer"), where $d* \ll d$ and $d$ is the dimension of the observed data. Our theoretical findings are supported by empirical experiments on image classification and regression datasets.

Summary

No one has generated a summary of this paper yet.

Paper to Video (Beta)

No one has generated a video about this paper yet.

Whiteboard

No one has generated a whiteboard explanation for this paper yet.

Open Problems

We haven't generated a list of open problems mentioned in this paper yet.

Continue Learning

We haven't generated follow-up questions for this paper yet.

Collections

Sign up for free to add this paper to one or more collections.