RECOMBINER: Robust and Enhanced Compression with Bayesian Implicit Neural Representations
Abstract: COMpression with Bayesian Implicit NEural Representations (COMBINER) is a recent data compression method that addresses a key inefficiency of previous Implicit Neural Representation (INR)-based approaches: it avoids quantization and enables direct optimization of the rate-distortion performance. However, COMBINER still has significant limitations: 1) it uses factorized priors and posterior approximations that lack flexibility; 2) it cannot effectively adapt to local deviations from global patterns in the data; and 3) its performance can be susceptible to modeling choices and the variational parameters' initializations. Our proposed method, Robust and Enhanced COMBINER (RECOMBINER), addresses these issues by 1) enriching the variational approximation while retaining a low computational cost via a linear reparameterization of the INR weights, 2) augmenting our INRs with learnable positional encodings that enable them to adapt to local details and 3) splitting high-resolution data into patches to increase robustness and utilizing expressive hierarchical priors to capture dependency across patches. We conduct extensive experiments across several data modalities, showcasing that RECOMBINER achieves competitive results with the best INR-based methods and even outperforms autoencoder-based codecs on low-resolution images at low bitrates. Our PyTorch implementation is available at https://github.com/cambridge-mlg/RECOMBINER/.
- Ntire 2017 challenge on single image super-resolution: Dataset and study. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR) Workshops, 2017.
- Nonlinear transform coding. IEEE Journal of Selected Topics in Signal Processing, 2020.
- Variational image compression with a scale hyperprior. In International Conference on Learning Representations, 2018.
- Fabrice Bellard. BPG image format. https://bellard.org/bpg/, 2014. Accessed: 2023-09-27.
- Weight uncertainty in neural network. In International Conference on Machine Learning, 2015.
- Hnerf: A hybrid neural representation for videos. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023.
- Learned image compression with discretized gaussian mixture likelihoods and attention modules. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, 2020.
- Coin: Compression with implicit neural representations. In Neural Compression: From Information Theory to Applications–Workshop@ ICLR 2021, 2021.
- Coin++: Neural compression across modalities. Transactions on Machine Learning Research, 2022.
- Efficient and scalable bayesian neural nets with rank-1 factors. In International conference on machine learning, 2020.
- Model-agnostic meta-learning for fast adaptation of deep networks. In International conference on machine learning, 2017.
- Gergely Flamich. Greedy Poisson rejection sampling. In Advances in Neural Information Processing Systems, 2023.
- Compressing images by encoding their latent representations with relative entropy coding. In Advances in Neural Information Processing Systems, 2020.
- Fast relative entropy coding with A* coding. In International Conference on Machine Learning, 2022.
- Causal contextual prediction for learned image compression. IEEE Transactions on Circuits and Systems for Video Technology, 2021.
- Compression with Bayesian implicit neural representations. In Advances in Neural Information Processing Systems, 2023.
- Minimal random code learning: Getting bits back from compressed model parameters. In International Conference on Learning Representations, 2018.
- Elic: Efficient learned image compression with unevenly grouped space-channel contextual adaptive coding. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022.
- Lora: Low-rank adaptation of large language models. In International Conference on Learning Representations, 2021.
- What are Bayesian neural network posteriors really like? In International conference on machine learning, 2021.
- Local implicit grid representations for 3d scenes. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020.
- JVET. VVC offical test model. https://jvet.hhi.fraunhofer.de, 2020. Accessed: 2024-03-05.
- Foldcomp: a library and format for compressing and indexing large protein structure sets. Bioinformatics, 2023.
- Scalable neural video representations with learnable positional features. In Advances in Neural Information Processing Systems, 2022.
- Variational dropout and the local reparameterization trick. In Advances in Neural Information Processing Systems, 2015.
- Eastman Kodak. Kodak Lossless True Color Image Suite (PhotoCD PCD0992). http://r0k.us/graphics/kodak/, 1993.
- Probabilistic graphical models: principles and techniques. MIT press, 2009.
- Learning multiple layers of features from tiny images, 2009.
- Cool-chic: Coordinate-based low complexity hierarchical image codec. In Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023.
- Learned image compression with mixed transformer-cnn architectures. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023.
- High-fidelity generative image compression. In Advances in Neural Information Processing Systems, 2020.
- Joint autoregressive and hierarchical priors for learned image compression. In Advances in neural information processing systems, 2018.
- Instant neural graphics primitives with a multiresolution hash encoding. ACM Transactions on Graphics, 2022.
- Scalable model compression by entropy penalized reparameterization. In International Conference on Learning Representations, 2019.
- Librispeech: An asr corpus based on public domain audio books. In 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2015.
- Film: Visual reasoning with a general conditioning layer. In AAAI conference on artificial intelligence, 2018.
- RCSB Protein Data Bank. PDB Statistics: PDB data distribution by resolution. https://www.rcsb.org/stats/distribution-resolution, 2000. Accessed: 2023-09-27.
- Meta-learning sparse compression networks. Transactions on Machine Learning Research, 2022.
- Modality-agnostic variational compression of implicit neural representations. In International conference on machine learning, 2023.
- Implicit neural representations with periodic activation functions. In Advances in Neural Information Processing Systems, 2020.
- Ladder variational autoencoders. In Advances in neural information processing systems, 2016.
- UCF101: A dataset of 101 human actions classes from videos in the wild. arXiv preprint arXiv:1212.0402, 2012.
- Image-centric compression of protein structures improves space savings. BMC Bioinformatics, 2023.
- Kenneth O Stanley. Compositional pattern producing networks: A novel abstraction of development. Genetic programming and evolvable machines, 2007.
- Fourier features let networks learn high frequency functions in low dimensional domains. In Advances in Neural Information Processing Systems, 2020.
- Suramya Tomar. Converting video formats with FFmpeg. Linux Journal, 2006.
- Overpruning in variational Bayesian neural networks. In Advances in Approximate Bayesian Inference workshop at NIPS 2017, 2017.
- PDC: a highly compact file format to store protein 3D coordinates. Database (Oxford), 2023.
- Transformer-based transform coding. In International Conference on Learning Representations, 2021.
Paper Prompts
Sign up for free to create and run prompts on this paper using GPT-5.
Top Community Prompts
Collections
Sign up for free to add this paper to one or more collections.