Accepted papers at TMLR (@TmlrPub) / X

Accepted papers at TMLR

3,657 posts

Accepted papers at TMLR

@TmlrPub

Joined March 2022

Accepted papers at TMLR
@TmlrPub
May 27, 2022
How to train your ViT? Data, Augmentation, and Regularization in Vision Transformers Andreas Peter Steiner, Alexander Kolesnikov, Xiaohua Zhai, Ross Wightman, Jakob Uszkoreit, Lucas Beyer
openreview.net
How to train your ViT? Data, Augmentation, and Regularization in...
Vision Transformers (ViT) have been shown to attain highly competitive performance for a wide range of vision applications, such as image classification, object detection and semantic image...
Accepted papers at TMLR
@TmlrPub
Nov 9, 2022
ZerO Initialization: Initializing Neural Networks with only Zeros and Ones Jiawei Zhao, Florian Tobias Schaefer, Anima Anandkumar
openreview.net
ZerO Initialization: Initializing Neural Networks with only Zeros...
Deep neural networks are usually initialized with random weights, with adequately selected initial variance to ensure stable signal propagation during training. However, selecting the appropriate...
Accepted papers at TMLR
@TmlrPub
Jan 22, 2024
DINOv2: Learning Robust Visual Features without Supervision Maxime Oquab, Timothée Darcet, Théo Moutakanni et al.. Action editor: Abhishek Kumar. openreview.net/forum?id=a68SU… #supervised #visual #features
openreview.net
DINOv2: Learning Robust Visual Features without Supervision
The recent breakthroughs in natural language processing for model pretraining on large quantities of data have opened the way for similar foundation models in computer vision. These models could...
38K
Accepted papers at TMLR
@TmlrPub
Oct 28, 2022
A Simple Convergence Proof of Adam and Adagrad Alexandre Défossez, Leon Bottou, Francis Bach, Nicolas Usunier
openreview.net
A Simple Convergence Proof of Adam and Adagrad
We provide a simple proof of convergence covering both the Adam and Adagrad adaptive optimization algorithms when applied to smooth (possibly non-convex) objective functions with bounded gradients....
Accepted papers at TMLR
@TmlrPub
Jun 3, 2022
Greedy Bayesian Posterior Approximation with Deep Ensembles Aleksei Tiulpin, Matthew B. Blaschko
openreview.net
Greedy Bayesian Posterior Approximation with Deep Ensembles
Ensembles of independently trained neural networks are a state-of-the-art approach to estimate predictive uncertainty in Deep Learning, and can be interpreted as an approximation of the posterior...
Accepted papers at TMLR
@TmlrPub
Aug 31, 2022
Emergent Abilities of Large Language Models Jason Wei, Yi Tay, Rishi Bommasani et al.
openreview.net
Emergent Abilities of Large Language Models
Scaling up language models has been shown to predictably improve performance and sample efficiency on a wide range of downstream tasks. This paper instead discusses an unpredictable phenomenon that...
Accepted papers at TMLR
@TmlrPub
Dec 24, 2023
Modular Deep Learning Jonas Pfeiffer, Sebastian Ruder, Ivan Vulić, Edoardo Ponti. Action editor: Karthik Narasimhan. openreview.net/forum?id=z9EkX… #modular #modularity #hierarchical
openreview.net
Modular Deep Learning
Transfer learning has recently become the dominant paradigm of machine learning. Pre-trained models fine-tuned for downstream tasks achieve better performance with fewer labelled examples....
25K
Accepted papers at TMLR
@TmlrPub
Dec 26, 2022
A geometrical connection between sparse and low-rank matrices and its application to manifold lea... Lawrence K. Saul openreview.net/forum?id=p8gnc… #sparse #manifold #dimensional
openreview.net
A geometrical connection between sparse and low-rank matrices and...
We consider when a sparse nonnegative matrix $\mathbf{S}$ can be recovered, via an elementwise nonlinearity, from a real-valued matrix~$\mathbf{L}$ of significantly lower rank. Of particular...
17K
Accepted papers at TMLR
@TmlrPub
Sep 26, 2022
Representation Alignment in Neural Networks Ehsan Imani, Wei Hu, Martha White
openreview.net
Representation Alignment in Neural Networks
It is now a standard for neural network representations to be trained on large, publicly available datasets, and used for new problems. The reasons for why neural network representations have been...
Accepted papers at TMLR
@TmlrPub
Aug 23, 2023
Understanding convolution on graphs via energies Francesco Di Giovanni, James Rowbottom, Benjamin Paul Chamberlain et al.. Action editor: Guillaume Rabusseau. openreview.net/forum?id=v5ew3… #convolutions #graphs #convolutional
openreview.net
Understanding convolution on graphs via energies
Graph Neural Networks (GNNs) typically operate by message-passing, where the state of a node is updated based on the information received from its neighbours. Most message-passing models act as...
16K
Accepted papers at TMLR
@TmlrPub
Oct 16, 2022
The Evolution of Out-of-Distribution Robustness Throughout Fine-Tuning Anders Johan Andreassen, Yasaman Bahri, Behnam Neyshabur, Rebecca Roelofs
openreview.net
The Evolution of Out-of-Distribution Robustness Throughout Fine-Tuning
Although machine learning models typically experience a drop in performance on out-of-distribution data, accuracies on in- versus out-of-distribution data are widely observed to follow a single...
Accepted papers at TMLR
@TmlrPub
Sep 24, 2024
Deep Generative Models through the Lens of the Manifold Hypothesis: A Survey and New Connections Gabriel Loaiza-Ganem, Brendan Leigh Ross, Rasa Hosseinzadeh, Anthony L. Caterini, Jesse C. Cresswell. Action editor: Serguei Barannikov.
openreview.net
Deep Generative Models through the Lens of the Manifold Hypothesis:...
In recent years there has been increased interest in understanding the interplay between deep generative models (DGMs) and the manifold hypothesis. Research in this area focuses on understanding...
5K
Accepted papers at TMLR
@TmlrPub
Oct 30, 2022
Structured Uncertainty in the Observation Space of Variational Autoencoders James Langley, Miguel Monteiro, Charles Jones, Nick Pawlowski, Ben Glocker
openreview.net
Structured Uncertainty in the Observation Space of Variational...
Variational autoencoders (VAEs) are a popular class of deep generative models with many variants and a wide range of applications. Improvements upon the standard VAE mostly focus on the modelling...
Accepted papers at TMLR
@TmlrPub
Jul 25, 2023
Self-Supervision is All You Need for Solving Rubik’s Cube Kyo Takano. Action editor: Marc Lanctot. openreview.net/forum?id=bnBeN… #rubik #cube #deepcubea
openreview.net
Self-Supervision is All You Need for Solving Rubik’s Cube
Existing combinatorial search methods are often complex and require some level of expertise. This work introduces a simple and efficient deep learning method for solving combinatorial problems with...
199K