Log inSign up
Accepted papers at TMLR
3,657 posts
user avatar
Accepted papers at TMLR
@TmlrPub
Joined March 2022
5
Following
4,378
Followers
  • user avatar
    Accepted papers at TMLR
    @TmlrPub
    May 27, 2022
    How to train your ViT? Data, Augmentation, and Regularization in Vision Transformers Andreas Peter Steiner, Alexander Kolesnikov, Xiaohua Zhai, Ross Wightman, Jakob Uszkoreit, Lucas Beyer
    openreview.net
    How to train your ViT? Data, Augmentation, and Regularization in...
    Vision Transformers (ViT) have been shown to attain highly competitive performance for a wide range of vision applications, such as image classification, object detection and semantic image...
  • user avatar
    Accepted papers at TMLR
    @TmlrPub
    Nov 9, 2022
    ZerO Initialization: Initializing Neural Networks with only Zeros and Ones Jiawei Zhao, Florian Tobias Schaefer, Anima Anandkumar
    openreview.net
    ZerO Initialization: Initializing Neural Networks with only Zeros...
    Deep neural networks are usually initialized with random weights, with adequately selected initial variance to ensure stable signal propagation during training. However, selecting the appropriate...
  • user avatar
    Accepted papers at TMLR
    @TmlrPub
    Jan 22, 2024
    DINOv2: Learning Robust Visual Features without Supervision Maxime Oquab, Timothée Darcet, Théo Moutakanni et al.. Action editor: Abhishek Kumar. openreview.net/forum?id=a68SU… #supervised #visual #features
    openreview.net
    DINOv2: Learning Robust Visual Features without Supervision
    The recent breakthroughs in natural language processing for model pretraining on large quantities of data have opened the way for similar foundation models in computer vision. These models could...
    38K
  • user avatar
    Accepted papers at TMLR
    @TmlrPub
    Oct 28, 2022
    A Simple Convergence Proof of Adam and Adagrad Alexandre Défossez, Leon Bottou, Francis Bach, Nicolas Usunier
    openreview.net
    A Simple Convergence Proof of Adam and Adagrad
    We provide a simple proof of convergence covering both the Adam and Adagrad adaptive optimization algorithms when applied to smooth (possibly non-convex) objective functions with bounded gradients....
  • user avatar
    Accepted papers at TMLR
    @TmlrPub
    Jun 3, 2022
    Greedy Bayesian Posterior Approximation with Deep Ensembles Aleksei Tiulpin, Matthew B. Blaschko
    openreview.net
    Greedy Bayesian Posterior Approximation with Deep Ensembles
    Ensembles of independently trained neural networks are a state-of-the-art approach to estimate predictive uncertainty in Deep Learning, and can be interpreted as an approximation of the posterior...
  • user avatar
    Accepted papers at TMLR
    @TmlrPub
    Aug 31, 2022
    Emergent Abilities of Large Language Models Jason Wei, Yi Tay, Rishi Bommasani et al.
    openreview.net
    Emergent Abilities of Large Language Models
    Scaling up language models has been shown to predictably improve performance and sample efficiency on a wide range of downstream tasks. This paper instead discusses an unpredictable phenomenon that...
  • user avatar
    Accepted papers at TMLR
    @TmlrPub
    Dec 24, 2023
    Modular Deep Learning Jonas Pfeiffer, Sebastian Ruder, Ivan Vulić, Edoardo Ponti. Action editor: Karthik Narasimhan. openreview.net/forum?id=z9EkX… #modular #modularity #hierarchical
    openreview.net
    Modular Deep Learning
    Transfer learning has recently become the dominant paradigm of machine learning. Pre-trained models fine-tuned for downstream tasks achieve better performance with fewer labelled examples....
    25K
  • user avatar
    Accepted papers at TMLR
    @TmlrPub
    Dec 26, 2022
    A geometrical connection between sparse and low-rank matrices and its application to manifold lea... Lawrence K. Saul openreview.net/forum?id=p8gnc… #sparse #manifold #dimensional
    openreview.net
    A geometrical connection between sparse and low-rank matrices and...
    We consider when a sparse nonnegative matrix $\mathbf{S}$ can be recovered, via an elementwise nonlinearity, from a real-valued matrix~$\mathbf{L}$ of significantly lower rank. Of particular...
    17K
  • user avatar
    Accepted papers at TMLR
    @TmlrPub
    Sep 26, 2022
    Representation Alignment in Neural Networks Ehsan Imani, Wei Hu, Martha White
    openreview.net
    Representation Alignment in Neural Networks
    It is now a standard for neural network representations to be trained on large, publicly available datasets, and used for new problems. The reasons for why neural network representations have been...
  • user avatar
    Accepted papers at TMLR
    @TmlrPub
    Aug 23, 2023
    Understanding convolution on graphs via energies Francesco Di Giovanni, James Rowbottom, Benjamin Paul Chamberlain et al.. Action editor: Guillaume Rabusseau. openreview.net/forum?id=v5ew3… #convolutions #graphs #convolutional
    openreview.net
    Understanding convolution on graphs via energies
    Graph Neural Networks (GNNs) typically operate by message-passing, where the state of a node is updated based on the information received from its neighbours. Most message-passing models act as...
    16K
  • user avatar
    Accepted papers at TMLR
    @TmlrPub
    Oct 16, 2022
    The Evolution of Out-of-Distribution Robustness Throughout Fine-Tuning Anders Johan Andreassen, Yasaman Bahri, Behnam Neyshabur, Rebecca Roelofs
    openreview.net
    The Evolution of Out-of-Distribution Robustness Throughout Fine-Tuning
    Although machine learning models typically experience a drop in performance on out-of-distribution data, accuracies on in- versus out-of-distribution data are widely observed to follow a single...
  • user avatar
    Accepted papers at TMLR
    @TmlrPub
    Sep 24, 2024
    Deep Generative Models through the Lens of the Manifold Hypothesis: A Survey and New Connections Gabriel Loaiza-Ganem, Brendan Leigh Ross, Rasa Hosseinzadeh, Anthony L. Caterini, Jesse C. Cresswell. Action editor: Serguei Barannikov.
    openreview.net
    Deep Generative Models through the Lens of the Manifold Hypothesis:...
    In recent years there has been increased interest in understanding the interplay between deep generative models (DGMs) and the manifold hypothesis. Research in this area focuses on understanding...
    5K
  • user avatar
    Accepted papers at TMLR
    @TmlrPub
    Oct 30, 2022
    Structured Uncertainty in the Observation Space of Variational Autoencoders James Langley, Miguel Monteiro, Charles Jones, Nick Pawlowski, Ben Glocker
    openreview.net
    Structured Uncertainty in the Observation Space of Variational...
    Variational autoencoders (VAEs) are a popular class of deep generative models with many variants and a wide range of applications. Improvements upon the standard VAE mostly focus on the modelling...
  • user avatar
    Accepted papers at TMLR
    @TmlrPub
    Jul 25, 2023
    Self-Supervision is All You Need for Solving Rubik’s Cube Kyo Takano. Action editor: Marc Lanctot. openreview.net/forum?id=bnBeN… #rubik #cube #deepcubea
    openreview.net
    Self-Supervision is All You Need for Solving Rubik’s Cube
    Existing combinatorial search methods are often complex and require some level of expertise. This work introduces a simple and efficient deep learning method for solving combinatorial problems with...
    199K

New to X?

Sign up now to get your own personalized timeline!

Create account

By signing up, you agree to the Terms of Service and Privacy Policy, including Cookie Use.

Terms·Privacy·Cookies·Accessibility·Ads Info·© 2026 X Corp.
Don't miss what's happening
People on X are the first to know.
Log inSign up
Advertisement
Advertisement