How to train your ViT? Data, Augmentation, and Regularization in Vision Transformers
Andreas Peter Steiner, Alexander Kolesnikov, Xiaohua Zhai, Ross Wightman, Jakob Uszkoreit, Lucas Beyer
Accepted papers at TMLR
3,657 posts
- ZerO Initialization: Initializing Neural Networks with only Zeros and Ones Jiawei Zhao, Florian Tobias Schaefer, Anima Anandkumar
- DINOv2: Learning Robust Visual Features without Supervision Maxime Oquab, Timothée Darcet, Théo Moutakanni et al.. Action editor: Abhishek Kumar. openreview.net/forum?id=a68SU… #supervised #visual #features
- A Simple Convergence Proof of Adam and Adagrad Alexandre Défossez, Leon Bottou, Francis Bach, Nicolas Usunier
- Greedy Bayesian Posterior Approximation with Deep Ensembles Aleksei Tiulpin, Matthew B. Blaschko
- Emergent Abilities of Large Language Models Jason Wei, Yi Tay, Rishi Bommasani et al.
- Modular Deep Learning Jonas Pfeiffer, Sebastian Ruder, Ivan Vulić, Edoardo Ponti. Action editor: Karthik Narasimhan. openreview.net/forum?id=z9EkX… #modular #modularity #hierarchical
- A geometrical connection between sparse and low-rank matrices and its application to manifold lea... Lawrence K. Saul openreview.net/forum?id=p8gnc… #sparse #manifold #dimensional
- Representation Alignment in Neural Networks Ehsan Imani, Wei Hu, Martha White
- Understanding convolution on graphs via energies Francesco Di Giovanni, James Rowbottom, Benjamin Paul Chamberlain et al.. Action editor: Guillaume Rabusseau. openreview.net/forum?id=v5ew3… #convolutions #graphs #convolutional
- The Evolution of Out-of-Distribution Robustness Throughout Fine-Tuning Anders Johan Andreassen, Yasaman Bahri, Behnam Neyshabur, Rebecca Roelofs
- Deep Generative Models through the Lens of the Manifold Hypothesis: A Survey and New Connections Gabriel Loaiza-Ganem, Brendan Leigh Ross, Rasa Hosseinzadeh, Anthony L. Caterini, Jesse C. Cresswell. Action editor: Serguei Barannikov.
- Structured Uncertainty in the Observation Space of Variational Autoencoders James Langley, Miguel Monteiro, Charles Jones, Nick Pawlowski, Ben Glocker
- Self-Supervision is All You Need for Solving Rubik’s Cube Kyo Takano. Action editor: Marc Lanctot. openreview.net/forum?id=bnBeN… #rubik #cube #deepcubea

