Log inSign up
Matthew Johnson
2,009 posts
user avatar
Matthew Johnson
@SingularMattrix
Researcher at Google Brain. I work on JAX (github.com/google/jax).
people.csail.mit.edu/mattjj/
Joined July 2010
3,336
Following
13.6K
Followers
  • user avatar
    Matthew Johnson
    @SingularMattrix
    Nov 9, 2018
    Measuring the effects of data parallelism on neural network training. A great example of careful science in machine learning.
    arXiv logo
    arxiv.org
    Measuring the Effects of Data Parallelism on Neural Network Training
    Recent hardware developments have dramatically increased the scale of data parallelism available for neural network training. Among the simplest ways to harness next-generation hardware is to...
  • user avatar
    Matthew Johnson
    @SingularMattrix
    Sep 21, 2022
    JAX+NVIDIA at #GTC22! w/ @mjsMLP nvidia.com/gtc/session-ca… New to JAX? This talk gets you up to speed. Already a JAXpert? Check out the new parallelization features at the end of the demo. And hear about how NVIDIA is making JAX faster and more scalable than ever on GPUs!
    Image
    Discover AI Sessions at NVIDIA GTC 2026 | March 16–19 San Jose
    From nvidia.com
  • user avatar
    Matthew Johnson
    @SingularMattrix
    Jul 21, 2022
    Replying to @cHHillee
    They continued: "If I wanted hard-to-install packages and general amateurism, I'd use JAX."
  • user avatar
    Matthew Johnson
    @SingularMattrix
    Feb 15, 2024
    Replying to @soumithchintala and @JeffDean
    Yeah! That was one of the first things people (I think @sharadvikram) tried with the JAX codebase, as in Fig 2 of the tech report: storage.googleapis.com/deepmind-media…
    Image
    7.5K
  • user avatar
    Matthew Johnson
    @SingularMattrix
    Aug 7, 2021
    Replying to @fchollet
    I enjoyed your tweet, and the discussion! It shone with good faith & curiosity. I am eager to see different perspectives, esp yours. (And let's not rule out the possibility that JAX is deficient here... JAX has many weaknesses. We love to hear about them, and to try to improve!)
  • user avatar
    Matthew Johnson
    @SingularMattrix
    Sep 24, 2021
    Replying to @ankurhandos
    In JAX you can set this to be a warning or error: jax.readthedocs.io/en/latest/rank… It can be done with a context manager too. We weren't able to turn it on by default though.
  • user avatar
    Matthew Johnson
    @SingularMattrix
    Jan 22, 2020
    Replying to @deliprao @RandomlyWalking and @cohenrap
    I recommend watching Skye's talk at NeurIPS (starts @ 44:26) slideslive.com/38922046/progr… If DL performance alone is too ho-hum, since JAX is about all numerical computing (not just DL) maybe you'd find the NumPyro benchmarks from UberAI interesting: openreview.net/forum?id=H1g1n…
    Image
    slideslive.com
    Zachary DeVito · Optimized Execution of Pytorch Programs with Torchscript
    Machine learning researchers often express complex models as a program, relying on program transformations to add functionality. New languages and transformations (e.g., TorchScript and TensorFlow...
  • user avatar
    Matthew Johnson
    @SingularMattrix
    Dec 23, 2019
    Replying to @latentjasper and @carlesgelada
    See Section 1.2.2 for a discussion of exactly this complaint about prior specification. Lots of later work too, like the recent paper “Functional Variational Bayesian Neural Networks” from @RogerGrosse ’s group.
  • user avatar
    Matthew Johnson
    @SingularMattrix
    Jan 7, 2023
    Replying to @jonkhler and @PatrickKidger
    Yes! github.com/jax-ml/jax-tri… In that repo there’s also an experimental new way to write Triton kernels, called pallas. It uses JAX tracing machinery for a more convenient embedding and some transformability. @sharadvikram
    Image
    GitHub - jax-ml/jax-triton: jax-triton contains integrations between JAX and OpenAI Triton
    From github.com
    3.2K
  • user avatar
    Matthew Johnson
    @SingularMattrix
    Jun 7, 2020
    Replying to @gabrielpeyre
    They can be generalized to R[\epsilon] / \epsilon^{k+1} to model higher order autodiff that is faster than nested dual numbers (often exponentially so). See Ch 13 of Griewank and Walther, and jax.experimental.jet in the JAX GitHub repo!
  • user avatar
    Matthew Johnson
    @SingularMattrix
    Oct 25, 2021
    It doesn't have to be either-or! Let's do more JAX _and_ PyTorch, together! For ex, they can do zero-copy handoff of GPU buffers via DLPack, and AD can be integrated, e.g.: gist.github.com/mattjj/e8b5107… There's much more to be done here. With interop, users win!
    Image
    jax2torch.py
    From gist.github.com
  • user avatar
    Matthew Johnson
    @SingularMattrix
    Oct 15, 2019
    "Functional languages *are* unnatural to use; but so are knives and forks, diplomatic protocols, double-entry bookkeeping, and a host of other things modern civilization has found useful." - James H. Morris bitsavers.org/pdf/xerox/parc…
  • user avatar
    Matthew Johnson
    @SingularMattrix
    Feb 24, 2021
    Replying to @jm_alexia
    If you have time to share, we would love to hear feedback on our issue tracker and/or GitHub Discussions!
  • user avatar
    Matthew Johnson
    @SingularMattrix
    Dec 15, 2018
    My sister is a postdoc in biophysics and biochemistry, and she’s writing open-source software for open and reproducible science.
    user avatar
    Stephanie Johnson
    @StephL_Johnson
    Dec 14, 2018
    If you're looking for fast, transparent, customizable software for analyzing large single molecule FRET/fluorescence data sets, my software package Traces is now compatible with the latest version of Matlab: github.com/stephlj/Traces. Check it out!
    Image

New to X?

Sign up now to get your own personalized timeline!

Create account

By signing up, you agree to the Terms of Service and Privacy Policy, including Cookie Use.

Terms·Privacy·Cookies·Accessibility·Ads Info·© 2026 X Corp.
Don't miss what's happening
People on X are the first to know.
Log inSign up
Advertisement
Advertisement