Image

Instructors

Image

Georgia Gkioxari

Image

Pietro Perona

Advisors

Image

Yang Song

Image

Giovanni Paolini

TAs

Image

Neehar Kondapaneni

Image

Laure Delisle

Image

Suzanne Stathatos

Image

Zihui Wu

Image

Rogério Guimarães


Guest Speakers

Image

Yejin Choi

Image

Björn Ommer

Image

Jack Hessel

Image

Ross Girshick

Image

Saining Xie


Syllabus

Date

Topic

Lecturers

Materials

04/04/23

  Intro to Large Language & Vision Models

  • What is AI and why pursue it
  • Weak vs Strong AI
  • Intro to Large Models for vision and language
  • AI4Science applications
  • Current LLVMs and failures
  • Class overview, homeworks, grading policy
Image
Pietro
Image
Georgia
  • Slides - Part A by Pietro: pdf
  • Slides - Part B by Georgia: pdf
04/06/23

  Brief Recap on MLPs

  • Definition of MLPs
  • Backpropagation
  • Stochastic Gradient Descent
  • Momentum (paper)
  • Adam (paper)
Image
Pietro
04/11/23

  Brief Recap on CNNs

Image
Pietro
  • Pietro's slides: pdf
04/13/23

  Transformers - Part A

Image
Georgia
  • Georgia's slides: pdf
04/18/23

  Transformers - Part B

Image
Georgia
  • Georgia's slides: pdf
04/20/23

  Self-Supervised Learning

Image
Elijah Cole
04/25/23

  Object Recognition at Scale

Image
Georgia
  • Georgia's slides: pdf
04/27/23

  Large Vision Models for Segmentation

Image
Ross Girshick
05/02/23

  Generative Models I

  • Intro to Generative Models & Autoregressive Models
  • GPT (GPT, GPT2, GPT3)
  • PixelCNN (paper)
  • WaveNet (paper)
Image
Georgia
  • Georgia's slides: pdf
05/04/23

  Generative Models II

  • Variational Autoencoders
  • Tutorial on VAEs (paper)
  • VQ-VAE (paper)
Image
Georgia
  • Georgia's slides: pdf
  • Assignment 3: 🤖 nanoGPT -- main & code.
05/09/23

  Generative Models III

  • Diffusion Models
  • DDPMs (paper)
  • Denoising Score Matching (paper)
Image
Georgia
  • Georgia's slides: pdf
05/11/23

  Generative Models III-1/2

  • More on Diffusion Models
  • Latent Diffusion (paper)
  • Classifier-free Guidance (paper)
Image
Georgia
  • Georgia's slides: pdf
05/16/23

  Scalable Vision Foundation Models

Image
Saining Xie
  • Saining's slides: pdf
05/18/23

  Common Sense: The Dark Matter of Language Intelligence

Image
Yejin
Image
Jack
  • Yejin's slides: pdf
  • Jack's slides: pdf
05/23/23

  GANs & Unified Vision and Language Models

Image
Pietro
Image
Georgia
  • Pietro's slides: pdf
  • Georgia's slides: pdf
  • Assignment 4: Diffusion 🎨 -- main & code
05/25/23

  Stable Diffusion & Image Generation

Image
Björn Ommer
05/30/23

  Ethics in AI

Image
Pietro
06/01/23

  Alignment

  • Intro to RL
  • REINFORCE, PPO
  • InstructGPT (paper)
Image
Giovanni Paolini
  • Giovanni's slides: pdf