InterMix: An Interference-based Data Augmentation And Regularization Technique For Automatic Deep Sound Classification

Implementation of InterMix: An Interference-based Data Augmentation And Regularization Technique For Automatic Deep Sound Classification by Ramit Sawhney, and Atula Tejaswi Neerkaje.

Environment & Installation Steps

Python 3.8 & Chainer 7.7.0

Run

Execute the following steps in the same environment:

python3 main.py --data data --dataset [DATASET] --mixup_type sound --netType envnetv2 --batchSize 32 --BC --eligible 1 2 3 4 --strongAugment

Command Line Arguments

To run different variants of InterMix, perform ablation or tune hyperparameters, the following command-line arguments may be used:

  --dataset DATASET     dataset from ['esc10', 'esc50', 'urbansound8k']
  --mixup-type TYPE     sound (p-weighting) vs. normal
  --bc                  perform mixup
  --eligible L          eligible layer set 
  --strongAugment       perform scale and gain augmentation
  --batchSize BATCH_SIZE
                        batch size
  --nEpochs EPOCHS      number of epochs
  --LR RATE             learning rate
  --weightDecay WD      weight decay
  --momentum MOMENTUM   LR momentum
  --split SPLIT         choice of split

Datasets

Dataset preparation for ESC-50, ESC-10, and UrbanSound8K

FFmpeg should be installed.
First of all, please make a directory to save datasets.
```
  mkdir [path]
```

ESC-50 and ESC-10 setup

python esc_gen.py [path]

Following files will be generated.
- [path]/esc50/wav16.npz # 16kHz, for EnvNet
- [path]/esc50/wav44.npz # 44.1kHz, for EnvNet-v2
- [path]/esc10/wav16.npz
- [path]/esc10/wav44.npz

UrbanSound8K setup

Download UrbanSound8K dataset from this page.

Move UrbanSound8K directory.

 mkdir -p [path]/urbansound8k
 mv UrbanSound8K [path]/urbansound8k/

Run the following command.
```
 python urbansound_gen.py [path]
```

Following files will be generated.
- [path]/urbansound8k/wav16.npz
- [path]/urbansound8k/wav44.npz

Cite

If our work was helpful in your research, please kindly cite this work:

@inproceedings{Sawhney2022InterMix,
  title={InterMix: An Interference-based Data Augmentation And Regularization Technique For Automatic Deep Sound Classification},
  author={Sawhney, Ramit and 
          Neerkaje, Atula Tejaswi},
  booktitle={ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)},
  year={2022},
  organization={IEEE}
}

References

[1] Jindal, A., Ranganatha, N. E., Didolkar, A., Chowdhury, A. G., Jin, D., Sawhney, R., & Shah, R. R. (2020, January). SpeechMix-Augmenting Deep Sound Recognition Using Hidden Space Interpolations. In INTERSPEECH (pp. 861-865).

[2] Tokozume, Y., Ushiku, Y., & Harada, T. (2018, February). Learning from Between-class Examples for Deep Sound Recognition. In International Conference on Learning Representations.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
dataset_gen		dataset_gen
models		models
Paper_ICASSP_2022_InterMix.pdf		Paper_ICASSP_2022_InterMix.pdf
Poster_ICASSP_2022_InterMix.pdf		Poster_ICASSP_2022_InterMix.pdf
README.md		README.md
dataset.py		dataset.py
main.py		main.py
opts.py		opts.py
train.py		train.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

InterMix: An Interference-based Data Augmentation And Regularization Technique For Automatic Deep Sound Classification

Environment & Installation Steps

Run

Command Line Arguments

Datasets

ESC-50 and ESC-10 setup

UrbanSound8K setup

Cite

References

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

InterMix: An Interference-based Data Augmentation And Regularization Technique For Automatic Deep Sound Classification

Environment & Installation Steps

Run

Command Line Arguments

Datasets

ESC-50 and ESC-10 setup

UrbanSound8K setup

Cite

References

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages