Repository logo

Infoscience

  • English
  • French
Log In
Logo EPFL, École polytechnique fédérale de Lausanne

Infoscience

    • English
    • French
    Log In
    1. Home
    2. Academic and Research Output
    3. Student works
    4. Structured Auto-Encoder with application to Music Genre Recognition
     
    master thesis

    Structured Auto-Encoder with application to Music Genre Recognition

    Defferrard, Michaël  
    2015

    In this work, we present a technique that learns discriminative audio features for Music Information Retrieval (MIR). The novelty of the proposed technique is to design auto-encoders that make use of data structures to learn enhanced sparse data representations. The data structure is borrowed from the Manifold Learning field, that is data are supposed to be sampled from smooth manifolds, which are here represented by graphs of proximities of the input data. As a consequence, the proposed auto-encoders finds sparse data representations that are quite robust w.r.t. perturbations. The model is formulated as a non-convex optimization problem. However, it can be decomposed into iterative sub-optimization problems that are convex and for which well-posed iterative schemes are provided in the context of the Fast Iterative Shrinkage-Thresholding (FISTA) framework. Our numerical experiments show two main results. Firstly, our graph-based auto-encoders improve the classification accuracy by 2% over the auto-encoders without graph structure for the popular GTZAN music dataset. Secondly, our model is significantly more robust as it is 8% more accurate than the standard model in the presence of 10% of perturbations.

    • Files
    • Details
    • Metrics
    Loading...
    Thumbnail Image
    Name

    report.pdf

    Access type

    openaccess

    Size

    559.1 KB

    Format

    Adobe PDF

    Checksum (MD5)

    14bb0108c34c0b4460cf85656e0a29be

    Logo EPFL, École polytechnique fédérale de Lausanne
    • Contact
    • infoscience@epfl.ch

    • Follow us on Facebook
    • Follow us on Instagram
    • Follow us on LinkedIn
    • Follow us on X
    • Follow us on Youtube
    AccessibilityLegal noticePrivacy policyCookie settingsEnd User AgreementGet helpFeedback

    Infoscience is a service managed and provided by the Library and IT Services of EPFL. © EPFL, tous droits réservés

    Advertisement
    Advertisement