🎵

The Song Search

Deep Learning - CS 7150

Prof. David Bau

September 29, 2022

Team members

  1. Praveen Kumar Sridhar (sridhar.p@northeastern.edu)
  1. Isha Hemant Arora (arora.isha@northeastern.edu)

Literature Review

  1. Main Paper/Blog
    • A brief review of these papers:
      • The blog (and the papers) start with describing the task of Automatic Music Transcription (AMT) which is the task of extracting symbolic representations of music from raw audio.
      • The authors then speak about the course of their research, that initially focused on AMT for pianos (as published in November 2021), but now is gradually extending towards other instruments.
      • To achieve their results, they have implemented a T5 small model.
      • The major focus today is exploring on making a general purpose AMT.
      • For this general purpose AMT they use MT3 (Multi-task Multitrack Music Transcription), which we found to be very interesting (and was the focus of the second paper that was published in March 2022).
  1. Auxiliary Papers/Blogs
    1. https://towardsdatascience.com/3-reasons-why-music-is-ideal-for-learning-and-teaching-data-science-59d892913608 (Max Hilsdorf)
    1. Vaswani, Ashish, et al. "Attention is all you need." Advances in neural information processing systems 30 (2017).
    1. Gong, Yuan, Yu-An Chung, and James Glass. "Ast: Audio spectrogram transformer." arXiv preprint arXiv:2104.01778 (2021).
    1. M. Awiszus, “Automatic music transcription using sequence to sequence learning,” Master’s thesis, Karlsruhe Institute of Technology, 2019.
    1. https://magenta.tensorflow.org/onsets-frames

Proposal for the Main Question

Aim

The aim of our project is to build an Information Retrieval system for music.

Flow