-
Problems to solve: automatic cover detection.
-
Method: a neural network architecture. Each track is represented as a
single embeded vector
. -
Question:
- what is
embedding extraction
?
- what is
-
Notes
- Cover detection is not a classification problem. It requires the comparison between tracks.
- Gerenal Ways:
- Compare:
- Dominant melody
- Tonal Progression: a sequence of chromas
- Chords
- Fusion of both
- Compute similarity score:
- cross-correlation
- a variant of the DTW algorithm
- combination of both
- Problems:
- Good on small datasets, expensive with the expansion of the dataset.
- Compare:
- Faster Methods:
- comparison of all possible subsequences pairs between 2D-DFT sequences derived from CQTs overlapping windows.
- Encode track as single scalar or vector: the comparison work can be done offline. Just need to compare the Euclidean distance between embeddings.
-
Words & Phrases
- Stricto sensu: 严格意义上