The document investigates the effects of noisy labels when training convolutional neural networks for music tagging, finding that tags with higher "taggability" due to being more unusual have less noisy labels which leads to better model performance; it also analyzes what the model learns by examining label vectors and their relationship to co-occurrence in the ground truth data.
1 of 38
Download to read offline
More Related Content
The effects of noisy labels on deep convolutional neural networks for music tagging
1. The Effects of Noisy Labels
Keunwoo.Choi
@qmul.ac.uk
on deep convolutional neural networks for music tagging
arXiv:1706.02361
3. @KeunwooChoi
2014--present: PhD, Queen Mary University of London
2016--present: Buzzmusiq lnc.
2016/ 06--12: Visiting PhD, NYU
2015/ 06--09: Intern, Naver Labs
2011--2014: Audio research team, ETRI
2009--2011: Applied Acoustic Lab, EECS, SNU
2005--2009: EECS, SNU
Papers on ISMIR/ICASSP/IEEE Trans./Etc.
Python/Keras/Pytorch
4. The Effects of Noisy Labels
Keunwoo.Choi
@qmul.ac.uk
on deep convolutional neural networks for music tagging
Gy旦rgy Fazekas, Kyunghyun Cho, Mark Sandler
arXiv:1706.02361
1. INTRODUCTION
5. Tagging
Anyone can tag any words (or non-words) to any song
The quality is ****.
Poor, innocent, (鍖nancially) poor researchers need to use it
6. Tagging
(Tag, count)
rock 101071
pop 69159
alternative 55777
indie 48175
electronic 46270
female vocalists 42565
favorites 39921
00s 31432
Awesome 26248
american 22694
seen live 20705
cool 19581
Favorite18864
Favourites 17722
female vocalist 17328
guitar 17302
loved 12483
favorite songs 12392
heard on Pandora 10470
USA 8725
2000s 8671
Favourite Songs 8661
drjazzmrfunkmusic 8364
77davez-all-tracks7278
fav 6155
bass 3364
songs I absolutely love
3293
vocals 2369
drums2281
9. The Effects of Noisy Labels
Keunwoo.Choi
@qmul.ac.uk
on deep convolutional neural networks for music tagging
Gy旦rgy Fazekas, Kyunghyun Cho, Mark Sandler
arXiv:1706.02361
2. HOW NOISY?
IS TRAINING OK?
10. Measuring the noise
We need strongly-labelled re-annotations
Instrumentation labels are (sort of) objective
(instrumental, female vocal, male vocal, guitar)
242K songs are still a lot select a subset (or two)!
I can do it!
..but not
all of them
11. Strongly labelling: Subset100
Subset100: random 50 from True
+ random 50 from False (for each label)
Instrumental
Female vocalists
Male vocalist
Guitar
True False
50songs 50songs
50 50
50 50
50 50
16. Again, with box plots
{Instrumental, female vocalists}
vs.
{male vocalists, guitar}
17. Group A vs B, but why?
Tagging vocals, drums, bass is like..
Theyre not tag-worthy
Lets call it taggability
Female vocalists
Male vocalist
Guitar
Bass
Vocals
Drums
0% 25% 50% 75% 100
True False
***?
Whats on
the desk?
18. The hypothesis
If unusual high taggability.
Instrumental, female vocal :
high taggability
Male vocal, guitar:
low taggability
19. The hypothesis
If unusual high taggability.
If high taggability
less false negative = higher recall (of GT)
Instrumental, female vocal :
high taggability,
less false neg, higher recall
Male vocal, guitar:
low taggability,
more false neg, lower recall
20. The hypothesis
If unusual high taggability.
If high taggability
less false negative = higher recall (of GT)
If higher recall (=more reliable GT),
?
21. [33] Choi et al. 2017, Convolutional recu...
Hypothesis
If unusual high taggability.
If high taggability
less false negative = higher recall (of GT)
If higher recall (=more reliable GT),
?
Performance(AUC)
!!!
22. The hypothesis
If unusual high taggability.
If high taggability
less false negative = higher recall (of GT)
Instrumental, female vocal :
high taggability,
less false neg, higher recall,
better classi鍖cation
Male vocal, guitar:
low taggability,
more false neg, lower recall,
worse classi鍖cation
If higher recall (=more reliable GT),
better classi鍖cation
24. The Effects of Noisy Labels
Keunwoo.Choi
@qmul.ac.uk
on deep convolutional neural networks for music tagging
Gy旦rgy Fazekas, Kyunghyun Cho, Mark Sandler
arXiv:1706.02361
3. IS EVALUATION OK?
29. The Effects of Noisy Labels
Keunwoo.Choi
@qmul.ac.uk
on deep convolutional neural networks for music tagging
Gy旦rgy Fazekas, Kyunghyun Cho, Mark Sandler
arXiv:1706.02361
4. LABEL VECTOR
ANALYSIS
34. Label vector vs co-occurrence (GT)
Mostly, LV reproduces the groundtruth.
Except: similar pairs only by label vector:
(sad, beautiful), (happy, catchy), (rnb, sexy)
Sad songs are beautiful.
Catchy songs are often happy songs.
R&B claims to be sexy.
Makes sense..
35. The Effects of Noisy Labels
Keunwoo.Choi
@qmul.ac.uk
on deep convolutional neural networks for music tagging
Gy旦rgy Fazekas, Kyunghyun Cho, Mark Sandler
arXiv:1706.02361
5. CONCLUSIONS
36. Conclusions
We quanti鍖ed how noisy weakly-labelled groundtruth is.
We conjectured why some labels are noisier.
We showed what happens to the noisier labels on training
and evaluation.
We investigated what a convnet learns.
37. The Effects of Noisy Labels
Keunwoo.Choi
@qmul.ac.uk
on deep convolutional neural networks for music tagging
Gy旦rgy Fazekas, Kyunghyun Cho, Mark Sandler
arXiv:1706.02361