Sunbelt Computer Software

Contrastive Self-supervised Learning of Music Audio

PyTorch implementation of Towards Proper Contrastive Self-supervised Learning Strategies for Music Audio Representation by Jeong Choi et al.

In this work, we focuse on assessing the potential of self-supervised music embeddings as a general representation. We set up experiments to compare the performance in various MIR tasks between different self-supervision strategies. We investigate to what extent we can benefit from music audio representations learned from some of widely used contrastive learning schemes by analyzing the results on three different MIR tasks (instrument classification, genre classification, and music recommendation) which are considered to represent different aspects of music similarity. Our experiments are set up using contrastive learning algorithms with variations in input / target instance settings and model architectures, which are designed to capture different levels the music semantic - global or regional information. Our strategies are categorized in the following table.

We then use the trained models as feature extractors and evaluate on different MIR tasks, where each task represents a certain abstraction level of music audio information. We compare the self-supervised embeddings with MFCCs which has long been a solid baseline feature in audio classification tasks.

Transfer Learning (Linear Probing) Results

Training

1. Pre-training

Run the following command to pre-train the model on the FMA_small dataset.

python main.py --USE_YAML_CONFIG 0

2. Inference audio embedding

To test a trained model, make sure to set the LOAD_WEIGHT_FROM, or specify it as an argument:

python main.py --MODE inference --LOAD_WEIGHT_FROM SomeCheckpointPath

Linear evaluation

To be updated.

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
data		data
data_example/FMA_small/info		data_example/FMA_small/info
model		model
utils		utils
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
cssma_config.py		cssma_config.py
cssma_config.yaml		cssma_config.yaml
cssma_manager.py		cssma_manager.py
main.py		main.py

Sunbelt Computer Software

PL/B Language Development and Support

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Contrastive Self-supervised Learning of Music Audio

Transfer Learning (Linear Probing) Results

Training

1. Pre-training

2. Inference audio embedding

Linear evaluation

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Sunbelt Computer Software

PL/B Language Development and Support

Folders and files

Latest commit

History

Repository files navigation

Contrastive Self-supervised Learning of Music Audio

Transfer Learning (Linear Probing) Results

Training

1. Pre-training

2. Inference audio embedding

Linear evaluation

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages