Run the parameter optimization process#

Parameter optimization is the process by which the nonlinear subplex optimization algorithm (see Rowan, 1990) is used to set the parameters of the onset detection algorithms. The objective function was to maximize the mean F-score obtained between the algorithm results and a ground truth set of reference annotations (contained in .\references\manual_annotations), for a given percentage (10%) of the dataset. The optimization process sets different parameters for detecting onsets foreach instrument (piano, bass, drums), and for detecting beats in the mixed audio signal.

Note

Creating annotation files The reference annotations were created with the Sonic Visualiser (Cannam et al., 2010) software by the lead author and two undergraduate research assistants, and all were checked by the lead author. If you’re interested in creating your own annotations to use as part of this process, see Create new manual annotations.

Warning

We only support the use of the optimized parameter set we created ourselves, (i.e., the .csv and .json files contained in .\references\parameter_optimisation by deafult). If you choose to re-run the parameter optimization yourself or use your own ground truth annotations, we cannot guarentee that you’ll be able to reproduce our results.

Setting up#

First, ensure that you’ve followed the installation instructions in Building the database up to the Onset detection section.

Running the parameter optimization#

Inside the virtual environment for the project you created when building the database, run the following command:

python src\detect\optimize_detection_parameters.py
>>> optimizing onset detection for piano, bass, drums ...
>>> optimising parameters across 30 track/instrument combinations ...
>>> ... instrument piano, iteration 0/?, mean F: 0.6402, stdev F: 0.0977, 30 tracks (0 loaded from cache),

This command will start the program off on optimizing the detection parameters for each instrument in turn, before optimizing the beat tracking algorithm. You’ll notice that the mean and standard deviation F-score obtained for each iteration of the algorithm is printed directly to the console.

By default, the program will optimize detection for the .\references\corpus_chronology.xlsx corpus file. To change this, you can pass in the -corpus_fname flag to direct to another file in this folder, like:

python src\detect\optimize_detection_parameters.py -corpus_fname corpus_bill_evans
>>> optimizing onset detection for piano, bass, drums ...

Tip

On any given iteration step, if the program detects that a particular combination of parameters have already been tried for an instrument, it will skip this iteration and load the corresponding F-scores directly from disk (contained in .\references\parameter_optimisation\{corpus_fname}\*.csv files). To suppress this functionality, delete or rename the .csv files inside .\references\parameter_optimisation\{corpus_fname}.

Check the results#

Once the optimization process has completed, you can find the converged parameters inside .\references\parameter_optimisation\{corpus_fname}\converged_parameters.json. These will be then be used by the relevant algorithms when running .\src\detect\detect_onsets.py.