Comparing Results

Usage

The final results obtained by the evaluation script can now be compared among all trained models. To do this interactively, use the following commands:

cd models
ipython3 compare_models.py

Review Results

The results of performance metrics are stored to Pandas data frames. Results are stored in the following Pandas dataframes:

  • global_res: Global performance metrics.
  • class_res: Class-wise performance metrics.
  • global_class_res: Global performance metrics by using optima class-wise thresholds.

By accessing it interactively, it is possible to find the global performance metrics with respect to the data set, model, aggregation strategy and averaging method used.

For example:

global_res.loc['irmas_1024'].loc['model_baseline'].loc['s2'].loc['micro']
class_res.loc['jazz_db_1024'].loc['model_baseline'].loc['s1'].loc['voi']
global_class_res.loc['youtube_1024'].loc['model_leaky'].loc['s1'].loc['macro']

will print the precision, recall, and f-scores of these cases.

Finally, you can uncomment sections of the script to plot the improvements by comparing the models in between them.