January 24, 2020
Polyphonic Sound Detection Score source code is now available on GitHub
First introduced in a research paper in late October 2019, our Polyphonic Sound Detection Score (PSDS) is an industry-standard evaluation framework and metric for polyphonic sound recognition systems.
We’ve now made the PSDS source code available on GitHub, enabling researchers to apply the score to their sound recognition models.
Dr Sacha Krstulović, Director of AA Labs (Audio Analytic’s research group), said: “PSDS solves some fundamental shortcomings of previous evaluation approaches, therefore we believe that the wider sound recognition community will benefit from accessing the source code and using PSDS for their own work.”
The PSDS GitHub repository comprises a python package which contains a library that calculates the PSDS of polyphonic sound event detection systems.
Dr Krstulović continues: “Our approach to evaluating the performance of polyphonic sound event detection systems revisits the definition of system errors, makes the evaluation more robust and expands the evaluation to include the factors which matter to user experience, for example the cross-triggers or the stability across classes.
“It means that with PSDS, the identification of the best performing system becomes grounded in user experience rather than abstract or impractical statistical definitions.”
Read more about the Polyphonic Sound Detection Score and download the research paper ‘A Framework for the Robust Evaluation of Sound Event Detection’, which has been submitted to ICASSP 2020.
You can find the full technical paper on the Polyphonic Sound Detection Score, along with links to GitHub and the Jupyter Notebook, here.
Like this? You can subscribe to our blog and receive an alert every time we publish an announcement, a comment on the industry or something more technical.
About Audio Analytic
Audio Analytic is the pioneer of AI sound recognition technology. The company is on a mission to give machines a compact sense of hearing. This empowers them with the ability to react to the world around us, helping satisfy our entertainment, safety, security, wellbeing, convenience, and communication needs across a huge range of consumer products.
Audio Analytic’s ai3™ and ai3-nano™ sound recognition software enables device manufacturers to equip products at the edge with the ability to recognize and automatically respond to our growing list of sounds and acoustic scenes.