In this talk, Dr Xu gives a summary of his previous deep learning based speech enhancement and noise robust speech recognition work. This work focused on how to improve the generalisation capacity of the deep learning-based system.

The winning system in the ‘Large-scale weakly supervised sound event detection for smart cars’ task of Detection and Classification of Acoustic Scenes and Events (DCASE) 2017 challenge’ will also be covered.


You can watch this and previous Audio Analytic Tech Talks on our Audio Analytic Labs YouTube channel.

More on Dr Yong Xu:

Dr Yong Xu is a research fellow in University of Surrey, UK. He got his PhD degree from University of Science and Technology of China (USTC) in 2015. He once visited Georgia Institute of Technology, USA from September 2014 to May 2015. After his PhD graduation, he worked for IFLYTEK as a researcher from April 2015 to April 2016. His papers have obtained 630+ citations with two ESI highly cited IEEE journal papers. He won the 1st place in DCASE 2017 challenge for ‘Large-scale weakly supervised sound event detection for smart cars’.


Like this? You can subscribe to our blog and receive an alert every time we publish an announcement, a comment on the industry or something more technical.

About Audio Analytic

Audio Analytic is the pioneer of AI sound recognition software. The company is on a mission to map the world of sounds, offering our sense of hearing to consumer technology. By transferring our sense of hearing to consumer products and digital personal assistants we give them the ability to react to the world around us, helping satisfy our entertainment, safety, security, wellbeing and communication needs.

Audio Analytic’s ai3™ sound recognition software enables device manufacturers and chip companies to equip products with Artificial Audio Intelligence, recognizing and automatically responding to our growing list of sound profiles.