Associate Professor · Machine Listening · Audio AI
This is Dr Shengchen Li, an Associate Professor in the Department of Intelligent Science, School of Advanced Technology, Xi’an Jiaotong-Liverpool University. My research focuses on trustworthy and robust machine listening, including acoustic scene analysis, music information retrieval, generative audio tracking, and audio security.
Latest updates
Research overview
My research area is centred on audio and music intelligence, with a broader goal of enabling machines to understand, model, generate, and reason about sound in real-world contexts. The work spans machine listening, acoustic scene and event analysis, audio-language understanding, symbolic music and music generation, expressive performance modelling, speech and voice processing, and healthcare audio. Across these topics, the core motivation is to build computational systems that can extract meaningful structure from complex sound, connect audio with human-centred applications, and support both scientific understanding and practical intelligent systems.
This research area is further strengthened by three connected directions. Robust, trustworthy, and efficient AI addresses the reliability of audio and music systems under domain shift, limited data, computational constraints, privacy risks, and adversarial conditions. Multi-modal and cross-domain perception extends audio intelligence by integrating sound with language, vision, wireless signals, radar, skeleton data, and other sensing modalities. Peripheral review and interdisciplinary work broadens the impact of the research by connecting audio AI, generative AI, signal analysis, education, healthcare, engineering, and applied systems.
Together, these four directions form a coherent research portfolio: audio and music intelligence provides the foundation, robustness and trustworthiness support deployment, multimodal perception expands the scope, and interdisciplinary work connects the research to wider academic and societal contexts.
Selected achievements
Recognition
Research themes
Acoustic scene analysis, music information retrieval, bioacoustics, speech enhancement, and symbolic music generation.
Domain adaptation, few-shot learning, model efficiency, privacy, source tracing, and audio security.
Audio-visual learning, multimodal scene classification, sign language recognition, and radar sensing.
Computer audition review, AI education, optical signal processing, and hardware FPGA systems.
Teaching
Founding module leader at XJTLU, from AY23-24 until now.
Taught at XJTLU from AY20-21 to AY25-26.
Taught at XJTLU in AY21-22 and AY22-23.
Academic service
China Audio Industry Association
Ghent University
Tsinghua University
Dalian Maritime University
Queen Mary University of London
University of Surrey
Fudan University
I am always open to collaborating with researchers, industry partners, and prospective PhD students who share an interest in machine listening, audio AI, and trustworthy machine learning. If you are interested in working together, please get in touch.
Get in touch