Associate Professor · Machine Listening · Audio AI

Machine listening for robust, intelligent, and responsible sound understanding.

This is Dr Shengchen Li, an Associate Professor in the Department of Intelligent Science, School of Advanced Technology, Xi’an Jiaotong-Liverpool University. My research focuses on trustworthy and robust machine listening, including acoustic scene analysis, music information retrieval, generative audio tracking, and audio security.

Latest updates

Recent news

View all news

Research overview

Sound, music, intelligence, and trust.

My research area is centred on audio and music intelligence, with a broader goal of enabling machines to understand, model, generate, and reason about sound in real-world contexts. The work spans machine listening, acoustic scene and event analysis, audio-language understanding, symbolic music and music generation, expressive performance modelling, speech and voice processing, and healthcare audio. Across these topics, the core motivation is to build computational systems that can extract meaningful structure from complex sound, connect audio with human-centred applications, and support both scientific understanding and practical intelligent systems.

This research area is further strengthened by three connected directions. Robust, trustworthy, and efficient AI addresses the reliability of audio and music systems under domain shift, limited data, computational constraints, privacy risks, and adversarial conditions. Multi-modal and cross-domain perception extends audio intelligence by integrating sound with language, vision, wireless signals, radar, skeleton data, and other sensing modalities. Peripheral review and interdisciplinary work broadens the impact of the research by connecting audio AI, generative AI, signal analysis, education, healthcare, engineering, and applied systems.

Together, these four directions form a coherent research portfolio: audio and music intelligence provides the foundation, robustness and trustworthiness support deployment, multimodal perception expands the scope, and interdisciplinary work connects the research to wider academic and societal contexts.

Selected achievements

Research impact

64Publications
770Citations
15h-index
23i10-index

Recognition

Awards and honours

Research themes

Current directions

Audio and Music Intelligence

Acoustic scene analysis, music information retrieval, bioacoustics, speech enhancement, and symbolic music generation.

Robust, Trustworthy and Efficient AI

Domain adaptation, few-shot learning, model efficiency, privacy, source tracing, and audio security.

Multimodal and Cross-Domain Perception

Audio-visual learning, multimodal scene classification, sign language recognition, and radar sensing.

Peripheral Review and Interdisciplinary Works

Computer audition review, AI education, optical signal processing, and hardware FPGA systems.

Teaching

Courses and supervision

INT306 Computer Auditory Systems

Founding module leader at XJTLU, from AY23-24 until now.

INT104 Artificial Intelligence

Taught at XJTLU from AY20-21 to AY25-26.

INT307 Multimedia Security Systems

Taught at XJTLU in AY21-22 and AY22-23.

Student supervision

Academic service

Contributing to the research community

Professional activities

  • Member of Technical Committee, CCF (since 2020)
  • Senior Member, IEEE (since 2023)
  • Meta Reviewer, ISMIR 2026
  • Area Chair, ICASSP 2025–2026
  • Steering Committee, CSMT (since 2018)

Reviewer service

  • Journals: IEEE/ACM TASLP, EURASIP JASMP, Frontiers in Medicine
  • Conferences: ICASSP, INTERSPEECH, ISMIR, ICME, DCASE, EUSIPCO, CMMR, CSMT

Selected invited talks

2024

“Robustness Analysis in Music Informatics”

China Audio Industry Association

2023

“Acoustic Signal Processing with Machine Learning”

Ghent University

2023

“An Overview of Computer Music”

Tsinghua University

2022

“The Development of Machine Learning Algorithms for Audio Signal Processing”

Dalian Maritime University

2019

“Computational Methods of Expressiveness Analysis”

Queen Mary University of London

2019

“Towards Hardware Solutions of Acoustic Signal Processing”

University of Surrey

2018

“Arts or Engineering: Music Perception and Recognition”

Fudan University

Collaboration and supervision

I am always open to collaborating with researchers, industry partners, and prospective PhD students who share an interest in machine listening, audio AI, and trustworthy machine learning. If you are interested in working together, please get in touch.

Get in touch