EPSRC Reference: |
EP/H012842/1 |
Title: |
Multi-Modal Blind Source Separation for Robot Audition |
Principal Investigator: |
Wang, Professor W |
Other Investigators: |
|
Researcher Co-Investigators: |
|
Project Partners: |
|
Department: |
Vision Speech and Signal Proc CVSSP |
Organisation: |
University of Surrey |
Scheme: |
Standard Research |
Starts: |
08 October 2009 |
Ends: |
07 October 2012 |
Value (£): |
115,288
|
EPSRC Research Topic Classifications: |
Digital Signal Processing |
|
|
EPSRC Industrial Sector Classifications: |
Aerospace, Defence and Marine |
|
|
Related Grants: |
|
Panel History: |
Panel Date | Panel Name | Outcome |
28 Apr 2009
|
DSTL-EPSRC Signal Processing
|
Announced
|
|
Summary on Grant Application Form |
This proposal draws on expertise in blind source separation and multimodal (audio-visual) speech processing within the Centre for Vision Speech and Signal Processing at University of Surrey. The objective is to perform source separation of the target speech in the presence of multiple competing sound sources in room environments and thereby ultimately provide progress towards automatic machine perception of auditory scenes within an un-controlled natural environment. The fundamental novelty in this work is to exploit visual cues for enhancing the operation of frequency domain blind source separation algorithms. Exploitation of such audio-visual processing is targeted at mitigating the permutation problem, the underdetermined problem (i.e. when the number of sources is greater than the number of microphones), and the reverberation problem, which currently limits the practical applicability of blind source separation algorithms. The focus of the work is therefore on the signal processing algorithms and software tools that can be used to perform automatic separation of sound signals, e.g., for a robot. The body of work in this proposal is underpinned by the substantial experience of the investigators, two from the areas of blind source separation and digital speech processing, and one from the area of computer vision and pattern recognition. The outcomes of the proposed research will be of considerable value to the UK defence industry working especially in the areas of target separation, detection and multi-path mitigation (or dereverberation), with applications in, for example, human-robot interaction, security surveillance and human-computer interaction.
|
Key Findings |
This information can now be found on Gateway to Research (GtR) http://gtr.rcuk.ac.uk
|
Potential use in non-academic contexts |
This information can now be found on Gateway to Research (GtR) http://gtr.rcuk.ac.uk
|
Impacts |
Description |
This information can now be found on Gateway to Research (GtR) http://gtr.rcuk.ac.uk |
Summary |
|
Date Materialised |
|
|
Sectors submitted by the Researcher |
This information can now be found on Gateway to Research (GtR) http://gtr.rcuk.ac.uk
|
Project URL: |
http://mod-udrc.org/project/55 |
Further Information: |
|
Organisation Website: |
http://www.surrey.ac.uk |