Citation

BibTex format

@inproceedings{Neo:2022:10.1109/SSPD54131.2022.9896222,
author = {Neo, VW and Weiss, S and Naylor, PA},
doi = {10.1109/SSPD54131.2022.9896222},
pages = {1--5},
publisher = {IEEE},
title = {A polynomial subspace projection approach for the detection of weak voice activity},
url = {http://dx.doi.org/10.1109/SSPD54131.2022.9896222},
year = {2022}
}

RIS format (EndNote, RefMan)

TY  - CPAPER
AB - A voice activity detection (VAD) algorithm identifies whether or not time frames contain speech. It is essential for many military and commercial speech processing applications, including speech enhancement, speech coding, speaker identification, and automatic speech recognition. In this work, we adopt earlier work on detecting weak transient signals and propose a polynomial subspace projection pre-processor to improve an existing VAD algorithm. The proposed multi-channel pre-processor projects the microphone signals onto a lower dimensional subspace which attempts to remove the interferer components and thus eases the detection of the speech target. Compared to applying the same VAD to the microphone signal, the proposed approach almost always improves the F1 and balanced accuracy scores even in adverse environments, e.g. -30 dB SIR, which may be typical of operations involving noisy machinery and signal jamming scenarios.
AU - Neo,VW
AU - Weiss,S
AU - Naylor,PA
DO - 10.1109/SSPD54131.2022.9896222
EP - 5
PB - IEEE
PY - 2022///
SP - 1
TI - A polynomial subspace projection approach for the detection of weak voice activity
UR - http://dx.doi.org/10.1109/SSPD54131.2022.9896222
UR - https://ieeexplore.ieee.org/document/9896222
UR - http://hdl.handle.net/10044/1/99145
ER -

Contact us

Address

Speech and Audio Processing Lab
CSP Group, EEE Department
Imperial College London

Exhibition Road, London, SW7 2AZ, United Kingdom

Email

p.naylor@imperial.ac.uk