Paper: Improving Speech Privacy in Personal Sound Zones

Presented at ICASSP 2016, we explore ways to improve speech privacy in multizone soundfield reproductions using intelligibility metrics, all while maintaining high speech quality.

Anechoic Results - Improving Speech Privacy in Personal Sound Zones
Multizone Soundfield Layout - Improving Speech Privacy in Personal Sound Zones

Continue reading Paper: Improving Speech Privacy in Personal Sound Zones

  •  
  •  
  •  
  •  
  •  
  •  
  •  
  •  
  •  

Paper: Multizone Reproduction of Speech Soundfields – A Perceptually Weighted Approach

Recent work on perceptual soundfield reproduction for multizone soundfield cases was presented in Hong Kong at the Asia-Pacific Signal & Information Processing Association Annual Summit and Conference (APSIPA ASC).

Results - Multizone Reproduction of Speech Soundfields- A Perceptually Weighted Approach
Spatial_Error_Improvement

Continue reading Paper: Multizone Reproduction of Speech Soundfields – A Perceptually Weighted Approach

A fast MATLAB executable (MEX) compilation of the PESQ measure

After having spent a good while looking for a working copy of the Perceptual Evaluation of Speech Quality measure implemented in MATLAB, I ended up using the readily available “wrapper” functions found on the MATLAB file exchange site. While these wrappers work a treat they are a bit slow as they require a call to the system to run the PESQ binary file. The PESQ binary file is compiled from the source code found on the ITU site for the PESQ standard. The source code is written in C which MATLAB supports for its MATLAB executable (MEX) compilations. That was what then prompt me to look for a MEX compiled version of the PESQ measure but to my surprise I couldn’t find any. I managed to get the ITU standards’ source code compiled and working in a MEX function and I will go through the few steps required to do it. Continue reading A fast MATLAB executable (MEX) compilation of the PESQ measure

The Speech Transmission Index (STI) for MATLAB

There are a few robust algorithms available to objectively measure the intelligibility of speech. They compare what is heard by a listener to the information that was transmitted by the talker, however, only one of these available algorithms works in reverberant rooms and at the time I was looking for an implementation for MATLAB, I couldn’t find any, zilch. So, I decided to code it from scratch myself and will show you how to do it step by step. Continue reading The Speech Transmission Index (STI) for MATLAB

Paper: An Efficient Approach to Dynamically Weighted Multizone Wideband Reproduction of Speech Soundfields

Presented at the 2015 IEEE China Summit & International Conference on Signal and Information Processing (ChinaSIP) this work looks at efficiently determining multizone soundfield zone weights using bi-linear interpolation of sparsely simulated soundfields.

PESQ Results - An Efficient Approach to Dynamically Weighted Multizone Wideband Reproduction of Speech Soundfields
Multizone_Layout_v2

Continue reading Paper: An Efficient Approach to Dynamically Weighted Multizone Wideband Reproduction of Speech Soundfields