Paper: Multizone Soundfield Reproduction With Privacy- and Quality-Based Speech Masking Filters

In this work, the authors present a comprehensive methodology for multizone sound field reproduction using specially designed speech masking filters. The masking filters are design to maximise speech privacy and quality. Trade-offs between speech privacy and quality are shown to exist and parameters are provided in the methods to control their values. An accurate and precise formulation of grating lobes from spatial aliasing in multizone reproduction scenarios is provided and used to enhance the masking filters. The mathematical descriptions and thorough methodology are evaluated using simulations and a real world implementation of a multizone sound field reproduction.

Continue reading Paper: Multizone Soundfield Reproduction With Privacy- and Quality-Based Speech Masking Filters

Paper: On the Comparison of Two Room Compensation / Dereverberation Methods Employing Active Acoustic Boundary Absorption

Using a planar array of microphones and loudspeakers, the authors show in this work a theoretical approach to actively suppressing the reflection of sound from boundaries, such as room walls. The filters that are proposed are applicable to real-time implementations and it is shown that the active absorption is similar to that of passive fibre panel absorption.

Continue reading Paper: On the Comparison of Two Room Compensation / Dereverberation Methods Employing Active Acoustic Boundary Absorption

Paper: Blind Speaker Counting in Highly Reverberant Environments by Clustering Coherence Features

Using only two microphones, like those commonly found on mobile devices, we show in this work how to count the number of people talking in a meeting scenario. This paper has been presented at and published in the proceedings of the 2017 Asia Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC) in Kuala Lumpur, Malaysia.

Continue reading Paper: Blind Speaker Counting in Highly Reverberant Environments by Clustering Coherence Features

Paper: Active Speech Control using Wave-Domain Processing with a Linear Wall of Dipole Secondary Sources

Ever wondered if you could cancel someones voice without the need for a physical wall or partition? In this work presented at ICASSP 2017 in New Orleans, USA, we investigate the possibilities of cancelling speech over a loudspeaker wall. The method is not limited to speech, in-fact, it works much better for periodic signals as the non-stationarity of speech degrades the performance.

Scenario Layout - ICASSP2017IEEE_ICASSP2017_1

Continue reading Paper: Active Speech Control using Wave-Domain Processing with a Linear Wall of Dipole Secondary Sources

Paper: Towards Real-Time Source Counting by Estimation of Coherent-to-Diffuse Ratios from Ad-Hoc Microphone Array Recordings

Blindly counting the number of speech sources (talkers) in a meeting room can be a difficult task. This paper was presented at HSCMA 2017 at the Google Offices in San Francisco and shows how using coherent-to-diffuse ratios could allow real-time source counting.

Example Layout - Towards Real-Time Source Counting by Estimation of Coherent-to-Diffuse Ratios from Ad-hoc Microphone Array Recordings
Success Rate - Towards Real-Time Source Counting by Estimation of Coherent-to-Diffuse Ratios from Ad-hoc Microphone Array Recordings

Continue reading Paper: Towards Real-Time Source Counting by Estimation of Coherent-to-Diffuse Ratios from Ad-Hoc Microphone Array Recordings