Abstract: In this paper, a new speaker counting algorithm is proposed by novel zig-zag nested array (ZZNA) combining with adaptive generalized cross-correlation (GCC) function (with phase transform ...
Abstract: Active Speaker Detection (ASD) aims to determine whether each candidate in a video frame is speaking. The egocentric dataset Ego4D introduces unique challenges for this task, such as dynamic ...
一些您可能无法访问的结果已被隐去。
显示无法访问的结果