Staff Profile
Dr Iain McCowan
Main page
Publications
| [1]
|
I. McCowan, M. Lincoln, and
I. Himawan. Microphone Array Calibration in Diffuse Noise
Fields. To appear in IEEE Transactions on Audio, Speech
and Language Processing, January 2008.
|
| [2]
|
D. Gatica-Perez, G. Lathoud,
J.-M. Odobez, and I. McCowan. Audio-Visual Probabilistic
Tracking of Multiple Speakers in Meetings. IEEE
Transactions on Speech and Audio Processing, 15(2):601-616,
February 2007.
|
| [3]
|
I. Himawan, I. McCowan, and
M. Lincoln. Microphone Array Beamforming Approach to Blind
Speech Separation. Under review., 2007.
|
| [4]
|
H. K. Maganti, D. Gatica-Perez,
and I. McCowan. Speech Enhancement and Recognition in
Meetings with an Audio-Visual Sensor Array. IEEE
Transactions on Audio, Speech and Language Processing,
2007.
|
| [5]
|
I. McCowan, D. Moore,
A. Nguyen, R. Bowman, B. Clarke, E. Duhig, and M-J. Fry.
Collection of Population Cancer Stage Data by Classifying
Free-text Medical Reports. Journal of the American
Medical Informatics Association (JAMIA), 14(6):736-745,
November-December 2007.
|
| [6]
|
D. Zhang, D. Gatica-Perez,
S. Bengio, I. McCowan, and G. Lathoud. Modeling Individual
and Group Actions in Meetings With Layered HMMs. IEEE
Transactions on Multimedia, 8(3):509-520, June 2006.
|
| [7]
|
J. Ajmera, I. McCowan, and
H. Bourlard. Robust speaker change detection. IEEE
Signal Processing Letters, 11(8), August 2004.
|
| [8]
|
I. McCowan, D. Gatica-Perez,
S. Bengio, G. Lathoud, M. Barnard, and D. Zhang. Automatic
Analysis of Multimodal Group Actions in Meetings. IEEE
Transactions on Pattern Analysis and Machine Intelligence,
27(3):305-317, March 2004.
|
| [9]
|
J. Ajmera, I. McCowan, and
H. Bourlard. Speech/music Segmentation using Entropy and
Dynamism Features in a HMM Classification Framework.
Speech Communication, 40:351-363, 2003.
|
| [10]
|
I. McCowan and H. Bourlard.
Microphone Array Post-filter based on Noise Field Coherence.
IEEE Transactions on Speech and Audio Processing,
11(6), November 2003.
|
| [11]
|
I. McCowan, D. Moore, and
S. Sridharan. Near-field Adaptive Beamformer with Application
to Robust Speech Recognition. Digital Signal Processing:
A Review, 12(1):87-106, January 2002.
|
| [12]
|
I. McCowan and
S. Sridharan. Multi-channel Sub-band Speech Recognition.
EURASIP Journal on Applied Signal Processing,
2001(1):45-52, March 2001.
|
| [13]
|
I. Himawan, S. Sridharan,
and I. McCowan. Dealing with Uncertainty in Microphone
Placement in a Microphone Array Speech Recognition System.
In To appear in Proceedings of ICASSP 2008, 2008.
|
| [14]
|
I. McCowan and H. Harden.
Towards Automated Observational Analysis of Leadership in
Clinical Networks. In Third International Conference
Information Technology in Health Care (ITHC2007):
Socio-technical approaches, August 2007.
|
| [15]
|
D. Moore, I. McCowan,
A. Nguyen, and M-J. Fry. Trial evaluation of automatic lung
cancer staging from pathology reports. In Proceedings of
12th International Health (Medical) Informatics Congress (Medinfo),
2007. (Abstract submission).
|
| [16]
|
A. Nguyen, D. Moore, and
I. McCowan. Unsupervised Clustering of Free-Living Human
Activities using Ambulatory Accelerometry. In IEEE
International Conference of the Engineering in Medicine and
Biology Society (EMBC), 2007.
|
| [17]
|
A. Nguyen, D. Moore,
I. McCowan, and M-J. Courage. Multi-class Classification of
Cancer Stages from Free-text Histology Reports using Support
Vector Machines. In IEEE International Conference of the
Engineering in Medicine and Biology Society (EMBC), 2007.
|
| [18]
|
I. McCowan, D. Moore, and
M-J. Fry. Classification of Cancer Stage from Free-text
Histology Reports. In Proceedings IEEE Engineering in
Medicine and Biology Conference (EMBC), 2006.
|
| [19]
|
I. McCowan, D. Moore, and
M-J. Fry. Automated Cancer Stage Classification from
Free-text Histology Reports. In Proceedings of the
Australian Health Informatics Conference (HIC), 2006.
|
| [20]
|
D. Moore, A. Nguyen,
I. McCowan, and M-J. Fry. Collection of population lung
cancer stage data by classifying free-text pathology reports.
In Proceedings of the 6th Annual Health and Medical Research
Conference of Queensland, page 243, 2006. (Abstract
submission).
|
| [21]
|
J. Carletta, S. Ashby, S. Bourban,
M. Flynn, M. Guillemot, T. Hain, J. Kadlec, V. Karaiskos, M. Kronenthal
W. Kraaij, G. Lathoud, M. Lincoln, A. Lisowska, I. McCowan,
W. Post, D. Reidsma, and P. Wellner. The AMI Meeting Corpus:
A Pre-Announcement. In Proceedings of Workshop on
Multimodal Interaction and Related Machine Learning Algorithms (MLMI).
Springer Lecture Notes in Computer Science, July 2005.
|
| [22]
|
D. Gatica-Perez, G. Lathoud,
J.M. Odobez, and I. McCowan. Multimodal Multispeaker
Probabilistic Tracking in Meetings. In Proceedings of
International Conference on Multimodal Interfaces (ICMI),
October 2005.
|
| [23]
|
T. Hain, L. Burget,
J. Dines, G. Garau, M. Karafiat, M. Lincoln, I. McCowan,
D. Moore, V. Wan, R. Ordelman, and S. Renals. The 2005 AMI
System for the Transcription of Speech in Meetings. In
Proceedings of the NIST Rich Transcription Meeting Recognition
Evaluation Workshop, July 2005.
|
| [24]
|
M. Lincoln, I. McCowan, J. Vepa,
and H. Krishna Maganti. The Multi-Channel Wall Street Journal
Audio-Visual Corpus (MC-WSJ-AV): Specification and Initial
Experiments. In Proceedings of the IEEE Automatic Speech
Recognition and Understanding Workshop (ASRU), December
2005.
|
| [25]
|
I. McCowan, J. Carletta,
W. Kraaij, S. Ashby, S. Bourban, M. Flynn, M. Guillemot,
T. Hain, J. Kadlec, V. Karaiskos, M. Kronenthal, G. Lathoud,
M. Lincoln, A. Lisowska, W. Post, D. Reidsma, and P. Wellner.
The AMI Meeting Corpus. In Proceedings of the 5th
International Conference on Methods and Techniques in Behavioral
Research, September 2005.
|
| [26]
|
I. McCowan, M. Hari-Krishna,
D. Gatica-Perez, D. Moore, and S. Ba. Speech Acquisition in
Meetings with an Audio-Visual Sensor Array. In
Proceedings of the IEEE International Conference on Multimedia
and Expo (ICME), July 2005.
|
| [27]
|
D. Zhang, D. Gatica-Perez,
S. Bengio, and I. McCowan. Semi-supervised Adapted HMMs for
Unusual Event Detection. In Proceeedings of IEEE Conf.
on Computer Vision and Pattern Recognition (CVPR), San
Diego, June 2005.
|
| [28]
|
J. Ajmera, G. Lathoud, and
I. McCowan. Clustering And Segmenting Speakers And Their
Locations In Meetings. In Proceedings of the
International Conference on Acoustics, Speech and Signal
Processing, 2004.
|
| [29]
|
J. Ajmera, I. McCowan, and
H. Bourlard. An Online Audio Indexing System. In
International Conference on Spoken Language Processing (ICSLP),
October 2004.
|
| [30]
|
Daniel Gatica-Perez, Iain
McCowan, D. Zhang, and S. Bengio. Detecting Group
Interest-level in Meetings. In Proceedings of ICASSP
2005, volume I, pages 489-492, 2004.
|
| [31]
|
Guillaume Lathoud and
Iain A. McCowan. A Sector-Based Approach for Localization of
Multiple Speakers with Microphone Arrays. In Proceedings
of the Workshop on Statistical and Perceptual Audio Processing
SAPA'04, October 2004.
|
| [32]
|
Guillaume Lathoud, Iain A.
McCowan, and Jean-Marc Odobez. Unsupervised Location-Based
Segmentation of Multi-Party Speech. In Proceedings of
the 2004 ICASSP-NIST Meeting Recognition Workshop,
Montreal, Canada, May 2004.
|
| [33]
|
Dong Zhang, Daniel Gatica-Perez,
Samy Bengio, Iain McCowan, and Guillaume Lathoud. Modeling
Individual and Group Actions in Meetings: a Two-Layer HMM
Framework. In IEEE Workshop on Event Mining: Detection
and Recognition of Events in Video, In Association with CVPR,
2004.
|
| [34]
|
Dong Zhang, Daniel Gatica-Perez,
Samy Bengio, Iain McCowan, and Guillaume Lathoud. Multimodal
Group Action Clustering in Meetings. In ACM 2nd
International Workshop on Video Surveillance and Sensor Networks
in conjunction with 12th ACM International Conference on
Multimedia, 2004.
|
| [35]
|
D. Gatica-Perez, G. Lathoud,
I. McCowan, and J-M. Odobez. A Mixed-State I-Particle Filter
for Multi-Camera Speaker Tracking. In Proceedings of
Workshop On Multimedia Technologies in E-Learning and
Collaboration (WOMTEC), in conjunction with International
Conference on Computer Vision (ICCV), September 2003.
|
| [36]
|
D. Gatica-Perez, G. Lathoud,
I. McCowan, J-M. Odobez, and D. Moore. Audio-Visual Speaker
Tracking with Importance Particle Filters. In
Proceedings of the IEEE International Conference on Image
Processing, September 2003.
|
| [37]
|
D. Gatica-Perez,
I. McCowan, M. Barnard, S. Bengio, and H. Bourlard. On
Automatic Annotation of Meeting Databases. In
Proceedings of the IEEE International Conference on Image
Processing, September 2003.
|
| [38]
|
G. Lathoud and I. McCowan.
Location Based Speaker Segmentation. In Proceedings
of the International Conference on Acoustics, Speech and Signal
Processing, April 2003.
|
| [39]
|
G. Lathoud, I. McCowan, and
D. Moore. Segmenting Multiple Concurrent Speakers using
Microphone Arrays. In Proceedings of Eurospeech 2003,
September 2003.
|
| [40]
|
I. McCowan, S. Bengio, D. Gatica-Perez,
G. Lathoud, F. Monay, D. Moore, P. Wellner, and H. Bourlard.
Modeling Human Interaction in Meetings. In Proceedings
of the International Conference on Acoustics, Speech and Signal
Processing, April 2003.
|
| [41]
|
I. McCowan, D. Gatica-Perez,
S. Bengio, D. Moore, and H. Bourlard. Towards Computer
Understanding of Human Interactions. In Proceedings of
the European Symposium on Ambient Intelligence. Springer
Lecture Notes in Computer Science, November 2003. Available as
IDIAP RR 03-45.
|
| [42]
|
D. Moore and I. McCowan.
Microphone Array Speech Recognition: Experiments on Overlapping
Speech in Meetings. In Proceedings of the International
Conference on Acoustics, Speech and Signal Processing,
April 2003.
|
| [43]
|
V. Tyagi, S. Ikbahl,
I. McCowan, and H. Bourlard. On Factorizing Spectral Dynamics
for Robust Speech Recognition. In Proceedings of
Eurospeech 2003, September 2003.
|
| [44]
|
V. Tyagi, I. McCowan,
H. Bourlard, and H. Misra. Mel-Cepstrum Modulation Spectrum (MCMS)
Features for Robust ASR. In Proceedings of Workshop on
Automatic Speech Recognition and Understanding (ASRU),
August 2003.
|
| [45]
|
J. Ajmera, H. Bourlard, I. Lapidot,
and I. McCowan. Unknown-multiple Speaker Clustering using HMM.
In Proceedings of the International Conference on Speech and
Language Processing, September 2002.
|
| [46]
|
J. Ajmera, I. McCowan, and
H. Bourlard. Robust HMM-based speech/music segmentation.
In Proceedings of the International Conference on Acoustics,
Speech and Signal Processing, May 2002.
|
| [47]
|
I. McCowan and H. Bourlard.
Microphone Array Post-filter for Diffuse Noise Field. In
Proceedings of the International Conference on Acoustics,
Speech and Signal Processing, May 2002.
|
| [48]
|
I. McCowan, A. Morris, and
H. Bourlard. Robust Speech Recognition with Small Microphone
Arrays using the Missing Data Approach. In Proceedings
of the International Conference on Speech and Language
Processing, September 2002.
|
| [49]
|
I. McCowan, J. Pelecanos,
and S. Sridharan. Robust Speaker Recognition using Microphone
Arrays. In Proceedings of 2001: A Speaker Odyssey,
June 2001.
|
| [50]
|
I. McCowan and
S. Sridharan. Microphone Array Sub-band Speech Recognition.
In Proceedings of the International Conference on Acoustics,
Speech and Signal Processing, May 2001.
|
| [51]
|
I. McCowan and
S. Sridharan. Adaptive Parameter Compensation for Robust
Hands-free Speech Recognition using a Dual-Beamforming
Microphone Array. In Proceedings on 2001 International
Symposium on Intelligent Multimedia, Video and Speech Processing,
May 2001.
|
| [52]
|
I. McCowan, C. Marro, and
L. Mauuary. Robust Speech Recognition using Near-field
Superdirective Beamforming with Post-filtering. In
Proceedings of the International Conference on Acoustics, Speech
and Signal Processing, pages 1723-1726, 2000.
|
| [53]
|
I. McCowan, D. Moore, and
S. Sridharan. Speech enhancement using Near-field
Superdirectivity with an Adaptive Sidelobe Canceler and
Post-filter. In Proceedings of the 2000 Australian
International Conference on Speech Science and Technology,
pages 268-273, December 2000.
|
|