Go to CSIRO.AU

ICT Centre - Innovative ICT transforming Australian industries

HOME

 


 

Staff Profile


 

Dr Iain McCowan

Main page

Publications

[1] I. McCowan, M. Lincoln, and I. Himawan. Microphone Array Calibration in Diffuse Noise Fields. To appear in IEEE Transactions on Audio, Speech and Language Processing, January 2008.
[2] D. Gatica-Perez, G. Lathoud, J.-M. Odobez, and I. McCowan. Audio-Visual Probabilistic Tracking of Multiple Speakers in Meetings. IEEE Transactions on Speech and Audio Processing, 15(2):601-616, February 2007.
[3] I. Himawan, I. McCowan, and M. Lincoln. Microphone Array Beamforming Approach to Blind Speech Separation. Under review., 2007.
[4] H. K. Maganti, D. Gatica-Perez, and I. McCowan. Speech Enhancement and Recognition in Meetings with an Audio-Visual Sensor Array. IEEE Transactions on Audio, Speech and Language Processing, 2007.
[5] I. McCowan, D. Moore, A. Nguyen, R. Bowman, B. Clarke, E. Duhig, and M-J. Fry. Collection of Population Cancer Stage Data by Classifying Free-text Medical Reports. Journal of the American Medical Informatics Association (JAMIA), 14(6):736-745, November-December 2007.
[6] D. Zhang, D. Gatica-Perez, S. Bengio, I. McCowan, and G. Lathoud. Modeling Individual and Group Actions in Meetings With Layered HMMs. IEEE Transactions on Multimedia, 8(3):509-520, June 2006.
[7] J. Ajmera, I. McCowan, and H. Bourlard. Robust speaker change detection. IEEE Signal Processing Letters, 11(8), August 2004.
[8] I. McCowan, D. Gatica-Perez, S. Bengio, G. Lathoud, M. Barnard, and D. Zhang. Automatic Analysis of Multimodal Group Actions in Meetings. IEEE Transactions on Pattern Analysis and Machine Intelligence, 27(3):305-317, March 2004.
[9] J. Ajmera, I. McCowan, and H. Bourlard. Speech/music Segmentation using Entropy and Dynamism Features in a HMM Classification Framework. Speech Communication, 40:351-363, 2003.
[10] I. McCowan and H. Bourlard. Microphone Array Post-filter based on Noise Field Coherence. IEEE Transactions on Speech and Audio Processing, 11(6), November 2003.
[11] I. McCowan, D. Moore, and S. Sridharan. Near-field Adaptive Beamformer with Application to Robust Speech Recognition. Digital Signal Processing: A Review, 12(1):87-106, January 2002.
[12] I. McCowan and S. Sridharan. Multi-channel Sub-band Speech Recognition. EURASIP Journal on Applied Signal Processing, 2001(1):45-52, March 2001.
[13] I. Himawan, S. Sridharan, and I. McCowan. Dealing with Uncertainty in Microphone Placement in a Microphone Array Speech Recognition System. In To appear in Proceedings of ICASSP 2008, 2008.
[14] I. McCowan and H. Harden. Towards Automated Observational Analysis of Leadership in Clinical Networks. In Third International Conference Information Technology in Health Care (ITHC2007): Socio-technical approaches, August 2007.
[15] D. Moore, I. McCowan, A. Nguyen, and M-J. Fry. Trial evaluation of automatic lung cancer staging from pathology reports. In Proceedings of 12th International Health (Medical) Informatics Congress (Medinfo), 2007. (Abstract submission).
[16] A. Nguyen, D. Moore, and I. McCowan. Unsupervised Clustering of Free-Living Human Activities using Ambulatory Accelerometry. In IEEE International Conference of the Engineering in Medicine and Biology Society (EMBC), 2007.
[17] A. Nguyen, D. Moore, I. McCowan, and M-J. Courage. Multi-class Classification of Cancer Stages from Free-text Histology Reports using Support Vector Machines. In IEEE International Conference of the Engineering in Medicine and Biology Society (EMBC), 2007.
[18] I. McCowan, D. Moore, and M-J. Fry. Classification of Cancer Stage from Free-text Histology Reports. In Proceedings IEEE Engineering in Medicine and Biology Conference (EMBC), 2006.
[19] I. McCowan, D. Moore, and M-J. Fry. Automated Cancer Stage Classification from Free-text Histology Reports. In Proceedings of the Australian Health Informatics Conference (HIC), 2006.
[20] D. Moore, A. Nguyen, I. McCowan, and M-J. Fry. Collection of population lung cancer stage data by classifying free-text pathology reports. In Proceedings of the 6th Annual Health and Medical Research Conference of Queensland, page 243, 2006. (Abstract submission).
[21] J. Carletta, S. Ashby, S. Bourban, M. Flynn, M. Guillemot, T. Hain, J. Kadlec, V. Karaiskos, M. Kronenthal W. Kraaij, G. Lathoud, M. Lincoln, A. Lisowska, I. McCowan, W. Post, D. Reidsma, and P. Wellner. The AMI Meeting Corpus: A Pre-Announcement. In Proceedings of Workshop on Multimodal Interaction and Related Machine Learning Algorithms (MLMI). Springer Lecture Notes in Computer Science, July 2005.
[22] D. Gatica-Perez, G. Lathoud, J.M. Odobez, and I. McCowan. Multimodal Multispeaker Probabilistic Tracking in Meetings. In Proceedings of International Conference on Multimodal Interfaces (ICMI), October 2005.
[23] T. Hain, L. Burget, J. Dines, G. Garau, M. Karafiat, M. Lincoln, I. McCowan, D. Moore, V. Wan, R. Ordelman, and S. Renals. The 2005 AMI System for the Transcription of Speech in Meetings. In Proceedings of the NIST Rich Transcription Meeting Recognition Evaluation Workshop, July 2005.
[24] M. Lincoln, I. McCowan, J. Vepa, and H. Krishna Maganti. The Multi-Channel Wall Street Journal Audio-Visual Corpus (MC-WSJ-AV): Specification and Initial Experiments. In Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), December 2005.
[25] I. McCowan, J. Carletta, W. Kraaij, S. Ashby, S. Bourban, M. Flynn, M. Guillemot, T. Hain, J. Kadlec, V. Karaiskos, M. Kronenthal, G. Lathoud, M. Lincoln, A. Lisowska, W. Post, D. Reidsma, and P. Wellner. The AMI Meeting Corpus. In Proceedings of the 5th International Conference on Methods and Techniques in Behavioral Research, September 2005.
[26] I. McCowan, M. Hari-Krishna, D. Gatica-Perez, D. Moore, and S. Ba. Speech Acquisition in Meetings with an Audio-Visual Sensor Array. In Proceedings of the IEEE International Conference on Multimedia and Expo (ICME), July 2005.
[27] D. Zhang, D. Gatica-Perez, S. Bengio, and I. McCowan. Semi-supervised Adapted HMMs for Unusual Event Detection. In Proceeedings of IEEE Conf. on Computer Vision and Pattern Recognition (CVPR), San Diego, June 2005.
[28] J. Ajmera, G. Lathoud, and I. McCowan. Clustering And Segmenting Speakers And Their Locations In Meetings. In Proceedings of the International Conference on Acoustics, Speech and Signal Processing, 2004.
[29] J. Ajmera, I. McCowan, and H. Bourlard. An Online Audio Indexing System. In International Conference on Spoken Language Processing (ICSLP), October 2004.
[30] Daniel Gatica-Perez, Iain McCowan, D. Zhang, and S. Bengio. Detecting Group Interest-level in Meetings. In Proceedings of ICASSP 2005, volume I, pages 489-492, 2004.
[31] Guillaume Lathoud and Iain A. McCowan. A Sector-Based Approach for Localization of Multiple Speakers with Microphone Arrays. In Proceedings of the Workshop on Statistical and Perceptual Audio Processing SAPA'04, October 2004.
[32] Guillaume Lathoud, Iain A. McCowan, and Jean-Marc Odobez. Unsupervised Location-Based Segmentation of Multi-Party Speech. In Proceedings of the 2004 ICASSP-NIST Meeting Recognition Workshop, Montreal, Canada, May 2004.
[33] Dong Zhang, Daniel Gatica-Perez, Samy Bengio, Iain McCowan, and Guillaume Lathoud. Modeling Individual and Group Actions in Meetings: a Two-Layer HMM Framework. In IEEE Workshop on Event Mining: Detection and Recognition of Events in Video, In Association with CVPR, 2004.
[34] Dong Zhang, Daniel Gatica-Perez, Samy Bengio, Iain McCowan, and Guillaume Lathoud. Multimodal Group Action Clustering in Meetings. In ACM 2nd International Workshop on Video Surveillance and Sensor Networks in conjunction with 12th ACM International Conference on Multimedia, 2004.
[35] D. Gatica-Perez, G. Lathoud, I. McCowan, and J-M. Odobez. A Mixed-State I-Particle Filter for Multi-Camera Speaker Tracking. In Proceedings of Workshop On Multimedia Technologies in E-Learning and Collaboration (WOMTEC), in conjunction with International Conference on Computer Vision (ICCV), September 2003.
[36] D. Gatica-Perez, G. Lathoud, I. McCowan, J-M. Odobez, and D. Moore. Audio-Visual Speaker Tracking with Importance Particle Filters. In Proceedings of the IEEE International Conference on Image Processing, September 2003.
[37] D. Gatica-Perez, I. McCowan, M. Barnard, S. Bengio, and H. Bourlard. On Automatic Annotation of Meeting Databases. In Proceedings of the IEEE International Conference on Image Processing, September 2003.
[38] G. Lathoud and I. McCowan. Location Based Speaker Segmentation. In Proceedings of the International Conference on Acoustics, Speech and Signal Processing, April 2003.
[39] G. Lathoud, I. McCowan, and D. Moore. Segmenting Multiple Concurrent Speakers using Microphone Arrays. In Proceedings of Eurospeech 2003, September 2003.
[40] I. McCowan, S. Bengio, D. Gatica-Perez, G. Lathoud, F. Monay, D. Moore, P. Wellner, and H. Bourlard. Modeling Human Interaction in Meetings. In Proceedings of the International Conference on Acoustics, Speech and Signal Processing, April 2003.
[41] I. McCowan, D. Gatica-Perez, S. Bengio, D. Moore, and H. Bourlard. Towards Computer Understanding of Human Interactions. In Proceedings of the European Symposium on Ambient Intelligence. Springer Lecture Notes in Computer Science, November 2003. Available as IDIAP RR 03-45.
[42] D. Moore and I. McCowan. Microphone Array Speech Recognition: Experiments on Overlapping Speech in Meetings. In Proceedings of the International Conference on Acoustics, Speech and Signal Processing, April 2003.
[43] V. Tyagi, S. Ikbahl, I. McCowan, and H. Bourlard. On Factorizing Spectral Dynamics for Robust Speech Recognition. In Proceedings of Eurospeech 2003, September 2003.
[44] V. Tyagi, I. McCowan, H. Bourlard, and H. Misra. Mel-Cepstrum Modulation Spectrum (MCMS) Features for Robust ASR. In Proceedings of Workshop on Automatic Speech Recognition and Understanding (ASRU), August 2003.
[45] J. Ajmera, H. Bourlard, I. Lapidot, and I. McCowan. Unknown-multiple Speaker Clustering using HMM. In Proceedings of the International Conference on Speech and Language Processing, September 2002.
[46] J. Ajmera, I. McCowan, and H. Bourlard. Robust HMM-based speech/music segmentation. In Proceedings of the International Conference on Acoustics, Speech and Signal Processing, May 2002.
[47] I. McCowan and H. Bourlard. Microphone Array Post-filter for Diffuse Noise Field. In Proceedings of the International Conference on Acoustics, Speech and Signal Processing, May 2002.
[48] I. McCowan, A. Morris, and H. Bourlard. Robust Speech Recognition with Small Microphone Arrays using the Missing Data Approach. In Proceedings of the International Conference on Speech and Language Processing, September 2002.
[49] I. McCowan, J. Pelecanos, and S. Sridharan. Robust Speaker Recognition using Microphone Arrays. In Proceedings of 2001: A Speaker Odyssey, June 2001.
[50] I. McCowan and S. Sridharan. Microphone Array Sub-band Speech Recognition. In Proceedings of the International Conference on Acoustics, Speech and Signal Processing, May 2001.
[51] I. McCowan and S. Sridharan. Adaptive Parameter Compensation for Robust Hands-free Speech Recognition using a Dual-Beamforming Microphone Array. In Proceedings on 2001 International Symposium on Intelligent Multimedia, Video and Speech Processing, May 2001.
[52] I. McCowan, C. Marro, and L. Mauuary. Robust Speech Recognition using Near-field Superdirective Beamforming with Post-filtering. In Proceedings of the International Conference on Acoustics, Speech and Signal Processing, pages 1723-1726, 2000.
[53] I. McCowan, D. Moore, and S. Sridharan. Speech enhancement using Near-field Superdirectivity with an Adaptive Sidelobe Canceler and Post-filter. In Proceedings of the 2000 Australian International Conference on Speech Science and Technology, pages 268-273, December 2000.

 

 

 

General enquiries:

csiroict@csiro.au

 

| Legal Notice and Disclaimer | Privacy | Copyright CSIRO 2005 | Last updated Last updated 18-Jan-2008 | to Top