Publications
2005- Gatica-Perez, D, Odobez, J-M, Ba, S, Smith, K and Lathoud, G (2005). Tracking People in Meetings with Particles. Proc. Int. Workshop on Image Analysis for Multimedia Interactive Service (WIAMIS), invited paper, Montreux, Apr. 2005.
- McCowan, I, Gatica-Perez, D, Bengio, S, Lathoud, G, Barnard, M and Zhang, D (2005). Automatic Analysis of Multimodal Group Actions in Meetings. IEEE Transactions on Pattern Analysis and Machine Intelligence, 27(3) 305-317.
- Wrigley, SN, Brown, GJ, Wan, V and Renals, S (2005). Speech and Crosstalk Detection in Multi-Channel Audio. IEEE Transactions on Speech and Audio Processing 13(1) 84-91.
- Smith, K and Gatica-Perez, D (2004). Order matters: a distributed sampling method for multi-object tracking. in Proc. British Machine Vision Conf. (BMVC), London, Sep. 2004.
- Zhang, D, Gatica-Perez, D, Bengio, S, McCowan, I and Lathoud, G (2004). Multimodal Group Action Clustering in Meetings. in Proc. ACM Int. Conf. on Multimedia, Workshop on Video Surveillance and Sensor Networks (ACM MM-VSSN), New York, Oct. 2004.
- Zhang, D, Gatica-Perez, D, Bengio, S, McCowan, I and Lathoud, G (2004). Modeling Individual and Group Actions in Meetings: a Two-Layer HMM Framework. in Proc. IEEE Conf. on Computer Vision and Pattern Recognition, Workshop on Event Mining in Video (CVPR-EVENT), Washington DC, Jul. 2004.
- Buist, AH, Kraaij, W and Raaijmakers, S (2004). Feasibility study of extractive meeting summarization. Poster at AMI/PASCAL/IM2/M4 workshop, Martigny, 2004.
- Ba, S and Odobez, J-M (2004). A probabilistic framework for joint head tracking and pose estimation. ICPR 2004.
- Pallotta, V, Ballim, A, Marchand-Maillet, S and Lisowska, A (2004). Towards Meeting Information Sytems: Meeting Knowledge Management. International Conference on Enterprise Information Sytems (ICEIS 04), Porto, Portugal, 2004.
- Moënne-Loccoz, N, Bruno, E and Marchand-Maillet, S (2004). Video Content Representation as Salient Regions of Activity. International Conference on Image and Video Retrieval, Dublin, Ireland, 2004.
- Moënne-Loccoz, N, Janvier, B, Marchand-Maillet, S and Bruno, E (2004). Managing Video Collections at Large. First International Workshop on Computer Vision meets Databases (CVDB 2004), Paris, France, 2004.
- Wrigley, SN and Brown, GJ (2004). Audio-visual source localization and tracking using a network of neural oscillators. British Society of Audiology Short Papers Meeting on Experimental Studies of Hearing and Deafness, University College London, UK, 16-17 September 2004.
- Igor, P, Stanislav, S, Michal, S (2004). Participant activity detection by hands and face movement tracking in the meeting room. Proc. Computer Graphics International 2004, Los Alamitos, US, 2004, p. 4.
- Schwarz, P, Matejka, P, Cernocky, J (2004). Towards Lower Error Rates in Phoneme Recognition, accepted to Text-Speech-Dialogue (TSD), Brno, Sep 2004.
- Karafiat, M, Grezl, F, Cernocky, J (2004). TRAP based features for LVCSR of meeting data, accepted to ICSLP 2004.
- Wallhoff, F, Zobl, M, Rigoll, G and Potucek, I (2004). Face Tracking in Meeting Room Scenarios Using Omnidirectional Views. IEEE Int. Proceedings on International Conference on Pattern Recognition (ICPR), Cambridge, UK, to appear in August 2004.
- Zobl, M, Laika, A, Wallhoff, F and G. Rigoll (2004). Recognition of Partly Occluded Person Actions in Meeting Scenarios. In IEEE Int. Proceedings on International Conference on Image Processing (ICIP), Singapore, to appear in October 2004.
- Reiter, S and Rigoll, G (2004). Segmentation and Classification of Meeting Events using Multiple Classifier Fusion and Dynamic Programming. In IEEE Int. Proceedings on International Conference on Pattern Recognition (ICPR), Cambridge, UK, to appear in August 2004.
- de Jong, FMG (2004). Disclosure of non-scripted video content: InDiCo and M4/AMI. To appear in: Proc. CIVR2004. Lecture Notes in Computer Science.
- Jovanovic, N and Rieks op den Akker (2004). Towards automatic addressee identification in multi-party dialogues. Proc. 5th SIGDial Workshop on Discourse and Dialogue, Boston, April 2004.
- Dielmann, A and Renals, S (2004). Dynamic Bayesian Networks for Meeting Structuring. Proc. IEEE ICASSP 2004.
- Ajmera, J, Lathoud, G and McCowan, I (2004). Clustering and Segmenting Speakers and their Locations in Meeetings. Proc. IEEE ICASSP 2004.
- Nijholt, A (2003). Multimodality and Ambient Intelligence. Chapter 2 in Algorithms in Ambient Intelligence. W.F.J. Verhaegh, E.H.L. Aarts & J. Korst (eds.), Philips Research Book Series, Kluwer Academic Publishers, Boston/Dordrecht/London, 2003, 23-53.
- Igor, P (2003). Tracking movement objects in sequence pictures. ElectronicsLetters.com , Vol. 2003, No. 2, Brno, CZ, p. 10, ISSN 1213-161X.
- Reyes-Gomez, MJ, Raj, B, Ellis, D (2003). Multi-channel Source Separation by Factorial HMMs. Proc. ICASSP-03, Hong Kong, April 2003, pp. I-664--667.
- Jovanovic, N (2003). Recognition of meeting actions using information obtained from different modalities. Report TR-CTIT-03-48, CTIT.
- McCowan, I, Gatica-Perez, D, Bengio, D, Moore, D and Bourlard, H (2003). Towards Computer Understanding of Human Interactions. Proc. European Symposium on Ambient Intelligence (EUSAI) (invited keynote paper), Eindhoven, Nov. 2003.
- Gatica-Perez, D, Lathoud, G, McCowan, I and Odobez, J-M (2003). A Mixed-State I-Particle Filter for Multi-Camera Speaker Tracking. Proc. IEEE International Conference on Computer Vision, Workshop on Multimedia Technologies in E-Learning and Collaboration (WOMTEC), Nice, Oct. 2003.
- Wrigley, SN, Brown, GJ, Wan, V and Renals, S (2003). Feature Selection for the Classification of Crosstalk in Multi-Channel Audio. Proc. Eurospeech 2003, Sept. 2003, pp. 469-472.
- Lathoud, G, McCowan, I and Moore, D (2003). Segmenting Multiple Concurrent Speakers Using Microphone Arrays. IDIAP-RR 03-21.
- Gatica-Perez, D, Lathoud, G, McCowan, I, Odobez, J-M and Moore, D (2003). Audio-Visual Speaker Tracking with Importance Particle Filters. Proc. IEEE ICIP, Sept. 2003.
- Gatica-Perez, D, McCowan, I, Barnard, M, Bengio, S and Bourlard, H (2003). On automatic annotation of meeting databases. Proc. IEEE ICIP, Sept. 2003. Publication(s) with similar content: techreport.
- Zobl, M, Wallhoff, F and Rigoll, G (2003). Action Recognition in Meeting Scenarios Using Global Motion Features. Proc. IEEE International Workshop on Performance Evaluation of Tracking and Surveillance 2003.
- Gatica-Perez, D, McCowan, I, Barnard, M, Bengio, S, and Bourlard, H (2003). On automatic annotation of meeting databases. IDIAP-RR 03-06. Publication(s) with similar content: IEEE ICIP paper.
- Lathoud, G and McCowan, I (2003). Location Based Speaker Segmentation. Proc. IEEE ICASSP 2003, Hong Kong.
- Marchand-Maillet, S (2003). Meeting Record Modelling for Enhanced Browsing. Tech. Rep. 03.01, Computer Vision and Multimedia Laboratory, Computing Centre, University of Geneva, Switzerland.
- Marchand-Maillet, S (2003). MRML: Steps towards version 2. Tech. Rep. 03.02, Computer Vision and Multimedia Laboratory, Computing Centre, University of Geneva, Switzerland.
- McCowan, I, Bengio, S, Gatica-Perez, D, Lathoud, G, Monay, F, Moore, D, Wellner, P, and Bourlard, H (2003). Modeling Human Interaction in Meetings. Proc. IEEE ICASSP 2003, Hong Kong.
- Moore, D and McCowan, I (2003). Microphone Array Speech Recognition : Experiments on Overlapping Speech in Meetings. Proc. IEEE ICASSP 2003, Hong Kong.
- Renals, S and Ellis, D (2003). Audio information access from meeting rooms. Proc. IEEE ICASSP 2003, Hong Kong (to appear). (in Special Session on Smart Meeting Rooms)
- Kennedy, L and Ellis, D (2003). Pitch-based emphasis detection for characterization of meeting recordings Automatic Speech Recognition and Understanding Workhop IEEE ASRU 2003, St. Thomas, December 2003.
- Gatica-Perez, D, Lathoud, G, McCowan, I, Odobez, J-M, and Moore, D (2002). Audio-Visual Speaker Tracking with Importance Particle Filters. IDIAP-RR 02-37.
- Kraaij, W, Spitters, M and Hulth, A (2002). Headline extraction based on a combination of uni- and multidocument summarization techniques. Proceedings of the ACL workshop on Automatic Summarization/Document Understanding Conference (DUC 2002) , June 2002, Philadelphia, USA.
- Lapidot, I (2002). Self-Organizing-Maps With BIC For Speaker Clustering. IDIAP-RR 02-60.
- Moore, D (2002). The IDIAP Smart Meeting Room. IDIAP-Com 02-07.
- Janin, A, Baron, D, Edwards, J, Ellis, D, Gelbart, D, Morgan, N, Peskin, B, Pfau, T, Shriberg, E, Stolcke, A and Wooters, C (2003). The ICSI Meeting Corpus. Proc. ICASSP-03, Hong Kong, April 2003.
- Morgan, N, Baron, D, Bhagat, S, Carvey, H, Dhillon, R, Edwards, J, Gelbart, D, Janin, A, Krupski, A, Peskin, B, Pfau, T, Shriberg, E, Stolcke, A and Wooters, C (2003). Meetings about meetings: research at ICSI on speech in multiparty conversations. ICASSP-2003, Hong Kong, April 2003.
- Reyes, M, Raj, B and Ellis, D (2003). Multi-channel Source Separation by Factorial HMMs. Proc. ICASSP-03, Hong Kong, April 2003.
Public Deliverables
D1.1 | Specification of smart room environment and data collection and annotation protocols | September 2002 |
D1.2 | Collection and annotation of meeting room data | March 2004 |
D2.2 | Final report on multimodal recognizers | March 2005 |
D3.3 | Final report on multimodal information access | March 2005 |
D4.3 | Report on final demonstrator | March 2005 |
Publicity Material