RSS

Speech and Hearing Seminars

SPandH Seminars usually take place at 12 midday on Wednesdays in G22 or Room 107-108 (the Ada Lovelace Room), Regent Court (directions).

For information on SPandH seminars please contact Ning Ma or Erfan Loweimi.


2018
Wednesday 11th July 12:00, Ada Lovelace Prasanta Ghosh, Indian Institute of Science (IISc), Bangalore, India The orchestra behind speech
Monday 9th July 14:00, Lewin Lab Nicholas Cummins, University of Augsburg, Germany Ubiquitous multisensory health analysis: Oppurtunities and Challenges
Wednesday 30th May 12:00, Mappin 9 Room 124 Paolo Vecchiotti, Information Engineering, Università Politecnica delle Marche, Italy Voice Activity Detection and Speaker Localization in domestic environment by using Deep Learning
Wednesday 31st January 12:00, Ada Lovelace Dimitrios Pappas, International Faculty, CITY College Thessalonikin Harnessing Emergence for Distributed CASA
Friday 15th January 11:00, Lewin Lab Erfan Loweimi, SPandH Internal Robust Phase-based Speech Signal Processing; From Source-Filter Separation to Model-Based Robust ASR
2017
Friday 7th July 12:00, Ada Lovelace Dr Michael Stone, Marston Senior Research Fellow in Audiology/Hearing Sciences, Manchester Centre for Audiology and Deafness School of Health Sciences, Manchester University The role of envelope cues in the perception of speech in background sounds
Wed 24th May 12:00, Ada Lovelace Dr Nic Lane, Senior Lecturer at University College London (UCL) and Principal Scientist at Nokia Bell Labs Squeezing Deep Learning onto Wearables, Phones and Things
Wed 25th April 12:00, G25 Dr William Whitmer, MRC Insitute of Hearing Research (Scottish Section) What we Talk about when we Talk about Speech Intelligibility Benefits
Wed 5th April 13:00, Ada Lovelace Room Dr Bob Sturm, School of Electronic Engineering and Computer Science, Queen Mary University of London Horses in Machine Learning
Wed 29th March 13:00, G22 Professor Kirill V Horoshenkov, Department of Mechanical Engineering, University of Sheffield Acoustic Condition Identification and Classification in Pipes Using Machine Learning Methods
Wed 22nd Feb 13:00, Ada Lovelace Room Professor Andrew Lambourne, School of Computing, Creative Technology and Engineering, Leeds Beckett University Computers and language – Is it All Smoke and Mirrors?
Wed 1st Feb 12:00, Ada Lovelace Room Nataliya Keberle and Hennadii Dobrovolskyi, Zaporozhye National University Language-Independent Pronunciation Quality Assessment by Comparison with Sample
2016
Tue 25th Oct 14:00, Ada Lovelace Room Jose Gonzalez Lopez and Phil Green, SPandH Internal Silent Speech: Reconstructing Speech from Sensor Data by Machine Learning
Wed 12th Oct 12:00, Ada Lovelace Room Christian Füllgrabe, MRC Institute of Hearing Research Beyond audibility - The role of supra-threshold auditory and cognitive processing in speech perception across the adult lifespan
Wed 5th Oct 12:00, Ada Lovelace Room Alessandro Di Nuovo, Sheffield Hallam University Number Understanding Modelling in a Behavioural Embodied Robot
Wed 2016-Aug-31 12:00, G22 Room Rosanna Milner, SPandH Internal DNN-based Speaker Clustering for Speaker Diarisation
Wed 2016-Aug-31 12:30, G22 Room Yulan Liu, SPandH Internal The Sheffield Wargame Corpus - Day Two and Day Three
Wed 2016-Aug-24 12:00, Ada Lovelace Room Thomas Hain, SPandH Internal webASR 2 - Improved cloud based speech technology
Wed 2016-Aug-24 12:30, Ada Lovelace Room Salil Deena, SPandH Internal Combining Feature and Model-Based Adaptation of RNNLMs for Multi-Genre Broadcast Speech Recognition
Wed 2016-Aug-17 12:00, Ada Lovelace Room Ning Ma, SPandH Internal Speech localisation in a multitalker mixture by humans and machines
Wed 2016-Aug-17 12:30, Ada Lovelace Room Raymond Ng, SPandH Internal Combining weak tokenisers for phonotactic language recognition in a resource-constrained setting
Wed 2016-Aug-10 12:00, Ada Lovelace Room Mortaza Doulaty, SPandH Internal Automatic Genre and Show Identification of Broadcast Media
Wed 2016-Aug-10 12:30, Ada Lovelace Room Yanmeng Guo, SPandH Internal A robust dual-microphone speech source localization algorithm for reverberant environments
Wed 2016-Aug-3 12:00, Ada Lovelace Room Erfan Loweimi, SPandH Internal Use of Generalised Nonlinearity in Vector Taylor Series Noise Compensation for Robust Speech Recognition
Wed 2016-Aug-3 12:30, Ada Lovelace Room Iñigo Casanueva, SPandH Internal Dialogue State Tracking Personalisation for Users with Speech Disorders
Wed 2016-Jun-1 12:00, Ada Lovelace Room Dr Tony Tew, Department of Electronics, University of York Around the head in 80 ways
Mon 2016-Apr-27, 12:30
Room G22
Yannis Stylianou, Professor of Speech Processing at the University of Crete and Group Leader of the Speech Technology Group at Toshiba Cambridge Research Lab, UK Speech Intelligibility and Beyond
Mon 2016-Mar-7 14:30, Ada Lovelace Room Dr Cleopatra Pike, Institute of Sound Recording, University of Surrey Compensation for spectral envelope distortion in auditory perception
2015
Wed 2015-Dec-2 12:00, Room 107-108 Hideki Kawahara, Emeritus Professor, Wakayama University; APSIPA Distinguished Lecturer 2015-2016; Visiting Research Scientist in Google UK Ltd. (London) Making Speech Tangible: For Better Understanding of Human Speech Communication
Wed 2015-Aug-19 11:30, Room 107-108 Sarah Al-Shareef, SPandH Internal Conversational Arabic Automatic Speech Recognition
Tue 2015-Aug-18 12:00, Room 107-108 Raymond Ng and Iñigo Casanueva, SPandH Internal A Study on The Stability and Effectiveness of Features in Quality Estimation for Spoken Language Translation / Knowledge Transfer Between Speaker for Personalised Dialogue Management
Wed 2015-June-3 12:30, Room 107-108 Iván López Espejo, Dept. of Signal Theory, Telematics and Communications, University of Granada, Spain Robust Automatic Speech Recognition on Mobile Devices with Small Microphone Array
Tue 2015-May-12 12:00, Room 107-108 Dorothea Kolossa, Cognitive Signal Processing Group, Institute of Communication Acoustics, Ruhr-Universität Bochum Statistical Models for Robust Speech Recognition and Model‐Based Speech Processing
Tue 2015-Apr-7 12:00, Room 107-108 Raymond Ng and Yulan Liu, SpandH Internal Quality Estimation for ASR K-best List Rescoring in Spoken Language Translation / An Investigation into Speaker Informed DNN Front-end for LVCSR
Tue 2015-Mar-24 13:00, G22 Stephen Elliot, Southampton Feedback Control of Sound in Aircraft and in The Ear
Tue 2015-Feb-03 13:00, G22 Ke Chen, Manchester Extracting Speaker Specific Information with a Deep Neural Architecture
Tue 2015-Jan-13 12:30, G22 Keiichi Tokuda, Nagoya Institute of Technology Human-like singing and talking machines
2014
Tue 2014-Dec-9 12:30, G22 Stephen Cox, UEA Read my Lips: Reflections on nearly Ten Years of Research at the University of East Anglia in Automatic Lip Reading
Tue 2014-Nov-18 12:30, G22 Stuart Green, Zoo Digital Ltd Opportunities for applied speech technologies in film and broadcast
Tue 2014-Nov-11 13:00, G22 Raymond Ng and Erfan Loweimi, SPandH Internal The USFD Systems for the IWSLT 2014 Evaluation / Phase information in speech recognition
Tue 2014-Aug-26 midday, G22 Tobias May, Technical University of Denmark A monaural cocktail-party processor: Speech segregation in background noise
Fri 2014-Jun-13 midday, G22 Michael Mandel, Ohio State Detailed models for understanding speech in noise
Tue 2014 Jun 3, midday, G22 Oscar Saz, Charles Fox and Heidi Christensen Natural Speech Technology: project overview
Mon 2014 Feb 10 Tim Jurgens, Carl-von-Ossietzky Universitat, Germany Auditory models for better rehabilitative devices
2013
Dec 4 2013 Patrick Naylor, Imperial Deverberation techiques
Oct 30 2013 midday G30 Jeff Adams, Amazon.com Speech & NLP at Amazon: Unique Challenges, Unique Resources
Sep 23 2013 15:15 G30 Angela Josupeit, U.Oldenburg, Germany Modeling of Speech Localization in a Multitalker Environment using Binaural and Harmonic Cues
Thu July 18 2013 midday G30 Simon Godsill, Cambridge Bayesian statistical methods in audio and music processing
Mon June 23 2013 midday G30 Pete Howell, UCL Screening school-aged children for risk of stuttering and other speech disorders
Mon 2013-Jun-3 midday G30. Alexa Wright, U. Westminster Conversation Piece: Speech technology in art
Tue 2013-May-7 midday G30. Rogier van Dalen, Cambridge Efficient segmental features for speech recognition
Tue 2013-Apr-23 midday G30. Max Little, MIT The Parkinson's voice initiative
Tue 2013-Mar-19> midday G30. Richard Smith, Smith Watkins Ltd Physics of Brass Instrument Design
Tue 2013-Mar-5, midday, G30 Dan Stowell, Queen Mary's London Tracking multiple intermittent sources in noise: inferring a mixture of Markov renewal processes
Tue 2013-Feb-12 midday G30. Steve Renals, Edinburgh Deep neural networks in speech recognition
Tue 2013-Jan-15, midday, G30 Oscar Saz, CMU Speech recognition and evaluation in the presence of severe phonological errors
2012
Tue 2012-Dec-11, midday, G30 Marcelo Rivolta, Sheffield BMS Repairing the ear with stem cells
Tue 2012-Oct-16, midday, G30 Chris Mitchell, Audio Analytic Ltd Sound Recognition in Physical Security Applications
Tue 2012-Oct-2, midday, G30 David Martinez Gonzales, internal iVector-Based Approaches for Spoken Language Identification
Tue 2012-Jul-17, midday, G30 Ray Mediss, Essex Auditory profiles for hearing dummies
Tue 2012-Jun-19, midday, G30 Arnab Ghoshal, Edinburgh Acoustic modeling with Subspace Gaussian Mixture Models
2012-May-22 Chris Sumner, Nottingham MRC Spectral and temporal neural processing: relationships to auditory perception
2012-Apr-24 Bernhard Seebe, Nottingham MRC Assessing and improving hearing with cochlear implants in noisy spaces
17/04/2012 Tim Kempton Machine-Assisted Phonemic Analysis
2011
13/12/2011 Maggie Vance Measuring speech perception in young children and children with language difficulties: Effects of background noise and phonetic contrast
20/10/2011 David Attwater Chasing the dream -- Man-machine conversation and the real world
16/08/2011 Juan A. Morales-Cordovilla Equivalences and Limits of Pitch-based Techniques for Robust Speech Recognition
18/07/2011 Chiori Hori Introduction of U-STAR activity
14/06/2011 Warren Mansell Perceptual Control Theory as a Framework for Computer Modelling Across the Social Sciences
07/06/2011 Parayitam Laxminarayana ASR Performance Over Wired and Wireless Networks
10/05/2011 Peter Wallis Social Engagement with Robots & Agents (SERA) - project report
07/02/2011 Angelo Cangelosi Embodied Language Learning with the Humanoid Robot iCub
12/01/2011 Kalle Palomaki Our recent studies in noise robust ASR in a large vocabulary task
2010
07/12/2010 Amy Beeston Compensation for reverberation
22/10/2010 Tim Jurgens Microscopic modelling of speech recognition for normal-hearing and hearing-impaired listeners
15/09/2010 John Culling Mapping speech intelligibility in noisy rooms
14/07/2010 Charles Fox Mean-field and Monte Carlo Musical Scene Analysis
11/06/2010 Matt Gibson Unsupervised adaptation of HMM-based synthesis models
21/04/2010 Sarah Creer Building personalised synthetic voices for individuals with severe speech impairment
31/03/2010 Patti Adank The role of vocal Imitation in speech comprehension
24/03/2010 Jindong Liu Effect of Inhibition in a Computational Model of the Inferior Colliculus on Sound Localisation
17/02/2010 Simon Makin Spectral- and temporal-envelope room-acoustic cues in attentional tracking
2009
2/12/2009 Guillaume Aimetti A computational model of early language acquisition: Towards a general statistical learning mechanism
18/11/2009 Christopher Peters Synthetic Characters: Behaviour Modelling, Perception and Interaction
11/11/2009 Matthew Robertson Modelling the Performance of Hearing Impaired Listeners Using an Auditory Model and Speech in Noise Test
5/11/2009 Francisco Lacerda An ecological model of early language acquisition
21/10/2009 Steven Greenberg Time Perspective in Spoken Language Processing
21/10/2009 Maurizio Filippone The Probabilistic Approach in Data Modeling
14/10/2009 Jim Hieronymus Exploiting Chinese Character Models to Improve Speech Recognition Performance
14/10/2009 Jim Hieronymus Spoken Dialogue Systems for Space and Lunar Exploration
8/4/2009 Jaydip Ray Modern otology and its links with the scientific community
29/1/2009 Mark Wibrow Acoustic Cues for Sarcasm? How Interesting.
29/1/2009 Sharif Alghowail Keyboard Acoustic Emanations
29/1/2009 Yi Hu The Techniques in Multiple-speaker Localisation
2008
10/12/2008 Michael Mandel Model-based EM Source Separation and Localization in Reverberant Mixtures
3/12/2008 Rob Morse Stochastic neural coding and implications for cochlear implant design
26/11/2008 Robin Hofe An animatronic tongue and vocal tract: AnTon
19/11/2008 Stuart Wrigley The influence of audio presentation style on multitasking during teleconferences
12/11/2008 Roddy Cowie The road to conversation with a computer
5/11/2008 Mark Huckvale Building Computational Models of Perception with Hierarchical Prediction Networks
29/10/2008 Ning Ma Active listening in auditory scenes
22/10/2008 James Carmichael Quantifying speech disorder diagnosis
15/10/2008 Sue Harding Perception of very brief segments of speech
8/10/2008 Heidi Christensen A speech fragment approach to localising multiple speakers in reverberant environments
1/10/2008 Mark Elshaw A gated recurrent self-organisation working memory model for emergent speech representation
9/4/2008 Alain de Cheveigné Cancellation in auditory scene analysis
16/1/2008 Timothy Kempton Language Identification: Insights from the Classification of Hand Annotated Phone Transcripts
2007
31/10/2007 Ian Howard A Computational Model of Infant Speech Development
17/10/2007 Peter van Hengel Verbal Aggression Detection in Complex Acoustical Environments
13/9/2007 Jort Gemmeke On the relation between statistical properties of spectrographic masks and their ability to reduce acoustic mismatch
6/9/2007 Maria Wolters Adapting Dialogue Systems to Older People
3/9/2007 Takatoshi Okuno Development of Frequency Selectivity Map (FSMap) depiction system for hearing impairment
22/8/2007 Jonathan Laidler Model-driven detection of clean speech patches in noise
25/7/2007 Sarah Creer Modern Speech Synthesis
7/6/2007 Piers Messum How children learn to pronounce, but not by imitation
23/5/2007 Robin Hofe Tongues, Trunks and Tentacles: Energetics in Physiology and Speech
2/5/2007 Yanchen Lu Modelling of Binaural Distance Perception
25/4/2007 Russell Mason The role of head movement in perception and measurement of spatial impression
28/3/2007 Mike Carey Mechanisms for Human Speech Acquisition and Processing
7/3/2007 Thomas Poulsen Sound Localization Through Evolutionary Learning Applied to Spiking Neural Networks
21/2/2007 Sue Harding Auditory Gist Perception and Attention
2/2/2007 Kalle Palom\E4ki Speech recognition activities in the Adaptive Informatics Research Centre
31/1/2007 Roger Moore Sensorimotor Overlap in Living Organisms
2006
13/12/2006 Junichi Yamagishi Model Adaptation Approach to Speech Synthesis with Diverse Voice and Styles
29/11/2006 Saeed Vaseghi Speech Enhancement: Noise Reduction, Bandwidth Extension and Lost Packet Recovery
15/11/2006 Yasser H. Abdel-Haleem Conditional Random Fields for Continuous Speech Recognition
26/9/2006 Sadaoki Furui Why is automatic recognition of spontaneous speech so difficult?
6/9/2006 Matt Gibson Hypothesis Spaces For Minimum Bayes Risk Training In Large Vocabulary Speech Recognition
5/7/2006 Iain Murray Expressive Speech Synthesis
14/6/2006 Oscar Saz Speech Technologies at the University of Zaragoza
24/5/2006 Colin Breithaupt Speech feature analysis for robust automatic speech recognition
10/5/2006 Peter Howell A model of fluency breakdown based on data from speakers who stutter
3/5/2006 Philip Jackson Amplitude modulation of frication by voicing: acoustics and perception
26/4/2006 Mahesan Niranjan Introduction to Sequential Monte Carlo methods and their use in estimating formants
29/3/2006 James Carmichael Quantifying Speech Disorder Diagnosis on the Cheap - Computerising the Frenchay Dysarthria Assessment Tests
22/3/2006 Dennis Norris Perceptual learning in speech
8/3/2006 John Bridle It Keeps them on the Knife: Interpretations of HMMs with 'Dynamic Observations'
1/3/2006 Torsten Dau Modeling spectro-temporal processing in the auditory system
23/2/2006 Robert Kirchner Exemplar-Based Speech Processing and Phonological Learning
22/2/2006 Martin Russell A Data-Driven Analysis of Vowels in the ABI (Accents of the British Isles) Speech Corpus
15/2/2006 Christoph Draxler SpeechRecorder - a Platform-Independent Tools for Speech Recordings via the WWW
14/2/2006 Christoph Draxler WebTranscribe - A framework for web-based speech annotation
8/2/2006 Simon King Dynamic Bayesian Networks: the new framework for ASR research?
2005
7/12/2005 Ray Meddis A computer model of absolute threshold performance in human listeners
30/11/2005 Christopher Newell Do we need synthetic speech to sound natural, in order to sound expresssive?
23/11/2005 Yan-Chen Lu My Multimedia Innovation Journey
17/11/2005 Alan Newell Systems for Older and Disabled People
16/11/2005 Sue Denham Modelling the representation and classification of natural sounds
2/11/2005 Piers Messum Learning to talk, but not by imitation
19/10/2005 Esmeralda Uraga Experiments using acoustic and articulatory data for speech recognition
12/10/2005 Stephen Cox Automatic Musical Genre Classification
5/10/2005 Richard Lyon History and Future of Electronic Color Photography: Where Vision and Silicon Meet
28/9/2005 Martin Cooke Non-native speech perception in noise
1/9/2005 Khademul Islam Molla Separation of mixed audio signals in the time-frequency domain
26/7/2005 Nobuaki Minematsu Theorem of the Invariant Structure and its Derivation of Speech Gestalt
12/5/2005 Sarah Hawkins Perceptual coherence and speech understanding, what a speech perception model should look like
5/5/2005 Roger Moore How Good Does Automatic Speech Recognition Have to Be? ... and when will it be that good?
16/3/2005 Sarah Simpson Consonant identification in N-talker babble as a function of N
9/3/2005 Ryuichiro Higashinaka Incorporating Discourse Features into Confidence Scoring of Intention Recognition Results in Spoken Dialogue Systems
16/2/2005 Kalle Palomäki Spatial Processing in Human Auditory Cortex: The Effects of 3D, ITD and ILD Stimulation Technique
31/1/2005 Sadaoki Furui Large-Scale Knowledge Resources
2004
3/11/2004 Ning Ma Whole Word Duration Modeling in Noise-Rocust Recognition
20/10/2004 Andre Coy So where do the Fragments come from?
6/10/2004 Simon Makin The Nature of the Beats (the problem with static approaches to modelling concurrent vowel segregation
26/5/2004 Barry Edmonds The role of sound localisation in speech intelligibility in noise
6/5/2004 Les Atlas Modulation Spectra: A New Tool for Acoustic Signal Analysis
14/4/2004 Tony Watkins Perceptual separation of the sounds of words and of rooms
4/2004 Stuart Cunningham Speech Recognition for Electroninc Assistive Technology
24/3/2004 Eva Bjorkner Voice source characteristics in different registers in Classically Trained Female Musical Theatre Singers
18/2/2004 Stuart Wrigley M4 Overview
2003
22/10/2003 Azra Ali Audio-visual Data Fusion Errors In Syllables And Words