Speech and Hearing Seminars

SPandH Seminars usually take place at 12 midday on Wednesdays in G22 or Room 107-108 (the Ada Lovelace Room), Regent Court (directions).

For information on SPandH seminars please contact Ning Ma or Erfan Loweimi.

2018
Wednesday 11th July 12:00, Ada Lovelace	Prasanta Ghosh, Indian Institute of Science (IISc), Bangalore, India	The orchestra behind speech
Monday 9th July 14:00, Lewin Lab	Nicholas Cummins, University of Augsburg, Germany	Ubiquitous multisensory health analysis: Oppurtunities and Challenges
Wednesday 30th May 12:00, Mappin 9 Room 124	Paolo Vecchiotti, Information Engineering, Università Politecnica delle Marche, Italy	Voice Activity Detection and Speaker Localization in domestic environment by using Deep Learning
Wednesday 31st January 12:00, Ada Lovelace	Dimitrios Pappas, International Faculty, CITY College Thessalonikin	Harnessing Emergence for Distributed CASA
Friday 15th January 11:00, Lewin Lab	Erfan Loweimi, SPandH Internal	Robust Phase-based Speech Signal Processing; From Source-Filter Separation to Model-Based Robust ASR
2017
Friday 7th July 12:00, Ada Lovelace	Dr Michael Stone, Marston Senior Research Fellow in Audiology/Hearing Sciences, Manchester Centre for Audiology and Deafness School of Health Sciences, Manchester University	The role of envelope cues in the perception of speech in background sounds
Wed 24th May 12:00, Ada Lovelace	Dr Nic Lane, Senior Lecturer at University College London (UCL) and Principal Scientist at Nokia Bell Labs	Squeezing Deep Learning onto Wearables, Phones and Things
Wed 25th April 12:00, G25	Dr William Whitmer, MRC Insitute of Hearing Research (Scottish Section)	What we Talk about when we Talk about Speech Intelligibility Benefits
Wed 5th April 13:00, Ada Lovelace Room	Dr Bob Sturm, School of Electronic Engineering and Computer Science, Queen Mary University of London	Horses in Machine Learning
Wed 29th March 13:00, G22	Professor Kirill V Horoshenkov, Department of Mechanical Engineering, University of Sheffield	Acoustic Condition Identification and Classification in Pipes Using Machine Learning Methods
Wed 22nd Feb 13:00, Ada Lovelace Room	Professor Andrew Lambourne, School of Computing, Creative Technology and Engineering, Leeds Beckett University	Computers and language – Is it All Smoke and Mirrors?
Wed 1st Feb 12:00, Ada Lovelace Room	Nataliya Keberle and Hennadii Dobrovolskyi, Zaporozhye National University	Language-Independent Pronunciation Quality Assessment by Comparison with Sample
2016
Tue 25th Oct 14:00, Ada Lovelace Room	Jose Gonzalez Lopez and Phil Green, SPandH Internal	Silent Speech: Reconstructing Speech from Sensor Data by Machine Learning
Wed 12th Oct 12:00, Ada Lovelace Room	Christian Füllgrabe, MRC Institute of Hearing Research	Beyond audibility - The role of supra-threshold auditory and cognitive processing in speech perception across the adult lifespan
Wed 5th Oct 12:00, Ada Lovelace Room	Alessandro Di Nuovo, Sheffield Hallam University	Number Understanding Modelling in a Behavioural Embodied Robot
Wed 2016-Aug-31 12:00, G22 Room	Rosanna Milner, SPandH Internal	DNN-based Speaker Clustering for Speaker Diarisation
Wed 2016-Aug-31 12:30, G22 Room	Yulan Liu, SPandH Internal	The Sheffield Wargame Corpus - Day Two and Day Three
Wed 2016-Aug-24 12:00, Ada Lovelace Room	Thomas Hain, SPandH Internal	webASR 2 - Improved cloud based speech technology
Wed 2016-Aug-24 12:30, Ada Lovelace Room	Salil Deena, SPandH Internal	Combining Feature and Model-Based Adaptation of RNNLMs for Multi-Genre Broadcast Speech Recognition
Wed 2016-Aug-17 12:00, Ada Lovelace Room	Ning Ma, SPandH Internal	Speech localisation in a multitalker mixture by humans and machines
Wed 2016-Aug-17 12:30, Ada Lovelace Room	Raymond Ng, SPandH Internal	Combining weak tokenisers for phonotactic language recognition in a resource-constrained setting
Wed 2016-Aug-10 12:00, Ada Lovelace Room	Mortaza Doulaty, SPandH Internal	Automatic Genre and Show Identification of Broadcast Media
Wed 2016-Aug-10 12:30, Ada Lovelace Room	Yanmeng Guo, SPandH Internal	A robust dual-microphone speech source localization algorithm for reverberant environments
Wed 2016-Aug-3 12:00, Ada Lovelace Room	Erfan Loweimi, SPandH Internal	Use of Generalised Nonlinearity in Vector Taylor Series Noise Compensation for Robust Speech Recognition
Wed 2016-Aug-3 12:30, Ada Lovelace Room	Iñigo Casanueva, SPandH Internal	Dialogue State Tracking Personalisation for Users with Speech Disorders
Wed 2016-Jun-1 12:00, Ada Lovelace Room	Dr Tony Tew, Department of Electronics, University of York	Around the head in 80 ways
Mon 2016-Apr-27, 12:30 Room G22	Yannis Stylianou, Professor of Speech Processing at the University of Crete and Group Leader of the Speech Technology Group at Toshiba Cambridge Research Lab, UK	Speech Intelligibility and Beyond
Mon 2016-Mar-7 14:30, Ada Lovelace Room	Dr Cleopatra Pike, Institute of Sound Recording, University of Surrey	Compensation for spectral envelope distortion in auditory perception
2015
Wed 2015-Dec-2 12:00, Room 107-108	Hideki Kawahara, Emeritus Professor, Wakayama University; APSIPA Distinguished Lecturer 2015-2016; Visiting Research Scientist in Google UK Ltd. (London)	Making Speech Tangible: For Better Understanding of Human Speech Communication
Wed 2015-Aug-19 11:30, Room 107-108	Sarah Al-Shareef, SPandH Internal	Conversational Arabic Automatic Speech Recognition
Tue 2015-Aug-18 12:00, Room 107-108	Raymond Ng and Iñigo Casanueva, SPandH Internal	A Study on The Stability and Effectiveness of Features in Quality Estimation for Spoken Language Translation / Knowledge Transfer Between Speaker for Personalised Dialogue Management
Wed 2015-June-3 12:30, Room 107-108	Iván López Espejo, Dept. of Signal Theory, Telematics and Communications, University of Granada, Spain	Robust Automatic Speech Recognition on Mobile Devices with Small Microphone Array
Tue 2015-May-12 12:00, Room 107-108	Dorothea Kolossa, Cognitive Signal Processing Group, Institute of Communication Acoustics, Ruhr-Universität Bochum	Statistical Models for Robust Speech Recognition and Model‐Based Speech Processing
Tue 2015-Apr-7 12:00, Room 107-108	Raymond Ng and Yulan Liu, SpandH Internal	Quality Estimation for ASR K-best List Rescoring in Spoken Language Translation / An Investigation into Speaker Informed DNN Front-end for LVCSR
Tue 2015-Mar-24 13:00, G22	Stephen Elliot, Southampton	Feedback Control of Sound in Aircraft and in The Ear
Tue 2015-Feb-03 13:00, G22	Ke Chen, Manchester	Extracting Speaker Specific Information with a Deep Neural Architecture
Tue 2015-Jan-13 12:30, G22	Keiichi Tokuda, Nagoya Institute of Technology	Human-like singing and talking machines
2014
Tue 2014-Dec-9 12:30, G22	Stephen Cox, UEA	Read my Lips: Reflections on nearly Ten Years of Research at the University of East Anglia in Automatic Lip Reading
Tue 2014-Nov-18 12:30, G22	Stuart Green, Zoo Digital Ltd	Opportunities for applied speech technologies in film and broadcast
Tue 2014-Nov-11 13:00, G22	Raymond Ng and Erfan Loweimi, SPandH Internal	The USFD Systems for the IWSLT 2014 Evaluation / Phase information in speech recognition
Tue 2014-Aug-26 midday, G22	Tobias May, Technical University of Denmark	A monaural cocktail-party processor: Speech segregation in background noise
Fri 2014-Jun-13 midday, G22	Michael Mandel, Ohio State	Detailed models for understanding speech in noise
Tue 2014 Jun 3, midday, G22	Oscar Saz, Charles Fox and Heidi Christensen	Natural Speech Technology: project overview
Mon 2014 Feb 10	Tim Jurgens, Carl-von-Ossietzky Universitat, Germany	Auditory models for better rehabilitative devices
2013
Dec 4 2013	Patrick Naylor, Imperial	Deverberation techiques
Oct 30 2013 midday G30	Jeff Adams, Amazon.com	Speech & NLP at Amazon: Unique Challenges, Unique Resources
Sep 23 2013 15:15 G30	Angela Josupeit, U.Oldenburg, Germany	Modeling of Speech Localization in a Multitalker Environment using Binaural and Harmonic Cues
Thu July 18 2013 midday G30	Simon Godsill, Cambridge	Bayesian statistical methods in audio and music processing
Mon June 23 2013 midday G30	Pete Howell, UCL	Screening school-aged children for risk of stuttering and other speech disorders
Mon 2013-Jun-3 midday G30.	Alexa Wright, U. Westminster	Conversation Piece: Speech technology in art
Tue 2013-May-7 midday G30.	Rogier van Dalen, Cambridge	Efficient segmental features for speech recognition
Tue 2013-Apr-23 midday G30.	Max Little, MIT	The Parkinson's voice initiative
Tue 2013-Mar-19> midday G30.	Richard Smith, Smith Watkins Ltd	Physics of Brass Instrument Design
Tue 2013-Mar-5, midday, G30	Dan Stowell, Queen Mary's London	Tracking multiple intermittent sources in noise: inferring a mixture of Markov renewal processes
Tue 2013-Feb-12 midday G30.	Steve Renals, Edinburgh	Deep neural networks in speech recognition
Tue 2013-Jan-15, midday, G30	Oscar Saz, CMU	Speech recognition and evaluation in the presence of severe phonological errors
2012
Tue 2012-Dec-11, midday, G30	Marcelo Rivolta, Sheffield BMS	Repairing the ear with stem cells
Tue 2012-Oct-16, midday, G30	Chris Mitchell, Audio Analytic Ltd	Sound Recognition in Physical Security Applications
Tue 2012-Oct-2, midday, G30	David Martinez Gonzales, internal	iVector-Based Approaches for Spoken Language Identification
Tue 2012-Jul-17, midday, G30	Ray Mediss, Essex	Auditory profiles for hearing dummies
Tue 2012-Jun-19, midday, G30	Arnab Ghoshal, Edinburgh	Acoustic modeling with Subspace Gaussian Mixture Models
2012-May-22	Chris Sumner, Nottingham MRC	Spectral and temporal neural processing: relationships to auditory perception
2012-Apr-24	Bernhard Seebe, Nottingham MRC	Assessing and improving hearing with cochlear implants in noisy spaces
17/04/2012	Tim Kempton	Machine-Assisted Phonemic Analysis
2011
13/12/2011	Maggie Vance	Measuring speech perception in young children and children with language difficulties: Effects of background noise and phonetic contrast
20/10/2011	David Attwater	Chasing the dream -- Man-machine conversation and the real world
16/08/2011	Juan A. Morales-Cordovilla	Equivalences and Limits of Pitch-based Techniques for Robust Speech Recognition
18/07/2011	Chiori Hori	Introduction of U-STAR activity
14/06/2011	Warren Mansell	Perceptual Control Theory as a Framework for Computer Modelling Across the Social Sciences
07/06/2011	Parayitam Laxminarayana	ASR Performance Over Wired and Wireless Networks
10/05/2011	Peter Wallis	Social Engagement with Robots & Agents (SERA) - project report
07/02/2011	Angelo Cangelosi	Embodied Language Learning with the Humanoid Robot iCub
12/01/2011	Kalle Palomaki	Our recent studies in noise robust ASR in a large vocabulary task
2010
07/12/2010	Amy Beeston	Compensation for reverberation
22/10/2010	Tim Jurgens	Microscopic modelling of speech recognition for normal-hearing and hearing-impaired listeners
15/09/2010	John Culling	Mapping speech intelligibility in noisy rooms
14/07/2010	Charles Fox	Mean-field and Monte Carlo Musical Scene Analysis
11/06/2010	Matt Gibson	Unsupervised adaptation of HMM-based synthesis models
21/04/2010	Sarah Creer	Building personalised synthetic voices for individuals with severe speech impairment
31/03/2010	Patti Adank	The role of vocal Imitation in speech comprehension
24/03/2010	Jindong Liu	Effect of Inhibition in a Computational Model of the Inferior Colliculus on Sound Localisation
17/02/2010	Simon Makin	Spectral- and temporal-envelope room-acoustic cues in attentional tracking
2009
2/12/2009	Guillaume Aimetti	A computational model of early language acquisition: Towards a general statistical learning mechanism
18/11/2009	Christopher Peters	Synthetic Characters: Behaviour Modelling, Perception and Interaction
11/11/2009	Matthew Robertson	Modelling the Performance of Hearing Impaired Listeners Using an Auditory Model and Speech in Noise Test
5/11/2009	Francisco Lacerda	An ecological model of early language acquisition
21/10/2009	Steven Greenberg	Time Perspective in Spoken Language Processing
21/10/2009	Maurizio Filippone	The Probabilistic Approach in Data Modeling
14/10/2009	Jim Hieronymus	Exploiting Chinese Character Models to Improve Speech Recognition Performance
14/10/2009	Jim Hieronymus	Spoken Dialogue Systems for Space and Lunar Exploration
8/4/2009	Jaydip Ray	Modern otology and its links with the scientific community
29/1/2009	Mark Wibrow	Acoustic Cues for Sarcasm? How Interesting.
29/1/2009	Sharif Alghowail	Keyboard Acoustic Emanations
29/1/2009	Yi Hu	The Techniques in Multiple-speaker Localisation
2008
10/12/2008	Michael Mandel	Model-based EM Source Separation and Localization in Reverberant Mixtures
3/12/2008	Rob Morse	Stochastic neural coding and implications for cochlear implant design
26/11/2008	Robin Hofe	An animatronic tongue and vocal tract: AnTon
19/11/2008	Stuart Wrigley	The influence of audio presentation style on multitasking during teleconferences
12/11/2008	Roddy Cowie	The road to conversation with a computer
5/11/2008	Mark Huckvale	Building Computational Models of Perception with Hierarchical Prediction Networks
29/10/2008	Ning Ma	Active listening in auditory scenes
22/10/2008	James Carmichael	Quantifying speech disorder diagnosis
15/10/2008	Sue Harding	Perception of very brief segments of speech
8/10/2008	Heidi Christensen	A speech fragment approach to localising multiple speakers in reverberant environments
1/10/2008	Mark Elshaw	A gated recurrent self-organisation working memory model for emergent speech representation
9/4/2008	Alain de Cheveigné	Cancellation in auditory scene analysis
16/1/2008	Timothy Kempton	Language Identification: Insights from the Classification of Hand Annotated Phone Transcripts
2007
31/10/2007	Ian Howard	A Computational Model of Infant Speech Development
17/10/2007	Peter van Hengel	Verbal Aggression Detection in Complex Acoustical Environments
13/9/2007	Jort Gemmeke	On the relation between statistical properties of spectrographic masks and their ability to reduce acoustic mismatch
6/9/2007	Maria Wolters	Adapting Dialogue Systems to Older People
3/9/2007	Takatoshi Okuno	Development of Frequency Selectivity Map (FSMap) depiction system for hearing impairment
22/8/2007	Jonathan Laidler	Model-driven detection of clean speech patches in noise
25/7/2007	Sarah Creer	Modern Speech Synthesis
7/6/2007	Piers Messum	How children learn to pronounce, but not by imitation
23/5/2007	Robin Hofe	Tongues, Trunks and Tentacles: Energetics in Physiology and Speech
2/5/2007	Yanchen Lu	Modelling of Binaural Distance Perception
25/4/2007	Russell Mason	The role of head movement in perception and measurement of spatial impression
28/3/2007	Mike Carey	Mechanisms for Human Speech Acquisition and Processing
7/3/2007	Thomas Poulsen	Sound Localization Through Evolutionary Learning Applied to Spiking Neural Networks
21/2/2007	Sue Harding	Auditory Gist Perception and Attention
2/2/2007	Kalle Palom\E4ki	Speech recognition activities in the Adaptive Informatics Research Centre
31/1/2007	Roger Moore	Sensorimotor Overlap in Living Organisms
2006
13/12/2006	Junichi Yamagishi	Model Adaptation Approach to Speech Synthesis with Diverse Voice and Styles
29/11/2006	Saeed Vaseghi	Speech Enhancement: Noise Reduction, Bandwidth Extension and Lost Packet Recovery
15/11/2006	Yasser H. Abdel-Haleem	Conditional Random Fields for Continuous Speech Recognition
26/9/2006	Sadaoki Furui	Why is automatic recognition of spontaneous speech so difficult?
6/9/2006	Matt Gibson	Hypothesis Spaces For Minimum Bayes Risk Training In Large Vocabulary Speech Recognition
5/7/2006	Iain Murray	Expressive Speech Synthesis
14/6/2006	Oscar Saz	Speech Technologies at the University of Zaragoza
24/5/2006	Colin Breithaupt	Speech feature analysis for robust automatic speech recognition
10/5/2006	Peter Howell	A model of fluency breakdown based on data from speakers who stutter
3/5/2006	Philip Jackson	Amplitude modulation of frication by voicing: acoustics and perception
26/4/2006	Mahesan Niranjan	Introduction to Sequential Monte Carlo methods and their use in estimating formants
29/3/2006	James Carmichael	Quantifying Speech Disorder Diagnosis on the Cheap - Computerising the Frenchay Dysarthria Assessment Tests
22/3/2006	Dennis Norris	Perceptual learning in speech
8/3/2006	John Bridle	It Keeps them on the Knife: Interpretations of HMMs with 'Dynamic Observations'
1/3/2006	Torsten Dau	Modeling spectro-temporal processing in the auditory system
23/2/2006	Robert Kirchner	Exemplar-Based Speech Processing and Phonological Learning
22/2/2006	Martin Russell	A Data-Driven Analysis of Vowels in the ABI (Accents of the British Isles) Speech Corpus
15/2/2006	Christoph Draxler	SpeechRecorder - a Platform-Independent Tools for Speech Recordings via the WWW
14/2/2006	Christoph Draxler	WebTranscribe - A framework for web-based speech annotation
8/2/2006	Simon King	Dynamic Bayesian Networks: the new framework for ASR research?
2005
7/12/2005	Ray Meddis	A computer model of absolute threshold performance in human listeners
30/11/2005	Christopher Newell	Do we need synthetic speech to sound natural, in order to sound expresssive?
23/11/2005	Yan-Chen Lu	My Multimedia Innovation Journey
17/11/2005	Alan Newell	Systems for Older and Disabled People
16/11/2005	Sue Denham	Modelling the representation and classification of natural sounds
2/11/2005	Piers Messum	Learning to talk, but not by imitation
19/10/2005	Esmeralda Uraga	Experiments using acoustic and articulatory data for speech recognition
12/10/2005	Stephen Cox	Automatic Musical Genre Classification
5/10/2005	Richard Lyon	History and Future of Electronic Color Photography: Where Vision and Silicon Meet
28/9/2005	Martin Cooke	Non-native speech perception in noise
1/9/2005	Khademul Islam Molla	Separation of mixed audio signals in the time-frequency domain
26/7/2005	Nobuaki Minematsu	Theorem of the Invariant Structure and its Derivation of Speech Gestalt
12/5/2005	Sarah Hawkins	Perceptual coherence and speech understanding, what a speech perception model should look like
5/5/2005	Roger Moore	How Good Does Automatic Speech Recognition Have to Be? ... and when will it be that good?
16/3/2005	Sarah Simpson	Consonant identification in N-talker babble as a function of N
9/3/2005	Ryuichiro Higashinaka	Incorporating Discourse Features into Confidence Scoring of Intention Recognition Results in Spoken Dialogue Systems
16/2/2005	Kalle Palomäki	Spatial Processing in Human Auditory Cortex: The Effects of 3D, ITD and ILD Stimulation Technique
31/1/2005	Sadaoki Furui	Large-Scale Knowledge Resources
2004
3/11/2004	Ning Ma	Whole Word Duration Modeling in Noise-Rocust Recognition
20/10/2004	Andre Coy	So where do the Fragments come from?
6/10/2004	Simon Makin	The Nature of the Beats (the problem with static approaches to modelling concurrent vowel segregation
26/5/2004	Barry Edmonds	The role of sound localisation in speech intelligibility in noise
6/5/2004	Les Atlas	Modulation Spectra: A New Tool for Acoustic Signal Analysis
14/4/2004	Tony Watkins	Perceptual separation of the sounds of words and of rooms
4/2004	Stuart Cunningham	Speech Recognition for Electroninc Assistive Technology
24/3/2004	Eva Bjorkner	Voice source characteristics in different registers in Classically Trained Female Musical Theatre Singers
18/2/2004	Stuart Wrigley	M4 Overview
2003
22/10/2003	Azra Ali	Audio-visual Data Fusion Errors In Syllables And Words

Department of Computer Science

Speech and Hearing Seminars