# | Title | Journal | Year | Citations |
---|
1 | Assessment for automatic speech recognition: II. NOISEX-92: A database and an experiment to study the effect of additive noise on speech recognition systems | Speech Communication | 1993 | 1,630 |
2 | Restructuring speech representations using a pitch-adaptive time–frequency smoothing and an instantaneous-frequency-based F0 extraction: Possible role of a repetitive structure in sounds | Speech Communication | 1999 | 1,458 |
3 | Vocal communication of emotion: A review of research paradigms | Speech Communication | 2003 | 1,256 |
4 | An overview of text-independent speaker recognition: From features to supervectors | Speech Communication | 2010 | 1,149 |
5 | The analysis of speech in different temporal integration windows: cerebral lateralization as ‘asymmetric sampling in time’ | Speech Communication | 2003 | 1,087 |
6 | Speaker identification and verification using Gaussian mixture speaker models | Speech Communication | 1995 | 1,041 |
7 | Pitch-synchronous waveform processing techniques for text-to-speech synthesis using diphones | Speech Communication | 1990 | 910 |
8 | Statistical parametric speech synthesis | Speech Communication | 2009 | 894 |
9 | Speech emotion recognition using hidden Markov models | Speech Communication | 2003 | 775 |
10 | Emotional speech recognition: Resources, features, and methods | Speech Communication | 2006 | 707 |
11 | Recognising realistic emotions and affect in speech: State of the art and lessons learnt from the first challenge | Speech Communication | 2011 | 595 |
12 | A review of depression and suicide risk assessment using speech analysis | Speech Communication | 2015 | 567 |
13 | Subjective comparison and evaluation of speech enhancement algorithms | Speech Communication | 2007 | 553 |
14 | The role of voice quality in communicating emotion, mood and attitude | Speech Communication | 2003 | 494 |
15 | Robust automatic speech recognition with missing and unreliable acoustic data | Speech Communication | 2001 | 487 |
16 | On multi-level modeling of data from repeated measures designs: a tutorial | Speech Communication | 2004 | 485 |
17 | Speech recognition in noisy environments: A survey | Speech Communication | 1995 | 473 |
18 | Intelligibility of normal speech I: Global and fine-grained acoustic-phonetic talker characteristics | Speech Communication | 1996 | 439 |
19 | Describing the emotional states that are expressed in speech | Speech Communication | 2003 | 439 |
20 | Speech database development at MIT: Timit and beyond | Speech Communication | 1990 | 434 |
21 | Speech recognition by machines and humans | Speech Communication | 1997 | 422 |
22 | Speech emotion recognition: Emotional models, databases, features, preprocessing methods, supporting modalities, and classifiers | Speech Communication | 2020 | 417 |
23 | Spoofing and countermeasures for speaker verification: A survey | Speech Communication | 2015 | 405 |
24 | Joint-sequence models for grapheme-to-phoneme conversion | Speech Communication | 2008 | 399 |
25 | How may I help you? | Speech Communication | 1997 | 394 |
26 | Design, analysis and experimental evaluation of block based transformation in MFCC computation for speaker recognition | Speech Communication | 2012 | 382 |
27 | Silent speech interfaces | Speech Communication | 2010 | 364 |
28 | Phone-level pronunciation scoring and assessment for interactive language learning | Speech Communication | 2000 | 357 |
29 | Automatic speech recognition and speech variability: A review | Speech Communication | 2007 | 349 |
30 | Emotional speech: Towards a new generation of databases | Speech Communication | 2003 | 346 |
31 | Glottal wave analysis with Pitch Synchronous Iterative Adaptive Inverse Filtering | Speech Communication | 1992 | 341 |
32 | Quantitative association of vocal-tract and facial behavior | Speech Communication | 1998 | 329 |
33 | Voice transformation using PSOLA technique | Speech Communication | 1992 | 328 |
34 | The importance of phase in speech enhancement | Speech Communication | 2011 | 326 |
35 | Confidence measures for speech recognition: A survey | Speech Communication | 2005 | 322 |
36 | Automatic speech recognition for under-resourced languages: A survey | Speech Communication | 2014 | 320 |
37 | A noise-estimation algorithm for highly non-stationary environments | Speech Communication | 2006 | 319 |
38 | The NIST speaker recognition evaluation – Overview, methodology, systems, results, perspective | Speech Communication | 2000 | 318 |
39 | Automatic speech emotion recognition using modulation spectral features | Speech Communication | 2011 | 309 |
40 | Efficient voice activity detection algorithms using long-term speech information | Speech Communication | 2004 | 308 |
41 | Primitives-based evaluation and estimation of emotions in speech | Speech Communication | 2007 | 308 |
42 | Prosody-based automatic segmentation of speech into sentences and topics | Speech Communication | 2000 | 306 |
43 | ATR Japanese speech database as a tool of speech recognition and synthesis | Speech Communication | 1990 | 304 |
44 | Experiments with a nonlinear spectral subtractor (NSS), Hidden Markov models and the projection, for robust speech recognition in cars | Speech Communication | 1992 | 293 |
45 | The role of intonation in emotional expressions | Speech Communication | 2005 | 276 |
46 | Emotion recognition using a hierarchical binary decision tree approach | Speech Communication | 2011 | 274 |
47 | Language-independent and language-adaptive acoustic modeling for speech recognition | Speech Communication | 2001 | 273 |
48 | Interaction between the native and second language phonetic subsystems | Speech Communication | 2003 | 271 |
49 | Ensemble methods for spoken emotion recognition in call-centres | Speech Communication | 2007 | 267 |
50 | The LIMSI Broadcast News transcription system | Speech Communication | 2002 | 264 |