4.8(top 5%)
impact factor
2.6K(top 10%)
papers
47.8K(top 10%)
citations
82(top 10%)
h-index
4.9(top 5%)
extended IF
3.1K
all documents
51.1K
doc citations
150(top 5%)
g-index

Top Articles

#TitleJournalYearCitations
1Convolutional Neural Networks for Speech RecognitionIEEE/ACM Transactions on Audio Speech and Language Processing20141,596
2A Regression Approach to Speech Enhancement Based on Deep Neural NetworksIEEE/ACM Transactions on Audio Speech and Language Processing2015926
3Conv-TasNet: Surpassing Ideal Time–Frequency Magnitude Masking for Speech SeparationIEEE/ACM Transactions on Audio Speech and Language Processing2019926
4Supervised Speech Separation Based on Deep Learning: An OverviewIEEE/ACM Transactions on Audio Speech and Language Processing2018870
5On Training Targets for Supervised Speech SeparationIEEE/ACM Transactions on Audio Speech and Language Processing2014758
6HuBERT: Self-Supervised Speech Representation Learning by Masked Prediction of Hidden UnitsIEEE/ACM Transactions on Audio Speech and Language Processing2021551
7Deep Sentence Embedding Using Long Short-Term Memory Networks: Analysis and Application to Information RetrievalIEEE/ACM Transactions on Audio Speech and Language Processing2016530
8Multitalker Speech Separation With Utterance-Level Permutation Invariant Training of Deep Recurrent Neural NetworksIEEE/ACM Transactions on Audio Speech and Language Processing2017499
9Complex Ratio Masking for Monaural Speech SeparationIEEE/ACM Transactions on Audio Speech and Language Processing2016495
10Pre-Training With Whole Word Masking for Chinese BERTIEEE/ACM Transactions on Audio Speech and Language Processing2021482
11An Overview of Noise-Robust Automatic Speech RecognitionIEEE/ACM Transactions on Audio Speech and Language Processing2014410
12PANNs: Large-Scale Pretrained Audio Neural Networks for Audio Pattern RecognitionIEEE/ACM Transactions on Audio Speech and Language Processing2020380
13From Feedforward to Recurrent LSTM Neural Networks for Language ModelingIEEE/ACM Transactions on Audio Speech and Language Processing2015359
14A Consolidated Perspective on Multimicrophone Speech Enhancement and Source SeparationIEEE/ACM Transactions on Audio Speech and Language Processing2017359
15Application of Deep Belief Networks for Natural Language UnderstandingIEEE/ACM Transactions on Audio Speech and Language Processing2014354
16Convolutional Recurrent Neural Networks for Polyphonic Sound Event DetectionIEEE/ACM Transactions on Audio Speech and Language Processing2017334
17Joint Optimization of Masks and Deep Recurrent Neural Networks for Monaural Source SeparationIEEE/ACM Transactions on Audio Speech and Language Processing2015332
18Using Recurrent Neural Networks for Slot Filling in Spoken Language UnderstandingIEEE/ACM Transactions on Audio Speech and Language Processing2015317
19An Algorithm for Predicting the Intelligibility of Speech Masked by Modulated Noise MaskersIEEE/ACM Transactions on Audio Speech and Language Processing2016308
20Very Deep Convolutional Neural Networks for Noise Robust Speech RecognitionIEEE/ACM Transactions on Audio Speech and Language Processing2016262
21Determined Blind Source Separation Unifying Independent Vector Analysis and Nonnegative Matrix FactorizationIEEE/ACM Transactions on Audio Speech and Language Processing2016232
22Data Augmentation for Deep Neural Network Acoustic ModelingIEEE/ACM Transactions on Audio Speech and Language Processing2015224
23End-to-End Waveform Utterance Enhancement for Direct Evaluation Metrics Optimization by Fully Convolutional Neural NetworksIEEE/ACM Transactions on Audio Speech and Language Processing2018203
24Robust Sound Event Classification Using Deep Neural NetworksIEEE/ACM Transactions on Audio Speech and Language Processing2015196
25Multichannel Audio Source Separation With Deep Neural NetworksIEEE/ACM Transactions on Audio Speech and Language Processing2016194
26Power-Normalized Cepstral Coefficients (PNCC) for Robust Speech RecognitionIEEE/ACM Transactions on Audio Speech and Language Processing2016187
27Learning Spectral Mapping for Speech Dereverberation and DenoisingIEEE/ACM Transactions on Audio Speech and Language Processing2015185
28Detection and Classification of Acoustic Scenes and Events: Outcome of the DCASE 2016 ChallengeIEEE/ACM Transactions on Audio Speech and Language Processing2018185
29A New Framework for CNN-Based Speech Enhancement in the Time DomainIEEE/ACM Transactions on Audio Speech and Language Processing2019176
30Learning Complex Spectral Mapping With Gated Convolutional Recurrent Networks for Monaural Speech EnhancementIEEE/ACM Transactions on Audio Speech and Language Processing2020174
31A Deep Ensemble Learning Method for Monaural Speech SeparationIEEE/ACM Transactions on Audio Speech and Language Processing2016169
32Unsupervised Speech Representation Learning Using WaveNet AutoencodersIEEE/ACM Transactions on Audio Speech and Language Processing2019167
33STFT Phase Reconstruction in Voiced Speech for an Improved Single-Channel Speech EnhancementIEEE/ACM Transactions on Audio Speech and Language Processing2014162
34Speech Emotion Classification Using Attention-Based LSTMIEEE/ACM Transactions on Audio Speech and Language Processing2019160
35An End-to-End Neural Network for Polyphonic Piano Music TranscriptionIEEE/ACM Transactions on Audio Speech and Language Processing2016158
36PEFAC - A Pitch Estimation Algorithm Robust to High Levels of NoiseIEEE/ACM Transactions on Audio Speech and Language Processing2014157
37Multichannel Signal Processing With Deep Neural Networks for Automatic Speech RecognitionIEEE/ACM Transactions on Audio Speech and Language Processing2017156
38Time-Frequency Masking in the Complex Domain for Speech Dereverberation and DenoisingIEEE/ACM Transactions on Audio Speech and Language Processing2017150
39Voice Conversion Using Deep Neural Networks With Layer-Wise Generative TrainingIEEE/ACM Transactions on Audio Speech and Language Processing2014148
40Toward Human Parity in Conversational Speech RecognitionIEEE/ACM Transactions on Audio Speech and Language Processing2017147
41Localization of Multiple Speakers under High Reverberation using a Spherical Microphone Array and the Direct-Path Dominance TestIEEE/ACM Transactions on Audio Speech and Language Processing2014145
42Statistical Parametric Speech Synthesis Incorporating Generative Adversarial NetworksIEEE/ACM Transactions on Audio Speech and Language Processing2018145
43Speaker-Independent Speech Separation With Deep Attractor NetworkIEEE/ACM Transactions on Audio Speech and Language Processing2018142
44A Feature Study for Classification-Based Speech Separation at Low Signal-to-Noise RatiosIEEE/ACM Transactions on Audio Speech and Language Processing2014132
45Exemplar-Based Sparse Representation With Residual Compensation for Voice ConversionIEEE/ACM Transactions on Audio Speech and Language Processing2014130
46Speech Intelligibility Potential of General and Specialized Deep Neural Network Based Speech Enhancement SystemsIEEE/ACM Transactions on Audio Speech and Language Processing2017129
47Deep Convolutional Neural Networks for Predominant Instrument Recognition in Polyphonic MusicIEEE/ACM Transactions on Audio Speech and Language Processing2017128
48Low-Complexity Direction-of-Arrival Estimation Based on Wideband Co-Prime ArraysIEEE/ACM Transactions on Audio Speech and Language Processing2015127
49Text-Independent Speaker Verification Based on Triplet Convolutional Neural Network EmbeddingsIEEE/ACM Transactions on Audio Speech and Language Processing2018126
50TERA: Self-Supervised Learning of Transformer Encoder Representation for SpeechIEEE/ACM Transactions on Audio Speech and Language Processing2021126