IEEE/ACM Transactions on Audio Speech and Language Processing

4.8(top 5%)

impact factor

2.6K(top 10%)

papers

47.8K(top 10%)

citations

82(top 10%)

h-index

4.9(top 5%)

extended IF

3.1K

all documents

51.1K

doc citations

150(top 5%)

g-index

Top Articles

#	Title	Journal	Year	Citations
1	Convolutional Neural Networks for Speech Recognition	IEEE/ACM Transactions on Audio Speech and Language Processing	2014	1,596
2	A Regression Approach to Speech Enhancement Based on Deep Neural Networks	IEEE/ACM Transactions on Audio Speech and Language Processing	2015	926
3	Conv-TasNet: Surpassing Ideal Time–Frequency Magnitude Masking for Speech Separation	IEEE/ACM Transactions on Audio Speech and Language Processing	2019	926
4	Supervised Speech Separation Based on Deep Learning: An Overview	IEEE/ACM Transactions on Audio Speech and Language Processing	2018	870
5	On Training Targets for Supervised Speech Separation	IEEE/ACM Transactions on Audio Speech and Language Processing	2014	758
6	HuBERT: Self-Supervised Speech Representation Learning by Masked Prediction of Hidden Units	IEEE/ACM Transactions on Audio Speech and Language Processing	2021	551
7	Deep Sentence Embedding Using Long Short-Term Memory Networks: Analysis and Application to Information Retrieval	IEEE/ACM Transactions on Audio Speech and Language Processing	2016	530
8	Multitalker Speech Separation With Utterance-Level Permutation Invariant Training of Deep Recurrent Neural Networks	IEEE/ACM Transactions on Audio Speech and Language Processing	2017	499
9	Complex Ratio Masking for Monaural Speech Separation	IEEE/ACM Transactions on Audio Speech and Language Processing	2016	495
10	Pre-Training With Whole Word Masking for Chinese BERT	IEEE/ACM Transactions on Audio Speech and Language Processing	2021	482
11	An Overview of Noise-Robust Automatic Speech Recognition	IEEE/ACM Transactions on Audio Speech and Language Processing	2014	410
12	PANNs: Large-Scale Pretrained Audio Neural Networks for Audio Pattern Recognition	IEEE/ACM Transactions on Audio Speech and Language Processing	2020	380
13	From Feedforward to Recurrent LSTM Neural Networks for Language Modeling	IEEE/ACM Transactions on Audio Speech and Language Processing	2015	359
14	A Consolidated Perspective on Multimicrophone Speech Enhancement and Source Separation	IEEE/ACM Transactions on Audio Speech and Language Processing	2017	359
15	Application of Deep Belief Networks for Natural Language Understanding	IEEE/ACM Transactions on Audio Speech and Language Processing	2014	354
16	Convolutional Recurrent Neural Networks for Polyphonic Sound Event Detection	IEEE/ACM Transactions on Audio Speech and Language Processing	2017	334
17	Joint Optimization of Masks and Deep Recurrent Neural Networks for Monaural Source Separation	IEEE/ACM Transactions on Audio Speech and Language Processing	2015	332
18	Using Recurrent Neural Networks for Slot Filling in Spoken Language Understanding	IEEE/ACM Transactions on Audio Speech and Language Processing	2015	317
19	An Algorithm for Predicting the Intelligibility of Speech Masked by Modulated Noise Maskers	IEEE/ACM Transactions on Audio Speech and Language Processing	2016	308
20	Very Deep Convolutional Neural Networks for Noise Robust Speech Recognition	IEEE/ACM Transactions on Audio Speech and Language Processing	2016	262
21	Determined Blind Source Separation Unifying Independent Vector Analysis and Nonnegative Matrix Factorization	IEEE/ACM Transactions on Audio Speech and Language Processing	2016	232
22	Data Augmentation for Deep Neural Network Acoustic Modeling	IEEE/ACM Transactions on Audio Speech and Language Processing	2015	224
23	End-to-End Waveform Utterance Enhancement for Direct Evaluation Metrics Optimization by Fully Convolutional Neural Networks	IEEE/ACM Transactions on Audio Speech and Language Processing	2018	203
24	Robust Sound Event Classification Using Deep Neural Networks	IEEE/ACM Transactions on Audio Speech and Language Processing	2015	196
25	Multichannel Audio Source Separation With Deep Neural Networks	IEEE/ACM Transactions on Audio Speech and Language Processing	2016	194
26	Power-Normalized Cepstral Coefficients (PNCC) for Robust Speech Recognition	IEEE/ACM Transactions on Audio Speech and Language Processing	2016	187
27	Learning Spectral Mapping for Speech Dereverberation and Denoising	IEEE/ACM Transactions on Audio Speech and Language Processing	2015	185
28	Detection and Classification of Acoustic Scenes and Events: Outcome of the DCASE 2016 Challenge	IEEE/ACM Transactions on Audio Speech and Language Processing	2018	185
29	A New Framework for CNN-Based Speech Enhancement in the Time Domain	IEEE/ACM Transactions on Audio Speech and Language Processing	2019	176
30	Learning Complex Spectral Mapping With Gated Convolutional Recurrent Networks for Monaural Speech Enhancement	IEEE/ACM Transactions on Audio Speech and Language Processing	2020	174
31	A Deep Ensemble Learning Method for Monaural Speech Separation	IEEE/ACM Transactions on Audio Speech and Language Processing	2016	169
32	Unsupervised Speech Representation Learning Using WaveNet Autoencoders	IEEE/ACM Transactions on Audio Speech and Language Processing	2019	167
33	STFT Phase Reconstruction in Voiced Speech for an Improved Single-Channel Speech Enhancement	IEEE/ACM Transactions on Audio Speech and Language Processing	2014	162
34	Speech Emotion Classification Using Attention-Based LSTM	IEEE/ACM Transactions on Audio Speech and Language Processing	2019	160
35	An End-to-End Neural Network for Polyphonic Piano Music Transcription	IEEE/ACM Transactions on Audio Speech and Language Processing	2016	158
36	PEFAC - A Pitch Estimation Algorithm Robust to High Levels of Noise	IEEE/ACM Transactions on Audio Speech and Language Processing	2014	157
37	Multichannel Signal Processing With Deep Neural Networks for Automatic Speech Recognition	IEEE/ACM Transactions on Audio Speech and Language Processing	2017	156
38	Time-Frequency Masking in the Complex Domain for Speech Dereverberation and Denoising	IEEE/ACM Transactions on Audio Speech and Language Processing	2017	150
39	Voice Conversion Using Deep Neural Networks With Layer-Wise Generative Training	IEEE/ACM Transactions on Audio Speech and Language Processing	2014	148
40	Toward Human Parity in Conversational Speech Recognition	IEEE/ACM Transactions on Audio Speech and Language Processing	2017	147
41	Localization of Multiple Speakers under High Reverberation using a Spherical Microphone Array and the Direct-Path Dominance Test	IEEE/ACM Transactions on Audio Speech and Language Processing	2014	145
42	Statistical Parametric Speech Synthesis Incorporating Generative Adversarial Networks	IEEE/ACM Transactions on Audio Speech and Language Processing	2018	145
43	Speaker-Independent Speech Separation With Deep Attractor Network	IEEE/ACM Transactions on Audio Speech and Language Processing	2018	142
44	A Feature Study for Classification-Based Speech Separation at Low Signal-to-Noise Ratios	IEEE/ACM Transactions on Audio Speech and Language Processing	2014	132
45	Exemplar-Based Sparse Representation With Residual Compensation for Voice Conversion	IEEE/ACM Transactions on Audio Speech and Language Processing	2014	130
46	Speech Intelligibility Potential of General and Specialized Deep Neural Network Based Speech Enhancement Systems	IEEE/ACM Transactions on Audio Speech and Language Processing	2017	129
47	Deep Convolutional Neural Networks for Predominant Instrument Recognition in Polyphonic Music	IEEE/ACM Transactions on Audio Speech and Language Processing	2017	128
48	Low-Complexity Direction-of-Arrival Estimation Based on Wideband Co-Prime Arrays	IEEE/ACM Transactions on Audio Speech and Language Processing	2015	127
49	Text-Independent Speaker Verification Based on Triplet Convolutional Neural Network Embeddings	IEEE/ACM Transactions on Audio Speech and Language Processing	2018	126
50	TERA: Self-Supervised Learning of Transformer Encoder Representation for Speech	IEEE/ACM Transactions on Audio Speech and Language Processing	2021	126

Scientometrics

Overviews

Citing Bodies

IEEE/ACM Transactions on Audio Speech and Language Processing

Top Articles