# | Title | Journal | Year | Citations |
---|
1 | Stochastic gradient boosting | Computational Statistics and Data Analysis | 2002 | 4,655 |
2 | PLS path modeling | Computational Statistics and Data Analysis | 2005 | 4,249 |
3 | ggplot2 | Wiley Interdisciplinary Reviews: Computational Statistics | 2011 | 2,171 |
4 | Mining Frequent Patterns without Candidate Generation: A Frequent-Pattern Tree Approach | Data Mining and Knowledge Discovery | 2004 | 2,034 |
5 | Extensions to the k-Means Algorithm for Clustering Large Data Sets with Categorical Values | Data Mining and Knowledge Discovery | 1998 | 1,773 |
6 | Deep learning for time series classification: a review | Data Mining and Knowledge Discovery | 2019 | 1,656 |
7 | Data Cube: A Relational Aggregation Operator Generalizing Group-By, Cross-Tab, and Sub-Totals | Data Mining and Knowledge Discovery | 1997 | 1,218 |
8 | Response surface methodology | Wiley Interdisciplinary Reviews: Computational Statistics | 2010 | 1,210 |
9 | Experiencing SAX: a novel symbolic representation of time series | Data Mining and Knowledge Discovery | 2007 | 1,190 |
10 | Algorithms and applications for approximate nonnegative matrix factorization | Computational Statistics and Data Analysis | 2007 | 1,162 |
11 | Frequent pattern mining: current status and future directions | Data Mining and Knowledge Discovery | 2007 | 1,109 |
12 | Density-Based Clustering in Spatial Databases: The Algorithm GDBSCAN and Its Applications | Data Mining and Knowledge Discovery | 1998 | 1,059 |
13 | E-Commerce Recommendation Applications | Data Mining and Knowledge Discovery | 2001 | 1,056 |
14 | Goodness-of-fit indices for partial least squares path modeling | Computational Statistics | 2013 | 978 |
15 | Discovery of Frequent Episodes in Event Sequences | Data Mining and Knowledge Discovery | 1997 | 974 |
16 | Partial least squares regression and projection on latent structure regression (PLS Regression) | Wiley Interdisciplinary Reviews: Computational Statistics | 2010 | 961 |
17 | Bursty and Hierarchical Structure in Streams | Data Mining and Knowledge Discovery | 2003 | 898 |
18 | Graph based anomaly detection and description: a survey | Data Mining and Knowledge Discovery | 2015 | 897 |
19 | The great time series classification bake off: a review and experimental evaluation of recent algorithmic advances | Data Mining and Knowledge Discovery | 2017 | 838 |
20 | Robust smoothing of gridded data in one and higher dimensions with missing values | Computational Statistics and Data Analysis | 2010 | 805 |
21 | Empirical characterization of random forest variable importance measures | Computational Statistics and Data Analysis | 2008 | 783 |
22 | Automatic Construction of Decision Trees from Data: A Multi-Disciplinary Survey | Data Mining and Knowledge Discovery | 1998 | 751 |
23 | On Bias, Variance, 0/1—Loss, and the Curse-of-Dimensionality | Data Mining and Knowledge Discovery | 1997 | 729 |
24 | Discretization: An Enabling Technique | Data Mining and Knowledge Discovery | 2002 | 729 |
25 | Levelwise Search and Borders of Theories in Knowledge Discovery | Data Mining and Knowledge Discovery | 1997 | 706 |
26 | A survey of hierarchical classification across different application domains | Data Mining and Knowledge Discovery | 2011 | 693 |
27 | Adaptive Fraud Detection | Data Mining and Knowledge Discovery | 1997 | 649 |
28 | On the Need for Time Series Data Mining Benchmarks: A Survey and Empirical Demonstration | Data Mining and Knowledge Discovery | 2003 | 649 |
29 | BIRCH: A New Data Clustering Algorithm and Its Applications | Data Mining and Knowledge Discovery | 1997 | 643 |
30 | Consistent and asymptotically normal PLS estimators for linear structural equations | Computational Statistics and Data Analysis | 2015 | 634 |
31 | How many principal components? stopping rules for determining the number of non-trivial axes revisited | Computational Statistics and Data Analysis | 2005 | 626 |
32 | Experimental comparison of representation methods and distance measures for time series data | Data Mining and Knowledge Discovery | 2013 | 612 |
33 | Real-world Data is Dirty: Data Cleansing and The Merge/Purge Problem | Data Mining and Knowledge Discovery | 1998 | 582 |
34 | Multicollinearity | Wiley Interdisciplinary Reviews: Computational Statistics | 2010 | 580 |
35 | Testing and dating of structural changes in practice | Computational Statistics and Data Analysis | 2003 | 573 |
36 | Estimating classification error rate: Repeated cross-validation, repeated hold-out and bootstrap | Computational Statistics and Data Analysis | 2009 | 564 |
37 | A classification EM algorithm for clustering and two stochastic versions | Computational Statistics and Data Analysis | 1992 | 550 |
38 | InceptionTime: Finding AlexNet for time series classification | Data Mining and Knowledge Discovery | 2020 | 542 |
39 | An adjusted boxplot for skewed distributions | Computational Statistics and Data Analysis | 2008 | 514 |
40 | Cluster-wise assessment of cluster stability | Computational Statistics and Data Analysis | 2007 | 512 |
41 | Practical variable selection for generalized additive models | Computational Statistics and Data Analysis | 2011 | 512 |
42 | Community detection in Social Media | Data Mining and Knowledge Discovery | 2012 | 509 |
43 | The EM algorithm for graphical association models with missing data | Computational Statistics and Data Analysis | 1995 | 500 |
44 | An application of changepoint methods in studying the effect of age on survival in breast cancer | Computational Statistics and Data Analysis | 1999 | 488 |
45 | Controlled experiments on the web: survey and practical guide | Data Mining and Knowledge Discovery | 2009 | 486 |
46 | Choosing starting values for the EM algorithm for getting the highest likelihood in multivariate Gaussian mixture models | Computational Statistics and Data Analysis | 2003 | 474 |
47 | The Bayesian information criterion: background, derivation, and applications | Wiley Interdisciplinary Reviews: Computational Statistics | 2012 | 473 |
48 | Selecting and estimating regular vine copulae and application to financial returns | Computational Statistics and Data Analysis | 2013 | 467 |
49 | On the exact distribution of maximally selected rank statistics | Computational Statistics and Data Analysis | 2003 | 455 |
50 | Hierarchical Clustering Algorithms for Document Datasets | Data Mining and Knowledge Discovery | 2005 | 452 |
51 | Robust forecasting of mortality and fertility rates: A functional data approach | Computational Statistics and Data Analysis | 2007 | 447 |
52 | Three naive Bayes approaches for discrimination-free classification | Data Mining and Knowledge Discovery | 2010 | 445 |
53 | On the evaluation of unsupervised outlier detection: measures, datasets, and an empirical study | Data Mining and Knowledge Discovery | 2016 | 445 |
54 | Training and assessing classification rules with imbalanced data | Data Mining and Knowledge Discovery | 2014 | 444 |
55 | Multiple factor analysis (AFMULT package) | Computational Statistics and Data Analysis | 1994 | 439 |
56 | Advances in Instance Selection for Instance-Based Learning Algorithms | Data Mining and Knowledge Discovery | 2002 | 435 |
57 | Characteristic-Based Clustering for Time Series Data | Data Mining and Knowledge Discovery | 2006 | 435 |
58 | Why the Monte Carlo method is so important today | Wiley Interdisciplinary Reviews: Computational Statistics | 2014 | 423 |
59 | Evaluating latent class analysis models in qualitative phenotype identification | Computational Statistics and Data Analysis | 2006 | 422 |
60 | Computing LTS Regression for Large Data Sets | Data Mining and Knowledge Discovery | 2006 | 415 |
61 | BACON: blocked adaptive computationally efficient outlier nominators | Computational Statistics and Data Analysis | 2000 | 412 |
62 | PARAFAC: Parallel factor analysis | Computational Statistics and Data Analysis | 1994 | 401 |
63 | Bayesian computing with INLA: New features | Computational Statistics and Data Analysis | 2013 | 400 |
64 | Fuzzy set theory | Wiley Interdisciplinary Reviews: Computational Statistics | 2010 | 387 |
65 | Bayesian spatial modeling of genetic population structure | Computational Statistics | 2008 | 377 |
66 | Maximum likelihood estimation in nonlinear mixed effects models | Computational Statistics and Data Analysis | 2005 | 374 |
67 | Relaxed Lasso | Computational Statistics and Data Analysis | 2007 | 372 |
68 | Genetic process mining: an experimental evaluation | Data Mining and Knowledge Discovery | 2007 | 372 |
69 | Ordinal, Continuous and Heterogeneous k-Anonymity Through Microaggregation | Data Mining and Knowledge Discovery | 2005 | 370 |
70 | Overdispersion: Models and estimation | Computational Statistics and Data Analysis | 1998 | 368 |
71 | Classification of time series by shapelet transformation | Data Mining and Knowledge Discovery | 2014 | 368 |
72 | An improved approximation to the precision of fixed effects from restricted maximum likelihood | Computational Statistics and Data Analysis | 2009 | 359 |
73 | ROCKET: exceptionally fast and accurate time series classification using random convolutional kernels | Data Mining and Knowledge Discovery | 2020 | 359 |
74 | Benchmark for filter methods for feature selection in high-dimensional classification data | Computational Statistics and Data Analysis | 2020 | 356 |
75 | maxLik: A package for maximum likelihood estimation in R | Computational Statistics | 2011 | 353 |
76 | FURIA: an algorithm for unordered fuzzy rule induction | Data Mining and Knowledge Discovery | 2009 | 351 |
77 | Time series classification with ensembles of elastic distance measures | Data Mining and Knowledge Discovery | 2015 | 349 |
78 | Community discovery using nonnegative matrix factorization | Data Mining and Knowledge Discovery | 2011 | 348 |
79 | A general class of zero-or-one inflated beta regression models | Computational Statistics and Data Analysis | 2012 | 347 |
80 | An extensive comparison of recent classification tools applied to microarray data | Computational Statistics and Data Analysis | 2005 | 344 |
81 | The Akaike information criterion: Background, derivation, properties, application, interpretation, and refinements | Wiley Interdisciplinary Reviews: Computational Statistics | 2019 | 344 |
82 | Possibility theory and statistical reasoning | Computational Statistics and Data Analysis | 2006 | 343 |
83 | The BOSS is concerned with time series classification in the presence of noise | Data Mining and Knowledge Discovery | 2015 | 340 |
84 | A note on the validity of cross-validation for evaluating autoregressive time series prediction | Computational Statistics and Data Analysis | 2018 | 329 |
85 | On-Line Unsupervised Outlier Detection Using Finite Mixtures with Discounting Learning Algorithms | Data Mining and Knowledge Discovery | 2004 | 323 |
86 | An anova test for functional data | Computational Statistics and Data Analysis | 2004 | 321 |
87 | Statistical analysis of financial networks | Computational Statistics and Data Analysis | 2005 | 321 |
88 | Multiple factor analysis: principal component analysis for multitable and multiblock data sets | Wiley Interdisciplinary Reviews: Computational Statistics | 2013 | 319 |
89 | A comparison of algorithms for fitting the PARAFAC model | Computational Statistics and Data Analysis | 2006 | 316 |
90 | Outlier identification in high dimensions | Computational Statistics and Data Analysis | 2008 | 313 |
91 | Model-based clustering of high-dimensional data: A review | Computational Statistics and Data Analysis | 2014 | 312 |
92 | Mining the customer credit using classification and regression tree and multivariate adaptive regression splines | Computational Statistics and Data Analysis | 2006 | 311 |
93 | Survey on mining subjective data on the web | Data Mining and Knowledge Discovery | 2012 | 307 |
94 | Open-source machine learning: R meets Weka | Computational Statistics | 2009 | 305 |
95 | Ridge regression | Wiley Interdisciplinary Reviews: Computational Statistics | 2009 | 305 |
96 | Adaptive proposal distribution for random walk Metropolis algorithm | Computational Statistics | 1999 | 304 |
97 | Mining Non-Redundant Association Rules | Data Mining and Knowledge Discovery | 2004 | 304 |
98 | Efficient Adaptive-Support Association Rule Mining for Recommender Systems | Data Mining and Knowledge Discovery | 2002 | 301 |
99 | (null) | Data Mining and Knowledge Discovery | 2001 | 294 |
100 | Analysis of Type-II progressively hybrid censored data | Computational Statistics and Data Analysis | 2006 | 292 |