4.6K(top 1%)
papers
60.9K(top 1%)
citations
85(top 1%)
h-index
155(top 1%)
g-index
27.0K
all documents
3.8K
doc citations

Top Articles

#TitleJournalYearCitations
1A high-performance, portable implementation of the MPI message passing interface standardParallel Computing19961,639
2Automated empirical optimizations of software and the ATLAS projectParallel Computing2001928
3The ganglia distributed monitoring system: design, implementation, and experienceParallel Computing2004903
4Hybrid scheduling for the parallel solution of linear systemsParallel Computing2006805
5Robust taboo search for the quadratic assignment problemParallel Computing1991726
6Parallel reactive molecular dynamics: Numerical methods and algorithmic techniquesParallel Computing2012716
7Genetic algorithms and neural networks: optimizing connections and connectivityParallel Computing1990622
8The parallel genetic algorithm as function optimizerParallel Computing1991569
9Data management and transfer in high-performance computational grid environmentsParallel Computing2002467
10PyCUDA and PyOpenCL: A scripting-based approach to GPU run-time code generationParallel Computing2012396
11Graph partitioning models for parallel computingParallel Computing2000371
12A dynamic model and parallel tabu search heuristic for real-time ambulance relocationParallel Computing2001360
13Evolution algorithms in combinatorial optimizationParallel Computing1988352
14A class of parallel tiled linear algebra algorithms for multicore architecturesParallel Computing2009327
15Swift: A language for distributed parallel scriptingParallel Computing2011319
16Parallel algorithms for hierarchical clusteringParallel Computing1995314
17Bringing skeletons out of the closet: a pragmatic manifesto for skeletal parallel programmingParallel Computing2004311
18Particle Swarm based Data Mining Algorithms for classification tasksParallel Computing2004309
19Towards dense linear algebra for hybrid GPU accelerated manycore systemsParallel Computing2010295
20SUPERB: A tool for semi-automatic MIMD/SIMD parallelizationParallel Computing1988290
21Optimization of sparse matrix–vector multiplication on emerging multicore platformsParallel Computing2009276
22PT-Scotch: A tool for efficient parallel graph orderingParallel Computing2008271
23Parallel recombinative simulated annealing: A genetic algorithmParallel Computing1995261
24Symmetry in interconnection networks based on Cayley graphs of permutation groups: A surveyParallel Computing1993260
25Extensible component-based architecture for FLASH, a massively parallel, multiphysics simulation codeParallel Computing2009219
26BSPlib: The BSP programming libraryParallel Computing1998218
27The PVM concurrent computing system: Evolution, experiences, and trendsParallel Computing1994207
28The communication challenge for MPP: Intel Paragon and Meiko CS-2Parallel Computing1994201
29From CUDA to OpenCL: Towards a performance-portable solution for multi-platform GPU programmingParallel Computing2012198
30A hybrid MPI–OpenMP scheme for scalable parallel pseudospectral computations for fluid turbulenceParallel Computing2011196
31DAGuE: A generic distributed DAG engine for High Performance ComputingParallel Computing2012196
32A hybrid multi-objective Particle Swarm Optimization for scientific workflow schedulingParallel Computing2017194
33Parallel implementation of the TRANSIMS micro-simulationParallel Computing2001193
34Parallel Tabu search heuristics for the dynamic multi-vehicle dial-a-ride problemParallel Computing2004192
35Parallel GRASP with path-relinking for job shop schedulingParallel Computing2003191
36Matrix algorithms on a hypercube I: Matrix multiplicationParallel Computing1987181
37Multiprocessor FFTsParallel Computing1987179
38Component averaging: An efficient iterative parallel algorithm for large and sparse unstructured problemsParallel Computing2001175
39A parallel tabu search algorithm for solving the container loading problemParallel Computing2003172
40FFT algorithms for vector computersParallel Computing1984166
41MapReduce in MPI for Large-scale graph algorithmsParallel Computing2011162
42Distributed processing of very large datasets with DataCutterParallel Computing2001161
43PaStiX: a high-performance parallel direct solver for sparse symmetric positive definite systemsParallel Computing2002160
44High performance computing using MPI and OpenMP on multi-core parallel systemsParallel Computing2011151
45Computational aspects of a code to study rotating turbulent convection in spherical shellsParallel Computing1999149
46Parallel solution of partial symmetric eigenvalue problems from electronic structure calculationsParallel Computing2011147
47New advances in chemistry and materials science with CPMD and parallel computingParallel Computing2000146
48Sparse matrix multiplication: The distributed block-compressed sparse row libraryParallel Computing2014143
49Cost-efficient task scheduling for executing large programs in the cloudParallel Computing2013141
50Monitors, messages, and clusters: The p4 parallel programming systemParallel Computing1994139
51List scheduling with and without communication delaysParallel Computing1993137
52High-performance parallel implicit CFDParallel Computing2001137
53Parallel synchronous and asynchronous implementations of the auction algorithmParallel Computing1991136
54Parallel heuristics for scalable community detectionParallel Computing2015136
55Probabilistic methods for centroidal Voronoi tessellations and their parallel implementationsParallel Computing2002135
56A single-program-multiple-data computational model for EPEX/FORTRANParallel Computing1988134
57The programming model of ASSIST, an environment for parallel and distributed portable applicationsParallel Computing2002134
58Sub optimal scheduling in a grid using genetic algorithmsParallel Computing2004134
59A parallel hybrid banded system solver: the SPIKE algorithmParallel Computing2006134
60Multiprocessor scheduling with communication delaysParallel Computing1990133
61Parallel graph component labelling with GPUs and CUDAParallel Computing2010132
62Data communication in parallel architecturesParallel Computing1989129
63Efficient schemes for nearest neighbor load balancingParallel Computing1999121
64A quadtree approach to domain decomposition for spatial interpolation in Grid computing environmentsParallel Computing2003121
65Multilevel summation of electrostatic potentials using graphics processing unitsParallel Computing2009118
66Parallel implementation of multifrontal schemesParallel Computing1986117
67A parallel solver for large quadratic programs in training support vector machinesParallel Computing2003117
68Maximizing parallelism and minimizing synchronization with affine partitionsParallel Computing1998112
69A survey on resource allocation in high performance distributed computing systemsParallel Computing2013112
70Hiding global synchronization latency in the preconditioned Conjugate Gradient algorithmParallel Computing2014107
71Computational solution of capacity planning models under uncertaintyParallel Computing2000106
72Cellular automata computations and secret key cryptographyParallel Computing2004106
73Scheduling for heterogeneous Systems using constrained critical pathsParallel Computing2012106
74A novel fault-tolerant scheduling algorithm for precedence constrained tasks in real-time heterogeneous systemsParallel Computing2006104
75A high performance, low complexity algorithm for compile-time task scheduling in heterogeneous systemsParallel Computing2005103
76Parallel image processing applications on a network of workstationsParallel Computing1995102
77Fault diagnosis for airplane engines using Bayesian networks and distributed particle swarm optimizationParallel Computing2007100
78Large tridiagonal and block tridiagonal linear systems on vector and parallel computersParallel Computing198799
79On the impact of the migration topology on the Island ModelParallel Computing201099
80Parallel job scheduling for power constrained HPC systemsParallel Computing201299
81Exploring weak scalability for FEM calculations on a GPU-enhanced clusterParallel Computing200798
82Parallel clustering algorithmsParallel Computing198997
83Multitasking the conjugate gradient method on the CRAY X-MP/48Parallel Computing198796
84ScaffCC: Scalable compilation and analysis of quantum programsParallel Computing201596
85Parallel Gaussian elimination on an MIMD computerParallel Computing198892
86Performance of parallel processorsParallel Computing198990
87Toward a better parallel performance metricParallel Computing199190
88The design of a standard message passing interface for distributed memory concurrent computersParallel Computing199490
89On the versatility of parallel sorting by regular samplingParallel Computing199389
90Optimizing noncontiguous accesses in MPI–IOParallel Computing200289
91Two-level dynamic scheduling in PARDISO: Improved scalability on shared memory multiprocessing systemsParallel Computing200289
92Message-passing multi-cell molecular dynamics on the connection machine 5Parallel Computing199488
93Optimizing a conjugate gradient solver with non-blocking collective operationsParallel Computing200788
94Distributed frameworks and parallel algorithms for processing large-scale geographic dataParallel Computing200387
95A parallel multiphase flow code for the 3D simulation of explosive volcanic eruptionsParallel Computing200785
96GunrockACM Transactions on Parallel Computing201784
97Design and performance of a scalable parallel community climate modelParallel Computing199583
98Optimizing data intensive GPGPU computations for DNA sequence alignmentParallel Computing200983
99The REFINE multiprocessor — theoretical properties and algorithmsParallel Computing199582
100Parallel optimisation algorithms for multilevel mesh partitioningParallel Computing200082