Scripting for large-scale sequencing based on Hadoop
Distribution of the number of citations over years.