Linear Fitted-Q Iteration with Multiple Reward Functions
Distribution of the number of citations over years.