Suppose you have rows of data in the following form:
City Population Average Income
--------------------------------------------------
CityA 100,000 60,000
CityB 300,000 40,000
CityC 40,000 70,000
I would like to generate rows with data points based on a given (typically normal) distribution.
Thus using the above example we would generate 100,000 + 300,000 + 40,000 = 440,000 rows each containing an actual (but hypothetical) income based on a given (typically normal) distribution of income of the city in question.