Using the environmental data for each of the provinces in Canada, and weighting each piece of data by the number of cities in the province, calculate the mean temperature and mean precipitation for all of Canada for annual and each month.
Environmental - v - assignment.xlsx
In each case, write a program using Python libraries of PySpark. It would make sense to treat the data as manipulation of matrices. Do not assume that the input data is all on the same node.
Your submission should include the Spark program, and the answers (probably in a matrix of the same form as the input matrices, but without Rows for High, Low Temperatures, and columns for # of cities, and years).