Exploratory statistical analysis of relationships between water quality variables in Scottish Water drinking water treatment sites and climate, geographic and operational variables
datasetposted on 01.01.2016 by M Valdivia-Garcia
Datasets usually provide raw data for analysis. This raw data often comes in spreadsheet form, but can be any collection of data, on which analysis can be performed.
The stored data comprises exploratory statistical analysis variables such as mean, median, variances and standard deviations values derived from water quality data stored in the Scottish Water Laboratory Information Management System (LIMS) covering two years (Jan.2011-Jan2013) and ninety three drinking water treatment sites. Soils data was obtained from http://www.soils-scotland.gov.uk/data/download-signup and climatic variables such as rainfall and temperature were obtained from historical weather information which was available online at http://www.metoffice.gov.uk/public/weather/climate-historic/#?tab=climateHistoric. Statistical analysis was performed using Minitab16, R Studio and Matlab R2015b, as described in the methods section of the paper. A layer containing soil maps according to carbon richness and grouped in six categories data was overlaid with median values for trihalomethanes, chloride, dibromochloromethane, turbidity and DOC as described in the paper using ArcMap 10.1. These results were linked to soil maps using coordinates values.