In this assignment, you will use statistical tests for non-normal data. You may use methods (non-parametric statistics tests) and tools (R, Excel, or SPSS) of your own choice – please don’t rely on one tool or method, variety is expected. It is not necessary to replicate any test you carry out, ie if you perform a test in R it is not necessary to repeat in SPSS and/or Excel. A data file (from the 2016 Census of Ireland) is suggested, though students are permitted to choose a different file if they wish (subject to approval by Dr O’Loughlin/Enda Stafford/Sidra Bashir/Theo Mendonca).
The first file contains raw data based on the 2016 Census of Ireland. The second file contains information such as location names and IDs. You should be able to combine both data sets into one using the GUID field.
The Small Areas CSV file has 18,641 records based on 68 columns of data. You are not expected to use all the data in the file and you may reduce to eliminate unused data if you wish. As there are a lot of data in this file, please be careful about what you decide to report on – it is up to you to choose.
Be aware that this is a statistical report and that Null/Alternate hypotheses, justification of levels of significance, correct reporting of results, and explanations of results are expected (see 8 Simple Rules document in Moodle). Please also explain and justify any statistical test used. State clearly any assumptions made.