Find a news article (that is of interest to you) from any trusted sources published in the last month. Formulate a research question in order to support, object to, or expand on the claim(s) in the selected news article. Find two (2) or more publicly accessible datasets on the web which can be used to answer your research question.
You may need to go through the following tasks (a)-(d) multiple times in order to arrive at a meaningful research question and findings.
(a) Identify the key claim(s) in the selected news article. Identify two (2) or more datasets that are publicly accessible. Analyse the dataset and answer the following questions. What kind of information is present in the datasets? How is the data organised and what common features can be used to relate the datasets? Are there data quality issues in the datasets (such as erroneous data, missing data, etc.)? Do you need to prepare (such as clean, transform, or manipulate) the raw data for analysis?
(b) Describe the measures that you need to calculate in order to answer your research question. Refer to Appendix 1 for an example.
(c) Apply your Python programming skills to generate summary statistics and graphical plots (or other form of visualisations) to address the stated research question. Explain your findings and observations.
(d) Identify some possible limitations of your findings. For example, are they limited to a certain city or country? Justify your assumptions about the data, if any.
Present your work for tasks (a)-(d) in your report using the template provided in Appendix 2. Provide screenshots of the relevant Python codes for each task and its output(s) where appropriate. Keep your report concise and coherent as a self-contained entity. The evaluation criteria also include logical flow of your explanation, variety of the visualisations employed and appropriate summary statistic used. More marks will be awarded to answers with in-depth analyses and practical recommendations. Limit your report file size to a maximum of 4M Bytes.
For a breakdown of the marks, please refer to Appendix 2.
Write the codes you used for implementing tasks (a)-(d) in an .ipynb file. The program should have sufficient comments to describe the corresponding steps and analyse the logical flow for each task.