Make progress on projects

This week you’ll be working on your projects.

Before we start…

To assist with the evaluation of the reproducibility and organization of your repository, please make sure to include the name, student ID and Github username of each team member in the README.md file of your project repository. To do so, please include a text like the one provided below in your README.md file - make sure to replace the text with the correct names and IDs!

## Team members

- Jon Snow (s1234567), johnsnow123
- Arya Stark (s1122334), thefaceless
- Daenerys Targaryen (s4433221), dragon_queen
- Tyrion Lannister (s1111111), t.lannister

Logistics

Review evaluation criteria: When evaluating your project, we will consider the following questions:

Presentation (50 points)

Write-up (30 points)

Reproducibility and Organization (10 points)

Team peer evaluation (10 points)

Project progress

Now back to your project…

  1. Craft your to-do list: Discuss your plan for your project as a team, review existing issues and open new ones as needed. Not every issue needs to have a checklist, but you might want to include checklists in some of them to remind yourselves the exact steps you discussed to tackle the issue. Then assign at least one issue to each team member.

  2. Cite your data: Now is the time to fix up those citations! In your project summary there is a link to a resource for properly citing data. Develop a citation for your dataset and add it under the data section using this guidance. If you have questions, ask a tutor for help!

  3. Strongly recommended: Get a hold of a tutor and run your ideas by them. Give them a 30 second version of your presentation and ask for their feedback.


The following might be useful when writing your report.

Demonstrating key IDS skills

Between your presentation and report, you should be able to demonstrate your skills with each component of the course as part of your investigation. Some specific topics that we will be looking for include:

It is not expected that each individual person to demonstrate their own skills in each of the above areas. However, it is important that you work as a group and to pull on each team member’s strengths so that you, as a group, demonstrate engagement with the key IDS skills. For example, one member may be more skilled than the others in wrangling and summarising data, another may be more suited in constructing and evaluating models, whereas another may be more capable at communicating results effectively. Pull on each other’s strengths to make a fair contribution to your project.

Report structure

The report should be a summary of the investigation that you have performed. It is suggested that you follow the standard IMRD structure (Introduction, Methods, Results, Discussion). Use headings and subheadings for clarity if you need in your Rmd file. The type of questions/points that you should try to address in each section are:

Introduction – What is your data? What the research question(s) you are investigating? Is there any relevant background information/literature that can help give context to your data/questions?

Methods – Summarise what cleaning/pre-processing you have done to your data. Brief description of data science techniques you have used in your investigation.

Results – Tables, figures and summary values from your exploratory investigation. Summary of a fitted statistical model, such as estimates, plots and fit statistic. A brief explanation of what the results mean in the given context of your data.

Discussion – Answers your research question(s). What are the advantages/limitations of the investigation you have performed? If applicable, are there any ethical considerations in relation to your investigation/findings?

Make sure to also check these technical aspects:

General Presentation Tips