class: center, middle, inverse, title-slide .title[ # Recap and Looking Beyond IDS ] .subtitle[ ##
Introduction to Data Science ] .author[ ### University of Edinburgh ] .date[ ###
2024/2025 ] --- layout: true <div class="my-footer"> <span> University of Edinburgh </span> </div> <!-- --- --> <!-- ## Reminders --> <!-- * **9.00-10.30** --> <!-- - Venue1: JCMB 4325c (Groups: 18, 2,20,21,22,23) --> <!-- - Venue2: JCMB 5326 (Groups: 42,43,45,47,48). --> <!-- * **10.30-12.00** --> <!-- - Venue1: JCMB 4325c (Groups: 13, 14, 16, 24, 25, 55) --> <!-- - Venue2: JCMB 5326 (Groups: 27, 7, 9, 52, 26). --> <!-- * **12.00-13.30** --> <!-- - Venue1: JCMB 4325c (Groups: 1,5,6,8,56) --> <!-- - Venue2: JCMB 5326 (Groups: 12,15,19,28,57) --> <!-- - Venue3: JCMB 4325b (Groups: 29,30,41,44,49). --> <!-- * **13.30-15.00** --> <!-- - Venue1: JCMB 1206c (Groups: 10, 11, 17, 3, 53, 54) --> <!-- - Venue2: JCMB 5326 (Groups: 31, 32, 33, 34, 35, 36) --> <!-- - Venue3: JCMB 4325b (Groups: 37, 38, 39, 4, 40, 46, 51). --> --- ## Reminders - Check your html file knits and loads properly -- - HDMI port on laptop, if you need converter let us know -- - Don't forget your peer evaluation form -- - Don't forget to add the course GitHub page to your project <!-- -- --> <!-- - Try to complete each requirement step by step and do submission on time to avoid extra stress! --> <!-- --- --> <!-- ## Topics for today --> <!-- - Recap - what did people find difficult? --> <!-- - Where can you go for more information? --> <!-- - Next steps in R --> <!-- - What else for Data Science? --> --- ## What have you learnt? | Week | Topic | |------|-------------------------------------------| | 1 | Welcome and Toolkit | | 2 | Importing and recoding data | | 3 | Wrangling and tidying data | | 4 | Visualising data | | 5 | Communicating effectively | | 6 | Ethics | | 7 | Programming and Functions | | 8 | Predictive Modelling | | 9 | Classification and model building | | 10 | Validation and uncertainty quantification | | 11 | Looking beyond IDS | <!-- --- ---> <!-- class:middle ---> <!-- ## IDS overall reflection ---> <!-- (hopefully) Positive impacts ---> <!-- - You worked on generally the basics of data science pipeline in R language ---> <!-- - Getting familiar with main steps of data analysis in general ---> <!-- - IDS can be a good starting point for your personal further interest in Data Science ---> <!-- (Can be) Limited ---> <!-- - Not so much space for statistical terms and methods (definitely will learn more later) ---> <!-- - Maybe some re-ordering of the weekly topics can be discussed ---> <!-- - Further details on modeling part is a must (you will learn later) ---> <!-- What do you want to add ? ---> <!-- - Please fill out end-of-semester feedback! ---> --- class:middle ## Looking Beyond IDS --- ## Books - I <div class="figure" style="text-align: center"> <img src="img/IMS.png" alt=" " width="25%" /><img src="img/R4DS.jpeg" alt=" " width="25%" /><img src="img/AdvancedR.png" alt=" " width="25%" /> <p class="caption"> </p> </div> --- ## Books - II <div class="figure" style="text-align: center"> <img src="img/StatLiteracy.jpeg" alt=" " width="25%" /><img src="img/ArtofStats.jpg" alt=" " width="25%" /><img src="img/StatswithoutTears.jpg" alt=" " width="25%" /> <p class="caption"> </p> </div> --- ## Courses - [Statistics (year 2)](http://www.drps.ed.ac.uk/23-24/dpt/cxmath08051.htm) * Estimators, including MLEs * Hypothesis Testing * Linear Regression with theory -- - [Statistical methodology (year 3)](http://www.drps.ed.ac.uk/23-24/dpt/cxmath10095.htm) * maximum likelihood estimation * likelihood ratio tests * Bayes theorem and posterior distribution -- - [Statistical computing (year 3)](http://www.drps.ed.ac.uk/23-24/dpt/cxmath10093.htm) * optimising functions in R * programming statistical techniques and interpreting the results (including bootstrap algorithms) --- ## Courses - [Machine Learning in Python (year 4)](http://www.drps.ed.ac.uk/23-24/dpt/cxmath11205.htm) * Supervised and Unsupervised learning methods * Coding with Python, two different projects. -- - [Statistical Case Studies (year 4)](http://www.drps.ed.ac.uk/23-24/dpt/cxmath10102.htm) * Two real-world statistical projects that are meant to simulate consultancy-style work in a business/research environment -- - [Mathematics Project (year 4)](http://www.drps.ed.ac.uk/24-25/dpt/cxmath10063.htm) --- class:middle ## R capabilities: Special case --- ## Shiny Apps .pull-left[ - Shiny is an R package that makes it easy to build interactive web apps straight from R - You can host standalone apps on a webpage or embed them in R Markdown documents or build dashboards - You can also extend your Shiny apps with CSS themes, htmlwidgets, and JavaScript actions - Learn more at [shiny.rstudio.com](https://shiny.rstudio.com/) ] .pull-right[ <img src="img/shiny.png" width="60%" style="display: block; margin: auto auto auto 0;" /> ] --- ## High level view - Every Shiny app has a webpage that the user visits, and behind this webpage there is a computer that serves this webpage by running R - When running your app locally, the computer serving your app is your computer - When your app is deployed, the computer serving your app is a web server - If you want to try making your own, try working through the reading [here](https://mastering-shiny.org/basic-app.html) --- ## Some others - Dynamic reporting with Rmd files with different programming language - Widely considered the best tool for making beautiful graphs and visualizations. - Possible to find advanced statistics packages for high-level modeling. -- **Key message**: If you learn a programming language to some extent, you can learn other tools intuitively even if there are some differences on syntax. The battle of R vs Python is already over! <img src="img/RPyhton.png" width="20%" style="display: block; margin: auto;" /> --- class:middle ## Research in Data Science --- ## Turing Institute <img src="img/Turing.png" width="100%" style="display: block; margin: auto;" /> --- class:middle ## Beyond your courses --- ## Royal Statistical Society They offer free membership to all students in full-time education with an interest in statistics and data science. The membership is called e-student so find out [benefits from here](https://rss.org.uk/membership/join-now/e-student/) <img src="img/RSSlogo.jpeg" width="50%" style="display: block; margin: auto;" /> --- ## Digital Skills Programme (UoE) They offer various different training courses to develop your digital skills - Introduction to Data Science - Basic Data Visualizations - Data and AI Ethics - Introduction to R or Python You can find more details from [digital skills programme for students](https://information-services.ed.ac.uk/help-consultancy/is-skills/classroom-based-courses-and-webinars) --- ## Good luck! for your project presentations and career after IDS... Please take some time to complete the end-of-semester feedback survey. You can access it [here](https://forms.office.com/e/38hF4Da3Bs) <img src="img/QRCodeIDSfeedback.png" width="30%" style="display: block; margin: auto;" />