In-Class Exercise Please complete the ‘Student Research and Data Management Interests’ survey: link to survey. After class: Be sure you complete and submit the assignment by 9 am Friday Prepare for next session (assigned reading, videos, etc).
BREAKOUT EXCERCISE: FIELD DATA STORAGE AND BACKUP Robin is a graduate student studying Malaria in Tanzania. The research project requires visiting communities, collecting mosquitoes from different sources of standing water (for later identification at the FLMNH and statistical analyses of mosquito diversity and abundance) and conducting semi–structured interviews in the community, where people are asked such things as their age, family income, their access to health care, if they have ever had malaria, their family income, use of mosquito nets.
QA/QC 2: Using OpenRefine to clean data How many ways can you spell the word… .
OpenRefine is a powerful, free, and open source tool that is used to work with and clean messy data.
Metadata & Codebooks 1. Review of Other Researchers' Metadata Data repositories such as Dryad and ICPSR are designed to permanently store the data thatused in research so it is available to future scholars.
Introduction - DMPs Today in class we will start drafting the DMP assigned as part of your semester project. To do so we will use the DMP tool, to which we have access as UF Researchers.
Efficient Data Collection Breakout: Process Audit of Data Forms Look over these forms link and conduct a process audit. Where and why do you think errors are most likely to sneak in?
Transcription & Translation Breakout 1: Translation of Texts into English Download the following .txt files. Note that the link takes you to a page on your browser; use “save this page as” to download it to your computer.
Paperless Data Collection Part 1: Using EpiCollect5 to gather data. Part 1a: Collecting and Uploading Data Download the Epicollect5 app and install it on a phone or tablet.
Automated Data Extraction Part 1: Scraping Data from Tables on Websites Import Wikipedia tables into Google Sheets.
a. Open Wikipedia page of List of Countries by Population (UN)
Data, Ethics, & The Law Before we start… an example from our lab of a real data clean-up situation for which OpenRefine is ideal Part 1: Anonymizing is hard Exercise 1: Finalize this “anonymization log' for one of your interviews.