Resources for Data Collection & Management

R Programming

Essential

  1. Hadley Wickham wrote a book on using the tidyverse and the online version is FREE. This is a phenomenal resource on using R to import, tidy, and visualize data.

  2. Posit Cheat Sheets: help with commands for using the different tidyverse packages, RStudio shortcuts and tricks, help with R commands, and more. You definitely want the ones for Data Import, Work with Strings, Factors, Data Transformation, and Base R.

  3. Where and How to ask for help

Tutorials and Books

  1. R Essential Training: Wrangling and Visualizing Data

  2. Software Carpentry: Using RStudio for Project Organization & Management

  3. Swirl

  4. R Bootcamp

  5. Kieran Healy’s Data Visualization: a practical introduction is my favorite introductory (yet super-comprehensive) book on data visualization with R. If you scroll down to the bottom of the page you can download the datasets and code used to make the figures in the book, which makes life much easier.

  6. So. Many. Resources.

  7. ROpenSci: tools for accessing, manipulating, and visualizing open data

  8. How to clean messy data in R

  9. The Ultimate Guide to Data Cleaning

  10. Learning R
    Swirl

Specific Problems in Data Cleaning and Managemnt

  1. Handling dates and times in R

  2. Text Mining: tidytext package

  3. Working with Qualtrics survey data with the qualtRics package

  4. Optical Character Recognition (OCR): extract text from images: tesseract package

  5. Extract text & metadata from pdf files: pdftools package

  6. Image processing in R: the magick package

Advanced R Packages

  1. DataCurator package: ‘a simple desktop data editor to help describe, validate and share usable open data’.

  2. RegExr: online tool to learn, build, & test Regular Expressions (RegEx / RegExp)

  3. janitor (cleanup of file names, etc.)

  4. knitr overview: reproducible documents with R

  5. qualtRics

Discipline-specific Resources

  1. historydata package: Sample data sets for historians learning R. They include population, institutional, religious, military, and prosopographical data suitable for mapping, quantitative analysis, and network analysis.

  2. The Programming Historian Website: wide range of topics, from text analysis to OpenRefine

Slide Presentations in R

  1. Make slide presentations with R

Data Archives

Qualitative Data Repository

Text Extraction and Organization

Plan for extraction and organization

Form Design

Best Practice for Form Design

Data Security

UF Office of Information Security and Compliance
Cyber Safeguards for UF
UF IRB
UF Data Classification Policy
UF Office of Information Security and Compliance

Platforms for Organization & Collaboration

Open Science Framework