7 Wrap-up and further resources
Let’s look back at what we were aiming to do today:
In this tutorial, we will introduce the audience to the R statistical programming environment and the RStudio Interactive Development Environment (IDE) with the aim of developing sufficient basic skills to conduct multivariate analyses (like Correspondence Analysis) on sensory and consumer datasets. We will provide a learning dataset for the analysis—a set of free response comments and overall liking scores from a central location test on berries. We will teach participants how to import, manipulate, and plot data using user-friendly, “tidy” R programming. All resources used in the tutorial are open-source and will remain available to attendees, including an R script covering the full workflow.
At the end of the tutorial, attendees will be able to prepare raw sensory data for common multivariate analyses or visual representations in R.
We have managed to touch on all of these topics, but of course we have taken the most cursory look at each. I hope what we’ve gone over today has inspired you, sure, but I mostly hope it has shown you how much you can do with just a little knowledge. My journey in terms of learning data science with R
has been all about building my coding ability incrementally. My code looks more like this than anything else, but I am able to get so much done:
By developing your ability to code (in R
or Python, or whatever works for you–Julia?) you will open up a whole set of analyses that you would otherwise be unable to access.
7.1 Getting help
- Look up the help file for whatever you’re doing. Do this by using the syntax
?<search item>
(for example?c
gets help on the vector command) as a shortcut on the console. - Search the help files for a term you think is related. Can’t remember the command for making a sequence of integers? Go to the “Help” pane in RStudio and search in the search box for “sequence”. See if some of the top results get you what you need.
- The internet. Seriously. I am not kidding even a little bit. R has one of the most active and (surprisingly) helpful user communities I’ve ever encountered. Try going to google and searching for “How do I make a sequence of numbers in R?” You will find quite a bit of useful help. I find the following sites particularly helpful
- Stack Overflow
- Cross Validated/Stack Exchange
- Seriously, Google will get you most of the way to helpful answers for many basic R questions.
I want to emphasize that looking up help is normal. I do it all the time. Learning to ask questions in helpful ways, how to quickly parse the information you find, and how to slightly alter the answers to suit your particular situation are key skills.
7.2 Learning more with Sensometrics Society
This workshop was organized and sponsored by the Sensometrics Society. Want to learn more? We are hosting our biennial conference in Paris 3-6 June, 2024:
In general, Sensometrics has a focus on methods and skills for the analysis of sensory data. If you’re interested in contributing to this focus (or just participating), please see our call for papers:
Please submit an abstract for either an oral or poster presentation by following the instructions on the conference website through. All submissions need to be received by December 15, 2023. All accepted contributions are invited to submit a full paper for inclusion in a virtual special issue of Food Quality and Preference (FQAP).
7.3 Further reading/resources
- General
R
programming- Data Carpentry’s R for Social Scientists (and, really the courses from The Carpentries in general)
- Wickham & Grolemund’s R for Data Science
- The stat545 course website
- Healy’s Data Visualization
- My own (somewhat opinionated and eccentric) course from VT: FST 5984
- Text analysis
We will also be presenting more coding demonstrations in R this week, as part of the “Applying Natural Language Processing tools for sensory and consumer data” workshop from 15:45-17:15 on August 23rd in Room 200! We’d love to see you there if you’re at all interested in working with text data (like the free response comments in the berry_data
).