Signed in as:
filler@godaddy.com
Signed in as:
filler@godaddy.com
The most surprising part of this course was the detailed data munging (data cleaning) that had to be done to get the data ready for use in R. Great emphasis was placed on not removing data unless there was ample reason to consider it tainted (i.e., not merely because an attention check was missed or the data point was an outlier). Additionally, extreme care and impartiality were used when translating open-response (qualitative) data into numerical values R could interpret.
Finally, the opportunity to apply exploratory data analysis to real-world data was such a great learning experience. Testing variables for normality and comparing data sets with and without observations that missed the attention or effort checks gave me increased familiarity with R coding and graphical interpretation, besides providing valuable insights about the effects of certain flagged observations. Our team also examined outliers, demographics, and correlations. Without a doubt, this course has been a valuable and memorable learning experience.
- Faith V., Student at Ouachita Baptist University, 2021
Over the summer, I was able to get hands-on experience in research and data analytics while studying the integration of Slack into higher education. I was able to apply my introductory knowledge of R to help clean data sets, organize groups of variables, and prepare plots and graphs. The biggest thing that I learned throughout the course of this summer was the importance of having sound data in the beginning. Research will eventually fall through and not hold up to statistical testing if it isn’t collected properly.
I also learned that even though it is extremely time-consuming, cleaning the data properly is an essential step in the research process. Without clean, organized data, you can’t even begin to perform tests or code graphs. In addition to learning how to properly clean data, I also learned how crucial graphs and plots can be. A graph is able to take hundreds of seemingly meaningless data points and put them together and tell a story or demonstrate a relationship. R is an amazing coding language for graphing because of the ggplot2 software in the tidyverse package. I was able to see firsthand how to create complex QQ plots that integrate different variables, and these graphs helped piece the data together. Ultimately, this summer showed me that the process of research starts with data, deals with lots of data and ends with trying to demonstrate how that data is correlated.
- Jeff M., Student at Ouachita Baptist University, 2021
© 2015-2024,
justinbkeeler.com
Website Updated 05-23-2024