Just attended a nice intro to R and RStudio webinare.
Here is a link to the article.
https://thomasmock.netlify.com/post/a-gentle-guide-to-tidy-statistics-in-r/
I'll try to add a link to the video in 3-4 days when it is posted.
However, here is a good start on videos
https://resources.rstudio.com/webinars
You can get access to R Studio in the cloud for free to learn at https://rstudio.cloud.
I'm an intermediate level R user with machine learning and data visualization experience if anyone ever wants to use me as an additional resource! Really happy to see more people using R and look forward to hearing any success stories. Thanks Tom!
How have you been using R at the Aquarium?
I started my position in April and have been working to clean up our data before I get to have any real fun with it, so nothing yet, unfortunately. But, I am looking forward to using R as a way to quickly summarize data and make visualizations of more complex datasets. You have exponentially more power in R for exploratory data analysis than you do in, say, Excel. It's amazing if you want advanced statistics, such as a cluster analysis. It can be especially useful when your data becomes too large to read in Excel, too.
Are you using R for any of your clean up work? Over on the Developers Group here on TessituraNetwork.com we have started a conversation about "Machine learning model for identifying duplicates". Have you tried anything like this with R? There are a few libraries out there for Record Linkage.
However, then there is the problem of what you do about the linked records once you have found them. Which one do you keep? Which to delete, and a host of other questions.
cc: Nick Reilingh
Here is a youtube showing the use of the Record Linkage Package.
https://www.youtube.com/watch?v=Msl1Q5Yv8Ow