The Win-Vector LLC data science blog is starting a new series on using Spark and R to handle big data.
What we want to do with the “
Rand big data” series is:
Give a taste of some of the power of the
Share a “capabilities and readiness” checklist you should apply when evaluating infrastructure.
Start to publicly document
Describe some of the warts and how to work around them.
Share fun tricks and techniques that make working with
Sparkmuch easier and more effective.