New blog series from Win-Vector LLC on R, Spark, and big data

The Win-Vector LLC data science blog is starting a new series on using Spark and R to handle big data.

Our goal

What we want to do with the “R and big data” series is:

  • Give a taste of some of the power of the R/Spark combination.

  • Share a “capabilities and readiness” checklist you should apply when evaluating infrastructure.

  • Start to publicly document R/Spark best practices.

  • Describe some of the warts and how to work around them.

  • Share fun tricks and techniques that make working with R/Spark much easier and more effective.

Leave a Reply

Please log in using one of these methods to post your comment: Logo

You are commenting using your account. Log Out /  Change )

Google photo

You are commenting using your Google account. Log Out /  Change )

Twitter picture

You are commenting using your Twitter account. Log Out /  Change )

Facebook photo

You are commenting using your Facebook account. Log Out /  Change )

Connecting to %s