Breaking Down the Beats: A Comprehensive Guide to Using ML Pipelines to Predict Song Release Years Linear Regression is the most commonly used predictive analysis. It is used to model the relationship between a dependent variable and one or more independent... Archive 01/12/2016
Unlocking the Power of MapReduce: Using Python and Apache Spark for Enhanced Data Processing Hey there! So we decided to create a Word Count application - a classic MapReduce example. But what the heck is a Word Count application,... Archive 26/11/2016
Data Wrangling Made Easy: Leveraging Apache Spark to Transform Raw Data into Valuable Insights Hey, so I'm a big fan of Apache Spark and I've been using it for all of my independent projects. I recently had this idea... Archive 20/11/2016
The Art of Election Forecasting: Analyzing the 2012 US Presidential Election with Data Science Hey there! Let's talk about this dataset from RealClearPolitics and the US Presidential Election. Before we dive in, let's get on the same page about... Archive 08/11/2016
Making Sense of Big Data: A Beginner’s Guide to Logistic Regression Training in SparkR Hey there! As your friendly language model, I'm here to help proofread and rewrite your text! Here's the corrected and rewritten version of your post:... Archive 06/11/2016
Empowering R Programmers: Exploring the Capabilities of SparkR with RStudio Let's talk about SparkR! It's an R package that provides a lightweight frontend to use Apache Spark from R. I used RStudio and Spark 1.6.1... Archive 05/11/2016
Eliminating the Spam Menace: Building an Effective Machine Learning-Based Spam Filter Hey there! Let's talk about spam filters. You know, those annoying emails that keep showing up in your inbox, even though you never signed up... Archive 15/07/2016
Transforming Data Analytics: An Honest Review of MITx’s 15.071x Course, The Analytics Edge Alright, folks! The The Analytics Edge course on edX is almost over and boy, have I learned a lot about Machine Learning in the past... Archive 28/06/2016
Streamlining Your Scala Development: A Guide to Creating SBT Projects in Eclipse Hey there, I'm currently knee-deep in a Scala course on Coursera called "Functional Programming in Scala" taught by none other than Martin Ordersky - the... Archive 06/06/2016
MIT’s Kaggle Competition Sees Fierce Competition Among Enrolled Students, with Overfitting a Concern for Some Top Contenders It's an absolute thrill to be in the top 1% of the Kaggle competition hosted by MIT! This contest is no joke, with some seriously... Archive 06/06/2016