Skip to content
Aakash Sharan
  • Home
  • Blog
  • Papers

Tag: apache spark

Archive

Breaking Down the Beats: A Comprehensive Guide to Using ML Pipelines to Predict Song Release Years

Linear Regression is the most commonly used predictive analysis. It is used to model the relationship between a dependent variable and one or more independent...

Archive
01/12/2016

Unlocking the Power of MapReduce: Using Python and Apache Spark for Enhanced Data Processing

Hey there! So we decided to create a Word Count application - a classic MapReduce example. But what the heck is a Word Count application,...

Archive
26/11/2016

Data Wrangling Made Easy: Leveraging Apache Spark to Transform Raw Data into Valuable Insights

Hey, so I'm a big fan of Apache Spark and I've been using it for all of my independent projects. I recently had this idea...

Archive
20/11/2016

Making Sense of Big Data: A Beginner’s Guide to Logistic Regression Training in SparkR

Hey there! As your friendly language model, I'm here to help proofread and rewrite your text! Here's the corrected and rewritten version of your post:...

Archive
06/11/2016

Empowering R Programmers: Exploring the Capabilities of SparkR with RStudio

Let's talk about SparkR! It's an R package that provides a lightweight frontend to use Apache Spark from R. I used RStudio and Spark 1.6.1...

Archive
05/11/2016

© 2026 Aakash Sharan. All rights reserved.