Home
Aakash Sharan
Cancel

Notes on Iterative Deepening

Depth-First Search(DFS) In DFS, we begin at some node and goes down a path until it reaches a node that has no children. Any time DFS runs out of moves, it backtracks and expands a sibling of the ...

Enrolled in Udacity's Artificial Intelligence Nanodegree

During my Masters Degree, I took an undergraduate CS course. There we created an AI that plays the Connect 4 game. We used a couple of algorithms, I don’t remember all of them. But I do remember we...

Passed Professional Certification on Apache Cassandra

I’ve been working with Apache Cassandra for almost more than a year. And recently I felt that it’s time that I step up and get certified. So I took Datastax’s Professional Certification on Apache C...

ML Pipeline to Predict Release Year of a Song

Linear Regression is the most commonly used predictive analysis. It is used to model the relationship between a dependent variable and one or more independent variables. In this project we create a...

Word Count Application using Apache Spark

Word Count is a classic MapReduce example. So, we decided on creating a Word Count application. But what exactly is a word count application? It is a program that reads data and calculates the most...

Data Wrangling using Apache Spark

I use Apache Spark exhaustively for my independent projects, which led me to this notion of creating a project that describes how to data wrangling with Apache Spark. We used Apache Spark 2.0.2 on...

US Presidential Election 2012 Forecasting

We’ve taken the dataset from RealClearPolitics. Before we dive into analysis, lets get some understanding of United States Presidential Election. The key points are as follows: A president is e...

Training Logistic Regression in SparkR

Lets do some ML with SparkR 1.6. As the package only gives us the option to do either linear or logistic regression, so for this exercise we are going to train a logistic regression model. I have r...

Introduction to SparkR with RStudio

SparkR (R on Spark) is an R package that provides a light-weight frontend to use Apache Spark from R. For this exercise I have used RStudio and Spark 1.6.1. SparkR provides a distributed data frame...

Passed Databricks Certified Spark Developer

After completing the XSeries course on Apache Spark on edx I had been planning on giving the Databricks Spark Developer Certification. Finally, I gave and cleared the exam last week. The course on ...