Skip to content

Aakash Sharan

Agentic AI. Retrieval. Distributed Systems.

  • About
Aakash Sharan
Agentic AI. Retrieval. Distributed Systems.
  • Artificial Intelligence | Machine Learning

    Making Sense of Big Data: A Beginner’s Guide to Logistic Regression Training in SparkR

    ByAakash Sharan November 6, 2016May 7, 2023

    Hey there! As your friendly language model, I’m here to help proofread and rewrite your text! Here’s the corrected and rewritten version of your post: Let’s do some Machine Learning with SparkR 1.6! The package only gives us the option to do linear or logistic regression, so for this exercise, we’re going to train a…

    Read More Making Sense of Big Data: A Beginner’s Guide to Logistic Regression Training in SparkRContinue

  • Archive

    Empowering R Programmers: Exploring the Capabilities of SparkR with RStudio

    ByAakash Sharan November 5, 2016December 17, 2025

    Let’s talk about SparkR! It’s an R package that provides a lightweight frontend to use Apache Spark from R. I used RStudio and Spark 1.6.1 for this exercise. SparkR has a distributed data frame implementation that supports operations like selection, filtering, and more. Cool, right? In RStudio, run the following code to check the system…

    Read More Empowering R Programmers: Exploring the Capabilities of SparkR with RStudioContinue

  • Artificial Intelligence | Machine Learning

    Eliminating the Spam Menace: Building an Effective Machine Learning-Based Spam Filter

    ByAakash Sharan July 15, 2016May 7, 2023

    Hey there! Let’s talk about spam filters. You know, those annoying emails that keep showing up in your inbox, even though you never signed up for them. Yeah, those. Well, a spam filter is a program that filters out those unwanted emails and messages. Pretty cool, right? So, we’re going to build and evaluate a…

    Read More Eliminating the Spam Menace: Building an Effective Machine Learning-Based Spam FilterContinue

  • Artificial Intelligence | Machine Learning

    Transforming Data Analytics: An Honest Review of MITx’s 15.071x Course, The Analytics Edge

    ByAakash Sharan June 28, 2016May 7, 2023

    Alright, folks! The The Analytics Edge course on edX is almost over and boy, have I learned a lot about Machine Learning in the past 2 months! This MOOC is hands down the best one I’ve taken so far, and I hope my other courses can at least live up to its awesomeness. I first…

    Read More Transforming Data Analytics: An Honest Review of MITx’s 15.071x Course, The Analytics EdgeContinue

  • Programming | Scala

    Streamlining Your Scala Development: A Guide to Creating SBT Projects in Eclipse

    ByAakash Sharan June 6, 2016May 7, 2023

    Hey there, I’m currently knee-deep in a Scala course on Coursera called "Functional Programming in Scala" taught by none other than Martin Ordersky – the inventor of Scala. Let me tell you, creating a Scala ecosystem is no walk in the park. Recently, I hit a roadblock while trying to create an Eclipse project using…

    Read More Streamlining Your Scala Development: A Guide to Creating SBT Projects in EclipseContinue

  • Artificial Intelligence | Machine Learning

    MIT’s Kaggle Competition Sees Fierce Competition Among Enrolled Students, with Overfitting a Concern for Some Top Contenders

    ByAakash Sharan June 6, 2016May 7, 2023

    It’s an absolute thrill to be in the top 1% of the Kaggle competition hosted by MIT! This contest is no joke, with some seriously experienced ML implementers throwing their hats into the ring. And let me tell you, the top 3 are on a whole other level – they’ve achieved over 90% accuracy, which…

    Read More MIT’s Kaggle Competition Sees Fierce Competition Among Enrolled Students, with Overfitting a Concern for Some Top ContendersContinue

  • Artificial Intelligence | Machine Learning

    Revolutionizing Baseball Strategy: Validating Moneyball Predictions through Machine Learning Models

    ByAakash Sharan May 11, 2016May 7, 2023

    After diving into regression analysis, I couldn’t wait to test my newfound skills on some real-world data. Luckily, Kaggle has just the thing – they’re hosting a competition called ‘History of Baseball’ and, even better, they’ve provided a dataset for it! I had a blast analyzing Paul dePodesta’s predictions and statistical findings, using linear regression…

    Read More Revolutionizing Baseball Strategy: Validating Moneyball Predictions through Machine Learning ModelsContinue

Page navigation

Previous PagePrevious 1 … 5 6 7

Recent Posts

  • PART II — ReAct: The Architecture That Unified Agentic Reasoning
  • Machine Learning Types Are the Wrong Abstraction for Agentic Systems
  • Statistics: Why Intelligent Systems Must Admit Uncertainty
  • PART I — Why Most Agents Fail: The Architectural Blind Spots Behind CoT and Tool-Only Models
  • What AI Exploit Benchmarks Actually Tell Us About System Architecture

Categories

Archives

© Aakash Sharan. Original work. Attribution appreciated.

  • About