AI/ML — Module 01

What is AI? (And what are ML, DL, and GenAI?)

The clearest explanation of the AI family — zero jargon, real examples, interactive visual.

20–25 min March 2026

Before we start

Everyone is confused. That's not your fault.

Open any tech news site today. You will read about "AI models," "machine learning algorithms," "deep learning networks," and "generative AI" — sometimes in the same sentence, often meaning different things, always without any explanation of how they relate to each other.

LinkedIn posts say "I'm learning AI." Job postings ask for "ML experience." News headlines scream about "deep learning breakthroughs." Everyone seems to assume you already know the difference between all of these. Almost nobody explains it clearly.

This is not a knowledge gap. It is a teaching gap. These terms are genuinely confusing because they are used interchangeably in popular media even though they refer to very different — but related — things.

Here's the one thing this page will do: by the time you finish reading, you will understand exactly what each term means, how they relate to each other, and why they are taught in a specific order. No jargon. No assumptions. Real examples throughout.

🎯 Pro Tip

This page has no code, no math, and no prerequisites. If you can read this sentence, you have everything you need to understand what follows.

The big picture

They're not separate subjects. They're nested.

Think of Russian nesting dolls — matryoshka. The outermost doll contains all the others. You can't pull out the innermost doll without also having the outer ones. AI, ML, Deep Learning, and Generative AI work the same way.

AI is the outermost layer — the broadest concept. Machine Learning lives inside AI. Deep Learning lives inside Machine Learning. Generative AI lives inside Deep Learning. Each layer is a more specific version of the one containing it.

Click each ring in the diagram below to see what each layer means, a plain-English analogy, and real examples from companies you know.

Gen AI

broader

more specific

The universe — everything lives inside this

Artificial Intelligence

Making machines do things that normally require human intelligence — understanding language, recognising faces, making decisions, solving problems. AI is the goal. Everything else is how we get there.

The analogy

AI is like the word "sports." It includes cricket, football, chess, swimming. Saying "I work in AI" is like saying "I play sports" — technically true, but tells you nothing specific. ML, DL, and GenAI are specific disciplines inside the AI universe.

Real examples

Google Maps rerouting you around traffic in real time — uses ML, which lives inside AI

Face ID unlocking your phone — uses Deep Learning, which is inside ML, which is inside AI

Fraud detection at Stripe blocking suspicious payments — uses classical ML

What it is NOT

AI is NOT just robots or science fiction. 99% of AI today is invisible software — recommendation engines, spam filters, search rankings. You use it dozens of times a day without knowing.

The one sentence that unlocks everything: Every GenAI model is a DL model. Every DL model is an ML model. Every ML model is an AI system. But the reverse is not true — not every AI system uses ML, and not every ML model uses deep learning.

Layer 1 of 4

Artificial Intelligence — the universe

Artificial Intelligence is the broad goal: make a machine do something that normally requires human intelligence. Understanding language. Recognising faces. Making decisions under uncertainty. Solving problems. AI is the destination. The other terms describe different routes to get there.

Before machine learning existed, AI meant writing rules by hand. Programmers sat down and encoded human knowledge directly into "if this, then that" logic. This is called rule-based AI or expert systems.

IF email contains "lottery" AND "winner" AND "bank details"
  → mark as SPAM

IF email is from known_contacts list
  → mark as NOT SPAM

IF email contains "invoice" AND sender matches company_domain
  → mark as NOT SPAM

// A human wrote every single one of these rules.
// Add 10 million spam emails and you need 10 million rules.

This approach works when the rules are knowable and finite. It breaks down when the problem is too complex, too variable, or when the rules themselves keep changing.

💡 Note

Rule-based AI is not obsolete. Traffic light systems, ATM transaction logic, chess engines using evaluation functions, and many industrial control systems still run on hand-coded rules today. The point is not that rule-based AI is bad — it is that it cannot scale to problems where the rules are too complex or unknown.

Layer 2 of 4

Machine Learning — let the data write the rules

Machine Learning is a specific approach to AI where instead of writing rules manually, you show the machine thousands — or millions — of labelled examples, and it figures out the rules itself. The machine learns from data. Hence the name.

You do not tell a spam filter what spam looks like. You show it 10 million spam emails and 10 million real ones. It discovers on its own that certain word combinations, sender patterns, and link structures are associated with spam. You never wrote a single rule about Nigerian princes.

The core shift: in rule-based AI, the programmer writes the logic. In ML, the programmer provides the data and the machine writes the logic. The output is a "model" — a mathematical function that maps inputs to outputs based on patterns learned from examples.

	Rule-Based AI	Machine Learning
Input	Hand-written rules + data	Labelled examples only
Spam filter	List of banned words / senders	Show 10M spam + 10M real emails
Delivery time	Write rules for every traffic scenario	Train on 1M past deliveries with outcomes
Adapts to new data	Someone must update rules manually	Retrain the model on new examples
Works best when	Rules are knowable and finite	Patterns exist but rules are too complex to write

🎯 Pro Tip

When someone says "we trained a model," this is what they mean: they fed labelled data into a learning algorithm, the algorithm adjusted its internal parameters to minimise errors on that data, and the result is a "trained model" — a function that can now make predictions on new, unseen inputs.

Layer 3 of 4

Deep Learning — ML with a brain-inspired twist

Classical machine learning is powerful, but it has a limitation: you have to tell it what to look for. To build a cat classifier with classical ML, you would need to manually define features — fur texture, ear shape, whisker presence, body proportions. A human expert decides which measurements matter.

Deep Learning removes that requirement. You show it 5 million photos labelled "cat" or "not cat," and it figures out — entirely on its own — which features distinguish cats from everything else. No human wrote "check for pointy ears." The network discovered that itself.

It does this through many layers of artificial neurons stacked on top of each other. Each layer transforms its input and passes it to the next. This is where the word "deep" comes from — not intelligence, just depth of layers.

What "deep" actually means: Layer 1 detects raw edges and colour gradients in the pixels. Layer 5 recognises basic shapes. Layer 15 assembles shapes into textures — fur, scales, skin. Layer 30 recognises object parts — eyes, ears, wheels. Layer 50 recognises full objects — a cat face, a car door. No human designed these layers. They emerge from training.

Data

DL needs millions of examples. Classical ML can work with hundreds. ImageNet had 14 million labelled images — that scale is what made DL possible.

Compute (GPUs)

Training billions of parameters requires parallel computation. GPUs — originally built for video games — turned out to be perfect for this. NVIDIA became an AI company.

Better algorithms

Researchers discovered tricks: ReLU activations, dropout regularisation, batch normalisation. Each solved a specific failure mode that had blocked neural networks for decades.

⚠️ Important

Deep Learning is not always the right tool. For structured tabular data — spreadsheets, databases, transaction logs — classical ML algorithms like XGBoost and Random Forests frequently outperform deep learning, train faster, and require far less data. DL dominates images, audio, and text. Classical ML dominates structured data. Both matter.

Layer 4 of 4

Generative AI — from recognising to creating

Every AI system we have described so far is discriminative — it takes an input and makes a judgement. Is this email spam? What digit is in this image? Will this customer churn? The output is a classification, a number, or a decision.

Generative AI does something fundamentally different: it creates. New text that has never been written. New images that have never been photographed. New code that has never been typed. The output is not a label — it is a new artefact.

The simplest way to think about it: all previous AI was a judge. You showed it something and it gave a verdict — real or fake, cat or dog, fraud or legitimate. Generative AI is a creator. You give it a prompt and it produces something that did not exist before. The shift from judging to creating is what makes GenAI a genuinely different category.

Large Language Models like Claude, GPT-4, and Gemini generate text one token at a time. A token is roughly a word or part of a word. Given everything that came before — your prompt plus its own previous output — the model predicts the most likely next token. It does this billions of times, very fast. What emerges reads like coherent thought.

💡 Note

LLM stands for Large Language Model. The architecture is called a Transformer (from the 2017 paper "Attention Is All You Need"). It is trained on hundreds of billions of tokens of text — books, websites, code, scientific papers — to predict the next token. At sufficient scale, this produces a model that can reason, translate, summarise, write code, and hold conversations. The mechanism is prediction. The emergent behaviour looks like understanding.

And Generative AI is not only text. The same "learn to generate from patterns" principle applies to every medium: Midjourney and DALL-E generate images. Suno and Udio generate music. Sora and Runway generate video. ElevenLabs generates voice. The modality changes; the core idea — learn the distribution of real data and sample from it — does not.

How we got here

The timeline — from concept to ChatGPT

These ideas did not arrive fully formed. Each breakthrough enabled the next. Understanding the sequence explains why we teach things in the order we do.

1950s

AI as a field is born

Alan Turing asks "Can machines think?" and proposes the Turing Test. John McCarthy coins the term "Artificial Intelligence" at the 1956 Dartmouth Conference. The goal is set. The tools do not yet exist.

1980s

Neural networks — first wave

Backpropagation is formalised, allowing neural networks to learn from errors. Early networks show promise on small problems. Limited data and weak hardware mean they cannot scale, and interest fades into the "AI winter."

2001

Machine Learning goes mainstream

Support Vector Machines, Random Forests, and boosting algorithms prove reliable on real problems. The internet generates data at scale for the first time. ML becomes an engineering discipline, not just an academic pursuit.

2012

Deep Learning breakthrough — AlexNet

A deep convolutional neural network wins the ImageNet competition by a margin that shocks the field — 15.3% error vs 26.2% for the next best. GPU training makes it possible. The modern deep learning era begins.

2017

"Attention Is All You Need" — the Transformer paper

Google researchers publish a new neural network architecture that processes entire sequences in parallel rather than one token at a time. This architecture — the Transformer — becomes the foundation for every major language model that follows.

2022

ChatGPT — Generative AI reaches everyone

OpenAI releases ChatGPT. One million users in five days. One hundred million in two months. For the first time, a general-purpose AI system is accessible to anyone with a browser. The conversation about AI in everyday life begins in earnest.

Real world — India

Indian companies using each layer — right now

This is not theoretical. Every layer of the AI hierarchy is running in production at companies whose apps are on your phone right now.

DoorDash

Classical ML

The problem

How do you show a delivery time estimate the moment someone opens the app — before they have even chosen what to order?

The approach

Gradient boosting models trained on millions of past deliveries, incorporating time of day, weather, kitchen prep patterns, and real-time traffic. No deep learning needed.

Stripe

Classical ML + DL

The problem

How do you detect fraudulent payments in under 100 milliseconds without blocking legitimate transactions?

The approach

Two-stage system: XGBoost flags suspicious patterns from transaction metadata, a neural net analyses behavioural sequences. Each layer catches what the other misses.

Amazon

Deep Learning

The problem

How do you recommend relevant products to 300 million users across categories as different as groceries and laptops?

The approach

A two-tower neural network learns separate embeddings for users and products. Similarity between embeddings determines recommendations. No explicit rules about what goes with what.

Instacart

Classical ML

The problem

How do you stock the right products at each dark store without running out or overstocking perishables?

The approach

Time series forecasting using gradient boosting on historical order data, local weather, local events, and day-of-week patterns. Getting this wrong costs money every hour.

HDFC Bank

Generative AI

The problem

How do you handle 10 million customer service queries per month in multiple Indian languages without proportional headcount growth?

The approach

An LLM fine-tuned on banking knowledge with RAG (Retrieval Augmented Generation) pulling from live policy documents. The model cites sources. Humans review edge cases.

Shopify

Deep Learning

The problem

How do you normalise and categorise 100 million product listings uploaded by small sellers who use inconsistent names, photos, and descriptions?

The approach

Convolutional neural networks classify products from images. A separate NLP model standardises titles. The result is a searchable catalogue despite messy inputs.

Your path

Which one should you learn?

The answer is all of them — and in order. This is not arbitrary. Each layer genuinely depends on the one below it. You cannot understand why deep learning works without understanding what machine learning is trying to do. You cannot understand Generative AI without understanding Transformers, which are a deep learning architecture.

The track is sequenced to respect these dependencies. Every section assumes you completed the one before it. Nothing is skippable without a cost.

01 – 03

Math, Python, Data

The ground floor. Every ML algorithm is math. Every ML implementation is code. Every ML project starts with data.

04 – 05

Classical ML & Evaluation

The bread and butter. Linear regression to XGBoost. How to measure if a model is actually good. Most production ML lives here.

Deep Learning

Neural networks from first principles to Transformers. The foundation for everything below.

07 – 08

NLP & Computer Vision

Deep learning applied to the two domains it transformed most completely. Text and images.

Generative AI

GANs, diffusion models, LLMs, RLHF, agents. Now you have the foundation to understand all of it.

10 – 11

MLOps & Cloud

How to ship models to production and keep them working. A model nobody uses is a hobby project.

🎯 Pro Tip

The most common mistake in this field is skipping to Generative AI because it looks exciting — and then spending months confused about why anything works. The foundations are not slow. They are what make everything fast. Start at section 01.

🎯 Key Takeaways

✓AI is the universe — any technique that makes machines mimic human intelligence, including hand-coded rules.
✓ML is a subset of AI where the machine learns rules from data instead of having rules written for it.
✓Deep Learning is a subset of ML using stacked neural network layers to learn features automatically from raw data.
✓Generative AI is a subset of DL where models create new content — text, images, audio — rather than just classifying inputs.
✓They are nested, not parallel. Every GenAI model is a DL model. Every DL model is an ML model. The reverse is not true.
✓This track teaches them in prerequisite order: each section depends on the one before it. Start from section 01.

Discussion

Have a better approach? Found something outdated? Share it — your knowledge helps everyone learning here.

Continue with GitHub