AI/ML — Module 02

The AI/ML Landscape — Tools, Roles and Career Paths

Every tool mapped. Every role defined. Every career path laid out. Know exactly where you fit before you write a single line of code.

22–28 min March 2026

AI & ML Track · Module 02

Why this page exists

The AI/ML world looks enormous. It's not.

Search "how to learn machine learning" and you'll drown in tool names — TensorFlow, PyTorch, scikit-learn, XGBoost, LangChain, HuggingFace, MLflow, Kubeflow, Vertex AI, SageMaker — and job titles that all sound the same: Data Scientist, ML Engineer, Applied Scientist, AI Engineer, Research Engineer, MLOps Engineer.

Most beginners spend weeks trying to figure out which tool to learn first, which role to target, and whether they're even on the right track. This page ends that confusion permanently.

By the end of this page: you'll have a map of every major tool and where it fits. You'll know what each ML role actually does day-to-day. You'll know the exact path from where you are today to your first ML role. And you'll never feel lost looking at a job description again.

🎯 Pro Tip

You don't need to know all these tools. A working ML engineer uses 5–8 tools regularly. The rest are for specific situations. This page shows you the landscape so you can navigate it — not memorise it.

The tools

Every major ML tool — mapped and explained

The tools in ML are organised by what stage of the workflow they serve. The same workflow from the previous module — collect data, prepare it, train a model, evaluate, deploy, monitor — maps directly to the tools below. Every tool has exactly one job.

Data & Preprocessing

NumPy

Fast numerical arrays — the base layer under everything

Pandas

Load, clean and explore tabular data (CSVs, databases)

Matplotlib / Seaborn

Plot distributions, correlations, model results

Scikit-learn

Preprocessing (scalers, encoders, imputers) + classical ML

Classical Machine Learning

Scikit-learn

The universal classical ML library — every algorithm in one API

XGBoost

Gradient boosting — wins most tabular ML competitions

LightGBM

Faster XGBoost — handles categorical features natively

CatBoost

Best for datasets with many categorical columns

Deep Learning Frameworks

PyTorch

Most popular DL framework in research and industry (2024+)

TensorFlow / Keras

Google's DL framework — strong for production serving

JAX

Research-focused — used by DeepMind, Google Brain

FastAI

High-level PyTorch wrapper — best for learning DL quickly

NLP & Language Models

HuggingFace Transformers

Access every pretrained model (BERT, GPT-2, LLaMA) in 3 lines

LangChain

Build LLM apps — chains, agents, RAG pipelines

LlamaIndex

Connect LLMs to your own data and documents

OpenAI / Anthropic APIs

Production LLM access — GPT-4, Claude, Gemini

MLOps & Production

MLflow

Track experiments, log metrics, register models

Weights & Biases

Richer experiment tracking — dashboards, sweeps

FastAPI

Wrap any model in a REST API in under 20 lines

Docker + Kubernetes

Containerise and scale model serving

Cloud ML Platforms

Azure ML

Microsoft's managed ML platform — connects to your Azure data

AWS SageMaker

Amazon's end-to-end ML platform — training to serving

GCP Vertex AI

Google's ML platform — strong AutoML and BigQuery ML

Databricks

Unified analytics + ML on Spark — popular in large enterprises

The learning order that works: NumPy → Pandas → Matplotlib → scikit-learn → XGBoost → PyTorch → HuggingFace → FastAPI + Docker. That sequence takes you from zero to employable. Every other tool you learn when a specific job requires it.

The roles

Every ML role — what they actually do

Job titles in ML are inconsistent. One company's "Data Scientist" is another company's "ML Engineer." Here's what each role actually means in terms of daily work — based on real Indian job descriptions, not generic definitions.

Machine Learning Engineer

₹18–28 LPA

Build, train, and deploy ML models to production

Day to day

Train and evaluate classification or regression models

Build data pipelines that feed the model clean features

Deploy models as REST APIs serving real-time predictions

Monitor production model accuracy and retrain on schedule

Key skills

Pythonsklearn / XGBoostDockerFastAPISQL

Hires in India

DoorDash, Stripe, Instacart, Shopify, Brex

Data Scientist

₹14–24 LPA

Answer business questions with data and models

Day to day

Analyse data to find patterns the business should act on

Build models that answer specific product questions

Run A/B tests and measure statistical significance

Present findings with charts and plain-English explanations

Key skills

PythonStatisticsSQLPandasStorytelling

Hires in India

Amazon, Ola, Sephora, Uber Eats, Expedia

Applied Scientist / Research Scientist

₹22–40 LPA

Apply cutting-edge research to hard product problems

Day to day

Read and implement techniques from recent papers

Run large-scale experiments to evaluate new architectures

Collaborate with product teams to define ML problem framing

Mentor ML Engineers on advanced implementation

Key skills

Deep LearningPyTorchResearch papersMathsPython

Hires in India

Google, Microsoft, Amazon, Samsung R&D, Qualcomm India

MLOps / ML Platform Engineer

₹16–28 LPA

Build the infrastructure that keeps ML systems running

Day to day

Build and maintain ML training and serving pipelines

Set up experiment tracking and model registry systems

Automate retraining when model accuracy degrades

Reduce model serving latency and infrastructure cost

Key skills

DockerKubernetesMLflowPythonCI/CD

Hires in India

Uber, Grab, Braintree, Venmo, Zendesk

GenAI / LLM Engineer

₹20–35 LPA

Build products powered by large language models

Day to day

Build RAG pipelines that connect LLMs to company data

Evaluate and compare LLM providers for quality and cost

Fine-tune open-source models on domain-specific data

Build and test multi-step AI agents with tool use

Key skills

LangChainRAGPrompt EngineeringFastAPIVector DBs

Hires in India

Every Indian startup building AI features right now

Data Analyst (ML-adjacent)

₹8–16 LPA

Surface insights from data using SQL and basic ML

Day to day

Query databases to answer ad-hoc business questions

Build dashboards tracking key product and revenue metrics

Identify anomalies and trends in user behaviour data

Support ML Engineers with labelling and data validation

Key skills

SQLExcel / Google SheetsPower BI / TableauPython basics

Hires in India

Any Indian company with data — literally all of them

💡 Note

Salary ranges are mid-level (3–6 years experience) at product companies in Seattle. Service companies (Accenture, Deloitte, KPMG) pay 30–40% less. Startups vary wildly. FAANG/GCC roles pay 2–3× these numbers. Freshers start at roughly 50–60% of these figures.

Choosing your role

Which role is right for you?

Don't pick a role by salary alone. Pick by what the day-to-day looks like. These four questions will tell you immediately:

Do you want to write production code and ship systems?

ML Engineer or MLOps Engineer — you're building and deploying models, not just training them in notebooks.

Do you want to answer business questions and present to stakeholders?

Data Scientist — you're closer to the business, doing analysis, A/B tests, and explaining results to non-technical people.

Do you want to work on cutting-edge research and hard technical problems?

Applied Scientist — you need a strong maths background and enjoy reading research papers. Usually requires a Masters or PhD.

Do you want to build AI products with LLMs right now, quickly?

GenAI / LLM Engineer — the fastest-growing role in 2024–2026. Heavy demand, strong pay, and the technical bar is lower than Applied Scientist.

If you still can't decide: aim for ML Engineer. It's the most versatile role — you can pivot to Data Scientist, GenAI Engineer, or MLOps from there. It also has the clearest skill requirements, making it the easiest role to prepare for from scratch.

Career paths

Your exact path — based on where you are today

Four realistic paths. Each one is based on real people who made the transition — not on optimistic YouTube advice. The timelines assume 2–3 hours of focused study per day.

Complete fresher — CS/IT graduate

Python basics (2–3 weeks)

NumPy + Pandas + Matplotlib (2 weeks)

Classical ML with sklearn (6 weeks)

One end-to-end project on Kaggle

Apply for Junior ML Engineer or Data Analyst roles

Realistic timeline4–5 months to first job

Non-IT background switcher (MBA, Science, Arts)

Python from scratch (3–4 weeks)

SQL fundamentals (2 weeks)

Classical ML + evaluation (8 weeks)

Domain-specific project (your old field + ML)

Target Data Analyst or Junior Data Scientist roles

Realistic timeline5–6 months to first role

Software engineer moving into ML

Skip Python basics — you know programming

NumPy + sklearn + ML concepts (3 weeks)

Deep Learning basics — PyTorch (4 weeks)

MLOps — Docker, FastAPI, model serving (3 weeks)

Apply for ML Engineer roles directly

Realistic timeline2–3 months to first ML role

Already in data (analyst / BI / data engineer)

Skip data fundamentals — you know SQL and data

Classical ML with sklearn (4 weeks)

Model evaluation and hyperparameter tuning (2 weeks)

One production-style project using your existing domain

Apply for Data Scientist or ML Engineer roles

Realistic timeline2 months to transition

⚠️ Important

These timelines assume consistent daily practice, not binge-studying for a weekend then stopping for two weeks. 2 hours every day beats 14 hours every Saturday. Consistency matters more than intensity in learning ML.

What to do next

Pick your role. Start the track.

You now have the full map. You know what every tool does, what every role involves, and the exact path from where you are to where you want to be.

The one thing that separates people who get ML jobs from people who don't is not intelligence, not a degree, and not which bootcamp they paid for. It's whether they built something real and can explain every decision they made. That's what this track is designed to produce.

Starting now: the next module begins with the math — specifically vectors and matrices. Don't skip it. Every algorithm in this track uses matrix operations. Understanding them visually before seeing the code is what makes the rest of the track click into place instead of feeling like memorisation.

🎯 Key Takeaways

✓ML tools are organised by workflow stage — data, classical ML, deep learning, NLP/LLMs, MLOps, cloud. Learn them in that order.
✓The learning path that works: NumPy → Pandas → sklearn → XGBoost → PyTorch → HuggingFace → FastAPI + Docker.
✓ML Engineer = build and ship systems. Data Scientist = answer business questions. Applied Scientist = research. GenAI Engineer = LLM products.
✓Salary ranges (mid-level Seattle): ML Engineer ₹18–28 LPA, Data Scientist ₹14–24 LPA, Applied Scientist ₹22–40 LPA, GenAI Engineer ₹20–35 LPA.
✓If you cannot decide which role — target ML Engineer. It is the most versatile entry point and has the clearest preparation path.
✓Realistic timelines: fresher 4–5 months, career switcher 5–6 months, SWE to ML 2–3 months, data professional to ML 2 months.

Discussion

Have a better approach? Found something outdated? Share it — your knowledge helps everyone learning here.

Continue with GitHub