The Math Behind Machine Learning: A Primer

December 12, 2022

David Ernesto Kim

At its core, machine learning is powered by mathematics. Behind every predictive model and intelligent system are well-established mathematical principles. For software engineers aiming to go deeper into machine learning, understanding the math isn't just helpful — it's essential for building better models, diagnosing problems, and innovating beyond standard libraries.

The Key Mathematical Building Blocks

Some of the foundational areas of math that drive machine learning include:

Linear Algebra: Vectors, matrices, eigenvalues — essential for representing data and transformations in models like neural networks and PCA.
Calculus: Especially gradients and partial derivatives, critical for optimization techniques like gradient descent.
Probability and Statistics: Understanding distributions, likelihoods, and uncertainty helps in designing models and evaluating their performance.
Optimization: Core to training models — minimizing loss functions through methods like stochastic gradient descent or second-order techniques.

Even seemingly complex algorithms are often just clever applications of these mathematical tools stitched together to solve specific problems.

Lessons from Electrochemical Cell ML Modeling

Working on the electrochemical cell modeling project highlighted how deeply interconnected math and machine learning really are. We weren't just fitting data to a curve — we were analyzing a high-dimensional space of physical variables like conductivity, porosity, and diffusivity, and trying to predict outcomes like capacity and energy.

Applying Principal Component Analysis (PCA) for dimensionality reduction showed firsthand the beauty of linear algebra in action. Using regression models emphasized the importance of understanding bias-variance tradeoff and loss functions. And interpreting models with techniques like SHAP values connected back to probability and conditional expectations.

Ultimately, math provided the language and structure that allowed us to move from raw physical data to meaningful, explainable insights.

Why It Matters for Engineers

Understanding the mathematical foundation of machine learning empowers engineers to:

Choose appropriate models for different types of problems
Diagnose underfitting, overfitting, or data leakage issues
Optimize hyperparameters more intelligently
Build custom models when off-the-shelf solutions fall short

You don't have to be a pure mathematician to succeed in machine learning. But investing the time to grasp the basics of linear algebra, calculus, probability, and optimization will pay off again and again in your technical career.

The Bottom Line

Machine learning may seem magical from the outside, but at its core, it's a story of elegant mathematics applied to messy real-world problems. The more fluent you become in that mathematical language, the more powerful and creative you can be as a builder, researcher, and innovator.