Why causal machine learning is the next revolution in AI

Get in on the ground floor now.

Causal modeling and inference is at the core of the most interesting questions in data science. A common task for a data scientist at a FAANG is to query users who had exposure to a feature and calculate the correlation between usage of that feature and engagement on the platform.

However, the data scientist does not care about that correlation; she cares about whether that correlation indicates that the feature drives engagement. In other words, she cares about the causal effect of the feature on engagement. For this reason, experience with causal inference is a highly sought-after skill in marketing and digital experimentation teams at top companies, particularly in tech. I use these FAANG-flavored use cases in these with my online students in a course called Causal Generative Machine Learning Minicourse on Altdeep.ai.

Not even deep correlation implies causation.

Deep learning has yielded awe-inspiring results in terms of predictive performance on a variety of benchmarks. Deep learning is essentially the state-of-the-art in correlation detection; it can model the complex nonlinear correlation between high dimensional sets of variables and use the model to generate accurate predictions. Yet the old maxim still holds; correlation does not imply causality, even if that correlation is fancy.

Like the FAANG data scientist, machine learning engineers find themselves in need of going beyond correlation to answer causal questions. Some of these questions emerge when dealing with failure modes of predictive modeling. Robustness is one example; if you could gain insight into the causal relationship between features and labels, you could construct models that were performant across different datasets since causality is consistent across data.

Similarly, the core questions of algorithmic fairness are causal. Suppose a hiring algorithm recommended not to hire Billy. Dose the credit scoring algorithm more likely to give lower scores on average to members of Billy’s race or gender because of that race or gender? Even if it doesn’t, would the outcome the algorithm gave to Billy specifically have been different if the algorithm he were of a different race or gender?

That latter question is a counterfactual query, a fundamental idea in causal modeling. I believe counterfactual queries have key implications to the advancement of AI.

We’ll get better reinforcement learning with counterfactual regret.

To be clear, the bread and butter data science is interesting stuff. But for me, the most exciting element of causal machine learning is causal reinforcement learning, or more generally, causal agent modeling. Cognitive scientists argue that causal inference is native to human reasoning — the human mind generates causal explanations for how the data came to be. Consider the fundamentally human emotion of regret.

Regret is that feeling you get when you reason, "I married Jack, the man my mother approved of, and now I'm not happy. Had I married my true love Phil, I would be happy."

This counterfactual statement is odd from a predictive modeling standpoint because the prediction "I would be happy" is based on an imaginary datum "I married my true love Phil". You did not marry Phil; you married Jack. Not even deep learning works on imagined non-existent data. Yet you can perform this counterfactual prediction because your mind can generate probable cause and effect scenarios of what life would have been like with Phil. Further, that generation incorporates data about your experience with Jack. Like that time you got pulled over and Jack mouthed off at the cop, resulting in a ticket; you imagine Phil would have kept his head and you would have gotten off with a warning. Your prediction in a parallel universe incorporates information from this one.

It turns out that we can indeed encode that reasoning process into an algorithm. For example, we can build agents that minimize counterfactual regret by reasoning, "I performed an action and got this reward; what reward would I have gotten had I performed a different action?" The agent can use the answer to that query to improve their action selection policy.

Algorithmic causal reasoning and inference will power a new set of breakthroughs in machine learning and artificial intelligence. Remember how those folks who got in early on the deep learning training enjoyed a massive career boost? Well, all aboard! The next revolution in AI is leaving the station. If you are interested in learning more, check out our Causal Generative Machine Learning Minicourse at Altdeep.ai.