AI Customer Churn Prediction & Retention

The problem: spreading retention spend thin across everyone

Most companies discover a customer has left only when the orders stop. By then the relationship is gone, and the cost of winning that customer back is far higher than the cost of keeping them would have been. That was the situation facing a major spice exporter we worked with: market preferences were shifting, competitors were circling, and the marketing campaigns meant to hold the base were quietly underperforming. Churn was rising, and nobody could say with confidence which customers were about to walk.

The deeper issue was not a lack of effort — it was a lack of aim. Retention budget was being spread evenly across the whole customer base, treating a loyal, high-value buyer the same as a one-off bargain hunter. That makes campaigns expensive and underwhelming at the same time: money goes to people who were never going to leave, while the customers genuinely on the edge get the same generic message as everyone else. The exporter needed two things before the budget ran out: a reliable signal for who was about to churn, and a clear read on what would actually make them stay.

How the system works

We built a churn-prediction system that turns raw transaction history into a ranked, explainable list of at-risk customers and the actions most likely to retain them. It runs in three stages, each answering a specific business question.

Segment — A K-means clustering model groups customers by behaviour and value, so the business sees natural cohorts rather than one undifferentiated mass. This is the difference between "our customers" and "these five distinct kinds of customer, each worth a different amount."
Score — Decision-tree models score every customer on churn risk and, crucially, surface the drivers behind that score. Instead of a black-box number, marketing sees exactly why a customer looks at risk — falling order frequency, a shrinking basket, a lapsed reorder cycle.
Target — With cohorts and risk drivers in hand, campaigns are tailored to the customers most worth keeping. Retention spend is concentrated where it returns the most, instead of being sprinkled evenly and hoping for the best.

The output is deliberately actionable, not just analytical. A churn score that nobody can act on is a vanity metric. By pairing each at-risk customer with the behavioural reason behind the risk, the model hands marketing a campaign brief, not a spreadsheet to puzzle over.

Data and approach: how we model churn and choose retention actions

Good churn prediction is mostly good data work before it is clever modelling. We start by consolidating the signals that actually predict churn — order frequency and recency, basket size and trend, product mix, seasonality, price sensitivity and response to past campaigns. Getting those features clean, joined and trustworthy is the foundation, and it is where a serious data engineering effort earns its keep. A churn model trained on messy, half-joined data will confidently predict the wrong things.

On the modelling side, we deliberately chose interpretable methods. K-means gives the business a stable set of behavioural and value-based segments — for example, high-value loyalists, steady mid-tier buyers, price-driven occasionals and dormant-but-recoverable accounts. Decision trees then score churn risk within that picture and expose the splits that drive each prediction, which matters for two reasons. First, an explainable model lets the commercial team sanity-check the logic against what they already know about their market. Second, the drivers are the retention levers: if the tree shows that customers churn when their reorder interval stretches past a threshold, the retention action writes itself. This is the machine learning philosophy we bring to every engagement — accuracy that a human can reason about beats a marginally sharper model nobody trusts.

Retention actions are then matched to cohort and cause. A high-value loyalist whose orders are slowing gets a different intervention — a relationship touch, a tailored assortment, a service check — than a price-driven occasional, who may respond best to a timed offer. The point is to stop sending one message to everyone and start sending the right message to the cohorts that move the numbers.

Results

These are real outcomes from this engagement; the client name is withheld pending permission to publish it. On the exporter's own data and campaigns, the system delivered:

−13% customer churn — fewer customers lost, by catching at-risk accounts early enough to act.
+€1.7M additional sales per campaign — by aiming spend at the cohorts most worth keeping, rather than spreading it evenly.

The mechanism behind both numbers is the same: better aim. The same budget, pointed at the right customers with the right message, retains more revenue and generates more incremental sales than a flat, spray-everywhere approach. Churn prediction did not replace the marketing team's judgement — it gave them a sharper target to point it at.

Who it's for, and the ROI case

This approach fits any business with a repeat-purchase relationship and enough transaction history to learn from — typically a few thousand customers and a year or more of orders. It is a natural fit for consumer goods, distribution and B2B sales, and the same churn-and-retention pattern applies cleanly in retail, subscription businesses and any operation where keeping a customer is cheaper than acquiring a new one.

The ROI logic is straightforward and easy to defend internally:

Retention is cheaper than acquisition. Reducing churn protects revenue you have already paid to win, which is almost always the highest-return marketing euro available.
Targeting concentrates spend. The same budget aimed at the right cohorts produces more retained revenue and more incremental sales — the €1.7M-per-campaign uplift above is exactly this effect.
It compounds. Each campaign feeds response data back into the model, so segmentation and scoring sharpen over time rather than going stale.

If you want to see how we frame projected return before any build, our pricing page sets out the fixed-scope tiers and the ROI-first approach we apply to every engagement. For a related, forecasting-led example of the same data-to-decision pattern, see our demand forecasting case study.

How we'd run a pilot

We do not ask you to commit to a full programme on faith. We prove the value on your own data first, with a focused pilot built to produce a clear go / no-go.

Scope and baseline. We agree the success metric up front — churn reduction, retained revenue, or uplift per campaign — and establish the current baseline so the result is measured, not asserted.
Data and features. We consolidate and clean your transaction history into the features that predict churn, the data engineering groundwork that makes everything downstream trustworthy.
Model and validate. We segment with K-means, score churn risk with decision trees, and validate against the baseline so you can see real numbers, not a demo.
Target and integrate. We turn at-risk cohorts and their drivers into campaign-ready segments, and connect the output into the tools your marketing team already uses through our AI implementation work.

Every step has a fixed scope and a price agreed up front, and the pilot reports verified figures from your own customers — no jargon, no vapourware. If the numbers stack up, we scale it into production. If you have a churn problem that looks like this one, book a free consultation and tell us what you are trying to keep.

Client result — name withheld

−13%Customer churn

+€1.7MAdditional sales per campaign

Real result from a Crux Digits engagement; client name withheld pending permission. Figures are from a field deployment on the client's own customer data and campaigns.

Frequently asked questions

How does the model know who is about to churn?

Decision-tree models score each customer on churn risk and surface the behavioural drivers behind that score — falling order frequency, a shrinking basket, a lapsed reorder cycle — so the signal is actionable, not just a number.

Why segment customers first?

K-means groups customers by behaviour and value so campaigns can be tailored to each cohort, concentrating retention spend where it returns the most instead of spreading it evenly across everyone.

How much data do we need to get started?

Typically a few thousand customers and at least a year of transaction history. The richer and cleaner the order data, the sharper the segmentation and churn scoring — which is why a short data-engineering step usually comes first.

Are the results from a real client?

Yes. The −13% churn reduction and +€1.7M additional sales per campaign are real outcomes from a Crux Digits engagement with a major exporter, measured on the client's own data. The client name is withheld pending permission to publish it.

Will this replace our marketing team?

No. It gives your marketing team a sharper target — which customers are at risk and why — so they can point their budget and judgement where it returns the most. The decisions and campaigns remain yours.

← All case studies