EN ES

Netflix Customer Churn: Identifying the Behavioral Drivers of User Retention

en cover capstone 3 netflix

1. Project overview

Netflix faces persistent customer churn, which erodes recurring revenue and limits upselling opportunities. In a highly competitive streaming market with rising acquisition costs and subscription fatigue, retaining existing subscribers is more profitable than constantly acquiring new ones.

This project analyzes Netflix’s 2025 User Behavior Dataset to identify the behavioral drivers of churn, build a prediction model, and deliver actionable recommendations to reduce cancellations.

2. Objectives

  • Develop a churn prediction model despite significant class imbalance (85% active, 15% churn).
  • Identify behavioral drivers of retention versus departure.
  • Provide concrete recommendations to improve retention ROI.

3. Key findings

  • Churn distribution: 85% active, 15% churned.
  • Early tenure is risky: most churn happens within the first 90 days.
  • Engagement drives loyalty: high watch time and completion rates strongly reduce churn risk.
  • Discovery matters: active searchers and recommendation clickers are more likely to stay.
  • Satisfaction predicts churn: low ratings and negative reviews often precede cancellations.
  • Static attributes are weak predictors: demographics and plan tiers have little impact compared to behavior.

4. Modeling results

We compared four models using PR AUC as the primary metric (due to class imbalance).

Interpretation:

  • Random Forest delivered the highest PR AUC (0.241), slightly outperforming Logistic Regression.
  • XGBoost underperformed in PR AUC but showed a more balanced recall/precision trade-off.
  • Overall, all models remain close to the baseline, highlighting the need for richer time-sensitive features before predictions can drive business impact.

5. Recommendations

  • Strengthen onboarding: Offer personalized starter packs, reminders to finish shows, and highlight recommendation features during the first 90 days, when churn risk is highest.
  • Monitor satisfaction: Identify users who give low ratings or negative reviews and reach out with personalized content or offers.
  • Focus on high-value customers: Estimate Customer Lifetime Value (CLV = monthly spend × tenure) and and direct retention resources toward high-value subscribers most at risk.

6. Future work

  • Engineer richer temporal features (recency, engagement decay, inactivity streaks).
  • Explore survival analysis to study how churn risk evolves over time.
  • Validate model-driven interventions through controlled A/B testing to measure real impact on retention and revenue.

Discover more from Éloïse Bouton

Subscribe now to keep reading and get access to the full archive.

Continue reading