This dataset contains sales and behavioral data for 10,000 customers, primarily designed for churn analysis and customer segmentation. It includes metrics tracked over several weeks (labeled W1 through W5) to capture trends in purchasing behavior.
Key Components of the Dataset:
Customer Identification: CUSTOMER_ID uniquely identifies each record.
Sales Metrics: Includes total sales (Total_Sale), standard deviation of sales (STD_Sales), and average purchase value (APV). Note that many sales values appear to be log-transformed or normalized.
Temporal Tracking (Weekly Data): Detailed breakdown for weeks 1–5, including:
Visits: Number of visits per week (W1_Visits, etc.).
Sales Performance: Minimum, maximum, and standard deviation of sales per week (e.g., W3_Max_Sale, W4_Min_Sale).
Activity Flags: Binary indicators for whether a customer was active in specific weeks (week_1, week_2, etc.).
Recency: Days_since_last_visit tracks how long it has been since the customer's last interaction.
Customer Segmentation:
Value-based: Categorizes customers as High_value, Low_value, or Regular.
Engagement-based: Categorizes customers by visit frequency (e.g., Frequent_Visitors, Rare_Visitors).
Target Variable:
CHURN: A binary indicator (0 or 1) typically used to predict if a customer will stop using the service.