Master data cleaning and analytical thinking through interactive games
Pattern Recognition & Cleaning
Race against time to identify and fix data quality issues in customer datasets. Spot duplicates, missing values, inconsistent formats, and outliers.
Missing Value Analysis
Investigate why data is missing and choose the best imputation strategy. Learn when missingness patterns matter for analysis.
Anomaly Detection
Visually identify outliers in scatter plots and decide whether they're errors or valuable insights. Real business data scenarios.
Deduplication Strategy
Find and resolve duplicate records using different matching strategies. Handle exact matches, fuzzy matches, and near-duplicates.
Distribution Analysis
Assess whether data follows a normal distribution using visual and statistical tests. Learn when normality matters for analysis.
Statistical Prerequisites
Balance statistical assumptions like a scale. Check if your data meets requirements for different analytical methods.
Data Standardization
Standardize inconsistent data formats across dates, currencies, addresses, and phone numbers. Fix real-world formatting chaos.
Statistical Power
Calculate required sample sizes for different business scenarios. Balance statistical power with practical constraints.
Comprehensive Investigation
Solve the case of the messy dataset! Use detective tools to uncover all hidden data quality issues in realistic business datasets.
Method Selection
Choose the right analytical method for real business scenarios. Match techniques to data characteristics and business needs.