← Back to Projects

Customer Sales & Income Prediction (Udacity Guided)

Power BI dashboard analyzing customer income, purchasing behavior, product performance, and geographic trends through structured data modeling and visual analytics.

Dat Tran β€” Data Analyst / BI Developer

Power BI Power Query Data Modeling Regression Analysis Geographic Analytics
View Project Files

Project Overview

This project uses Microsoft Power BI to analyze customer income, purchasing behavior, product performance, and geographic trends. The goal was to transform raw customer and product data into actionable business insights through structured data modeling and visual analytics.

Key Goals

  • Clean and transform raw data using Power Query
  • Build relational data model for cross-filtering
  • Analyze income-to-sales correlation patterns
  • Identify product recommendation trends
  • Visualize geographic income distribution

Data Cleaning (Power Query)

Data preparation was performed in Power Query Editor before building visuals:

  • Removed unnecessary or inconsistent columns
  • Corrected data types for income, sales, and ratings
  • Structured income ranges and age groups
  • Standardized product and state fields
  • Prepared regression variables and calculated fields

Why clean at query stage? Cleaning data in Power Query ensures faster dashboard performance and accurate aggregations inside visuals.

Power Query Data Transformation

Power Query Editor showing data cleaning and transformation steps

Data Modeling & Relationships

After cleaning, tables were loaded into the Power BI Data Model where relationships were created between:

  • Customer table
  • Sales metrics
  • Product data
  • Geographic/state fields

Key benefit: These relationships enabled cross-filtering between visuals, accurate counts, and consistent analytics across maps, charts, and regression visuals.

Power BI Data Model Relationships

Data model showing relationships between Customer, Sales, Product, and Geographic tables

Dashboard Analysis

Income & Sales Regression

Scatter plot analysis shows a strong positive relationship between income and sales:

πŸ“ˆ Correlation: 0.78 (Strong Positive)
Regression Model
y = 0.01073x βˆ’ 722.14
Slope: 0.01073
Intercept: βˆ’722.14

Insight: Higher-income customers generate higher purchasing values. Every $1,000 increase in income corresponds to approximately $10.73 in additional sales.

Income vs Sales Regression Scatter Plot

Scatter plot showing strong positive correlation (0.78) between customer income and sales

Customer Rating vs Return Rate

Return rate and customer rating display a negative correlation:

πŸ“‰ Correlation: βˆ’0.69 (Moderate Negative)

Products with higher ratings show significantly lower return percentages, visible in scatter and product comparison visuals.

Customer Rating vs Return Rate Analysis

Scatter plot showing negative correlation between customer ratings and return rates

Customer Distribution

$80K–$120K
Primary Income Range
35–54
Core Age Group
IL & NJ
Top Earning States

Income heatmap highlights higher earning regions across the country, with concentration in Illinois and New Jersey aligning with higher product recommendation activity.

Geographic Income Heatmap

National heatmap showing income concentration by state with Illinois and New Jersey highlighted

Product Recommendation & Inventory

Top Recommended Products

1
Sweater
660 customers (66%)
2
Shirt
Secondary recommendation
3
Leather Bag
High-income preference

Inventory Highlights

Analysis highlights the following products with highest stock levels:

  • Long Dress β€” Highest inventory
  • Wool Scarf β€” High stock
  • Sweater Dress β€” High stock

Geographic insight: High-income states show strong Leather Bag preference, suggesting targeted marketing opportunities.

Product Recommendation Analysis

Product recommendation breakdown showing Sweater as top recommended item (66%)

Tools & Techniques

πŸ“Š
Microsoft Power BI
πŸ”§
Power Query ETL
πŸ”—
Data Model Relationships
πŸ“ˆ
Scatter Plots & Trendlines
πŸ—ΊοΈ
Geographic Heatmaps
πŸ“‰
Regression Modeling
πŸ“‹
Clustered Bar Charts
πŸ”’
Correlation Analysis

Key Insights

πŸ’° Income Drives Sales

Strong 0.78 correlation confirms higher-income customers generate significantly higher purchasing values.

⭐ Quality Reduces Returns

Products with higher ratings show lower return rates, validating investment in product quality.

🎯 Target Demographics

Core customer base is 35–54 years old with $80K–$120K income β€” ideal for marketing focus.

πŸ“ Geographic Targeting

Illinois and New Jersey show highest income concentration and product recommendation activity.

More Projects