Safety Trends & Risk Insights — Transforming millions of accident records into actionable safety analytics
This project analyzes the US Accidents (2016–2023) dataset from Kaggle to identify nationwide crash trends, high-risk regions, and behavioral patterns across time. The goal was to transform a large real-world public safety dataset into a structured analytical dashboard that highlights when, where, and why accidents occur.
Using Excel analytics, PivotTables, and calculated KPIs, the dashboard converts millions of accident records into executive-level insights that support transportation planning, safety awareness, and risk analysis.
The dataset includes:
This project focuses on trend analysis, crash distribution, and environmental risk factors.
Traffic safety agencies and analysts need a clear understanding of crash patterns to make data-driven decisions. Raw accident data is large and difficult to interpret without structured analysis.
Key questions addressed:
The dashboard presents high-level performance metrics including:
These KPIs provide an instant overview of national safety trends.
A year-by-year crash count visualization highlights the growth in reported accidents from 2016 through 2022, allowing viewers to quickly identify upward trends and potential external influences such as traffic volume or reporting changes.
A choropleth map visualizes crash counts across US states, revealing regional hotspots.
Key observations:
A day-of-week vs hour heatmap reveals behavioral patterns in accident occurrence.
Insights include:
A rate comparison chart evaluates the proportion of crashes linked to weather conditions, night driving, weekend travel, and severe incidents.
Weather-related crashes account for the largest share, suggesting environmental conditions are a dominant risk factor.
A ranked bar chart highlights the Top 10 states by crash count, enabling quick benchmarking between regions.
The dashboard was built using structured Excel analytics techniques:
With 7.7M+ records, the dataset exceeds Excel's native row limit of ~1 million rows. Power Query was used to:
The design focuses on translating millions of records into a clear decision-support interface.
My approach followed a structured analytics pipeline:
This workflow mirrors real-world data analyst responsibilities, combining data cleaning, aggregation, and storytelling.
Crash volume shows a steady upward trend over multiple years, indicating growing traffic exposure or improved reporting coverage.
Weather conditions are associated with a significant percentage of incidents, highlighting the need for weather-aware safety campaigns.
Peak accident times align closely with commuting hours, suggesting targeted enforcement and awareness during rush periods.
High-population states dominate total crash counts, emphasizing exposure risk tied to traffic volume rather than inherently unsafe driving.
This project demonstrates my ability to transform a large public dataset into an interactive analytics dashboard. It highlights my strengths in data modeling, visualization design, and translating complex datasets into clear, actionable insights — essential skills for data analyst and business intelligence roles.