Exploratory Data Analysis on Terrorism

Mar 11, 2026

Large-scale exploratory data analysis project using Spark to study terrorism patterns across regions and time periods.

Highlights

  • Used PySpark for distributed analysis over a large dataset.
  • Explored geographic and temporal patterns through notebook-based investigation.
  • Combined scalable processing with more familiar Python analytics tooling.
  • Focused on pattern discovery rather than one-off descriptive charts.

Tech Stack

Python, PySpark, Pandas, and Jupyter.

Why It Matters

This project highlights big-data analysis capability and the ability to use distributed tooling when dataset size or processing demands increase.