About Me
Hi! I’m Chris, a data scientist and former math teacher.I specialize in using data visualization, predictive modeling, and analytics to drive decisions that make a positive impact on society.With a passion for solving complex problems, I turn data into actionable insights that empower organizations and communities.
Skills
Python | R | Tableau | Power BI | SQL | Excel | InDesign | Git
Data Analysis 2+ years
Data Visualization 2+ years
Data Reporting 1+ years
Predictive Modeling 1+ years
Featured Projects
Professional Experience
DATA/POLICY ANALYST - Cook County Office of Research, Operations & Innovation
VISION ZERO COMMUNITY AMBASSADOR - City of Philadelphia
LEAD STEM CONTENT SPECIALIST - TEACH, LLC.
ARTIFICIAL INTELLIGENCE/MACHINE LEARNING INSTRUCTOR - Kode with Klossy
HIGH SCHOOL MATH TEACHER - School District of Philadelphia
SUPPLY CHAIN & OPERATIONS INTERN - Laborocity
Education

Data, Economics, and Public Policy Certificate, MITx MicroMasters

M.S. Education, University of Pennsylvania

Data Science Certificate, Flatiron School Bootcamp

B.S. Mathematics, University of Pittsburgh
Recent Blog Posts
Python | Machine Learning
Traffic Crash Prediction

OVERVIEW:This project analyzes Chicago’s crash data to identify factors influencing severe crashes, aiming to support Vision Zero initiatives and provide actionable insights for policymakers.DATA: Over 900k records on crash incidents, 1.98 million on individuals, and 1.84 million on vehicles.GOALS: Develop an inferential model to predict and identify factors leading to serious crashes, addressing challenges like class imbalance.MODELING STEPS:
1. Data Cleaning: Handled missing values, duplicates, outliers, and ensured proper formatting.
2. Data Splitting: Prepared training, validation, and test sets to prevent data leakage.
3. Baseline Models: Built logistic regression and decision tree models for initial comparisons.
4. Addressing Class Imbalance: Applied SMOTE, class weighting, and their combinations to address imbalances.
5. Hyperparameter Tuning: Used GridSearchCV to find optimal parameters, optimizing for PR AUC.
6. Model Selection: Chose the decision tree for its interpretability and strong performance.
7. Model Evaluation: Focused on evaluating feature importances from the decision tree to gain insights into key predictors.
- INSIGHT #1: Overrepresented factors in severe crashes include airbag deployment, light condition, and males.

- INSIGHT #2: The top contributing factors identified in severe crashes are airbag deployment, males, and seasonality. The feature importance output from the decision tree model highlights these variables as having the greatest influence on crash severity.

RECOMMENDATIONS:
1. Male Injury Trends:
- Conduct further research to analyze trends in male driver involvement in serious crashes.
- Develop targeted safety campaigns focused on behaviors like speeding and aggressive driving.2. Vehicle Size and Speed Regulations:
- Implement city-specific tiered fees for heavier vehicles, using revenue to fund pedestrian and cyclist safety measures.3. Annual Airbag Inspections:
- Promote policies requiring annual airbag inspections to ensure functionality and compliance with safety standards.
The full project can be viewed on my github page: here
R & TABLEAU | REGRESSION ANALYSIS
Gun Violence & Education Dashboard
OVERVIEW:
Explored the relationship between youth gun deaths and high school graduation rates across Chicago ZIP codes using publicly available data (2018-2021). Conducted hypothesis test, and regression analysis, as well as created data visualizations, revealing a statistically significant inverse relationship between graduation rates and juvenile gun deaths.DATA:
Mortality Data: Juvenile (under 18) gun-related deaths in Chicago (2018–2021).
Education Data: 4-year high school graduation rates by ZIP code (2018–2021).
GOALS:
Explore the relationship between youth gun deaths and high school graduation rates by:
1. Conducting hypothesis test to determine the statistical significance of the relationship.
2. Performing regression analysis to quantify the strength and direction of this relationship.
3. Visualizing the data using maps and scatterplots to illustrate the relationship both geographically and quantitatively.
KEY INSIGHTS:
The relationship between graduation rates and juvenile gun deaths is inverse—as graduation rates increase, juvenile gun deaths decrease.
The relationship is statistically significant (p = 0.0216, p < 0.05).
For every 1% increase in the graduation rate, there is an associated decrease of 0.349 juvenile gun deaths per 100K youth.
The full project can be viewed on my github page: here
Python | Machine Learning
Water Well Classification

OVERVIEW:
This project uses machine learning to assist an NGO in identifying damaged water wells across Tanzania.DATA: The dataset includes 59k water well records with 41 features, such as location, water source, and functionality status.GOAL: Build a machine learning model that classifies wells as functional or in need of repair, with a focus on maximizing recall to minimize missed repair opportunities.MODELING STEPS:
1. Data cleaning and preprocessing, including handling null values, reclassification of target variable, and feature engineering.
2. Trained multiple models, including logistic regression, decision trees, and random forests.
3. Evaluated performance using recall to minimize false negatives, with an emphasis on regions most in need of repairs.
INSIGHTS:
Areas like Kigoma and Dar es Salaam are high priority for repairs due to a high density of broken wells and low access to functional ones.

RECOMMENDATIONS:
1. Prioritize regions identified for high repair needs.
2. Deploy the initial Random Forest model to predict repair needs, as it offers the best recall despite some overfitting.
3. Refine the model further with better feature selection and hyperparameter tuning to improve performance and reduce overfitting.
The full project can be viewed on my github page: here
Excel & Canva | Data Analysis
ACT PROGRAM EVALUATION Report

OVERVIEW:
Developed a 5-page data report analyzing the effectiveness of the ACT Prep program over 8 years, summarizing key trends, improvements in scores, and national comparisons.DATA:
Analyzed data from 8 cohorts, serving 156 students with an average enrollment of 19.5 students per cohort.INSIGHTS
93% of students achieved growth
On average, students’ national percentiles increased by 17 points, rising from the 22nd to the 39th percentile during the ACT Prep class


The full report can be viewed: here
Tableau | Data Visualization
Trees & Wealth Dashboard
OVERVIEW:
Developed an interactive Tableau dashboard to explore the relationship between wealth and trees in New York City.DATA:
Income data across NYC’s 5 boroughs.
Tree census data from over 680k trees
The full dashboard can be viewed on my Tableau Public page: here