Projects

A collection of data science, machine learning, and research projects spanning from experimental physics to industry applications and AI research.

From Physics to AI Safety: Cross-Disciplinary Questions for Transformer Interpretability

July 2025

Technologies: Python Transformers

A systematic exploration of transformer architecture fundamentals, approaching neural attention mechanisms through the lens of physics intuition and structured analysis. This project documents my methodology for rapidly understanding complex ML architectures and identifies physics-inspired research questions for future investigation.

Key Insights:

  • Physics-informed understanding of attention mechanisms
  • Systematic learning framework for complex architectures
  • Hands-on exploration with conceptual testing
  • Physics-inspired research questions for future investigation
Read More

P-Rex: Personalized Load Recommendation System

June 2024

Technologies: Python XGBoost GCP/Vertex.ai Recommendation Systems

Production ML system for personalized load recommendations at CloudTrucks, improving driver load booking rates through intelligent matching. Built end-to-end pipeline from feature engineering to real-time inference, serving hundreds of drivers daily.

Key Achievements:

  • 12% increase in driver load booking rate through personalization
  • XGBoost algorithm selection and optimization for recommendation system
  • Data warehouse pipeline design for ML feature engineering
  • A/B testing framework and continuous model monitoring
Read More | CloudTrucks Engineering Blog

Wave Machine: Sea Conditions Prediction

July 2017

Technologies: Python PyMC3 Bayesian Inference Maritime Analytics

Automated sea condition detection for cargo ships using time series sensor data and Bayesian logistic regression. Developed during Insight Data Science Fellowship to address small data challenges in maritime consulting applications.

Key Achievements:

  • Bayesian approach handling uncertainty with small data
  • Physics-inspired feature engineering for sensor data analysis
  • Successful model validation using posterior predictions
  • Automated reporting foundation enabling future fuel optimization
Read More

More Projects Coming

Additional projects from my physics research, data science work at NBCUniversal, and ongoing interpretability research will be added here.


Research & Professional Background

Physics Research

PhD in Physics from Columbia with postdoctoral research at Yale. Specialized in experimental nuclear and particle physics, dark matter detection, and neutrino physics. Co-authored 25+ peer-reviewed papers with 8,000+ citations.

Industry Data Science

Senior Data Scientist roles at CloudTrucks and NBCUniversal, building ML systems for load recommendations, price forecasing, advertising effectiveness, and customer analytics. Experience scaling models from prototypes to production systems.

Current Focus

Transitioning to AI interpretability research, combining experimental physics rigor with modern ML techniques. Developing systematic approaches to understanding how neural networks actually work.


Find me on LinkedIn | GitHub | Google Scholar