Montreal, QC · EN / FR

I'm Patrick Leduc,
Data Developer.

Building and maintaining scalable ETL pipelines on GCP. 5+ years turning raw data into business-critical insights at a fast-growing AdTech startup.

5+
Years Experience
200+
Brands Scaled
13+
Networks Integrated
$60k
Annual Savings

Data-driven problem solver
with a research backbone.

I'm a Data Developer at Goloot, a fast-growing AdTech startup in Montreal where I lead the design and delivery of our data infrastructure on Google Cloud Platform. My day-to-day centers on building and maintaining ETL pipelines with BigQuery and Dataform, ensuring our reporting database scales reliably as we onboard new affiliate networks and brands.

Before tech I spent five years in academia at the University of Toronto, where I developed a deep foundation in statistical methods, hypothesis testing, and research design. That analytical rigor carries over into everything I build today.

Location
Montreal, QC
Languages
English & French
Current Role
Data Developer
Team
1 Direct Report

Where I've made an impact.

September 2022 — Present
Data Developer
Goloot · Montreal
  • Build and maintain ETL pipelines end-to-end on GCP using BigQuery and Dataform, with 1 data analyst reporting to me.
  • Built a holistic reporting database integrating 13+ affiliate networks (Impact, CJ, Rakuten, and more) with internal data.
  • Scaled reporting from ~10 to 200+ brands across 20+ publisher networks, building Tableau dashboards for self-service analytics across financial, operational, and marketing KPIs.
  • Saved $60k/year on data storage and processing costs by introducing indexing, clustering, partitioning, and incremental processing.
  • Guided growth with ML forecasting models (XGBoost), including data cleaning, feature engineering, EDA, model tuning, and deployment — saving CS and sales 20+ hours/month.
  • Built campaign-publisher matching algorithms in Python that automated a 40-hour/month manual process, freeing ~0.5 FTE.
  • Implemented data governance best practices for quality and consistency across the full pipeline.
  • Mentored colleagues on SQL and Tableau best practices.
September 2016 — January 2022
Researcher & Lecturer
University of Toronto · Toronto
  • Collected and analyzed data from governmental, NGO, and research sources. Developed and tested hypotheses using T-tests and statistical inference. Presented findings to expert audiences.
  • Delivered weekly 2-hour lectures to 200 undergraduate students.

My toolkit.

⚙️
Data Engineering
BigQuery Dataform (dbt) ETL Pipelines Docker GCP
📊
Analytics & Visualization
Tableau Power BI Looker Amplitude Google Analytics Excel
🧠
Languages & ML
Python SQL PostgreSQL Machine Learning XGBoost NLP
📈
Statistics
Causal Inference Hypothesis Testing Longitudinal Studies Panel Data
🔧
Infrastructure
Docker Pihole Grafana Prometheus
🤝
Process & Collaboration
Agile Kanban Mentoring Data Governance

Academic background.

Certificate, Data Science
Concordia University
2022 · Montreal
PhD Candidate, Sociology
University of Toronto
2017 – 2022 · Toronto
MA Sociology
University of Toronto
2017 · Toronto
BA Sociology, Honours
McGill University
2016 · Montreal

Side work & experiments.

Holistic Affiliate Reporting Database
Built a unified reporting database that integrates 13+ affiliate networks (Impact, CJ, Rakuten, and more) with internal company data, enabling cross-network analytics and consolidated business intelligence.
BigQuery Dataform GCP ETL
Beer Brewing Recommendation System
Personalized product recommendation system for beer brewing kits, matching users to recipes based on flavor preferences using natural language processing.
Python NLP SpaCy
Home Networking Hub
Self-hosted home network with ad blocking, monitoring dashboards, and automated services — all running on Docker containers.
Docker Pihole Grafana Prometheus

Let's connect.

Interested in working together?

Whether you have a role in mind, want to discuss a data challenge, or just want to say hello — I'd love to hear from you.