About Me

I’m an Analytics Engineering professional based in New Zealand, specializing in building scalable data pipelines, transforming raw data into actionable insights, and enabling data-driven decision making through modern data stack solutions.

Currently at Canva in Data & Platform Engineering - Scaling warehouse infrastructure and data platforms:

Core Focus: Snowflake optimization • dbt pipeline management • Data architecture (Iceberg, Semantic Layer, Cortex Intelligence)

Engineering Practices: CI/CD automation • Observability • Pipeline orchestration • Stakeholder enablement

Skills & Tech Stack

Languages: Python • SQL • R • Rust

Data Engineering: dbt • Snowflake • Fivetran • Apache Spark • Airflow • PostgreSQL

Machine Learning: Scikit-Learn • Pandas • MLflow • Weights & Biases

Cloud & DevOps: AWS • Terraform • Docker • Kubernetes • CI/CD

Tools: Git • Looker • Jupyter • Power BI • FastAPI • Flask

ML & Data Engineering Portfolio

Data Engineering dbt Automated Pipeline

dbt Automated Data Pipeline

Modern data stack implementation combining dbt Core for SQL transformations with Meltano for ELT orchestration. Ingests cryptocurrency API data into PostgreSQL, featuring multi-layer transformations and automated GitHub Actions workflows.

View Project

Cloud ML

Machine Learning Model Deployment to the Cloud

Learn how to deploy and test a trained salary prediction model to the cloud using a Flask web API/endpoint and Heroku platform.

View Project

AWS & BI

AWS RDS Automated Setup & Power BI Dashboard

Discover how to set up an AWS RDS Postgres instance using Terraform, deploy a database, and connect it to build a comprehensive Power BI dashboard.

View Project

NLP & ML

Disaster Response NLP Pipeline

Build an NLP pipeline that classifies disaster messages into categories to help relief organizations quickly identify and prioritize emergency responses. Features Flask web app deployment on Azure.

View Project

MLOps

NYC Rental Price Prediction ML Pipeline

Deploy an end-to-end machine learning pipeline for predicting NYC short-term rental prices using MLOps best practices with MLflow, Weights & Biases, and Hydra for experiment tracking and artifact management.

View Project

ML DevOps

Dynamic Customer Churn Risk Assessment Pipeline

Automated ML workflow that continuously computes customer attrition risk with new data, featuring automated retraining, deployment, and monitoring through Flask API endpoints and cron-based orchestration.

View Project

CI/CD

Census Bureau Salary Classification Pipeline

Complete CI/CD pipeline for deploying an ML salary classifier using GitHub Actions, DVC for data versioning, FastAPI for serving predictions, and automated Heroku deployment with testing and linting.

View Project

Data Science Portfolio

ML & Analytics Market Basket Analysis

Market Basket Analysis of Instacart Dataset

Explore how market basket analysis using Association Rules and the Apriori algorithm can identify customer purchase patterns and provide personalized item recommendations.

View Project

Classification Predictive Maintenance

Predicting Operational Status of Water Assets

Investigate the application of classification models to predict the operational state (functional, repair needed, non-functional) of water pumps in Tanzania.

View Project

Clustering Customer Segmentation

Wholesale Distributor Customer Spending Analysis

Examine the use of unsupervised learning techniques (PCA, Gaussian Mixture Model) to analyze customer spending patterns and optimize a new delivery service.

View Project

Chris Cochet

About Me

Skills & Tech Stack

ML & Data Engineering Portfolio

dbt Automated Data Pipeline

Machine Learning Model Deployment to the Cloud

AWS RDS Automated Setup & Power BI Dashboard

Disaster Response NLP Pipeline

NYC Rental Price Prediction ML Pipeline

Dynamic Customer Churn Risk Assessment Pipeline

Census Bureau Salary Classification Pipeline

Data Science Portfolio

Market Basket Analysis of Instacart Dataset

Predicting Operational Status of Water Assets

Wholesale Distributor Customer Spending Analysis