Home

Analytics Engineering Portfolio

End-to-End Data Pipeline | Real Google Analytics Data | Interview-Ready

**by Nicolas Klein**
Data Engineer | Analytics Engineer | Perth, Australia

Data Pipeline Architecture

Raw Events

Google Analytics

Warehouse

BigQuery

Staging

dbt Models

Marts

Aggregations

Data Lake

Embedded Parquet

Dashboards

Evidence.dev

AI ChatBot

NL-to-SQL

💡 Data Lake: 700KB Parquet files embedded in Docker for instant queries (no external DB calls)

Data Warehouse Schema

📊 SOURCE
ga_sessions_*
Public Dataset
37.8k rows
🔄 STAGING
stg_ga_sessions
View
Cleaned & Typed
📈 MARTS
mart_kpis_daily
Table
636 rows
sessions_enriched
View
37.8k rows
kpis_by_country
View
202 rows
Data Flow: Raw Events → Staging (cleaning) → Marts (aggregations) → Evidence Dashboards

Evidence Query Layer

📊 KPI Queries
• kpis_daily.sql
• kpis_summary
• channel_performance
• revenue_trend
• top_channels
Source: mart_kpis_daily
🌍 Geographic Queries
• kpis_by_country.sql
• country_performance
• pareto_analysis
• geo_concentration
• country_drilldown
Source: kpis_by_country
🔍 Session Queries
• sessions_enriched.sql
• source_medium_attribution
• device_breakdown
• drilldown_channel_country
• acquisition_analysis
Source: sessions_enriched
📈 Analytics Queries
• data_quality_daily.sql
• sql_interview_patterns.sql
• anomaly_detection
• control_charts
• statistical_analysis
Source: Multiple marts
12 Evidence Pages powered by 15+ SQL queries across 3 data sources

Performance Overview (Q1 2017)

Total Sessions

196,817

90 days

Transactions

2,439

1.24% conversion

Revenue

$337,599

8 channels


Revenue Trend (Q1 2017)

Loading...

Channel Performance

Loading...

Quick Navigation

Executive Summary

AI-Powered Insights

Auto-generated insights • Strategic recommendations • Storytelling

Perfect for: Executive presentations & business acumen

Advanced Analytics

Statistical Maturity

Control charts • Anomaly detection • Distribution analysis

Perfect for: Demonstrating senior analytical thinking

Geographic Analysis

Strategic Insights

Pareto 80/20 • Country drilldowns • Concentration metrics

Perfect for: BI thinking & executive reporting

KPI Dashboard

Interactive Features

Date filters • Channel selector • Real-time updates

Perfect for: Demonstrating dashboard skills


All Pages

Executive Summary

Automated insights, strategic recommendations, AI-powered analysis

Overview

Business context, KPIs, revenue trends, channel performance

Data Model

Grain definition, column dictionary, modeling notes

ETL & Lineage

Sankey diagram, architecture docs, best practices

Drilldowns

Multi-dimensional analysis by channel, country, device

Data Quality

5 automated checks, outlier detection, gap analysis

SQL Patterns

10 interview patterns, window functions, live queries

Geographic Analysis

Pareto 80/20, country deep dives, concentration metrics

Advanced Analytics

Control charts, anomaly detection, statistical distributions

Marketing Performance

ROI proxy, channel efficiency, strategic recommendations

Attribution & Acquisition

Source/medium mapping, conversion by source, acquisition trends


Technology Stack

Data Warehouse

Google BigQuery
Public Dataset

Transformation

dbt-core 1.11
Jinja + YAML

Query Engine

DuckDB
SQL Analytics

Visualization

Evidence.dev
Svelte + Vite

Containerization

Docker
Node.js 18

CI/CD

GitHub Actions
Auto-Deploy

Deployment

Render.com
Free Tier

Environment

Python 3.12
Virtual Env

Version Control

Git
GitHub

AI Integration

OpenAI GPT-4
NL-to-SQL

Backend API

FastAPI
Python

🚀 Full DevOps Pipeline: Git push → Docker build → Auto-deploy → Live in 3 minutes

What This Portfolio Demonstrates

Architecture

Medallion design

Raw → Staging → Marts

Version controlled

Analytics

Interactive dashboards

Dimensional analysis

Business metrics

Engineering

Data quality checks

Automated testing

SQL best practices


About the Author

Nicolas Klein
Data Engineer | Analytics Engineer

📍 Perth, Australia
🌐 Website: karrotskode.com
� Portfolio: nicolasklein.vercel.app
📧 Email: nicolas.klein.g@gmail.com

This portfolio demonstrates end-to-end analytics engineering capabilities using real Google Analytics data, dbt transformations, and Evidence.dev visualizations.