Lead Data Engineer · GCP · dbt · PySpark · Airflow

Building data platforms
that are fast, trusted,
and actually used.

10+ years engineering scalable data platforms across FMCG, Banking, E-commerce, and Agritech. End-to-end — from event-driven ingestion and Medallion Architecture to dashboards CXOs rely on daily.

Dhananjay Hawal
10+
Years engineering data platforms
FMCG · Banking · Agritech
$5K
Monthly infra cost cut
Airbyte + Airflow · GCP
60%
Faster data processing
Automation · Fractal
Tech stack
Cloud & Infrastructure
GCPBigQueryCloud StorageCloud SQLDataprocPub/SubCloud RunCloud FunctionsEventarc
Ingestion & Orchestration
Apache AirflowCloud ComposerAirbyteFivetranApache Kafka
Transformation & Big Data
dbtPySparkApache SparkHiveHDFSImpala
Languages
PythonSQLScalaR
Databases & Warehouses
BigQuerySnowflakePostgreSQLMySQLHBase
BI & Observability
Looker StudioTableauPower BIMetabaseMixpanelCleverTap
Engineering projects
01
Weather Data Pipeline on GCP
Event-driven pipeline ingesting weather data for 15 Indian cities. Cloud Composer (Airflow) orchestrates hourly runs; data lands in partitioned & clustered BigQuery tables; raw JSON archived to GCS for audit/backfill. Looker Studio dashboards for trend analysis.
Cloud ComposerBigQueryGCSLooker StudioOpenWeather APIPartitioning
02
Modern Data Stack on GCP — Superstore
Cloud-native ELT pipeline: IP-restricted Cloud SQL → Airbyte CDC ingestion → dbt transformations with full test coverage → Docker-orchestrated Airflow → Tableau dashboards served directly off dbt Gold models.
Cloud SQLAirbyte CDCdbtAirflowTableauDocker
03
End-to-End Medallion Pipeline — Open Source Stack
Production-grade pipeline using Bronze → Silver → Gold architecture. Bronze: raw ingestion into PostgreSQL. Silver: dbt cleaning, PII masking, test coverage. Gold: dimensional models optimised for analytics. Airflow on Docker for orchestration. Metabase on Gold layer.
dbtPostgreSQLAirflowMetabaseMedallionPII Handling
04
dbt + Snowflake: Call Center Analytics
Multi-layer dbt project (staging → intermediate → marts) on Snowflake. KPIs: conversion rate, AOV, revenue, rolling 7-day trends. Rep × product performance cross-analysis, funnel metrics, SCD2 snapshots, custom macros, and auto-generated dbt docs.
dbtSnowflakeSCD2MacrosLooker Studiodbt docs
Data stories by DJ
Architecture
ETL vs ELT: Which One Should You Choose?
A field guide with real scenarios, diagrams, and worked examples — Twitter API, Mixpanel events, and dbt.
Read →
Product Analytics
Choosing the Right Visualizations for User Behavior Analysis
When to use Insights, Funnels, and Flows in Mixpanel/Clevertap — plus drop-off formulas.
Read →
Data Modeling
Star Schema vs Snowflake Schema: A Hands-On Project
Designing and populating STAR and SNOWFLAKE schemas in MySQL/Postgres with Superstore data and SQL scripts.
Read →
SQL
Window Functions Demystified
Every frequently used SQL window function — all in one reference query. The only cheat sheet you'll need.
Read →
Work experience
Oct 2025 — Present
8 months
Lead Data Engineer
Coforge
Pune, India
  • Delivered event-driven pipelines using Pub/Sub and Cloud Run for real-time, scalable data ingestion on GCP.
  • Built and integrated DAG-based orchestration pipelines combining PySpark and dbt for large-scale batch transformation workloads.
  • Engineered complex JSON processing workflows and implemented Medallion Architecture (Bronze → Silver → Gold) using Spark and dbt.
GCPPub/SubCloud RunPySparkdbtAirflowMedallion
Dec 2022 — Sep 2025
2 years 10 months
Analytics Engineer
Jiva (Agritech)
Pune, India
  • Designed and built a multi-source data integration pipeline using Airbyte + Airflow, cutting monthly GCP billing by $5K.
  • Developed a predictive model for farmer and MC loan defaults, achieving F1 score of 0.90 for non-defaulters.
  • Built satellite-data pipeline for vegetation analysis, surfacing new harvest pattern insights for the product team.
  • Delivered monthly CXO-level insights and strategic recommendations; owned Mixpanel & CleverTap user behaviour analysis to improve app funnel drop-offs.
↓ $5K/mo infra cost · F1 0.90 on loan default model
PythonSQLAirbytedbtAirflowGCPMetabaseTableau
Apr 2021 — Dec 2022
1 year 9 months
Senior Data Analyst
Tata Consultancy Services
Pune, India · Client: Credit Suisse (IAM)
  • Built SQL-based Tableau dashboards generating 10,000+ views/month for Credit Suisse's Identity & Access Management compliance team.
  • Developed a SQL review automation tool in Python + Streamlit, significantly cutting manual analyst effort.
  • Automated IAM governance processes, reducing errors and access-review turnaround time.
↑ 10K+ dashboard views/month · IAM automation
SQLTableauPythonStreamlitImpala
Nov 2019 — Apr 2021
1 year 6 months
Assistant Manager, Data Science
Wipro Limited
Pune, India
  • Optimised order management for an international music company using automated anomaly detection in R.
  • Applied outlier detection for product delay identification, improving on-time delivery rates.
SQLPower BIRVBA
Sep 2015 — Oct 2019
4 years 2 months
Consultant, Data Analytics
Fractal Analytics
Mumbai, India · Clients: P&G, Colgate-Palmolive (US, UK, Canada)
  • Led data analytics projects for Fortune 500 FMCG clients — Procter & Gamble and Colgate-Palmolive — across US, UK, and Canada.
  • Developed ETL pipelines and Tableau dashboards; automated data processing with Talend + SQL, cutting turnaround time by 60%.
  • Designed a data modelling + dashboarding solution (Analytics CoE) pitched to 3 clients, directly generating additional revenue.
  • Recognised with 'Great Samaritan' and 'Eureka' awards for impactful cross-team contributions.
↓ 60% turnaround · Revenue-generating CoE pitch · 2 awards
ETLTalendSQLTableauPythonVBA
Education & certifications
2025 · In Progress
MS in Data Engineering
Scaler Academy
2021 – 2022
PG Diploma in AI & Machine Learning
University of Hyderabad
84%
2011 – 2015
B.Tech, Mechanical Engineering
Sardar Patel College of Engineering, Mumbai
8.4 / 10 CGPA
Certifications
Industry Credentials
· Big Data Masters Program · Data Science Masters Program · Big Data Fundamentals with PySpark · dbt Fundamentals
Verified LinkedIn recommendations
LinkedIn recommendation from Arnav Pandey, CPO at Jiva Ag
LinkedIn recommendation from Karan Patel, Jiva Ag
Contact
Let's build something
data-driven together.

Open to Lead / Senior Data Engineer, Analytics Engineer, and Data Platform roles.
dhananjay.hawal@gmail.com · +91 8898890243