▸ INTERVIEW SKILLS · before the interview starts

Resume bullet improvements.

Your resume determines which interviews you get. Before-and-after rewrites for data engineers, analytics engineers, ML engineers, and staff-level roles — with annotations on exactly what changed and why.

The formula behind a strong bullet

THE STRUCTURE

Every strong resume bullet has three parts, in this order:

  1. Action verb + what you built/did — specific, past tense, owns the work
  2. The mechanism — what technical approach made it work
  3. The impact — a number, a business outcome, or a scope that signals level
The test: Remove the impact from your bullet. If what's left is still impressive, you have a good bullet. If what's left is generic, the impact was the only interesting part — which means the bullet depends entirely on the number, and numbers get questioned in interviews.
ElementWeak versionStrong version
Action verbWorked on, helped with, assisted, involved inDesigned, built, migrated, reduced, replaced, owned, led, enforced
Specificity"a data pipeline""the Kafka-to-Iceberg ingestion pipeline for real-time event attribution"
Impact"improved performance""reduced P99 query latency from 14s to 900ms"
Scope signalnone"serving 400 internal analysts across 6 product teams"
· · ·

Data engineering bullets

BEFORE AND AFTER
Data Engineer
Before
After
Worked on data pipelines using Python and Airflow to move data between systems.
Redesigned 14 Airflow DAGs from single-threaded Python loops to vectorized Pandas + Spark batch jobs, cutting nightly ETL runtime from 9h to 55min and eliminating 3am on-call pages.
Added: specific count (14 DAGs), technical approach (vectorized + Spark), before/after metric (9h → 55min), business outcome (eliminated pages)
Before
After
Built a real-time data pipeline for streaming events.
Architected a Kafka → Flink → Iceberg streaming pipeline processing 8M events/day with exactly-once semantics and sub-3-minute freshness SLAs, replacing a fragile cron-based batch system that caused daily data gaps.
Added: full stack named (Kafka/Flink/Iceberg), scale (8M/day), technical property (exactly-once), SLA (sub-3-min), and the business context (replacing what, and why it mattered)
Before
After
Helped with data quality monitoring and alerts.
Implemented a data contract enforcement layer on top of dbt — schema tests, null-rate monitors, and row-count SLOs — reducing bad-data incidents surfaced by analytics from 3/week to 0.2/week over 6 months.
Changed: "helped with" → "implemented"; named the approach (contracts, dbt, specific test types); quantified the outcome (incident rate before/after)
Before
After
Worked on migrating our data warehouse to Snowflake.
Led the zero-downtime migration of 400TB data warehouse from Redshift to Snowflake, designing the phased cutover strategy that maintained dual-write for 3 weeks and reduced Redshift costs by $280K/year.
Added: ownership ("led"), scale (400TB), technical approach (dual-write phased cutover), timeline (3 weeks), and cost impact ($280K)
· · ·

Analytics engineering bullets

BEFORE AND AFTER
Analytics Engineer
Before
After
Created dbt models for business reporting.
Built 60+ dbt models forming the company's canonical semantic layer — revenue, retention, and funnel metrics used by 8 product teams and cited in the board deck — reducing metric inconsistency disputes from weekly occurrences to zero.
Added: count (60+ models), what it is (semantic layer), which metrics (revenue/retention/funnel), downstream scope (8 teams, board deck), and the outcome (disputes to zero)
Before
After
Improved data documentation and made it easier for analysts to find data.
Drove company-wide dbt documentation initiative: added column-level descriptions, freshness expectations, and ownership tags to 120 models, cutting analyst time-to-first-query on new datasets from 3 days to 4 hours.
Added: specific scope (column-level, freshness, ownership), count (120 models), and a concrete before/after on analyst productivity (3 days → 4 hours)
· · ·

ML / AI engineering bullets

BEFORE AND AFTER
ML / AI Engineer
Before
After
Built feature engineering pipelines for machine learning models.
Designed and maintained 200+ features in the company's Feast feature store, including 15 real-time features served under 20ms P99 latency, supporting 4 production models with a combined $40M revenue attribution.
Added: count (200+ features, 15 real-time), latency SLA (20ms P99), downstream scope (4 models), and business context ($40M attribution)
Before
After
Worked on training data pipelines for LLM fine-tuning.
Built the deduplication and quality-filtering stage for pre-training data — MinHash LSH deduplication at 2T-token scale, perplexity-based quality filtering, and toxic content classification — producing the 800B-token clean dataset used in three model training runs.
Added: specific techniques (MinHash LSH, perplexity, toxic classification), scale (2T tokens, 800B clean), and downstream use (3 training runs) — shows you know the vocabulary of training data work
Before
After
Improved model performance through better evaluation.
Designed an LLM evaluation harness with 12 domain-specific benchmarks and a human preference annotation pipeline (500 labeled examples/week), increasing model selection precision and catching 3 regressions that had previously shipped to production.
Added: what the harness consisted of (12 benchmarks, annotation pipeline), operational scale (500 examples/week), and a concrete outcome (3 regressions caught)
· · ·

Staff / principal level bullets

BEFORE AND AFTER — ORG-LEVEL IMPACT
Staff / Principal

At staff+ level, bullets need to show organizational impact — not just technical execution. The scope of the work, the number of teams affected, and the lasting change matter as much as the technical approach.

Before
After
Led the data platform team and helped set technical direction.
Defined and drove adoption of the company's Lakehouse architecture across 12 data teams — migrating from 6 fragmented Redshift clusters to a unified Iceberg + Spark platform on S3, reducing compute costs by 62% and enabling cross-team data sharing for the first time.
Added: scope (12 teams), specifics of the problem (6 fragmented clusters), specifics of the solution (Iceberg + Spark on S3), cost impact (62%), and the new capability (cross-team sharing)
Before
After
Mentored junior engineers and improved team processes.
Built the data engineering hiring process from scratch — interview rubric, leveling guide, and take-home design problem — increasing offer acceptance rate from 40% to 74% and growing the team from 4 to 11 in 18 months.
Added: what specifically was built (rubric, leveling, problem), outcome metrics (acceptance rate, headcount growth, timeline)
· · ·

Words to cut from your resume

THE HIT LIST
Cut thisReplace with
Worked onBuilt, designed, implemented, migrated, owned
Helped withName what you specifically did
Assisted inYour role, specifically
UtilizedUsed
LeveragedUsed
SpearheadedLed, initiated, drove
VariousName them. "3" is better than "various"
"Collaborated with teams to…"Skip the collaboration preamble — start with what you did
"Responsible for…"Past tense action verb: Built, Owned, Maintained
Passionate aboutShown by what you built, not stated

Red Flags  ·  Common Mistakes →