Faster to root-cause
No more downloading archives, unzipping, and estimating where the fault might be. It surfaces the exact failing cases, and the sessions behind a complaint, so you inspect the right ones, not a random sample.
Data & Platform Engineering
I think as a Data Engineer
Given an open-ended goal, I find the real problem underneath it, then design and own the system that solves it. Solutions are shaped around how people operate, aiming to make their work easier.
Ingest the data, learn the schemas, build dashboards. A starting point, but the capability teams truly needed was still unclear.
Worked beside the people closest to the data to find where it actually hurt.
Architecture, stack, deployment, roadmap, vision: every call, mine.
Give me a direction, not a spec.
I turn ambiguity into infrastructure, and own every decision in between.
No more downloading archives, unzipping, and estimating where the fault might be. It surfaces the exact failing cases, and the sessions behind a complaint, so you inspect the right ones, not a random sample.
Postgres, MongoDB, Elasticsearch, cloud storage, SharePoint and Jira, all reachable through one interface.
Adopted across engineering, product, and customer-facing teams.
Test results reach the dashboards and the team's channel on their own.
One tool across 5 device families · 20+ modules · shipped solo in ~6 months.
Telemetry that used to be locked away is now a self-serve tool teams open every day. Built solo in about six months.
Hand-bundled vanilla JS with no framework, so there is almost nothing to maintain.
Each feature is walled off behind its own markers, so any one can be removed in a single pass without touching the rest.
If a dependency or module fails to load, the rest of the page keeps working.
Moved hot-path queries off live federation into denormalized ClickHouse tables, so common lookups stay fast.
Disk-backed result cache, pre-warmed on the pipeline's refresh cadence, so we recompute only when the data changes.
Search fan-out splits into chunks and reclassifies heavy vs light at runtime.
On-demand extraction, cross-source search, live analytics, and self-serve sampling, all on one FastAPI backend.
Production .tar / .tar.gz / .zip pulled from cloud storage and unpacked on demand.
Look up a user, org, or device, then stream its sessions from across every source.
Interactive dashboards for data rate, quality tiers, comparisons, and AI-written insights.
Filter, count, and pull every matching archive with one generated command.
Trino federates the sources; a daily, Dockerized batch job lands rollups in ClickHouse for the dashboards.
Image frames · video · radar / IQ signal · run logs · nested JSON blobs.
A pipeline I built so quality reaches the team with no human in the loop.
I start with the problem, not a job title. Solving it well usually means owning the whole thing: the data, the platform, the interface, the deployment, and what it costs to run. Each part sized to what the company needs now, and built to scale later.
Turning raw data into quality metrics teams can track.
Building the pipelines and aggregates that keep the numbers fresh.
Federation, caching, and streaming APIs that make data quick to reach.
Designing schemas and changing them without breaking what already exists.
The app itself: its layout, interactions, and motion.
Self-serve tools that remove the manual bottlenecks.
I do my best work where the scope is unclear and the decisions are mine to make.