10+ years taking Data and GenAI products from ambiguous problem to launch. As Data product lead for Sparky, Walmart's AI shopping assistant (used by ~50% of Walmart app users and a publicly cited driver of ~35% larger orders), I own the evaluation, experimentation, and quality systems that steer the roadmap.
My edge is the combination of hands-on technical depth (LLM evaluation, RAG, observability, experimentation) with the product judgment to weigh customer experience, safety, cost, and scale in one call.
Take customer-facing GenAI products from problem definition and requirements through launch, experimentation, and iteration.
Define the quality KPIs, evaluation standards, and experiments that tell Product and Engineering what to build next.
Grounding, observability, and human-in-the-loop governance so AI systems are measurable, auditable, and safe at consumer scale.
Five live, self-contained apps spanning the AI-product lifecycle: build, evaluate, experiment, monitor, explain. Each runs on synthetic or real public data, so there's nothing proprietary. Just click and explore.
Model-health monitoring across quality, safety, performance, cost & drift, on a SQL-backed pipeline with alerting and PDF/PPTX export.
An LLM-as-a-judge evaluation that scores conversations on a 4-dimension rubric, calibrated against human labels.
Tracks AI recommendation relevance week over week and surfaces the drivers behind any change.
Hypothesis design, randomization, guardrail metrics, and ship / iterate / stop decisioning.
A finance-ops RAG agent over two sources (real SEC EDGAR filings and FP&A planning documents) with grounded, cited answers that refuse when out-of-corpus, plus token-minimization controls and MCP retrieval servers.
Data product lead for Sparky, Walmart's AI shopping assistant. Defined the platform's first standardized quality KPI and its greenfield evaluation standards from zero; own the analytics, experimentation, and measurement strategy that steer the roadmap. Two-time Bravo Award recipient.
Owned the BI and data-reporting infrastructure powering financial budgets, forecasts, and analyses, partnering closely with Finance and Business Operations.
Built strategic frameworks and automated reporting (SQL, R, Tableau) for mobile-studio workforce planning, translating leadership questions into decision-ready analytics.
Led a self-service analytics platform, managing a team of 6–7 and driving adoption across US and global partner firms. Innovation Challenge Winner (selected from 218 submissions).