BRIEF Index
An open benchmark for weighted, multi-constraint instruction following on enterprise knowledge-worker tasks, measuring what single-instruction leaderboards miss. 25 models, 150 tasks, live leaderboard and methodology.
VP, Implementation Lead at Citibank, guiding enterprise merchants through Spring by Citi payment gateway integrations across North America. Former PM at Citibank, Capital One, Bank of America, and Wolters Kluwer. Builder of AI benchmarks, analytics platforms, and agent systems.
Applied AI research and product builds outside the day job, evaluating how frontier and open-source models perform on real enterprise work.
Post-sale technical lead for Spring by Citi's global eCommerce payment gateway, guiding enterprise clients from discovery and solutioning through integration, certification, and go-live.
Owned documentation compliance and release validation for a consumer investments portfolio spanning digital onboarding and physical customer touchpoints.
Led cloud migration and international expansion of a legacy platform, defining integration architecture and region-specific solution design for regulated markets.
Architected B2B2C integration platforms and partnership solutions for CitiPay, connecting APIs, merchant onboarding, and digital wallet strategy to measurable adoption.
Shipped AI/ML-powered fraud detection and third-party marketplace integrations in an API-first payments environment.
Delivered fintech and ecommerce applications for multiple clients, combining agile delivery with low-code tooling to accelerate solution deployment.
An open benchmark for weighted, multi-constraint instruction following on enterprise knowledge-worker tasks, measuring what single-instruction leaderboards miss. 25 models, 150 tasks, live leaderboard and methodology.
Analytics for TikTok Live leaderboards, built for my creator agency. Tracks rankings and performance over time so creators and operators can make decisions from data instead of vibes.
Hermes Agent deployment with a Telegram gateway, running in a cloud workspace as a daily driver assistant.
Retrieval pipeline with pgvector and the Claude API for synthesizing insights from market research reports.
Python backtesting infrastructure on a full year of 1 minute futures data with honest out of sample validation.
I'm looking for solutions engineering, forward deployed, product management, and enablement roles at AI companies, where enterprise integration experience and hands on AI building meet.