Data & Infrastructure Engineer
I work with robust monitoring platforms, automate infrastructure, and solve complex technical challenges to improve system reliability and reduce operational overhead.
With over 13 years of experience in systems engineering and applications development, I've built my career at leading financial technology companies including Gemini, Two Sigma Investments, Jane Street, and Highbridge Capital Management.
My work focuses on building robust monitoring platforms, automating infrastructure operations, and developing data-driven solutions. I specialize in creating full-stack monitoring systems, migrating legacy infrastructure to modern cloud platforms, and building automation tools that reduce manual work and improve system reliability. I've worked extensively with Python for automation and data analysis, cloud infrastructure (AWS, Kubernetes), and monitoring tools (Datadog, Airflow) to solve complex technical challenges and improve operational efficiency.
An ETL pipeline that scrapes trending GitHub repositories, processes the data, stores it in a database, and visualizes it in a Streamlit app. The pipeline uses Apache Airflow for automation, running hourly to capture snapshots of trending repos over time. Features include historical tracking, time-series analysis, and interactive visualizations.
A full-stack web application that aggregates free and discounted events across New York City from over a dozen sources — including Lincoln Center, Screen Slate, Do NYC, and Club Free Time. A Python/FastAPI backend runs scheduled scrapers every few hours, normalizes event data into SQLite, and enriches events with venue details and images via deep-scrape jobs. A custom ML scoring engine ranks events using BERT semantic embeddings, source reputation weights, social proof indicators, spam penalties, and freshness to surface the best events first. The home feed features curated carousel sections including a personalized "For You" feed driven by each user's taste profile. Users can save events, build filter presets, and export to Google Calendar or .ics. Includes an analytics dashboard with per-source health metrics, scrape run history, and scoring heatmaps.