GPU fleet infrastructure (500k+ NVIDIA H200/GB200) and LLM inference engine for Copilot. Cluster orchestration, GDPR-compliant prompt routing, sub-ms tail latency. Microsoft Certified: Azure AI Engineer Associate.
Experience
Optimized phishing detection algorithms, reducing AWS server costs by 75% ($3M in savings). Refactored Go microservices and Terraform infrastructure.
Real-time video ML pipelines for human movement recognition. Company advised by Yoshua Bengio, acquired by Qualcomm.
Seeds for the Future fellow. Sponsored research fellowship to study telecom infrastructure in Shenzhen and Beijing.
Eight-month internship optimizing delivery speed promises across Amazon's EU fulfillment network. Modeled product placement and routing across fulfillment centers, distribution nodes, and last-mile handoffs to minimize delivery time, directly impacting the 1-day and 2-day delivery promises that drive conversion.
Built real-time ETF data visualization tools using D3.js and Python, supporting the launch of new financial products.
Replaced Excel workflows with scikit-learn and Keras models for APR price optimization on SMB transactional data.