GPU cluster management (500k+ NVIDIA H200/GB200). LLM inference engines for Copilot. ML platform infrastructure at petabyte scale.
Diego Iruretagoyena
Software Engineer · Microsoft Azure High Performance Computing · diegoiru@stanford.edu / diiruo@gmail.com
I care deeply about creating products that make people's lives better. I'm passionate about machine learning, engineering, and societal wellbeing.
Experience
AI chip design (Astrus) and healthtech marketplace (Gotta). 50+ customer interviews, 3 government grants.
Cybersecurity cloud infra. Distributed Go microservices, Terraform, AWS cost optimization (−75%).
Deep learning video processing for human movement recognition. Company advised by Yoshua Bengio, acquired by Qualcomm.
Supply chain optimization algorithms for the EU logistics network. Zero-carbon routing engines.
ETF visualization engine (D3.js, Python). Interactive financial dashboards for client conversion.
Education
AI Graduate Certificate (SCPD). CS231N Deep Learning for Computer Vision, CS229 Machine Learning, CS230 Deep Learning. Corporate sponsorship by Microsoft.
ML Research Scholar. Deep learning on ConvGANs with Alex Dimakis. First undergrad awarded the AI Research Scholarship.
B.Sc. Computer Science, Minor in Statistics. Chapter author, 'ChatGPT und andere Quatschmaschinen' (2024).
Projects
Nahvi — Private LLM Platform
Self-hosted LLM inference with RAG on Azure GPUs. DeepSeek + Ollama + Qdrant + FastAPI. Multi-tenant API for privacy-sensitive industries.
Health — AI Health Tracking Platform
Full-stack health platform with blood work analysis, wearable sync (Apple Health, Garmin, WHOOP, Oura), and GPT-4 insights. Next.js 15 + Expo.
LLM Inference at Scale
Aggregate-merge inference engine processing millions of prompts/hour with <1ms latency and GDPR compliance.
Deep Learning for Human Movement
Real-time video recognition pipelines for an AI health coach. PyTorch, computer vision, temporal modeling.
ConvGAN Research
Deep learning research on Convolutional GANs (PyTorch/TensorFlow). First undergrad AI Research Scholarship.
EEG-Based Recommender System
Recommender system driven by EEG brain signals. Signal processing, feature extraction, neural data classification.
LLM Fine-Tuning for Scientific Code Generation
Fine-tuning language models to generate Python code for scientific computing — simulations, numerical methods, data visualization.
Neural Networks & Reinforcement Learning
Implementations of neural network architectures and RL algorithms from scratch.
Drone Mapping & UAV Engineering
Aerial mapping of two municipalities with DJI fleets. Custom-built UAVs for experimental flight.
Founding & Ventures
Interests
Outside of engineering, I'm drawn to drone building and aerial mapping, 3D printing, and exploring the physical world. I believe in building things that bridge software and the real world.