Duy Nguyen
Seattle University Seal

About Me

I'm Duy Nguyen, an MS Data Science student at Seattle University (GPA 4.0 — College of Science and Engineering Dean's Graduate Student Honor Roll, Winter 2026). My focus is building data systems that researchers can actually use: pipelines that are auditable, databases that reproduce the original analysis exactly, and infrastructure that outlasts the person who built it.

At Seattle University, I hold two research positions. With Dr. Brian Fischer (Mathematics), I designed a 7-table normalized MySQL database for barn owl auditory neuroscience by ingesting data from two researchers across 110 neurons, 8 owls, and ~261 experiments stored as 14,000+ files in six proprietary formats with no consistent structure. I identified 8 data quality problems, built a 6-phase ETL pipeline to resolve them, and implemented 4 separate loaders to handle the completely different internal .mat file formats each researcher used. The fitting methods alone cover two-sided asymmetric Gaussians, rate-level sigmoids with 5 physiological parameters, Akima spline interpolation, and SVD for response separability. The end result: a 40-line analysis loop that previously required 228 file loads now runs as a single SQL query. With Dr. Wenjing Yang, I'm investigating a quieter problem in medical AI: most vision RAG pipelines for clinical imaging skip measuring whether the retrieval step actually works. I'm trying to quantify that gap on mammography data and understand what it takes to fix it.

I'm looking for Summer 2026 data science or AI/ML engineering internships where the work has clear stakes and the feedback is real.

660K+ Users Served MOSAIC Immigration Chatbot
$30.4M Projected Savings UC Berkeley Capstone
95.9% Variance Explained NASA Flight Analysis

Technical Skills

Languages & Tools

Python
SQL
R

ML & Deep Learning

PyTorch
scikit-learn
TensorFlow

Specialties

Causal Inference A/B Testing Statistical Modeling NLP RAG Systems Knowledge Graphs Transfer Learning Experimental Design

Other Projects

Garbage Classification - Deep Learning

94% accuracy, 100% minority class recall. ResNet34 transfer learning with live demo.

Duy Integral Theorem - ML Theory

Novel mathematical framework for understanding generalization in neural networks.

SFU Faisal Lab - Medical RAG

RAG system translating natural language to JSON for CT/MRI scan retrieval.

AI Agent - ML-Business Alignment

Agent bridging ML teams and business stakeholders for strategic alignment.

Algorithm Learning Tool

Interactive visualization for mastering tree algorithms with AI feedback.

Programmatic Business Card

Print-ready business cards built with HTML/CSS/JS. Dynamic QR codes, professional design.

Contact

Email: dcnguyen060899@gmail.com

LinkedIn: https://www.linkedin.com/in/duwe-ng/

GitHub: https://github.com/dcnguyen060899

Resume: https://duyng-portfolio.com/docs/index_resume.html

Drag to resize ⇘
Portfolio AI Assistant Click to toggle • Drag to move