I'm an engineer specializing in AI / ML performance, compilers, optimization, and quantization.
I co-created and maintain AI Hub Models and AI Hub Apps - open-source projects providing 175+ optimized ML models and sample applications for AI deployment on the edge.
For inquiries or to otherwise get in contact, shoot me an email at [email protected]. Thanks!
- Co-created AI Hub Models and AI Hub Apps, the largest collection of edge-deployable models on Huggingface with fully open source recipes on GitHub
- Managed model contributions from 16+ engineers; developers can optimize, validate, and deploy AI models on Qualcomm-powered devices within minutes
- Built an ML benchmarking suite that drove improvements in graph compilation coverage, latency, and accuracy across all Qualcomm software and hardware
- Drove performance improvements by identifying and resolving root causes of performance regressions, resulting in significant latency reductions across many graphs
- Member of founding team focused on Neural Network Optimization for Mobile and Edge
- Made core contributions to the Qualcomm AI Hub edge neural network compiler
- Designed core services and developer-facing APIs that power the Qualcomm AI Hub user experience today
- Worked with customers including Microsoft, Verizon, Roblox, and Sonos on latency-sensitive real-time video, image, and speech applications
- Helped customers deploy models with 2-10x improvements in latency, battery life, and memory in real workloads
- Built a compiler and runtime for blazing-fast model inference and training on the edge
- Core contributor to MIL, an intermediate representation similar to MLIR for ML models; designed a general ML opset
- Designed & implemented a language serializer / parser to translate to/from a readable C-like format
- Enhanced lowering code for hardware compiler IRs, implementing optimizations for significant performance gains
- Helped design software that breaks a model into pieces that run on optimal hardware backends
University of WashingtonComputer Engineering2016 - 2019
| Apple |
Applied ML Research |
Machine Learning Engineer Intern, Vision |
2019 |
| Apple |
Applied ML Research |
Software / Machine Learning Engineer Intern, Vision |
2018 |
| Google |
Cloud Billing |
Software Engineer Intern |
2017 |
| Microsoft |
SQL Server |
Software Engineer Intern |
2016 |
| Blue Origin |
Avionics |
Software Engineer Intern |
2015 |
| Boeing |
P8 Program |
Software Engineer Intern |
2014 |
| SeaTec Consulting |
Airborne Wifi |
Lead Software Engineer Intern |
2014 |
| CSE 401 |
Intro to Compiler Construction |
Abstract Syntax Trees, Finite State Machines, Grammars, x86 Assembly, IRs, etc. |
Fall 2019 |
| CSE 333 |
Systems Programming |
C and C++ Intro, Berkeley Networking API |
Spring 2019 |
| CSE 351 |
The Hardware/Software Interface |
x86 Assembly, Computer Architecture, Caching, Virtual Memory, Memory Allocation |
Autumn 2018, Winter 2019 |