I am a passionate AI student and engineer focused on deep multimodal models, generative frameworks, and robotics intelligence.
My experience spans deep model implementation and training, under-the-hood knowledge and research, as well as pipeline integration.
Please feel free to view my recent work below.
AI research and development for agents, NLP and robotics based in Portland, Oregon.
Exploring the usage of diffusion models to generate priors during Thompson Sampling in the non-stationary multi-armed bandit setting.
Investigating an automated method of detecting action divergence during inference, and prompting human redirection to capture finetuning data.
We provide a mathematical framework for reward poisoning on stochastic dueling bandits algorithms, proving effective spoofing with sublinear cost bounds and forced linear regret.
Paper available on
OSU Archives, code available on
Github.
Thompson Sampling with Diffusion Priors for Non-Stationary Bandits
Automated Finetuning of Robot Policies Post Deployment
Attacks on Stochastic Dueling Bandits
Research
Bandits, Robotics, Diffusion
Discrete Diffusion and Auto-Regressive Language Model
Denoising Diffusion and Flow Matching Image Generation
Math Machine Translation
A from-scratch implementation of both traditional auto-regressive and discrete diffusion generative transformer-based language models.
Trained on openwebtext utilizing DGX system.
Open-source code available on
Github.
A from-scratch implementation of denoising diffusion and flow matching image generative models (CNN U-Net and latent vision transformer).
Trained on Stanford Cars, CelebA, ImageNet utilizing DGX system.
Open-source code available on
Github.
Implementation of novel seq2seq transformer model for math word problem to computational graph construction.
Trained on MathQA dataset utilizing DGX system.
Open-source code and report available on
Github.
Projects
NLP, Generation, Translation
Designed and deployed video conferencing application allowing real-time VR embodiment of robots for teleoperation. Implemented with WebXR, GPU accelerated encoding, SFU server.
Utilized Hugging Face LeRobot framework for teleoperation, 100+ episode dataset recording, compression.
Employed MLOps for automated multi-day training of transformer-based robot policies on DGX system using collected datasets.
Teaching assistant for Programming Language Fundamentals, covering Haskell and Prolog.
Learning assistant for Operating Systems I, Data Structures, and Gen Engineering.
REVOBOTS AI Intern
Graduate Teaching Assistant
Undergraduate Learning Assistant
Experience
Industry, Educational