Josh Negreanu
Portland, Oregon
SOCIALS
GITHUB LINKEDIN RESUME
I am a passionate AI student and engineer focused on deep multimodal models, generative frameworks, and robotics intelligence.

My experience spans deep model implementation and training, under-the-hood knowledge and research, as well as pipeline integration.


Please feel free to view my recent work below.
AI research and development for agents, NLP and robotics based in Portland, Oregon.
Exploring the usage of diffusion models to generate priors during Thompson Sampling in the non-stationary multi-armed bandit setting.


Investigating an automated method of detecting action divergence during inference, and prompting human redirection to capture finetuning data.


We provide a mathematical framework for reward poisoning on stochastic dueling bandits algorithms, proving effective spoofing with sublinear cost bounds and forced linear regret. Paper available on OSU Archives, code available on Github.
Thompson Sampling with Diffusion Priors for Non-Stationary Bandits


Automated Finetuning of Robot Policies Post Deployment



Attacks on Stochastic Dueling Bandits
Research
Bandits, Robotics, Diffusion
Discrete Diffusion and Auto-Regressive Language Model





Denoising Diffusion and Flow Matching Image Generation






Math Machine Translation
A from-scratch implementation of both traditional auto-regressive and discrete diffusion generative transformer-based language models. Trained on openwebtext utilizing DGX system. Open-source code available on Github.


A from-scratch implementation of denoising diffusion and flow matching image generative models (CNN U-Net and latent vision transformer). Trained on Stanford Cars, CelebA, ImageNet utilizing DGX system. Open-source code available on Github.


Implementation of novel seq2seq transformer model for math word problem to computational graph construction. Trained on MathQA dataset utilizing DGX system. Open-source code and report available on Github.
Projects
NLP, Generation, Translation
Designed and deployed video conferencing application allowing real-time VR embodiment of robots for teleoperation. Implemented with WebXR, GPU accelerated encoding, SFU server.

Utilized Hugging Face LeRobot framework for teleoperation, 100+ episode dataset recording, compression.

Employed MLOps for automated multi-day training of transformer-based robot policies on DGX system using collected datasets.


Teaching assistant for Programming Language Fundamentals, covering Haskell and Prolog.


Learning assistant for Operating Systems I, Data Structures, and Gen Engineering.
REVOBOTS AI Intern













Graduate Teaching Assistant



Undergraduate Learning Assistant
Experience
Industry, Educational