Josh Negreau / Portfolio

I am a passionate AI student and engineer focused on deep multimodal models, generative frameworks, and robotics intelligence.

My experience spans deep model implementation and training, under-the-hood knowledge and research, as well as pipeline integration.

Please feel free to view my recent work below.

AI research and development for agents, NLP and robotics based in Portland, Oregon.

Exploring the usage of diffusion models to generate priors during Thompson Sampling in the non-stationary multi-armed bandit setting.

Investigating an automated method of detecting action divergence during inference, and prompting human redirection to capture finetuning data.

We provide a mathematical framework for reward poisoning on stochastic dueling bandits algorithms, proving effective spoofing with sublinear cost bounds and forced linear regret. Paper available on OSU Archives, code available on Github.

Thompson Sampling with Diffusion Priors for Non-Stationary Bandits

Automated Finetuning of Robot Policies Post Deployment

Attacks on Stochastic Dueling Bandits

Research
Bandits, Robotics, Diffusion

Discrete Diffusion and Auto-Regressive Language Model

Denoising Diffusion and Flow Matching Image Generation

Math Machine Translation

A from-scratch implementation of both traditional auto-regressive and discrete diffusion generative transformer-based language models. Trained on openwebtext utilizing DGX system. Open-source code available on Github.

A from-scratch implementation of denoising diffusion and flow matching image generative models (CNN U-Net and latent vision transformer). Trained on Stanford Cars, CelebA, ImageNet utilizing DGX system. Open-source code available on Github.

Implementation of novel seq2seq transformer model for math word problem to computational graph construction. Trained on MathQA dataset utilizing DGX system. Open-source code and report available on Github.

Projects
NLP, Generation, Translation

Designed and deployed video conferencing application allowing real-time VR embodiment of robots for teleoperation. Implemented with WebXR, GPU accelerated encoding, SFU server.

Utilized Hugging Face LeRobot framework for teleoperation, 100+ episode dataset recording, compression.

Employed MLOps for automated multi-day training of transformer-based robot policies on DGX system using collected datasets.

Teaching assistant for Programming Language Fundamentals, covering Haskell and Prolog.

Learning assistant for Operating Systems I, Data Structures, and Gen Engineering.

REVOBOTS AI Intern

Graduate Teaching Assistant

Undergraduate Learning Assistant

Experience
Industry, Educational