
Drive innovation in multi-modal reinforcement learning to advance intelligent systems. Develop and scale reinforcement learning techniques within complex multi-modal architectures. Conduct research on reinforcement learning algorithms for multimodal models, including diffusion-based approaches for image and autoregressive models for multimodal understanding. Design and build reinforcement learning infrastructure that supports scalable, distributed training across multimodal systems. Develop and refine reward modeling strategies that improve training stability and mitigate reward hacking. Create and curate multimodal simulation environments and datasets to support robust training, evaluation, and benchmarking of reinforcement learning systems. Publish research findings in top-tier conferences.