summary.md
ML Research Engineer specializing in agent post-training and multi-agent reasoning systems. 5+ years leading production LLM training—GRPO, DPO, LoRA fine-tuning on distributed infrastructure—with deployment across AWS Bedrock, Modal, and vLLM. Designed multi-agent architectures that learn from both process and outcome rewards, including debate-driven systems for structured reasoning. Research published on AI pluralism and multi-agent alignment. M.S. Machine Learning, Columbia.