Cross-Cutting Themes

Publications

Hejna, Joey, Rafailov, Rafael, Sikchi, Harshit, Finn, Chelsea, Niekum, Scott, Knox, W. Bradley, and Sadigh, Dorsa. 2024. “Contrastive Preference Learning: Learning from Human Feedback without RL.” https://doi.org/10.48550/ARXIV.2310.13639

Jensen, Jared and Murthy, Dhiraj. 2025. “Communicating for collaboration in AI development teams.”

Jensen, Jared and Murthy, Dhiraj. 2025. “What is being reimagined? Creativity, aura, and generative AI as the automation of remix.”

Jensen, Jared, Murthy, Dhiraj, and Baker, Samuel. 2025. “Automating remix: Generative AI, creative labor, and the decay of aura.”

Muslimani, Calarina, Chandramouli, Suyog, Booth, Serena, Knox, Bradley W., and Taylor, Matthew E.. 2024. “Analyzing Reward Functions via Trajectory Alignment.” https://openreview.net/pdf?id=Shnso8m57C

Muslimani, Calarina, Johnstonbaugh, Kerrick, Chandramouli, Suyog, Booth, Serena, Knox, W. Bradley, and Taylor, Matthew E.. 2025. “Towards Improving Reward Design in RL: A Reward Alignment Metric for RL Practitioners.” https://doi.org/10.48550/arXiv.2503.05996

Rafailov, Rafael, Chittepu, Yaswanth, Park, Ryan, Sikchi, Harshit, Hejna, Joey, Knox, Bradley, Finn, Chelsea, and Niekum, Scott. 2024. “Scaling Laws for Reward Model Overoptimization in Direct Alignment Algorithms.” https://doi.org/10.48550/ARXIV.2406.02900

Zhang, Michael J. Q., Knox, W. Bradley, and Choi, Eunsol. 2025. “Modeling Future Conversation Turns to Teach LLMs to Ask Clarifying Questions.” https://doi.org/10.48550/arXiv.2410.13788