Publications
Hejna, Joey, Rafailov, Rafael, Sikchi, Harshit, Finn, Chelsea, Niekum, Scott, Knox, W. Bradley, and Sadigh, Dorsa. 2024. “Contrastive Preference Learning: Learning from Human Feedback without RL.” https://doi.org/10.48550/ARXIV.2310.13639
Jensen, Jared and Murthy, Dhiraj. 2025. “Communicating for collaboration in AI development teams.”
Jensen, Jared and Murthy, Dhiraj. 2025. “What is being reimagined? Creativity, aura, and generative AI as the automation of remix.”
Jensen, Jared, Murthy, Dhiraj, and Baker, Samuel. 2025. “Automating remix: Generative AI, creative labor, and the decay of aura.”
Muslimani, Calarina, Chandramouli, Suyog, Booth, Serena, Knox, Bradley W., and Taylor, Matthew E.. 2024. “Analyzing Reward Functions via Trajectory Alignment.” https://openreview.net/pdf?id=Shnso8m57C
Muslimani, Calarina, Johnstonbaugh, Kerrick, Chandramouli, Suyog, Booth, Serena, Knox, W. Bradley, and Taylor, Matthew E.. 2025. “Towards Improving Reward Design in RL: A Reward Alignment Metric for RL Practitioners.” https://doi.org/10.48550/arXiv.2503.05996
Rafailov, Rafael, Chittepu, Yaswanth, Park, Ryan, Sikchi, Harshit, Hejna, Joey, Knox, Bradley, Finn, Chelsea, and Niekum, Scott. 2024. “Scaling Laws for Reward Model Overoptimization in Direct Alignment Algorithms.” https://doi.org/10.48550/ARXIV.2406.02900
Zhang, Michael J. Q., Knox, W. Bradley, and Choi, Eunsol. 2025. “Modeling Future Conversation Turns to Teach LLMs to Ask Clarifying Questions.” https://doi.org/10.48550/arXiv.2410.13788