Publications

Cross-Cutting Themes

Publications

Hejna, J., Rafailov, R., Sikchi, H., Finn, C., Niekum, S., Knox, W. B., & Sadigh, D. (2024). Contrastive Preference Learning: Learning from Human Feedback without RL. In The Thirteenth International Conference on Learning Representations . https://doi.org/10.48550/ARXIV.2310.13639

Jensen, J. T., Murthy, D., & Baker, S. (2025). Automating remix: generative AI, creative labor, and the decay of aura. Information, Communication & Society 1-17 . https://doi.org/10.1080/1369118X.2025.2609779

Muslimani, C., Chandramouli, S., Booth, S., Knox, B. W., & Taylor, M. E. (2024). Analyzing Reward Functions via Trajectory Alignment. In NeurIPS 2024 Workshop on Behavioral Machine Learning . https://openreview.net/pdf?id=Shnso8m57C

Muslimani, C., Johnstonbaugh, K., Chandramouli, S., Booth, S., Knox, W. B., & Taylor, M. E. (2025). Towards Improving Reward Design in RL: A Reward Alignment Metric for RL Practitioners. In Reinforcement Learning Conference (RLC) . https://rlj.cs.umass.edu/2025/papers/Paper280.html

Rafailov, R., Chittepu, Y., Park, R., Sikchi, H., Hejna, J., Knox, B., Finn, C., & Niekum, S. (2024). Scaling Laws for Reward Model Overoptimization in Direct Alignment Algorithms. In NeurIPS 2024 Workshop on Behavioral Machine Learning . https://doi.org/10.48550/ARXIV.2406.02900

Zhang, M. J. Q., Knox, W. B., & Choi, E. (2025). Modeling Future Conversation Turns to Teach LLMs to Ask Clarifying Questions. In The Thirteenth International Conference on Learning Representations . https://doi.org/10.48550/arXiv.2410.13788