Constructive AI Alignment

Developing a systems-level alignment agenda that treats humans and AI as co-adapting agents.

This research agenda argues for alignment methods that account for interaction, adaptation, and feedback between humans and AI systems rather than treating behavior as static or one-shot. It is a long-running project I’ve been developing with Max Kanwal since our UC Berkeley days.

  1. AAAI Workshop Constructive Alignment
    Max Kanwal* and Caryn Tran*
    In AAAI-26 Workshop on Machine Ethics: from formal methods to emergent machine ethics, Singapore, Jan 2026
  2. AAAI Workshop Bounded Morality
    Max Kanwal*, Caryn Tran*, and Patrick Mineault
    In AAAI-26 Workshop on Machine Ethics: from formal methods to emergent machine ethics, Singapore, Jan 2026