Skip to content

alignment

Training or evaluation work focused on making model behavior better match intended goals, policies, preferences, or safety constraints.

Loading posts…