alignment
Training or evaluation work focused on making model behavior better match intended goals, policies, preferences, or safety constraints.
Loading postsā¦
Training or evaluation work focused on making model behavior better match intended goals, policies, preferences, or safety constraints.