Alignment pretraining: AI discourse creates self-fulfilling (mis)alignment

by anigbrowl | View on Hacker News