6 items with this tag.12/17/2022Positive Values Seem More Robust and Lasting than Prohibitionsshard theoryhuman valuesAI11/29/2022Alignment Allows “Non-Robust” Decision-Influences and Doesn’t Require Robust Gradingshard theoryhuman valuesAI9/9/2022Understanding and Avoiding Value Drifthuman valuesshard theoryrationalityAI9/4/2022The Shard Theory of Human Valuesunderstanding the worldshard theoryhuman valuesrationalityAI7/14/2022Humans Provide an Untapped Wealth of Evidence About Alignmentshard theoryhuman valuesAI7/7/2022Human Values & Biases Are Inaccessible to the Genomeunderstanding the worldshard theoryhuman values