Alignment Research Center

[1] Established by former OpenAI researcher Paul Christiano, ARC focuses on recognizing and comprehending the potentially harmful capabilities of present-day AI models.

[2][3] ARC's mission is to ensure that powerful machine learning systems of the future are designed and developed safely and for the benefit of humanity.

It was founded in April 2021 by Paul Christiano and other researchers focused on the theoretical challenges of AI alignment.

"[9] In March 2023, OpenAI asked the ARC to test GPT-4 to assess the model's ability to exhibit power-seeking behavior.

[10] ARC evaluated GPT-4's ability to strategize, reproduce itself, gather resources, stay concealed within a server, and execute phishing operations.