Governance frameworks should address the prospect of AI systems that cannot

In this Policy Forum, Michael Cohen and colleagues highlight the unique risks presented by a particular class of artificial intelligence (AI) systems: reinforcement learning (RL) agents that plan more effectively than humans over long horizons. “Giving [such] an advanced AI system the objective to maximize its reward and, at some point, withholding reward from it, strongly incentivizes the AI system to take humans out of the loop,” write Cohen and colleagues. This incentive also arises for long-term planning agents (LTPAs) more generally, say the authors, and in ways empirical testing is unlikely to cover. It is thus critical to address extinction risk from these systems, say Cohen et al., and this will require new forms of government intervention. Although governments have expressed some concern about existential risks from AI and taken promising first steps in the U.S. and U.K, in particular, regulatory proposals to date do not adequately address this particular class of risk – losing control of advanced LTPAs. Even empirical safety testing – the prevailing regulatory approach for AI – is likely to be either dangerous or uninformative, for a sufficiently capable LTPA, say the authors. Accordingly, Cohen and colleagues propose that developers not be permitted to build sufficiently capable LTPAs, and that the resources required to build them be subject to stringent controls. When it comes to determining how capable is “sufficiently capable,” for an LTPA, the authors offer insight to guide regulators and policymakers. They note they do not believe that existing AI systems exhibit existentially dangerous capabilities, nor do they exhibit several of the capabilities mentioned in President Biden’s recent executive order on AI, “and it is very difficult to predict when they could.” The authors note that although their proposal for governing LTPAs fills an important gap, “further institutional mechanisms will likely be needed to mitigate the risks posed by advanced artificial agents.”

Journal

Science

DOI

10.1126/science.adl0625

Article Title

Regulating advanced artificial agents

Article Publication Date

5-Apr-2024

Governance frameworks should address the prospect of AI systems that cannot be safely tested

Sylvester physician co-authors global plan to combat prostate cancer

Cells engineered to produce immune-boosting amino acids in prizewinning research

Related Posts

Comprehensive Global Analysis: Merging Finance, Technology, and Governance Essential for Just Climate Action

Revolutionary Genetic Technology Emerges to Combat Antibiotic Resistance

Nanophotonic Two-Color Solitons Enable Two-Cycle Pulses

Insilico Medicine Welcomes Dr. Halle Zhang as New Vice President of Clinical Development for Oncology

Novel Gene Editing Technique Targets Tumors Overloaded with Oncogenes

New Study Uncovers Microscopic Sources of Surface Noise Affecting Diamond Quantum Sensors

Cells engineered to produce immune-boosting amino acids in prizewinning research

Mothers who receive childcare support from maternal grandparents show more parental warmth, finds NTU Singapore study

University of Seville Breaks 120-Year-Old Mystery, Revises a Key Einstein Concept

Bee body mass, pathogens and local climate influence heat tolerance

Researchers record first-ever images and data of a shark experiencing a boat strike

Groundbreaking Clinical Trial Reveals Lubiprostone Enhances Kidney Function

RECENT NEWS

Categories

Subscribe to Blog via Email

Welcome Back!

Retrieve your password

Governance frameworks should address the prospect of AI systems that cannot be safely tested

Journal

DOI

Article Title

Article Publication Date

Sylvester physician co-authors global plan to combat prostate cancer

Cells engineered to produce immune-boosting amino acids in prizewinning research

Related Posts

RECENT NEWS

Categories

Subscribe to Blog via Email

Welcome Back!

Retrieve your password

Discover more from Science