Resource Library

Curated collection of articles, videos, podcasts, books, and papers about AGI, AI safety, and alignment.

Type

Showing 20 of 20 resources

The Singularity Is Near

2005

by Ray Kurzweil

A landmark book exploring the coming merging of humans and machines, and the exponential growth of technology leading to the Singularity.

AGI Foundationsbook

Superintelligence: Paths, Dangers, Strategies

2014

by Nick Bostrom

A foundational work examining the potential paths to machine superintelligence and the existential risks it poses.

AGI Foundationsbook

AGI Safety Fundamentals

2023

by AI Safety Fundamentals Team

Comprehensive introduction to AGI safety concepts and core readings for newcomers to the field.

AGI Foundationsarticle

Concrete Problems in AI Safety

2016

by Amodei et al.

Identifies practical safety problems that need to be solved before deploying AI systems in the real world.

AI Safetypaper

AI Alignment Podcast

2023

by Lucas Perry

Weekly podcast featuring conversations with researchers working on AI alignment and existential risk.

AI Safetypodcast

Risks from Learned Optimization

2019

by Hubinger et al.

Explores the dangers of mesa-optimization and inner alignment problems in machine learning systems.

AI Safetypaper

AI Safety Explained

2023

by Robert Miles

Accessible video series explaining AI safety concepts, alignment problems, and existential risk.

AI Safetyvideo

The Ethics of Artificial Intelligence

2014

by Nick Bostrom & Eliezer Yudkowsky

Philosophical examination of the ethical considerations surrounding artificial intelligence development.

Philosophyarticle

Consciousness and Artificial Intelligence

2010

by David Chalmers

Explores the hard problem of consciousness in relation to artificial intelligence and machine sentience.

Philosophyarticle

Human Compatible: AI and the Problem of Control

2019

by Stuart Russell

Proposes a new framework for building AI systems aligned with human values and preferences.

Philosophybook

Attention Is All You Need

2017

by Vaswani et al.

The groundbreaking paper introducing the Transformer architecture that revolutionized deep learning.

Technicalpaper

Deep Learning

2016

by Ian Goodfellow, Yoshua Bengio, Aaron Courville

Comprehensive textbook covering the mathematical foundations and practical implementations of deep learning.

Technicalbook

Scaling Laws for Neural Language Models

2020

by Kaplan et al.

Empirical study of how language model performance scales with model size, dataset size, and compute.

Technicalpaper

The Malicious Use of Artificial Intelligence

2018

by Brundage et al.

Forecasts potential security threats from malicious uses of AI and proposes policy responses.

Policy & Governancepaper

Artificial Intelligence and International Security

2018

by Michael Horowitz

Analyzes the strategic implications of AI for national security and international relations.

Policy & Governancearticle

AI Governance Podcast

2023

by GovAI

Discussions with experts on AI governance, policy, and strategy for managing transformative AI.

Policy & Governancepodcast

The Alignment Problem

2020

by Brian Christian

Explores how to align machine values with human values as AI systems become more powerful.

Alignmentbook

RLHF: Reinforcement Learning from Human Feedback

2017

by Christiano et al.

Introduces reinforcement learning from human feedback, a key technique for AI alignment.

Alignmentpaper

Eliciting Latent Knowledge

2021

by ARC Theory Team

Proposes methods for extracting truthful information from AI systems even when they have incentives to deceive.

Alignmentarticle

AI Alignment Forum

2023

by LessWrong Community

Online forum for technical AI alignment research discussion and collaboration.

Alignmentarticle

Resource Library

Categories

Type