Beyond ARC-AGI: GAIA and the search for a real intelligence benchmark
GUEST: Intelligence is pervasive, yet its measurement seems subjective. At best, we approximate its measure through tests and benchmarks. Think of college entrance exams: Every year, countless student...