ARC-AGI - N | Robots Science Gamedev

Published: August 21, 2024Updated: September 04, 2024

Table of contents

Path to AGI
The challenge
- Helpful links
- Tools
Papers/results

Deadline for the kaggle competition is November

Path to AGI

What do researchers miss when they try to design a system that conforms to AGI principles? (besides the part where they focus on LLMs too much)

When I search for “multiple neural network systems”, it finds me articles about networks trained on different parameters. In other words, multiple networks as an ensemble of networks, functional copies that do the same calculations and at the end then there is some strategy for choosing the final answer.

In my mind I see multiple networks to work in conjunction. There is a work that seems close to my view: A theoretical framework for multiple neural network systems by Mike Shields and Matthew Casey. Also Model-Agnostic Meta-Learning (MAML) look like a multi network system

There is also a biologically plausible system based on Anokhin’s theory of functional systems. Anokhin’s theory of functional systems is about enclosing every component by its function. Every component can be thought as an RL agent (but not limited to, I saw another approach used in practice).

Components in the system would perceive the world perfectly and look like a conscious entity but no one can prove if it is really conscious or only imitating. This is a reference to David Chalmers’s thought experiment about zombies (p-zombies).

The challenge

Data is here.

For a neural network, when inference improvements are finished on one image, you verify it on other images. Or most likely next training samples will be used for formulating the transformation properly. For the ARC challenge there should be a network that can recognize a space of one color surrounded by a line or other shape of another color. Network of 5 neurons should be sufficient. It would be similar to RNN, it starts from one point, follows the edge until it finds if it breaks or connect with the starting point. There should be an external network that waits an answer. But how do we make a question? (Another side question: Why such network would exist?)

I think the question should appear after comparing input and output and trying to transform them pixel by pixel, where every step or a group of steps describe specific transformation. All steps in transformation we will call a question about the current task. Then this question will find or create networks for transformation and they will be applied to the test data.

The network will be applying transformations if it understand them. There will be a unique network for every transformation. They will be like heads in multi-head attention: input goes to all networks then they apply transformation and produce some output. Then we compare networks' output to the expected output and mark the best transformation network for further fine tuning.