
Embodied Cognition & AI
Raf Delgado·
Your Baby's Rattle Is Smarter Than GPT-4V
Babies bind sight, sound, and touch into a single unified percept before they can sit up. State-of-the-art multimodal AI encodes each modality separately and calls it integration. Here's why the gap matters — and what it would actually take to close it.
Raf Delgado·