<?xml version="1.0" encoding="UTF-8"?>
<?latexml searchpaths="/home/japhy/scienceReplication.artiswrong.com/paper_files/arxiv/2504.17939/latex_extracted"?>
<?latexml class="IEEEtran" options="lettersize,journal"?>
<?latexml package="amsmath,amsfonts"?>
<?latexml package="algorithmic"?>
<?latexml package="algorithm"?>
<?latexml package="array"?>
<?latexml package="subfig" options="caption=false,font=normalsize,labelfont=sf,textfont=sf"?>
<?latexml package="textcomp"?>
<?latexml package="stfloats"?>
<?latexml package="url"?>
<?latexml package="verbatim"?>
<?latexml package="graphicx"?>
<?latexml package="cite"?>
<?latexml package="xcolor"?>
<?latexml package="quotes"?>
<?latexml package="placeins"?>
<?latexml package="graphicx"?>
<?latexml package="epstopdf"?>
<?latexml package="tikz-cd"?>
<!--  %sergiu --><!--  %sergiu --><?latexml package="microtype"?>
<!--  %updated with editorial comments 8/9/2021 --><!--  %**** Mobile˙paradigm˙model.tex Line 25 **** --><?latexml RelaxNGSchema="LaTeXML"?>
<document xmlns="http://dlmf.nist.gov/LaTeXML" class="ltx_authors_1line">
  <resource src="LaTeXML.css" type="text/css"/>
  <resource src="ltx-article.css" type="text/css"/>
  <title>A computational model of infant sensorimotor exploration in the mobile paradigm</title>
  <creator role="author">
    <personname>Josua Spisak<Math mode="inline" tex="{}^{1,\dagger}" text="^list@(1, dagger)" xml:id="m1">
        <XMath>
          <XMApp role="FLOATSUPERSCRIPT" scriptpos="1">
            <XMDual>
              <XMApp>
                <XMTok meaning="list"/>
                <XMRef idref="m1.1"/>
                <XMRef idref="m1.2"/>
              </XMApp>
              <XMWrap>
                <XMTok fontsize="70%" meaning="1" role="NUMBER" xml:id="m1.1">1</XMTok>
                <XMTok fontsize="70%" role="PUNCT">,</XMTok>
                <XMTok fontsize="70%" name="dagger" role="MULOP" xml:id="m1.2">†</XMTok>
              </XMWrap>
            </XMDual>
          </XMApp>
        </XMath>
      </Math>, Sergiu Tcaci Popescu<Math mode="inline" tex="{}^{2,\dagger,*}" text="^list@(2, dagger, *)" xml:id="m2">
        <XMath>
          <XMApp role="FLOATSUPERSCRIPT" scriptpos="1">
            <XMDual>
              <XMApp>
                <XMTok meaning="list"/>
                <XMRef idref="m2.1"/>
                <XMRef idref="m2.2"/>
                <XMRef idref="m2.3"/>
              </XMApp>
              <XMWrap>
                <XMTok fontsize="70%" meaning="2" role="NUMBER" xml:id="m2.1">2</XMTok>
                <XMTok fontsize="70%" role="PUNCT">,</XMTok>
                <XMTok fontsize="70%" name="dagger" role="MULOP" xml:id="m2.2">†</XMTok>
                <XMTok fontsize="70%" role="PUNCT">,</XMTok>
                <XMTok fontsize="70%" meaning="times" role="MULOP" xml:id="m2.3">*</XMTok>
              </XMWrap>
            </XMDual>
          </XMApp>
        </XMath>
      </Math>, Stefan Wermter<Math mode="inline" tex="{}^{1}" text="^1" xml:id="m3">
        <XMath>
          <XMApp role="FLOATSUPERSCRIPT" scriptpos="1">
            <XMTok fontsize="70%" meaning="1" role="NUMBER">1</XMTok>
          </XMApp>
        </XMath>
      </Math>, Matej Hoffmann<Math mode="inline" tex="{}^{2,*}" text="^list@(2, *)" xml:id="m4">
        <XMath>
          <XMApp role="FLOATSUPERSCRIPT" scriptpos="1">
            <XMDual>
              <XMApp>
                <XMTok meaning="list"/>
                <XMRef idref="m4.1"/>
                <XMRef idref="m4.2"/>
              </XMApp>
              <XMWrap>
                <XMTok fontsize="70%" meaning="2" role="NUMBER" xml:id="m4.1">2</XMTok>
                <XMTok fontsize="70%" role="PUNCT">,</XMTok>
                <XMTok fontsize="70%" meaning="times" role="MULOP" xml:id="m4.2">*</XMTok>
              </XMWrap>
            </XMDual>
          </XMApp>
        </XMath>
      </Math>, J. Kevin O’Regan<Math mode="inline" tex="{}^{3}" text="^3" xml:id="m5">
        <XMath>
          <XMApp role="FLOATSUPERSCRIPT" scriptpos="1">
            <XMTok fontsize="70%" meaning="3" role="NUMBER">3</XMTok>
          </XMApp>
        </XMath>
      </Math>






</personname>
    <contact role="thanks">
S.T.P. and M.H. were supported by the Czech Science Foundation (GA CR), project no. 25-18113S. S.T.P. was additionally supported by the project Mobility ČVUT MSCA-F-CZ-I, no. CZ.02.01.01/00/22_010/0003405. J.S. and S.W. were supported by the DFG (MoReSpace), part of the DFG Priority Program 2134 “The Active Self” (DFG PA2302/13-1).</contact>
    <contact role="thanks"><Math mode="inline" tex="{}^{1}" text="^1" xml:id="m6">
        <XMath>
          <XMApp role="FLOATSUPERSCRIPT" scriptpos="1">
            <XMTok fontsize="70%" meaning="1" role="NUMBER">1</XMTok>
          </XMApp>
        </XMath>
      </Math>Josua Spisak and Stefan Wermter are with the University of Hamburg, Informatics Department, Knowledge Technology Group, Germany.</contact>
    <contact role="thanks"><Math mode="inline" tex="{}^{2}" text="^2" xml:id="m7">
        <XMath>
          <XMApp role="FLOATSUPERSCRIPT" scriptpos="1">
            <XMTok fontsize="70%" meaning="2" role="NUMBER">2</XMTok>
          </XMApp>
        </XMath>
      </Math>Sergiu T. Popescu and Matej Hoffmann are with Department of Cybernetics, Faculty of Electrical Engineering, Czech Technical University in Prague, Czech Republic.</contact>
    <contact role="thanks"><Math mode="inline" tex="{}^{3}" text="^3" xml:id="m8">
        <XMath>
          <XMApp role="FLOATSUPERSCRIPT" scriptpos="1">
            <XMTok fontsize="70%" meaning="3" role="NUMBER">3</XMTok>
          </XMApp>
        </XMath>
      </Math>J. Kevin O’Regan is with Université Paris Cité and CNRS, Paris, France.</contact>
    <contact role="thanks"><Math mode="inline" tex="{}^{\dagger}" text="^dagger" xml:id="m9">
        <XMath>
          <XMApp role="FLOATSUPERSCRIPT" scriptpos="1">
            <XMTok fontsize="70%" name="dagger" role="MULOP">†</XMTok>
          </XMApp>
        </XMath>
      </Math> Equal contribution.</contact>
    <contact role="thanks"><text class="ltx_markedasmath" font="bold">*</text> Corresponding authors:
Matej Hoffmann <text font="typewriter" fontsize="90%">matej.hoffmann@fel.cvut.cz</text> and Sergiu T. Popescu
<text font="typewriter" fontsize="90%">sergiu.t.popescu@gmail.com</text>.</contact>
  </creator>
  <abstract name="Abstract">
<!--  %In this work, we present a computational model of the cognitive mechanism that may be at work in determining infants’ behavior in the so-called ”mobile paradigm”. This paradigm has been used for more than 50 years in developmental psychology to explore how infants perceive and learn the sensory effects of their actions. Concretely, when a mobile suspended over an infant’s crib is connected to one of its limbs, the infant will move that ‘‘connected” limb more than the other limbs.  Presumably, the ability to detect such ‘‘sensorimotor contingencies” is a core cognitive mechanism essential in infants’ cognitive and social development. Our model incorporates a simple neural network and several components that we think are present in infants: an action-outcome prediction mechanism, an exploration mechanism, motor noise, and motor control involving multiple muscles. Simulations generated with our model in the mobile paradigm show behavior very similar to the main findings observed in infant studies: The model rapidly begins making actions that create ‘‘interesting” sensory results, moving the connected limb more than the others. More detailed comparisons of the model with moment-to-moment recordings from two recent studies with 6-month-old infants revealed close correspondence in the temporal course of detection of the contingency. Similar to infants, the model also shows a dependence on whether the mobile moves proportionately or in an all-or-none way as a function of the connected limb’s movements. In some simulations we observed that on disconnecting the connected limb, the model shows a burst of activity. This is also a phenomenon sometimes observed in infants. Several ablation studies we performed demonstrated that action-outcome prediction and exploration, motor noise, and motor control involving multiple muscles are all essential to accurately replicate infant behavior. This supports our hypothesis that similar mechanisms may also be used by infants. -->    <p>We present a computational model of the mechanisms that may determine infants’ behavior in the ”mobile paradigm”. This paradigm has been used in developmental psychology to explore how infants learn the sensory effects of their actions. In this paradigm, a mobile (an articulated and movable object hanging above an infant’s crib) is connected to one of the infant’s limbs, prompting the infant to preferentially move that ”connected” limb. This ability to detect a ”sensorimotor contingency” is considered to be a foundational cognitive ability in development. To understand how infants learn sensorimotor contingencies, we built a model that attempts to replicate infant behavior. Our model incorporates a neural network, action-outcome prediction, exploration, motor noise, preferred activity level, and biologically-inspired motor control. We find that simulations with our model replicate the classic findings in the literature showing preferential movement of the connected limb. An interesting observation is that the model sometimes exhibits a burst of movement after the mobile is disconnected, casting light on a similar occasional finding in infants. In addition to these general findings, the simulations also replicate data from two recent more detailed studies using a connection with the mobile that was either gradual or all-or-none. A series of ablation studies further shows that the inclusion of mechanisms of action-outcome prediction, exploration, motor noise, and biologically-inspired motor control was essential for the model to correctly replicate infant behavior. This suggests that these components are also involved in infants’ sensorimotor learning.</p>
    <note mark="4" role="footnotetext" xml:id="footnotex1">The code of the model is published at: <ref class="ltx_url" font="typewriter" href="https://github.com/ctu-vras/mobile-paradigm-model.">https://github.com/ctu-vras/mobile-paradigm-model.</ref></note>
  </abstract>
  <keywords>
mobile paradigm, sensorimotor contingency, learning, surprise, prediction
</keywords>
<!--  %¡-this % stops a space 
     %style guide: https://docs.google.com/document/d/1OalFYqVfxKKMelF79U3B93SjmIEiTnJC12wPJrKeSDM/edit?tab=t.0
     %The paper headers
     %**** Mobile˙paradigm˙model.tex Line 50 ****
     %“IEEEpubid–0000__0000/00“$00.00~“copyright~2021 IEEE˝
     %Remember, if you use this you must call “IEEEpubidadjcol in the second
     %column for its text to clear the IEEEpubid mark.
     %For IEEE TCDS
     %The abstract must be self-contained, without abbreviations, footnotes, or references. It should be a microcosm of the full article.
     %The abstract must be between 150 and 250 words. Be sure that you adhere to these limits; otherwise, you will need to edit your abstract accordingly.-->  <section inlist="toc" xml:id="S1">
    <tags>
      <tag>I</tag>
      <tag role="refnum">I</tag>
      <tag role="typerefnum">§I</tag>
    </tags>
    <title><tag close=" ">I</tag><text font="smallcaps">Introduction</text></title>
<!--  %“IEEEPARstart–F˝or 
     %**** Mobile˙paradigm˙model.tex Line 75 ****-->    <para xml:id="S1.p1">
      <p>For a developing infant, detecting the effect of its own actions on its sensory input (i.e. detecting “sensorimotor contingencies”) is presumably a core mechanism by which the infant first learns how to distinguish its body from its environment, and then learns how to control the body and use it to grasp and manipulate external objects (for reviews, see <cite class="ltx_citemacro_cite">[<bibref bibrefs="jacquey_fagard_2020" separator="," yyseparator=","/>]</cite> and <cite class="ltx_citemacro_cite">[<bibref bibrefs="sen_making_2021" separator="," yyseparator=","/>]</cite>). Becoming a social being, perceiving itself and its caregivers as agents with causal effects is also an essential aspect of infant development that requires noting contingencies between its actions and other people’s reactions <cite class="ltx_citemacro_cite">[<bibref bibrefs="tarabulsy_contingency_1996" separator="," yyseparator=","/>]</cite>,<cite class="ltx_citemacro_cite">[<bibref bibrefs="adamson_stillface_2003" separator="," yyseparator=","/>]</cite>.</p>
    </para>
    <para xml:id="S1.p2">
      <p>In the literature on infant development, a variety of paradigms have been used to study detection of sensorimotor contingencies, using different types of actions on the part of the infant (limb movements, head movements, facial expressions, vocalizations) and different types of sensory consequences (sounds, voices, visual stimulation, including human or schematic faces).</p>
    </para>
    <para xml:id="S1.p3">
      <p>In what follows, we present the key findings from mobile paradigm studies focused on sensorimotor contingency detection and learning. We then provide a brief overview of previous models that address target infant behavior and highlight the main advances and novel aspects of our model.</p>
    </para>
    <subsection inlist="toc" xml:id="S1.SS1">
      <tags>
        <tag>I-A</tag>
        <tag role="refnum">I-A</tag>
        <tag role="typerefnum">§I-A</tag>
      </tags>
      <title><tag close=" ">I-A</tag><text font="italic">The mobile paradigm</text></title>
      <para xml:id="S1.SS1.p1">
        <p>One of the most frequently used methods to study the detection of sensorimotor contingencies has been the “mobile paradigm” introduced in the late 1960’s by Rovee &amp; Rovee <cite class="ltx_citemacro_cite">[<bibref bibrefs="rovee_conjugate_1969" separator="," yyseparator=","/>]</cite> and by Watson &amp; Ramey<cite class="ltx_citemacro_cite">[<bibref bibrefs="watson_reactions_1972" separator="," yyseparator=","/>]</cite>. In the classic version, a ”mobile” (a movable, articulated object with attached decorations) that hangs above an infant’s crib is connected by a ribbon to one of the infant’s legs or arms so that the movement of the connected limb influences the motion of the mobile.</p>
      </para>
      <para xml:id="S1.SS1.p2">
        <p>Early studies demonstrated that infants from at least 2 months onward can rapidly detect the contingency between their actions and the mobile (e.g. <cite class="ltx_citemacro_cite">[<bibref bibrefs="greco_ontogeny_1986" separator="," yyseparator=","/>]</cite>,<cite class="ltx_citemacro_cite">[<bibref bibrefs="hayne_ontogeny_1986" separator="," yyseparator=","/>]</cite>, <cite class="ltx_citemacro_cite">[<bibref bibrefs="lewis_emotional_1985" separator="," yyseparator=","/>]</cite>, <cite class="ltx_citemacro_cite">[<bibref bibrefs="alessandri_violation_1990" separator="," yyseparator=","/>]</cite>), showing more motor activity than when there is no connection. Five- to six-month-old infants start to reliably distinguish which is the ”connected” limb and move it more than the other limbs <cite class="ltx_citemacro_cite">[<bibref bibrefs="jacquey_popescu_2020" separator="," yyseparator=","/>]</cite>, <cite class="ltx_citemacro_cite">[<bibref bibrefs="popescu_6-month-old_2021" separator="," yyseparator=","/>]</cite>. Later research by Rovee-Collier’s group (e.g. <cite class="ltx_citemacro_cite">[<bibref bibrefs="rovee-collier_development_1999" separator="," yyseparator=","/>]</cite>,<cite class="ltx_citemacro_cite">[<bibref bibrefs="rovee-collier_long-term_1999" separator="," yyseparator=","/>]</cite>) turned away from contingency detection itself. Instead, they used the phenomenon to test the duration of retention of the contingency, the particular visual cues that an infant will use, how the infant will generalize to other cues and environmental parameters.</p>
      </para>
      <para xml:id="S1.SS1.p3">
        <p>Recently several studies have gone back to more carefully reviewing the literature and re-investigating the basic contingency detection phenomenon itself to confirm the exact conditions and time-course of the infant’s reaction to the establishment of a contingency <cite class="ltx_citemacro_cite">[<bibref bibrefs="sen_making_2021" separator="," yyseparator=","/>]</cite>, <cite class="ltx_citemacro_cite">[<bibref bibrefs="jacquey_popescu_2020" separator="," yyseparator=","/>]</cite>, <cite class="ltx_citemacro_cite">[<bibref bibrefs="popescu_6-month-old_2021" separator="," yyseparator=","/>]</cite>,<cite class="ltx_citemacro_cite">[<bibref bibrefs="sen_methodological_2024" separator="," yyseparator=","/>]</cite>, <cite class="ltx_citemacro_cite">[<bibref bibrefs="sloan_meaning_2023" separator="," yyseparator=","/>]</cite>. This recent work has shown that infants’ ability to detect contingencies is not as easy to demonstrate as the earlier literature might have led one to believe. However, the main empirical results that appear to be reliable are the following.</p>
      </para>
      <para xml:id="S1.SS1.p4">
        <p><text font="bold">Specificity of connected limb:</text> For infants aged 5-6 months, when one limb is connected to the mobile, that limb’s activity rapidly becomes overall higher than that of the other limbs. This effect appears within the first minute of the experiment, showing that infants detect the contingency very quickly and are able to activate the appropriate limb selectively <cite class="ltx_citemacro_cite">[<bibref bibrefs="watanabe_general_2006" separator="," yyseparator=","/>]</cite>, <cite class="ltx_citemacro_cite">[<bibref bibrefs="popescu_6-month-old_2021" separator="," yyseparator=","/>]</cite>.</p>
      </para>
      <para xml:id="S1.SS1.p5">
        <p><text font="bold">Comparison with a control group:</text> In addition to the main experimental group where the mobile is activated contingently on the infant’s movements, some studies use a control group where the mobile is activated non-contingently, i.e. with a similar activation rate but in a random fashion independent of the infant’s movements (for example, <cite class="ltx_citemacro_cite">[<bibref bibrefs="rovee_conjugate_1969" separator="," yyseparator=","/>]</cite>, <cite class="ltx_citemacro_cite">[<bibref bibrefs="rovee-collier_topographical_1978" separator="," yyseparator=","/>]</cite>, <cite class="ltx_citemacro_cite">[<bibref bibrefs="jacquey_popescu_2020" separator="," yyseparator=","/>]</cite>,<cite class="ltx_citemacro_cite">[<bibref bibrefs="popescu_6-month-old_2021" separator="," yyseparator=","/>]</cite>,<cite class="ltx_citemacro_cite">[<bibref bibrefs="angulo-kinzler_three-month-old_2002" separator="," yyseparator=","/>]</cite>). By showing that infants’ activity in the contingent group is still higher than in the non-contingent control group, these studies show that the infants are sensitive to the contingency itself, and not just the activity of the mobile.</p>
      </para>
      <para xml:id="S1.SS1.p6">
        <p>In addition to these two main empirical results, there are two interesting but less reliable findings, which we will consider in this paper.</p>
      </para>
      <para xml:id="S1.SS1.p7">
        <p><text font="bold">Extinction burst:</text> When, at the end of the experiment, the connected limb is disconnected from the mobile, there sometimes is an ”extinction burst”—the activity of the connected limb briefly increases and then decreases again, to a level similar to that of the other limbs. The existence of an extinction burst is not systematically observed in the literature <cite class="ltx_citemacro_cite">[<bibref bibrefs="alessandri_violation_1990" separator="," yyseparator=","/>]</cite>,<cite class="ltx_citemacro_cite">[<bibref bibrefs="heathcock_relative_2005" separator="," yyseparator=","/>]</cite>,<cite class="ltx_citemacro_cite">[<bibref bibrefs="lewis_violation_1990" separator="," yyseparator=","/>]</cite>,<cite class="ltx_citemacro_cite">[<bibref bibrefs="rovee-collier_topographical_1978" separator="," yyseparator=","/>]</cite>, and might be indicative of the presence of a predictive mechanism at work in the infant’s behavior.</p>
      </para>
      <para xml:id="S1.SS1.p8">
        <p><text font="bold">Greater effect for binary vs. non-binary:</text> When the stimulation caused by the infant’s limb movements is non-binary or “conjugate” with the movement of the mobile—that is, when the connected limb’s activity determines the mobile movement in a proportionate way—the effects described above are less clearly visible than in a “binary” condition, where the connected limb always triggers the maximum mobile movement when the connected limb movement passes a threshold. This finding from empirical work <cite class="ltx_citemacro_cite">[<bibref bibrefs="popescu_6-month-old_2021" separator="," yyseparator=","/>]</cite> is worth noting because it might be predicted from the computational model to be presented here.</p>
      </para>
    </subsection>
    <subsection inlist="toc" xml:id="S1.SS2">
      <tags>
        <tag>I-B</tag>
        <tag role="refnum">I-B</tag>
        <tag role="typerefnum">§I-B</tag>
      </tags>
      <title><tag close=" ">I-B</tag><text font="italic">Computational models of the mobile paradigm</text></title>
<!--  %**** Mobile˙paradigm˙model.tex Line 100 **** -->      <para xml:id="S1.SS2.p1">
        <p>Surprisingly, despite the substantial literature and the importance of contingency detection as a mechanism underlying infant development, there have been few attempts to model the findings observed in the mobile paradigm, either mathematically or by computer simulation.</p>
      </para>
      <para xml:id="S1.SS2.p2">
        <p>An exception was the work of John S. Watson and his collaborators Butko &amp; Movellan <cite class="ltx_citemacro_cite">[<bibref bibrefs="butko_detecting_2010" separator="," yyseparator=","/>]</cite>,<cite class="ltx_citemacro_cite">[<bibref bibrefs="movellan_Watson_development_2002" separator="," yyseparator=","/>]</cite>,<cite class="ltx_citemacro_cite">[<bibref bibrefs="butko_learningToLearn_2007" separator="," yyseparator=","/>]</cite>. They were struck by the fact that infants in the mobile paradigm often coo and smile in the presence of the connected mobile, suggesting that they might be putting to work a mechanism designed to seek out social contingencies. They proposed that infants in the mobile paradigm might be deploying a similar kind of action/waiting-for-reaction behavior that might be used to detect whether one is communicating with a social partner rather than a random or a completely deterministic entity (like an echo). They modeled an agent that maximizes the information gain or reduces the uncertainty and combined it with reinforcement learning to conceive an optimal infomax controller for detecting social contingencies. They showed that this approach accounts for infants’ behavior in a situation in which a simple robot was animated when the infant generated vocalizations. However, as pointed out by the authors, the infomax calculation is probably not practical for a brain to make, and a simpler approximation would be more realistic.</p>
      </para>
      <para xml:id="S1.SS2.p3">
        <p>Two other attempts to explicitly model behavior in the mobile paradigm have also appeared recently.</p>
      </para>
      <para xml:id="S1.SS2.p4">
        <p>Zaadnoordijk et al.’s <cite class="ltx_citemacro_cite">[<bibref bibrefs="zaadnoordijk_can_2018" separator="," yyseparator=","/>]</cite> goal was to demonstrate that the most basic effects observed in the mobile paradigm could be explained <text font="italic">without</text> assuming that the infant was explicitly searching for contingencies. They were, therefore, taking the opposite viewpoint from Butko &amp; Movellan, trying to show that the mobile paradigm is <text font="italic">not</text> a good test for a notion of causality or agency in infants. Zaadnoordijk et al. <cite class="ltx_citemacro_cite">[<bibref bibrefs="zaadnoordijk_can_2018" separator="," yyseparator=","/>]</cite> set out to explain the observed infant behavior using a simple “babybot” model in the form of a state machine with four limbs that could each be in three different positions (top, middle, and bottom) and three possible actions (move up, move down, hold still). They showed how a very simple operant conditioning mechanism can fairly accurately simulate the temporal course of infant learning in the mobile paradigm. Their mechanism worked simply by repeating movements that produce a sensory effect, i.e. by moving the connected limb. The controller (“brain”) part of the model was deliberately designed in a minimalist and non-representational fashion—the agent had no internal machinery to represent or predict the consequences of its actions (action-effect relationship). Simple reinforcement of actions triggering the movement of the mobile was enough to reproduce the core behavior, i.e. more movement in the limb connected to the mobile. However, because their model contains no predictive element, it does not seem compatible with the possibility of an ”extinction burst” following the removal of the contingency <cite class="ltx_citemacro_cite">[<bibref bibrefs="rovee-collier_topographical_1978" separator="," yyseparator=","/>]</cite>.</p>
      </para>
      <para xml:id="S1.SS2.p5">
        <p>Kelso &amp; Fuchs <cite class="ltx_citemacro_cite">[<bibref bibrefs="kelso_coordination_2016" separator="," yyseparator=","/>]</cite> also proposed a model without agency and with no predictive component. Unlike the discrete steps in the state machine in <cite class="ltx_citemacro_cite">[<bibref bibrefs="zaadnoordijk_can_2018" separator="," yyseparator=","/>]</cite>, Kelso &amp; Fuchs’s approach was rooted in the dynamical systems framework (see also <cite class="ltx_citemacro_cite">[<bibref bibrefs="fujihira2023dynamical" separator="," yyseparator=","/>]</cite> for another model), paying close attention to the temporal dynamics and the properties of the physical movements of the infants’ limbs, the mobile, and their coupling (the connected limb being physically attached to the mobile with a ribbon). The changes in the infant behavior were explained by phase transitions in this coupled dynamical system—a mechanistic explanation that does not require any rewards, reinforcements, or cognitive mechanisms (contingency detection, prediction, etc.). Since, again, no predictive element is present in this approach, it is unclear whether an ”extinction burst” could be explained in this framework.</p>
      </para>
    </subsection>
    <subsection inlist="toc" xml:id="S1.SS3">
      <tags>
        <tag>I-C</tag>
        <tag role="refnum">I-C</tag>
        <tag role="typerefnum">§I-C</tag>
      </tags>
      <title><tag close=" ">I-C</tag><text font="italic">Our model</text></title>
      <para xml:id="S1.SS3.p1">
        <p>The purpose of the present paper is to propose a first attempt at a detailed computational model, shown in Fig. <ref labelref="LABEL:fig:Flowchart"/>, of the mobile paradigm that addresses the main two empirical findings observed in infants at age 6 months: the increase in activity specific to the ”connected” limb, and a significant effect for a contingent group but not for a control group with random stimulation. Additionally, we examine how the model might simulate the controversial phenomenon of the extinction burst and the difference between binary (non-conjugate) and non-binary (conjugate) stimulation.</p>
      </para>
      <para xml:id="S1.SS3.p2">
        <p>In our model, there were two important constraints that we wanted to adhere to. The first constraint was the fact that contrary to Zaadnordijk et al. <cite class="ltx_citemacro_cite">[<bibref bibrefs="zaadnoordijk_can_2018" separator="," yyseparator=","/>]</cite> and to Kelso &amp; Fuchs <cite class="ltx_citemacro_cite">[<bibref bibrefs="kelso_coordination_2016" separator="," yyseparator=","/>]</cite>, we wanted to include the idea that the infant is attempting to predict the outcome of its actions in our model. Similar to Butko &amp; Movellan <cite class="ltx_citemacro_cite">[<bibref bibrefs="butko_detecting_2010" separator="," yyseparator=","/>]</cite>, we assume this is an elementary aspect of infant interaction with the world that underlies the search for causation and agency. Within the context of current models of intrinsically motivated learning <cite class="ltx_citemacro_cite">[<bibref bibrefs="baldassarre2013intrinsically" separator="," yyseparator=","/>]</cite>, the approach we used could be said to be <text font="italic">knowledge-based</text> rather than <text font="italic">competence-based</text> and to involve predictive novelty motivation (cf.<cite class="ltx_citemacro_cite">[<bibref bibrefs="oudeyer2007intrinsic" separator="," yyseparator=","/>]</cite>). The central element of our model is a neural network and a table of the possible limb activities and their degree of “<text font="italic">interest</text>”. From that table, at each iteration of the simulation, the neural net chooses and attempts to execute the <text font="italic">currently most interesting</text> limb activity and, at the same time, makes a prediction for the effect that this limb activity will produce on the mobile. When the prediction is wrong, this generates <text font="italic">surprise</text>, and the degree of <text font="italic">interest</text> of the issued command is increased in the interest table. If the prediction is right, the level of <text font="italic">interest</text> for that command decreases in the table. Note that this scheme is distinct from reinforcement learning <cite class="ltx_citemacro_cite">[<bibref bibrefs="sutton2018reinforcement" separator="," yyseparator=","/>]</cite> where an agent seeks to maximize a sum of future and past rewards. In our model, there are no rewards.</p>
      </para>
      <para xml:id="S1.SS3.p3">
        <p>The second constraint on our model was to include a motor system that was biologically more realistic than that used by Zaadnordijk et al. <cite class="ltx_citemacro_cite">[<bibref bibrefs="zaadnoordijk_can_2018" separator="," yyseparator=","/>]</cite>. We wanted to simulate that in real infants, narrowing down actions to the particular limb that is connected to the mobile presumably represents a complicated problem. For that reason, our model assumed a realistic motor control mechanism whose effects on sensory feedback the infant has to learn.</p>
      </para>
      <para xml:id="S1.SS3.p4">
        <p>To train the neural net, our model optimizes its functioning by minimizing three differences (referred to as <text font="italic">losses</text>): the difference between its current activity and a baseline level of activity, the difference between the executed and the most interesting limb activity, and the difference between the predicted sensory feedback and the actual sensory feedback.</p>
      </para>
      <para xml:id="S1.SS3.p5">
        <p>We shall see below that our model broadly reproduces infants’ behavior. It quickly learns to differentiate and selectively increase the activity in the connected limb as compared to the unconnected limbs. The model also reproduces the control experiments showing that mere activity of the mobile does not suffice to produce increased limb activity: a proper contingency must exist between the infant’s movements and the activation of the mobile. Our simulations also predict better contingency detection in the binary versus the conjugate condition, and provide insights into the controversial extinction burst.</p>
      </para>
      <para xml:id="S1.SS3.p6">
        <p>In addition to these main results, we perform several “ablation” studies, where we observe how the model performs when its main components are selectively removed. These studies demonstrate that most of the components of the model are necessary ingredients: numerous muscles, motor noise, and the exploration mechanism attempting to minimize prediction error.</p>
      </para>
    </subsection>
  </section>
  <section inlist="toc" xml:id="S2">
    <tags>
      <tag>II</tag>
      <tag role="refnum">II</tag>
      <tag role="typerefnum">§II</tag>
    </tags>
    <title><tag close=" ">II</tag><text font="smallcaps">Methods</text></title>
<!--  %**** Mobile˙paradigm˙model.tex Line 125 **** -->    <para xml:id="S2.p1">
      <p>In this section, we describe how, in our model, we simulate the infant body, the environment, and the learning mechanisms. For this purpose, we start by introducing background information about the infant body and motor control. Then we describe the simulated mobile paradigm. Finally, we detail how we modeled infant learning and how our simulations match the number of participants and durations in infant studies <cite class="ltx_citemacro_cite">[<bibref bibrefs="jacquey_popescu_2020" separator="," yyseparator=","/>]</cite>, <cite class="ltx_citemacro_cite">[<bibref bibrefs="popescu_6-month-old_2021" separator="," yyseparator=","/>]</cite>.</p>
    </para>
    <subsection inlist="toc" labels="LABEL:body" xml:id="S2.SS1">
      <tags>
        <tag>II-A</tag>
        <tag role="refnum">II-A</tag>
        <tag role="typerefnum">§II-A</tag>
      </tags>
      <title><tag close=" ">II-A</tag><text font="italic">Body</text></title>
      <para xml:id="S2.SS1.p1">
        <p>Human motor control is highly complex and involves the interplay of a hierarchy of neural connections—cortical, subcortical and spinal—and a complex structure involving approximately 600 skeletal muscles <cite class="ltx_citemacro_cite">[<bibref bibrefs="edwards_motor_2010" separator="," yyseparator=","/>]</cite>, composed of muscle fibers, with a complex geometry (some acting over multiple joints) and nonlinear properties. The structures and their control show some redundancy—different motor commands can bring about the same movement; same effector movement can be instantiated through different joint configurations. We believe that the complexity and redundancy of this control mechanism is an important constraint that shapes sensorimotor learning. In a real infant, through neural, mechanical or inertial connections, activating one muscle can affect multiple limbs. Direct effects arise because nerve fibers may innervate single or multiple muscles that affect the limb and even other limbs through multi-limb muscle-chain synergies. Indirect effects occur because limbs have masses and the body is in contact with the ground, and resultant inertial interactions mean that moving one limb may affect other limbs or the whole body.</p>
      </para>
      <para xml:id="S2.SS1.p2">
        <p>Copying the entirety of the human motor control would have increased the complexity of our model beyond our current scope. Instead, we simplify the elements used in human motor control and abstract the entire chain by using 600 ”muscle commands”. In the model these 600 muscle commands are transmitted via 600 output neurons of the neural network. The muscle commands are mapped by a function M to the four limbs of our simulated agent. This mapping works in an overlapping fashion across the limbs—a single command can have an effect on several limbs and limbs are controlled by multiple muscle commands.</p>
      </para>
      <para xml:id="S2.SS1.p3">
        <p>Also in order to simplify, we used a single scalar value between 0 and 1 as a proxy for each limb’s activity. This general definition of ’limb activity’ allows us to make the link with data in the experimental literature where various measures of limb activity were used, for example, the frequency of kicks in Rovee &amp; Rovee <cite class="ltx_citemacro_cite">[<bibref bibrefs="rovee_conjugate_1969" separator="," yyseparator=","/>]</cite> or the change in limb acceleration over time in Popescu et al. <cite class="ltx_citemacro_cite">[<bibref bibrefs="popescu_6-month-old_2021" separator="," yyseparator=","/>]</cite>.</p>
      </para>
    </subsection>
    <subsection inlist="toc" xml:id="S2.SS2">
      <tags>
        <tag>II-B</tag>
        <tag role="refnum">II-B</tag>
        <tag role="typerefnum">§II-B</tag>
      </tags>
      <title><tag close=" ">II-B</tag><text font="italic">Environment</text></title>
      <para xml:id="S2.SS2.p1">
        <p>In a typical mobile paradigm experiment with infants, a limb movement may produce a sound or a visual change from the mobile. To model this, we represent the sensory feedback received by the model as a single value: either 0 or 1 in the binary condition, and a number between 0 and 1 in the non-binary (conjugate) condition (see next paragraph). For simplicity in our simulation, we excluded all sources of sensory input to the agent except the mobile.</p>
      </para>
      <para xml:id="S2.SS2.p2">
        <p>As in the mobile paradigm, we assumed that only one of the four limbs of the agent was ”connected” to the mobile. For comparison with the empirical work of Popescu et al.<cite class="ltx_citemacro_cite">[<bibref bibrefs="popescu_6-month-old_2021" separator="," yyseparator=","/>]</cite> we explored two variants of the relation between the connected limb activity and the sensory feedback the mobile produces: a binary relation and a conjugate (or non-binary) relation.</p>
      </para>
      <para xml:id="S2.SS2.p3">
        <p>For the binary relation, when the activity of the connected limb exceeded a threshold, the mobile was activated for two iterations of the simulation. For compatibility with Popescu et al. <cite class="ltx_citemacro_cite">[<bibref bibrefs="popescu_6-month-old_2021" separator="," yyseparator=","/>]</cite>, where the threshold for triggering the mobile activation was increased in a few steps before reaching its maximum value, we did the same in our simulation (raising it from 0.5 to 0.6 in increments of 0.02).</p>
      </para>
      <para xml:id="S2.SS2.p4">
        <p>For the conjugate (or non-binary) relation, the intensity of the sensory feedback emitted by the mobile was directly proportional to the value of the connected limb activity.</p>
      </para>
      <para xml:id="S2.SS2.p5">
        <p>After the contingent phase of the experiment where the mobile response was present, we simulated the extinction phase included in some empirical work (e.g. <cite class="ltx_citemacro_cite">[<bibref bibrefs="zaadnoordijk_movement_2020,heathcockPerformanceInfantsBorn2004" separator="," yyseparator=","/>]</cite>; for a review, see <cite class="ltx_citemacro_cite">[<bibref bibrefs="bednarski_infants_2022" separator="," yyseparator=","/>]</cite>) by removing sensory feedback from the mobile for the last steps of each run.</p>
      </para>
      <figure inlist="lof" labels="LABEL:fig:Flowchart" xml:id="S2.F1">
        <tags>
          <tag>Fig. 1</tag>
          <tag role="refnum">1</tag>
          <tag role="typerefnum">Fig. 1</tag>
        </tags>
        <graphics candidates="Images/GiantFigureKevinForSim.drawio-1.pdf" class="ltx_centering" graphic="Images/GiantFigureKevinForSim.drawio-1.pdf" options="width=433.62pt" xml:id="S2.F1.g1"/>
        <toccaption class="ltx_centering"><tag close=" ">1</tag><text font="bold">Architecture of our model.</text> The architecture shows each component of our model. The red arrows show the aspects of the model that have a direct influence on the network’s weights. The green arrow depicts the flow of the new limb activations. The purple arrow shows the flow of the new sensory feedback prediction and the blue arrows show the flow of the input data of the neural network. We separate the model into four components, on the top left is the neural network, on the bottom left is the deviation from baseline module, on the bottom right is the exploration module and on the the top right is the sensory feedback module.</toccaption>
        <caption class="ltx_centering"><tag close=": ">Fig. 1</tag><text font="bold">Architecture of our model.</text> The architecture shows each component of our model. The red arrows show the aspects of the model that have a direct influence on the network’s weights. The green arrow depicts the flow of the new limb activations. The purple arrow shows the flow of the new sensory feedback prediction and the blue arrows show the flow of the input data of the neural network. We separate the model into four components, on the top left is the neural network, on the bottom left is the deviation from baseline module, on the bottom right is the exploration module and on the the top right is the sensory feedback module.</caption>
<!--  %**** Mobile˙paradigm˙model.tex Line 150 **** -->      </figure>
    </subsection>
    <subsection inlist="toc" xml:id="S2.SS3">
      <tags>
        <tag>II-C</tag>
        <tag role="refnum">II-C</tag>
        <tag role="typerefnum">§II-C</tag>
      </tags>
      <title><tag close=" ">II-C</tag><text font="italic">Learning Mechanism</text></title>
      <para xml:id="S2.SS3.p1">
        <p>As shown in Fig. <ref labelref="LABEL:fig:Flowchart"/>, our learning mechanism consists of four modules: the neural network, a sensory prediction module, an exploration module, and a deviation from baseline module. Its purpose is to control the system’s behavior over successive time steps in order to take actions and make predictions about the sensory effect that it expects from those actions. On the one hand, the mechanism tries to continuously improve the predictions of the sensory feedback, therefore reducing the amount of surprise it experiences. On the other hand, it intentionally chooses actions that are likely to result in novel experiences, in this way making surprises more likely. The result of these two opposing forces is to make the model undertake a continuous non-random exploratory activity, seeking out the most surprising actions it can perform.</p>
      </para>
      <subsubsection inlist="toc" labels="LABEL:NeuralNetworks" xml:id="S2.SS3.SSS1">
        <tags>
          <tag>II-C1</tag>
          <tag role="refnum">II-C1</tag>
          <tag role="typerefnum">§II-C1</tag>
        </tags>
        <title><tag close=" ">II-C1</tag>Neural network</title>
        <para xml:id="S2.SS3.SSS1.p1">
          <p>At each step of a simulation, the neural network takes the previous state of the system as its input and generates both new limb activities and a prediction for the sensory input that the network expects to receive. The previous state is represented by four neurons with activations between 0 and 1 corresponding to the activity of each of the four limbs and a fifth neuron with activation between 0 and 1 corresponding to the sensory feedback received from the environment. The five input neurons are fully connected via learnable weights to 8 hidden neurons, which are in turn fully connected via learnable weights to 601 output neurons (or fewer in the ablation studies). 600 of these correspond to 600 muscle commands (see Section <ref labelref="LABEL:body"/>), and one corresponds to the predicted sensory effect.</p>
        </para>
        <para xml:id="S2.SS3.SSS1.p2">
          <p>The 600 muscle commands simulate the human motor control of limbs. We attribute four weights to each of the 600 muscle commands, each weight determining the degree to which that muscle command influences each of the four limbs. We sample the four weights from a distribution between zero and one, such that on average about 550 out of 600 samples would be near 0, 25 out of 600 samples would be near one and 25 would be between zero and one. The beta function (shown in Figure <ref labelref="LABEL:fig:BetaDistributions"/>):</p>
        </para>
<!--  %In humans, limbs are operated in an ’overlapping’ way by multiple muscles with complex synergies and inertial interactions. A small proportion of the total 600 muscles of the body exclusively activate each limb, but some muscles, because of synergies and inertial interactions, influence two or more limbs. The way we instantiate this in our model was to attribute four weights to each of the 600 muscle commands, each weight determining the degree to which that muscle command influences each of the four limbs. We sample the four weights from a distribution which ranged between zero and one, such that on average about 550 out of 600 samples would be near 0, 25 out of 600 samples would be near one and 25 would be in between zero and one. The beta function (shown in Figure “ref–fig:BetaDistributions˝): -->        <para xml:id="S2.SS3.SSS1.p3">
          <equation xml:id="S2.E1">
            <tags>
              <tag>(1)</tag>
              <tag role="refnum">1</tag>
            </tags>
            <Math mode="display" tex="B(z_{1},z_{2})=\int_{0}^{1}t^{z_{1}-1}(1-t)^{z_{2}-1}dt" text="B * open-interval@(z _ 1, z _ 2) = ((integral _ 0) ^ 1)@(t ^ (z _ 1 - 1) * (1 - t) ^ (z _ 2 - 1) * differential-d@(t))" xml:id="S2.E1.m1">
              <XMath>
                <XMApp>
                  <XMTok meaning="equals" role="RELOP">=</XMTok>
                  <XMApp>
                    <XMTok meaning="times" role="MULOP">⁢</XMTok>
                    <XMTok font="italic" role="UNKNOWN">B</XMTok>
                    <XMDual>
                      <XMApp>
                        <XMTok meaning="open-interval"/>
                        <XMRef idref="S2.E1.m1.1"/>
                        <XMRef idref="S2.E1.m1.2"/>
                      </XMApp>
                      <XMWrap>
                        <XMTok role="OPEN" stretchy="false">(</XMTok>
                        <XMApp xml:id="S2.E1.m1.1">
                          <XMTok role="SUBSCRIPTOP" scriptpos="post1"/>
                          <XMTok font="italic" role="UNKNOWN">z</XMTok>
                          <XMTok fontsize="70%" meaning="1" role="NUMBER">1</XMTok>
                        </XMApp>
                        <XMTok role="PUNCT">,</XMTok>
                        <XMApp xml:id="S2.E1.m1.2">
                          <XMTok role="SUBSCRIPTOP" scriptpos="post1"/>
                          <XMTok font="italic" role="UNKNOWN">z</XMTok>
                          <XMTok fontsize="70%" meaning="2" role="NUMBER">2</XMTok>
                        </XMApp>
                        <XMTok role="CLOSE" stretchy="false">)</XMTok>
                      </XMWrap>
                    </XMDual>
                  </XMApp>
                  <XMApp>
                    <XMApp>
                      <XMTok role="SUPERSCRIPTOP" scriptpos="post1"/>
                      <XMApp>
                        <XMTok role="SUBSCRIPTOP" scriptpos="post1"/>
                        <XMTok mathstyle="display" meaning="integral" name="int" role="INTOP">∫</XMTok>
                        <XMTok fontsize="70%" meaning="0" role="NUMBER">0</XMTok>
                      </XMApp>
                      <XMTok fontsize="70%" meaning="1" role="NUMBER">1</XMTok>
                    </XMApp>
                    <XMApp>
                      <XMTok meaning="times" role="MULOP">⁢</XMTok>
                      <XMApp>
                        <XMTok role="SUPERSCRIPTOP" scriptpos="post1"/>
                        <XMTok font="italic" role="UNKNOWN">t</XMTok>
                        <XMApp>
                          <XMTok fontsize="70%" meaning="minus" role="ADDOP">-</XMTok>
                          <XMApp>
                            <XMTok role="SUBSCRIPTOP" scriptpos="post2"/>
                            <XMTok font="italic" fontsize="70%" role="UNKNOWN">z</XMTok>
                            <XMTok fontsize="50%" meaning="1" role="NUMBER">1</XMTok>
                          </XMApp>
                          <XMTok fontsize="70%" meaning="1" role="NUMBER">1</XMTok>
                        </XMApp>
                      </XMApp>
                      <XMApp>
                        <XMTok role="SUPERSCRIPTOP" scriptpos="post1"/>
                        <XMDual>
                          <XMRef idref="S2.E1.m1.3"/>
                          <XMWrap>
                            <XMTok role="OPEN" stretchy="false">(</XMTok>
                            <XMApp xml:id="S2.E1.m1.3">
                              <XMTok meaning="minus" role="ADDOP">-</XMTok>
                              <XMTok meaning="1" role="NUMBER">1</XMTok>
                              <XMTok font="italic" role="UNKNOWN">t</XMTok>
                            </XMApp>
                            <XMTok role="CLOSE" stretchy="false">)</XMTok>
                          </XMWrap>
                        </XMDual>
                        <XMApp>
                          <XMTok fontsize="70%" meaning="minus" role="ADDOP">-</XMTok>
                          <XMApp>
                            <XMTok role="SUBSCRIPTOP" scriptpos="post2"/>
                            <XMTok font="italic" fontsize="70%" role="UNKNOWN">z</XMTok>
                            <XMTok fontsize="50%" meaning="2" role="NUMBER">2</XMTok>
                          </XMApp>
                          <XMTok fontsize="70%" meaning="1" role="NUMBER">1</XMTok>
                        </XMApp>
                      </XMApp>
                      <XMApp>
                        <XMTok font="italic" meaning="differential-d" role="DIFFOP">d</XMTok>
                        <XMTok font="italic" role="UNKNOWN">t</XMTok>
                      </XMApp>
                    </XMApp>
                  </XMApp>
                </XMApp>
              </XMath>
            </Math>
          </equation>
          <p>with <Math mode="inline" tex="z_{1}=0.01" text="z _ 1 = 0.01" xml:id="S2.SS3.SSS1.p3.m1">
              <XMath>
                <XMApp>
                  <XMTok meaning="equals" role="RELOP">=</XMTok>
                  <XMApp>
                    <XMTok role="SUBSCRIPTOP" scriptpos="post1"/>
                    <XMTok font="italic" role="UNKNOWN">z</XMTok>
                    <XMTok fontsize="70%" meaning="1" role="NUMBER">1</XMTok>
                  </XMApp>
                  <XMTok meaning="0.01" role="NUMBER">0.01</XMTok>
                </XMApp>
              </XMath>
            </Math> and <Math mode="inline" tex="z_{2}=0.1" text="z _ 2 = 0.1" xml:id="S2.SS3.SSS1.p3.m2">
              <XMath>
                <XMApp>
                  <XMTok meaning="equals" role="RELOP">=</XMTok>
                  <XMApp>
                    <XMTok role="SUBSCRIPTOP" scriptpos="post1"/>
                    <XMTok font="italic" role="UNKNOWN">z</XMTok>
                    <XMTok fontsize="70%" meaning="2" role="NUMBER">2</XMTok>
                  </XMApp>
                  <XMTok meaning="0.1" role="NUMBER">0.1</XMTok>
                </XMApp>
              </XMath>
            </Math> has this property.</p>
        </para>
        <para xml:id="S2.SS3.SSS1.p4">
          <p>Fig. <ref labelref="LABEL:fig:BetaDistributions"/>A shows an example of an attribution of muscle command weights to the four limbs following this scheme. Fig. <ref labelref="LABEL:fig:BetaDistributions"/>B reorders the muscle commands in order of increasing activation for the right arm only, to show that we have obtained the desired connectivity. Muscle commands 1 to 550 have a weight of zero, meaning they do not influence the right arm. Muscle commands 550 to 575 have intermediate weights and muscle commands 575 to 600 have weights of 1, meaning they strongly activate the right arm. Similar findings would apply to the other limbs but for different muscle commands. Note that during each run of the model corresponding to an experiment with one infant, the weights linking muscle commands to limbs stayed the same. To simulate different infants from infant experiments <cite class="ltx_citemacro_cite">[<bibref bibrefs="jacquey_popescu_2020" separator="," yyseparator=","/>]</cite>, <cite class="ltx_citemacro_cite">[<bibref bibrefs="popescu_6-month-old_2021" separator="," yyseparator=","/>]</cite>, the weights were resampled randomly for each run.</p>
        </para>
        <figure inlist="lof" labels="LABEL:fig:BetaDistributions" xml:id="S2.F2">
          <tags>
            <tag>Fig. 2</tag>
            <tag role="refnum">2</tag>
            <tag role="typerefnum">Fig. 2</tag>
          </tags>
          <graphics candidates="Images/exunsortedBeta.pdf" class="ltx_centering" graphic="Images/exunsortedBeta.pdf" options="width=433.62pt" xml:id="S2.F2.g1"/>
          <graphics candidates="Images/exsortedBeta.pdf" class="ltx_centering" graphic="Images/exsortedBeta.pdf" options="width=433.62pt" xml:id="S2.F2.g2"/>
          <toccaption class="ltx_centering"><tag close=" ">2</tag><text font="bold">Plots of example distributions of the used beta function.</text> (Upper) Muscle commands unordered by weight, color-coded separately for each limb (see legend). (Lower) An example of muscle commands ordered by weight for the left arm.</toccaption>
          <caption class="ltx_centering"><tag close=": ">Fig. 2</tag><text font="bold">Plots of example distributions of the used beta function.</text> (Upper) Muscle commands unordered by weight, color-coded separately for each limb (see legend). (Lower) An example of muscle commands ordered by weight for the left arm.</caption>
<!--  %**** Mobile˙paradigm˙model.tex Line 175 **** -->        </figure>
<!--  %These four values for a given muscle correspond to weights determining how much that muscle affects each limb. During each run of the model corresponding to an experiment with one infant the weights stayed the same. To simulate different infants from the previous experiments in –“cite–jacquey˙popescu˙2020˝ and “cite–popescu˙6-month-old˙2021˝, the weights were resampled randomly for each run. -->        <para xml:id="S2.SS3.SSS1.p5">
          <p>After the 600 muscle commands have been mapped to their respective limbs, the resultant total limb activation is calculated for each limb and motor noise is added. We chose a wide range of motor noise, specifically a random value between -0.3 and 0.3, in order to increase the difficulty of the task and enforce some amount of exploration through variability. Finally, after the generation of muscle commands activations, their mapping to the limbs, and the addition of the motor noise, the network has produced the new limb activity.</p>
        </para>
        <para xml:id="S2.SS3.SSS1.p6">
          <p>All layers in the network are fully connected linear layers, meaning that each neuron in one layer is connected via a particular weight to each neuron in the previous and succeeding layers. Linear layers apply a linear transformation to incoming data: <Math mode="inline" tex="y=xA^{T}+b" text="y = x * A ^ T + b" xml:id="S2.SS3.SSS1.p6.m1">
              <XMath>
                <XMApp>
                  <XMTok meaning="equals" role="RELOP">=</XMTok>
                  <XMTok font="italic" role="UNKNOWN">y</XMTok>
                  <XMApp>
                    <XMTok meaning="plus" role="ADDOP">+</XMTok>
                    <XMApp>
                      <XMTok meaning="times" role="MULOP">⁢</XMTok>
                      <XMTok font="italic" role="UNKNOWN">x</XMTok>
                      <XMApp>
                        <XMTok role="SUPERSCRIPTOP" scriptpos="post1"/>
                        <XMTok font="italic" role="UNKNOWN">A</XMTok>
                        <XMTok font="italic" fontsize="70%" role="UNKNOWN">T</XMTok>
                      </XMApp>
                    </XMApp>
                    <XMTok font="italic" role="UNKNOWN">b</XMTok>
                  </XMApp>
                </XMApp>
              </XMath>
            </Math>, where y is a vector of output values for the next layer, x the vector of input values for the layer, <Math mode="inline" tex="A^{T}" text="A ^ T" xml:id="S2.SS3.SSS1.p6.m2">
              <XMath>
                <XMApp>
                  <XMTok role="SUPERSCRIPTOP" scriptpos="post1"/>
                  <XMTok font="italic" role="UNKNOWN">A</XMTok>
                  <XMTok font="italic" fontsize="70%" role="UNKNOWN">T</XMTok>
                </XMApp>
              </XMath>
            </Math> is a matrix of weights and b is a vector of the biases of each neuron in the layer. The weights and biases in each layer are learnable parameters of the model. The weights and biases are constantly adjusted in order to optimize a defined loss. This adjustment is done through the so-called backpropagation, which means that the loss is propagated through each connection and neuron of the model and adjusts weights and biases in a way that will reduce future losses. This means that if the model receives the same input as before, it will compute an output that is slightly different from the previous one and that would be closer to the output expected by the loss function.</p>
        </para>
        <para xml:id="S2.SS3.SSS1.p7">
          <p>The loss function of our model consists of three terms, each specifying a distance using the mean square error (MSE). The first of these terms evaluates the accuracy of the predicted sensory feedback. The second term encourages exploration within the model. The third term ensures that the activities of the limbs remain close to a preferred baseline level. The three terms are combined as shown in the following equation:</p>
          <equation xml:id="S2.Ex1">
            <Math mode="display" tex="\textit{Loss}=\ \textit{MSE}(\textit{predicted\_sensory\_feedback},\textit{%&#10;sensory\_feedback})" text="[Loss] = [MSE] * open-interval@([predicted_sensory_feedback], [sensory_feedback])" xml:id="S2.Ex1.m1">
              <XMath>
                <XMApp>
                  <XMTok meaning="equals" role="RELOP" rpadding="5.0pt">=</XMTok>
                  <XMText><text font="italic">Loss</text></XMText>
                  <XMApp>
                    <XMTok meaning="times" role="MULOP">⁢</XMTok>
                    <XMText><text font="italic">MSE</text></XMText>
                    <XMDual>
                      <XMApp>
                        <XMTok meaning="open-interval"/>
                        <XMRef idref="S2.Ex1.m1.1"/>
                        <XMRef idref="S2.Ex1.m1.2"/>
                      </XMApp>
                      <XMWrap>
                        <XMTok role="OPEN" stretchy="false">(</XMTok>
                        <XMText xml:id="S2.Ex1.m1.1"><text font="italic">predicted_sensory_feedback</text></XMText>
                        <XMTok role="PUNCT">,</XMTok>
                        <XMText xml:id="S2.Ex1.m1.2"><text font="italic">sensory_feedback</text></XMText>
                        <XMTok role="CLOSE" stretchy="false">)</XMTok>
                      </XMWrap>
                    </XMDual>
                  </XMApp>
                </XMApp>
              </XMath>
            </Math>
          </equation>
          <equation xml:id="S2.Ex2">
            <Math mode="display" tex="+\textit{ MSE}(\textit{activity},\textit{desired\_activity})" text="+ [ MSE] * open-interval@([activity], [desired_activity])" xml:id="S2.Ex2.m1">
              <XMath>
                <XMApp>
                  <XMTok meaning="plus" role="ADDOP">+</XMTok>
                  <XMApp>
                    <XMTok meaning="times" role="MULOP">⁢</XMTok>
                    <XMText><text font="italic"> MSE</text></XMText>
                    <XMDual>
                      <XMApp>
                        <XMTok meaning="open-interval"/>
                        <XMRef idref="S2.Ex2.m1.1"/>
                        <XMRef idref="S2.Ex2.m1.2"/>
                      </XMApp>
                      <XMWrap>
                        <XMTok role="OPEN" stretchy="false">(</XMTok>
                        <XMText xml:id="S2.Ex2.m1.1"><text font="italic">activity</text></XMText>
                        <XMTok role="PUNCT">,</XMTok>
                        <XMText xml:id="S2.Ex2.m1.2"><text font="italic">desired_activity</text></XMText>
                        <XMTok role="CLOSE" stretchy="false">)</XMTok>
                      </XMWrap>
                    </XMDual>
                  </XMApp>
                </XMApp>
              </XMath>
            </Math>
          </equation>
          <equation xml:id="S2.E2">
            <tags>
              <tag>(2)</tag>
              <tag role="refnum">2</tag>
            </tags>
            <Math mode="display" tex="+\textit{ MSE}(\textit{activity},\textit{baseline\_activity})" text="+ [ MSE] * open-interval@([activity], [baseline_activity])" xml:id="S2.E2.m1">
              <XMath>
                <XMApp>
                  <XMTok meaning="plus" role="ADDOP">+</XMTok>
                  <XMApp>
                    <XMTok meaning="times" role="MULOP">⁢</XMTok>
                    <XMText><text font="italic"> MSE</text></XMText>
                    <XMDual>
                      <XMApp>
                        <XMTok meaning="open-interval"/>
                        <XMRef idref="S2.E2.m1.1"/>
                        <XMRef idref="S2.E2.m1.2"/>
                      </XMApp>
                      <XMWrap>
                        <XMTok role="OPEN" stretchy="false">(</XMTok>
                        <XMText xml:id="S2.E2.m1.1"><text font="italic">activity</text></XMText>
                        <XMTok role="PUNCT">,</XMTok>
                        <XMText xml:id="S2.E2.m1.2"><text font="italic">baseline_activity</text></XMText>
                        <XMTok role="CLOSE" stretchy="false">)</XMTok>
                      </XMWrap>
                    </XMDual>
                  </XMApp>
                </XMApp>
              </XMath>
            </Math>
          </equation>
        </para>
<!--  %REMOVING THE FIGURE OF THE NN 
     %“begin–figure˝
     %“centering
     %“includegraphics[width=“linewidth]–Images/NeuralPictureBold.png˝
     %“caption–“textbf–Detail of the neural network.˝ The input of the neural network is the previous state. The neural network consists of 5 input neurons, a hidden layer of 8 neurons, 601 output neurons: 600 neurons that link motor commands to limb activity and one neuron that predicts sensory feedback.  Because we used a beta function to map output neurons to limbs, approximately 10“% of the neurons are connected to a given limb, meaning that many of the neurons are not connected to any limbs and only a very small number to multiple limbs. In the figure we have ordered the 600 neurons by the weight of their connection to limb 1. For this reason the neurons connected to limb 1 are all grouped together at the top of the network (and make it look like the area is shaded). Note that for simplicity the effects of motor noise and other components shown in Figure “ref–fig:Flowchart˝ are omitted here. ˝
     %**** Mobile˙paradigm˙model.tex Line 200 ****
     %“label–fig:NeuralNetwork˝
     %“end–figure˝-->        <para xml:id="S2.SS3.SSS1.p8">
          <p>To simulate that there were different infants in the behavioral experiments, the weights and biases are initialized randomly for each simulation. Each simulation begins with an initialization period of 1000 steps without any contingency in order to force the model’s baseline activity to approximate real infants’ starting activity. Based on infant data of previous studies <cite class="ltx_citemacro_cite">[<bibref bibrefs="popescu_6-month-old_2021" separator="," yyseparator=","/>]</cite>, <cite class="ltx_citemacro_cite">[<bibref bibrefs="jacquey_popescu_2020" separator="," yyseparator=","/>]</cite>, we chose an activity with a mean of 0.15 and a standard deviation of 0.15 for the baseline. Time “0” of each run then starts after the initialization period.</p>
        </para>
        <para xml:id="S2.SS3.SSS1.p9">
          <p>The learning rate of our model is another important parameter determining the results and perhaps the parameter hardest to translate to the experiments in the real world. For infants, a higher learning rate could mean that they are able to learn more quickly. In neural networks, the learning rate is generally defined as the amount by which the network’s weights are changed at each learning step. A higher learning rate means that each learning step has a higher impact. However, a compromise in learning rate must be reached because a learning rate which is too high often gives rise to oscillatory behavior instead of converging to a dynamic equilibrium, while a very low learning rate would mean that the model would take a long time to find the equilibrium. We used a constant learning rate of 0.00075 for all simulations. We chose this value after empirical testing and found that it gave results that best matched those of infant studies <cite class="ltx_citemacro_cite">[<bibref bibrefs="popescu_6-month-old_2021" separator="," yyseparator=","/>]</cite>, <cite class="ltx_citemacro_cite">[<bibref bibrefs="jacquey_popescu_2020" separator="," yyseparator=","/>]</cite>. Fig. <ref labelref="LABEL:fig:LRApendix"/> in the Appendix (Section <ref labelref="LABEL:Appendix"/>) shows examples of results with other learning rates.</p>
        </para>
      </subsubsection>
      <subsubsection inlist="toc" xml:id="S2.SS3.SSS2">
        <tags>
          <tag>II-C2</tag>
          <tag role="refnum">II-C2</tag>
          <tag role="typerefnum">§II-C2</tag>
        </tags>
        <title><tag close=" ">II-C2</tag>Sensory feedback</title>
        <para xml:id="S2.SS3.SSS2.p1">
          <p>The sensory feedback module has two purposes: (1) the simulation of the mobile and (2) the calculation of the prediction error. The simulated sensory feedback produced by the mobile is based on the activity of the connected limb. The sensory feedback is a single scalar value that might represent any sort of sensory feedback used in experimental work, be it auditory or visual.</p>
        </para>
        <para xml:id="S2.SS3.SSS2.p2">
          <p>In our model, out of the four limbs, only one limb is connected to the mobile. In the binary condition, when the activity of the connected limb exceeds a threshold, the mobile is activated for two steps. In the non-binary (conjugate) condition, the intensity of the sensory feedback emitted by the mobile is directly proportional to the value of the connected limb activity. After the contingent phase of the experiment comes the extinction phase. In this phase, the connected limb was disconnected from the mobile (in practice, we removed the sensory feedback from the mobile). The extinction phase lasts for the last 240 steps of each run.</p>
        </para>
        <para xml:id="S2.SS3.SSS2.p3">
          <p>In addition to activating the mobile as a function of limb movements, the sensory feedback module must calculate the error that the neural network has made in predicting the sensory feedback produced by the mobile. This error is defined as the mean squared error between the predicted sensory feedback and the actual sensory feedback. This value is then fed back into the neural network, where it acts as another loss used to adjust the network’s weights to improve its predictions.</p>
        </para>
      </subsubsection>
      <subsubsection inlist="toc" xml:id="S2.SS3.SSS3">
        <tags>
          <tag>II-C3</tag>
          <tag role="refnum">II-C3</tag>
          <tag role="typerefnum">§II-C3</tag>
        </tags>
        <title><tag close=" ">II-C3</tag>Exploration</title>
        <para xml:id="S2.SS3.SSS3.p1">
          <p>The exploration module uses the sensory prediction error calculated by the sensory feedback module and the new limb activities from the neural network to update the <text font="italic">activity interest map</text>. It also calculates an ”Exploration Error” used as one of the losses to modify the weights of the neural network, guiding it to a more efficient exploration.
<!--  %“subsection–Activity Interest Map˝ --></p>
        </para>
        <figure inlist="lof" labels="LABEL:fig:AIM" xml:id="S2.F3">
          <tags>
            <tag>Fig. 3</tag>
            <tag role="refnum">3</tag>
            <tag role="typerefnum">Fig. 3</tag>
          </tags>
          <graphics candidates="Images/Aimex.pdf" class="ltx_centering" graphic="Images/Aimex.pdf" options="width=433.62pt" xml:id="S2.F3.g1"/>
          <toccaption class="ltx_centering"><tag close=" ">3</tag><text font="bold">An example process of three timesteps and corresponding changes in the activity interest map of limb 1.</text> The activity is divided into 10 ranges: 0 to 0.1, 0.1 to 0.2 and so forth. Each range can have a different amount of interest from 0 to 1. At Timestep 1, the activity range 0.4-0.5 has an interest value of 0.6 that is the highest value (shown on green background). The model produces an activity in this range, 0.46 and the sensory feedback is unsurprising. Therefore, the interest value of the range 0.4-0.5 is reduced from 0.6 to 0.5 in Timestep 2. Now, in Timestep 2 the highest interest value among all activity ranges is 0.5 and there are two ranges with that value, 0.4-0.5 and 0.3-0.4. When there is more than one activity range with the highest interest value, one of them is randomly chosen. In this example that is the range 0.3-0.4 which is highlighted in green. The model produces an activity in this range, 0.39, and is surprised by the sensory feedback. Due to being surprised the interest in this range is raised to 1 in Timestep 3.
</toccaption>
          <caption class="ltx_centering"><tag close=": ">Fig. 3</tag><text font="bold">An example process of three timesteps and corresponding changes in the activity interest map of limb 1.</text> The activity is divided into 10 ranges: 0 to 0.1, 0.1 to 0.2 and so forth. Each range can have a different amount of interest from 0 to 1. At Timestep 1, the activity range 0.4-0.5 has an interest value of 0.6 that is the highest value (shown on green background). The model produces an activity in this range, 0.46 and the sensory feedback is unsurprising. Therefore, the interest value of the range 0.4-0.5 is reduced from 0.6 to 0.5 in Timestep 2. Now, in Timestep 2 the highest interest value among all activity ranges is 0.5 and there are two ranges with that value, 0.4-0.5 and 0.3-0.4. When there is more than one activity range with the highest interest value, one of them is randomly chosen. In this example that is the range 0.3-0.4 which is highlighted in green. The model produces an activity in this range, 0.39, and is surprised by the sensory feedback. Due to being surprised the interest in this range is raised to 1 in Timestep 3.
</caption>
<!--  %**** Mobile˙paradigm˙model.tex Line 225 **** -->        </figure>
        <para xml:id="S2.SS3.SSS3.p2">
          <p>The activity interest map keeps track of two elements: 1) which limb actions ranges the model has explored before and 2) the extent to which it can predict the resulting sensory feedback for those ranges. When the model successfully predicts the sensory feedback resulting from an action, it should become less interested in the corresponding action range. However, when the sensory feedback is unexpected, we want the model to become highly interested in that action range. To accomplish this, we use an Activity Interest Map array to record whether the agent is ”surprised” (see following paragraph) by the sensory result of its previous limb action. We discretize the continuous values for each limb activity into 10 action ranges of equal size, the first one for action values between 0 and 0.1 and so on, resulting in an overall array of dimensions (4,10) (four limbs and 10 ranges for each). At each simulation step, if a limb action generates surprise, the value in the table for the corresponding action range is directly set to 1, indicating high interest. However, if the limb action generates no surprise, the value in the array for the corresponding action range is decreased by 0.1, implementing a gradually decreasing interest. An example of the progress in the Activity Interest Map is shown in Fig. <ref labelref="LABEL:fig:AIM"/>. The array is initialized with values of 0.1 for each range. This can be interpreted as having a baseline level of interest in all possible actions.</p>
        </para>
        <para xml:id="S2.SS3.SSS3.p3">
          <p>As the model’s prediction is not binary, we introduce a ”Novelty Threshold” that is set to 0.1. The model becomes ”surprised” by the result of its action when the sensory prediction error generated by its limb action exceeds this novelty threshold. The novelty threshold determines the agent’s sensitivity to discrepancies between actual and predicted sensory feedback.</p>
        </para>
        <para xml:id="S2.SS3.SSS3.p4">
          <p>In the exploration module, once the activity interest map has been updated as a function of whether surprise has occurred, the model must modify the neural net so that in the future the agent will tend to make actions within ranges of high interest. For this, the model calculates the mean square error (MSE) as the distance between the current action and the range of activity with the highest interest (should there be more than one range of equal interest, one of them is randomly chosen). This distance is used as a loss to adjust the weights of the neural network so that future muscle commands lead to an activity closer to the area of highest interest.</p>
        </para>
      </subsubsection>
      <subsubsection inlist="toc" xml:id="S2.SS3.SSS4">
        <tags>
          <tag>II-C4</tag>
          <tag role="refnum">II-C4</tag>
          <tag role="typerefnum">§II-C4</tag>
        </tags>
        <title><tag close=" ">II-C4</tag>Deviation from baseline activity</title>
        <para xml:id="S2.SS3.SSS4.p1">
          <p>We incorporated a ”deviation from baseline” module to emulate the fact that, on the one hand, infants presumably act to keep their activity in a restricted range around a certain baseline, but, on the other hand, they also tend to get globally more agitated, with that baseline increasing as the experiment proceeds. Deviation from this baseline is used as another loss that modifies the functioning of the neural network.</p>
        </para>
        <para xml:id="S2.SS3.SSS4.p2">
          <p>The baseline is defined as four random values between 0 and a maximum value (set at 0.3 at the start), one for each limb, chosen at each simulation step. These emulate how each of the infant’s four limbs would tend to move naturally in the absence of any sensory stimulation. To emulate that infants tend to grow fussier over time, the maximum value is increased by 0.0001 at every simulation step up to 0.454 at the end. The deviation from baseline is measured using the mean squared error between the current four baseline values and the current four limb activity values. Note that we assume that infants are not only fatigued by moving above their baseline activity but also that they are fatigued by trying to keep still.</p>
        </para>
      </subsubsection>
    </subsection>
    <subsection inlist="toc" xml:id="S2.SS4">
      <tags>
        <tag>II-D</tag>
        <tag role="refnum">II-D</tag>
        <tag role="typerefnum">§II-D</tag>
      </tags>
      <title><tag close=" ">II-D</tag><text font="italic">Number of simulation runs and their duration</text></title>
      <subsubsection inlist="toc" xml:id="S2.SS4.SSS1">
        <tags>
          <tag>II-D1</tag>
          <tag role="refnum">II-D1</tag>
          <tag role="typerefnum">§II-D1</tag>
        </tags>
        <title><tag close=" ">II-D1</tag>Correspondence between the number of participants in the experiments and the number of simulations</title>
        <para xml:id="S2.SS4.SSS1.p1">
          <p>To parallel the number of infants in the two empirical studies, 20 in the binary condition in <cite class="ltx_citemacro_cite">[<bibref bibrefs="popescu_6-month-old_2021" separator="," yyseparator=","/>]</cite> and 18 in the non-binary condition in <cite class="ltx_citemacro_cite">[<bibref bibrefs="jacquey_popescu_2020" separator="," yyseparator=","/>]</cite>, we performed 20 simulations for each of the two conditions.</p>
        </para>
      </subsubsection>
      <subsubsection inlist="toc" xml:id="S2.SS4.SSS2">
        <tags>
          <tag>II-D2</tag>
          <tag role="refnum">II-D2</tag>
          <tag role="typerefnum">§II-D2</tag>
        </tags>
        <title><tag close=" ">II-D2</tag>Correspondence between experimental and simulation time</title>
        <para xml:id="S2.SS4.SSS2.p1">
          <p>When comparing the model outputs with real infant experiments, we must decide how to link the time steps of the simulation (i.e. each pass through the loop of the simulation) with time in a real experiment. In our simulation of the binary condition, the mobile activation lasts for 2 simulation steps. In the original study <cite class="ltx_citemacro_cite">[<bibref bibrefs="popescu_6-month-old_2021" separator="," yyseparator=","/>]</cite>, the mobile activation lasted 0.5 s. We took this as a reference and therefore decided to consider that a simulation step is equivalent to 0.25 s. Therefore, to simulate the duration of the original study of 300 s, the simulation included 1200 steps, or 5 <text font="italic">pseudo-minutes</text>. To allow the model to reach the baseline activity level, we added 1000 steps without the mobile at the beginning of the simulation (excluded from data analysis). To check if there is an extinction burst, we also added 240 steps without the mobile at the end of the simulation. Therefore, an entire “run” of the simulation involved 2440 steps or loops through the model. This ensures that the durations of infant experiments and of our simulations are comparable.</p>
        </para>
      </subsubsection>
    </subsection>
  </section>
  <section inlist="toc" xml:id="S3">
    <tags>
      <tag>III</tag>
      <tag role="refnum">III</tag>
      <tag role="typerefnum">§III</tag>
    </tags>
    <title><tag close=" ">III</tag><text font="smallcaps">Results</text></title>
    <para xml:id="S3.p1">
      <p>We start this section by checking whether our model replicates the two main empirical results, <text font="bold">Specificity of connected limb</text> and <text font="bold">Comparison with a control group</text>. We then look at what can be deduced from our simulations of the <text font="bold">Extinction burst</text>. Regarding the replication of the difference between the binary and conjugate stimulation, we report it within each of the first three subsections (we compare model simulation data with empirical data on 6-month-old infants separately for the binary and non-binary<note mark="1" role="footnote" xml:id="footnote1"><tags>
            <tag>1</tag>
            <tag role="refnum">1</tag>
            <tag role="typerefnum">footnote 1</tag>
          </tags>Note that we show graphs for both binary and non-binary conditions for every result, we will discuss the difference between these in subsection <ref labelref="LABEL:binaryVsNonBinary"/>.</note>, taken respectively from  <cite class="ltx_citemacro_cite">[<bibref bibrefs="popescu_6-month-old_2021" separator="," yyseparator=","/>]</cite> and <cite class="ltx_citemacro_cite">[<bibref bibrefs="jacquey_popescu_2020" separator="," yyseparator=","/>]</cite>) and conclude on in the subsection <text font="bold">Greater effect for binary versus non-binary</text>. In the last subsection, we consider the results of ablation studies where we do simulations in which we remove different components of the model to test which of them are essential for the results.</p>
    </para>
    <subsection inlist="toc" xml:id="S3.SS1">
      <tags>
        <tag>III-A</tag>
        <tag role="refnum">III-A</tag>
        <tag role="typerefnum">§III-A</tag>
      </tags>
      <title><tag close=" ">III-A</tag><text font="italic">Specificity of connected limb</text></title>
      <para xml:id="S3.SS1.p1">
        <p>Fig. <ref labelref="LABEL:fig:SimVsInfant"/>, shows the graphs of the moment-to-moment activity of the four limbs in the simulations and a comparison with the corresponding infant data of the two arms. The most important fact is that the model does what it was expected to do: like infants, it <text font="bold">is able to differentiate the connected and unconnected limb(s)</text>. However, the model is somewhat slower in doing this, requiring more than one minute, whereas infants seem to differentiate the limbs at the start of the first minute.
<!--  %**** Mobile˙paradigm˙model.tex Line 250 **** --></p>
      </para>
      <para xml:id="S3.SS1.p2">
        <p>An obvious difference between the model and the infants is that the overall variability of activity is much greater for the infants than for the model. Compared to the model, between-participant variability may be greater in infants because they vary more in intrinsic parameters like body morphology, degree of maturation, and learning rate. Within-participant variability in infants may also be greater because they suffer moment-to-moment variations in attention, changes in interest and distractions that we have not modeled in our simulation.</p>
      </para>
      <figure inlist="lof" labels="LABEL:fig:SimVsInfant" xml:id="S3.F4">
        <tags>
          <tag>Fig. 4</tag>
          <tag role="refnum">4</tag>
          <tag role="typerefnum">Fig. 4</tag>
        </tags>
        <graphics candidates="Images/modelVsInfantsv1.pdf" class="ltx_centering" graphic="Images/modelVsInfantsv1.pdf" options="width=433.62pt" xml:id="S3.F4.g1"/>
        <toccaption class="ltx_centering"><tag close=" ">4</tag><text font="bold">Comparison of limb activity in simulations and infants (mean activity per limb and per 10-s bin).</text> Activity in simulations is expressed in arbitrary units and activity in infants is expressed in gravitation acceleration units. The thick curves show the mean activities across individual data, of the connected limb (red) and the unconnected limb(s) (gray). Thinner pale curves show individual data (individual simulation runs or infants). (A) Model in the Binary condition (20 simulation runs). (B) Infants in the Binary condition (data on 20 six-month-old infants in Popescu et al. <cite class="ltx_citemacro_cite">[<bibref bibrefs="popescu_6-month-old_2021" separator="," yyseparator=","/>]</cite>). (C) Model in the Non-Binary condition (20 simulation runs). (D) Infants in the Non-Binary condition (data on 18 six-month-old infants in Jacquey et al. <cite class="ltx_citemacro_cite">[<bibref bibrefs="jacquey_popescu_2020" separator="," yyseparator=","/>]</cite>; note that this experiment lasted only 4 minutes and that the data on the attention-getter corresponding to the first bin of every minute was removed).</toccaption>
        <caption class="ltx_centering"><tag close=": ">Fig. 4</tag><text font="bold">Comparison of limb activity in simulations and infants (mean activity per limb and per 10-s bin).</text> Activity in simulations is expressed in arbitrary units and activity in infants is expressed in gravitation acceleration units. The thick curves show the mean activities across individual data, of the connected limb (red) and the unconnected limb(s) (gray). Thinner pale curves show individual data (individual simulation runs or infants). (A) Model in the Binary condition (20 simulation runs). (B) Infants in the Binary condition (data on 20 six-month-old infants in Popescu et al. <cite class="ltx_citemacro_cite">[<bibref bibrefs="popescu_6-month-old_2021" separator="," yyseparator=","/>]</cite>). (C) Model in the Non-Binary condition (20 simulation runs). (D) Infants in the Non-Binary condition (data on 18 six-month-old infants in Jacquey et al. <cite class="ltx_citemacro_cite">[<bibref bibrefs="jacquey_popescu_2020" separator="," yyseparator=","/>]</cite>; note that this experiment lasted only 4 minutes and that the data on the attention-getter corresponding to the first bin of every minute was removed).</caption>
      </figure>
      <figure inlist="lof" labels="LABEL:fig:heatmapConUnCon" placement="t" xml:id="S3.F5">
        <tags>
          <tag>Fig. 5</tag>
          <tag role="refnum">5</tag>
          <tag role="typerefnum">Fig. 5</tag>
        </tags>
        <graphics candidates="Images/allHeatMap.pdf" class="ltx_centering" graphic="Images/allHeatMap.pdf" options="width=433.62pt" xml:id="S3.F5.g1"/>
<!--  %“includegraphics[width=0.49“linewidth]–Images/heatmapLR2NB.png˝ 
     %“includegraphics[width=0.49“linewidth]–Images/heatmapLR1NB.png˝
     %“includegraphics[width=0.49“linewidth]–Images/heatmapLR0NB.png˝-->        <toccaption class="ltx_centering"><tag close=" ">5</tag>Evolution of the activity interest map for each limb, example of a run in the non-binary condition. For each limb, the figure shows how the values of interest for each activity range evolve over time. The X-axis shows the time and the Y-axis shows the activity ranges. The color of each (x,y) cell shows the interest of that activity range at that specific time. The blue-yellow spectrum represents the scalar value corresponding to the interest, blue color corresponds to lowest interest values, greenish color–to intermediary and yellow color–to highest interest values.</toccaption>
        <caption class="ltx_centering"><tag close=": ">Fig. 5</tag>Evolution of the activity interest map for each limb, example of a run in the non-binary condition. For each limb, the figure shows how the values of interest for each activity range evolve over time. The X-axis shows the time and the Y-axis shows the activity ranges. The color of each (x,y) cell shows the interest of that activity range at that specific time. The blue-yellow spectrum represents the scalar value corresponding to the interest, blue color corresponds to lowest interest values, greenish color–to intermediary and yellow color–to highest interest values.</caption>
<!--  %“vspace–-2cm˝ -->      </figure>
      <para xml:id="S3.SS1.p3">
        <p>To provide a deeper look into the mechanics of the model and how it can identify the connected limb, let us zoom in on the Exploration module (see Fig. <ref labelref="LABEL:fig:Flowchart"/>), specifically on the Activity Interest Map. The main goal of this sub-module is to guide exploration in our model. It does so by calculating how interested the model is in each limb activity. Fig. <ref labelref="LABEL:fig:heatmapConUnCon"/> shows how the values of ”interest” in the Activity Interest Map change over time for each limb. The difference between the connected (lower right panel) and the unconnected limbs is evident. This shows how the model identifies which limb is connected and explores its activity distinctively.
<!--  %**** Mobile˙paradigm˙model.tex Line 275 **** --></p>
      </para>
      <para xml:id="S3.SS1.p4">
        <p>In Fig. <ref labelref="LABEL:fig:heatmapConUnCon"/>, the colors from blue to yellow represent rising values of ”interest”, with blue corresponding to lowest interest, green shades to intermediate, and yellow to highest interest. For the connected limb, there is a clear separation between blue and yellow areas: blue areas (little interest) are found near where most of the limb’s activity is occurring, namely near values of about 0.5 (see Fig. <ref labelref="LABEL:fig:SimVsInfant"/>). For activity ranges for which the model has learned to predict the sensory feedback, the model is no longer ”surprised” and will explore them less than the very high and very low activity ranges. For the unconnected limbs, there is no statistically reliable link between the limb’s activity and the sensory feedback, and so no reliable predictions can be made for any activity range; we see fairly high interest (yellow shades) for all activity ranges. The region in the heatmaps after minute 5 corresponds to the time when the contingency has been removed. The model always receives the same sensory feedback set at 0, and so becomes able to predict the results of all of its actions. There is no interest in any activity region.</p>
      </para>
<!--  %One minor aspect worth mentioning is due to the way we implemented the variations of activity about the average baseline level, are not correlated between themselves from one step to the next, potentially resulting in a higher frequency and lower amplitude of limb activity variation. -->    </subsection>
    <subsection inlist="toc" xml:id="S3.SS2">
      <tags>
        <tag>III-B</tag>
        <tag role="refnum">III-B</tag>
        <tag role="typerefnum">§III-B</tag>
      </tags>
      <title><tag close=" ">III-B</tag><text font="italic">Comparison with a control group</text></title>
      <para xml:id="S3.SS2.p1">
        <p>We not only want to verify that the model correctly simulates what infants do in the presence of contingent sensory feedback (Figures <ref labelref="LABEL:fig:contingentvsNot"/> A and C), but also what they do when the sensory feedback is not contingent on their actions, that is, when the stimuli are triggered independently of limb movements. Experimental data obtained for infants in this case is presented in Fig. <ref labelref="LABEL:fig:contingentvsNot"/> B and D. For the non-contingent condition, the stimuli from the contingent group were ”replayed”. In this way, the frequency and intensity of stimuli in the contingent and non-contingent groups were equated. We did the same in our model. The most important finding is that our model correctly simulates the higher activity in the contingent group compared to the non-contingent (control) group, as shown in Fig. <ref labelref="LABEL:fig:contingentvsNot"/>. It indicates that, <text font="bold">like infants, the model is sensitive to the contingency itself, not just the sensory stimulation coming from the mobile.</text></p>
      </para>
      <figure inlist="lof" labels="LABEL:fig:contingentvsNot" placement="t" xml:id="S3.F6">
        <tags>
          <tag>Fig. 6</tag>
          <tag role="refnum">6</tag>
          <tag role="typerefnum">Fig. 6</tag>
        </tags>
        <graphics candidates="Images/contingentVsNot1000v1.pdf" graphic="Images/contingentVsNot1000v1.pdf" options="width=433.62pt" xml:id="S3.F6.g1"/>
        <toccaption><tag close=" ">6</tag><text font="bold">Mean limb activity in Contingent and Non-Contingent conditions (mean over joint activity of the connected and unconnected limbs per 10-s bin).</text> Note that for consistency with infant data, in the complete model we selected one unconnected limb among the three available). The thick curves show the mean activities, across individual data, of the Contingent group (green) and the Non-Contingent group (orange). Thinner pale curves show individual data (individual simulation runs or infants). (A) Complete model in the Binary condition (20 simulation runs). (B) Infants in the Binary condition (data on 20 6-month-old infants in Popescu et al. <cite class="ltx_citemacro_cite">[<bibref bibrefs="popescu_6-month-old_2021" separator="," yyseparator=","/>]</cite>). (C) Complete model in the Non-Binary condition (20 simulation runs). (D) Infants in the Non-Binary condition (data on 18 6-month-old infants in Jacquey et al. <cite class="ltx_citemacro_cite">[<bibref bibrefs="jacquey_popescu_2020" separator="," yyseparator=","/>]</cite>); note that this experiment was shorter (4 minutes), and that, similar to the original study, the data on the attention-getter corresponding to the first bin of every minute was removed.</toccaption>
        <caption><tag close=": ">Fig. 6</tag><text font="bold">Mean limb activity in Contingent and Non-Contingent conditions (mean over joint activity of the connected and unconnected limbs per 10-s bin).</text> Note that for consistency with infant data, in the complete model we selected one unconnected limb among the three available). The thick curves show the mean activities, across individual data, of the Contingent group (green) and the Non-Contingent group (orange). Thinner pale curves show individual data (individual simulation runs or infants). (A) Complete model in the Binary condition (20 simulation runs). (B) Infants in the Binary condition (data on 20 6-month-old infants in Popescu et al. <cite class="ltx_citemacro_cite">[<bibref bibrefs="popescu_6-month-old_2021" separator="," yyseparator=","/>]</cite>). (C) Complete model in the Non-Binary condition (20 simulation runs). (D) Infants in the Non-Binary condition (data on 18 6-month-old infants in Jacquey et al. <cite class="ltx_citemacro_cite">[<bibref bibrefs="jacquey_popescu_2020" separator="," yyseparator=","/>]</cite>); note that this experiment was shorter (4 minutes), and that, similar to the original study, the data on the attention-getter corresponding to the first bin of every minute was removed.</caption>
      </figure>
      <para xml:id="S3.SS2.p2">
        <p>The difference between the contingent and non-contingent groups are somewhat smaller in the model than in infants. Also, for the model, the shape of the curves for contingent and non-contingent conditions is fairly similar, starting with a rising slope and then flattening off. On the other hand, for infants, the curves are overall flatter and may (in the case of the non-contingent condition) not have a rising part at all. This could suggest that in the non-contingent condition, infants realize from the beginning that they have no impact on the stimuli and so move less than in the contingent condition. The model, in contrast, ”believes” for longer that it affects the stimuli and searches longer for a way to use its limbs to manipulate the sensory feedback. Another difference between the model and the infants is that the variability is much more significant in the infant data, similar to what we saw in the previous subsection.</p>
      </para>
      <para xml:id="S3.SS2.p3">
        <p>Interestingly, there is a peculiar aspect of the curves for the binary condition, where at the beginning of the experiment the non-contingent group actually shows greater activity than the contingent condition, but at around 2 minutes into the experiment it falls below the curves for the contingent group. This behavior is unexpected, as there should be no reason for a higher activity in the non-contingent group at the start of the simulation.</p>
      </para>
<!--  %This difference also suggests that the model’s exploration mechanism may in some way be somewhat different from that in infants.% WAIT FOR JOSUA’s IDEAS!!! 
     %__???-->    </subsection>
    <subsection inlist="toc" xml:id="S3.SS3">
      <tags>
        <tag>III-C</tag>
        <tag role="refnum">III-C</tag>
        <tag role="typerefnum">§III-C</tag>
      </tags>
      <title><tag close=" ">III-C</tag><text font="italic">Extinction burst</text></title>
      <para xml:id="S3.SS3.p1">
        <p>In the literature on the mobile paradigm, an extinction burst is sometimes observed when, after the infant has learned to activate the mobile with the connected limb, this limb is disconnected from the mobile causing a sudden and transient burst of activity <cite class="ltx_citemacro_cite">[<bibref bibrefs="alessandri_violation_1990" separator="," yyseparator=","/>]</cite>,<cite class="ltx_citemacro_cite">[<bibref bibrefs="heathcock_relative_2005" separator="," yyseparator=","/>]</cite>,<cite class="ltx_citemacro_cite">[<bibref bibrefs="lewis_violation_1990" separator="," yyseparator=","/>]</cite>,<cite class="ltx_citemacro_cite">[<bibref bibrefs="rovee-collier_topographical_1978" separator="," yyseparator=","/>]</cite>. Such extinction bursts can be interpreted as the reaction to a violation of expectations and loss of control <cite class="ltx_citemacro_cite">[<bibref bibrefs="lewis_violation_1990" separator="," yyseparator=","/>]</cite>. When in our model, we disconnect the connected limb, we also observe extinction bursts but they do not occur systematically, as shown in Fig. <ref labelref="LABEL:fig:Extinction"/>. This is similar to results on extinction bursts in infants (for a review, see <cite class="ltx_citemacro_cite">[<bibref bibrefs="bednarski_infants_2022" separator="," yyseparator=","/>]</cite>).</p>
      </para>
<!--  %To clarify this question, we have examined what happens to the behavior of our model when we disconnect the connected limb. As in infant studies, after some time (in our case, 5 pseudo-minutes), we disconnected from the mobile the previously connected limb. Similar to findings in infants, extinction bursts ware non-systematic, and happened in some but not all simulation runs. 
     %**** Mobile˙paradigm˙model.tex Line 300 ****
     %Since the model is attempting to find contingencies between its actions and their effect, we expected that when the contingency is removed, the model would immediately start a wider exploration of its possible actions in order to re-establish the contingency. We should not expect a burst in activity but rather a greater variability of activity, including also lower activity.-->      <para xml:id="S3.SS3.p2">
        <p>Figure <ref labelref="LABEL:fig:Extinction"/> shows the average results of our simulations along with two examples of individual runs. The vertical line in the graphs shows the moment of extinction of the contingency, marked as time 5 minutes. Whereas in the binary condition there seems to be no evidence for an extinction burst, there seems to be a clear one in the non-binary condition. The data for individual simulated infants are quite variable (see Fig. <ref labelref="LABEL:fig:individualStandardB"/> and <ref labelref="LABEL:fig:individualStandardNB"/> in the Appendix, Section <ref labelref="LABEL:Appendix"/>), with two examples shown in Fig. <ref labelref="LABEL:fig:Extinction"/> B-C and <ref labelref="LABEL:fig:Extinction"/> E-F, one with a possible extinction burst, and one without an extinction burst. In the individual data there does not seem to be clear evidence for a sudden increase in activity after the extinction of the contingency. Our findings are consistent with the fact that the existence of an extinction burst is debated in the literature, since different infants may have different learning rates, and depending on the sample, extinction bursts may be more or less evident.</p>
      </para>
      <figure inlist="lof" labels="LABEL:fig:Extinction" placement="!t" xml:id="S3.F7">
        <tags>
          <tag>Fig. 7</tag>
          <tag role="refnum">7</tag>
          <tag role="typerefnum">Fig. 7</tag>
        </tags>
        <graphics candidates="Images/extinction1000v1.pdf" class="ltx_centering" graphic="Images/extinction1000v1.pdf" options="width=433.62pt" xml:id="S3.F7.g1"/>
        <toccaption class="ltx_centering"><tag close=" ">7</tag><text font="bold">Limb activity before and after removing the contingency (mean activity per limb and per 10-s bin).</text> On the Time axis, the moment of contingency removal is emphasized with a vertical line. The one-minute period after contingency removal is shown on contrasting background. (A &amp; D) The thick curves show the mean activities, across individual data, of the connected limb (red) and the unconnected limbs (gray). Thinner pale curves show data on individual simulation runs. (A) Model in the Binary condition (20 simulation runs). Examples of individual runs in the Binary condition (B) with an extinction burst and (C) without an extinction burst. (D) Model in the Non-Binary condition (20 simulation runs). Examples of individual runs in the Non-Binary condition (E) with an extinction burst and (F) without an extinction burst.</toccaption>
        <caption class="ltx_centering"><tag close=": ">Fig. 7</tag><text font="bold">Limb activity before and after removing the contingency (mean activity per limb and per 10-s bin).</text> On the Time axis, the moment of contingency removal is emphasized with a vertical line. The one-minute period after contingency removal is shown on contrasting background. (A &amp; D) The thick curves show the mean activities, across individual data, of the connected limb (red) and the unconnected limbs (gray). Thinner pale curves show data on individual simulation runs. (A) Model in the Binary condition (20 simulation runs). Examples of individual runs in the Binary condition (B) with an extinction burst and (C) without an extinction burst. (D) Model in the Non-Binary condition (20 simulation runs). Examples of individual runs in the Non-Binary condition (E) with an extinction burst and (F) without an extinction burst.</caption>
      </figure>
    </subsection>
    <subsection inlist="toc" labels="LABEL:binaryVsNonBinary" xml:id="S3.SS4">
      <tags>
        <tag>III-D</tag>
        <tag role="refnum">III-D</tag>
        <tag role="typerefnum">§III-D</tag>
      </tags>
      <title><tag close=" ">III-D</tag><text font="italic">Greater effect for binary versus non-binary</text></title>
      <para xml:id="S3.SS4.p1">
        <p>Our prediction for our model was that a binary contingency would be more easily learned and more effective in increasing the model’s activity than a non-binary (conjugate) contingency. Indeed, this is what we find when comparing connected and unconnected limbs (Fig. <ref labelref="LABEL:fig:SimVsInfant"/> A and C) and contingent and non-contingent conditions (Fig. <ref labelref="LABEL:fig:contingentvsNot"/> A and C).</p>
      </para>
      <para xml:id="S3.SS4.p2">
        <p>To explain these effects, we suggest that in the binary condition, a small change in activity can cause a large change in the sensory feedback, while in the non-binary condition, the changes in activity and sensory feedback are proportional. In the binary condition, a small limb activity that crosses the stimulus triggering threshold creates a strong change in the sensory feedback. Therefore, in the binary condition, it is easier to detect which is the connected limb, even when limb activity only varies slightly compared to other limbs. While in the non-binary condition, it is harder to detect the limb that is responsible for a small change in sensory feedback. Analogous differences between the two conditions are also observed in the ablation studies of the number of muscles commands and motor noise.</p>
      </para>
<!--  %The surprisingly high activity in the unconnected limbs as compared to the connected limb remains to be explored further.)) -->    </subsection>
    <subsection inlist="toc" xml:id="S3.SS5">
      <tags>
        <tag>III-E</tag>
        <tag role="refnum">III-E</tag>
        <tag role="typerefnum">§III-E</tag>
      </tags>
      <title><tag close=" ">III-E</tag><text font="italic">Ablation studies</text></title>
      <subsubsection inlist="toc" xml:id="S3.SS5.SSS1">
        <tags>
          <tag>III-E1</tag>
          <tag role="refnum">III-E1</tag>
          <tag role="typerefnum">§III-E1</tag>
        </tags>
        <title><tag close=" ">III-E1</tag>No prediction error</title>
        <para xml:id="S3.SS5.SSS1.p1">
          <p>To see how the model works in the absence of the prediction error, we disconnect the prediction loss from the network, not allowing it to modify any of the weights. Therefore, this ablation shows how the model works when it cannot predict the effects of its actions. Fig. <ref labelref="LABEL:fig:predictionAb"/> shows the results of this ablation study. The effect of this ablation is to completely prevent the model from distinguishing connected and unconnected limbs, therefore, the model cannot learn to activate the mobile. Without the prediction loss, the model’s behavior becomes mainly determined by which activity levels are labeled as causing surprise. Note that the model continues to compute ”surprise” and modify the Activity Interest Map as a function of the prediction error. Therefore, in this case, once the model by chance finds a surprising activity, it continues to explore that activity range, with variations caused only by random motor noise.
<!--  %**** Mobile˙paradigm˙model.tex Line 325 **** 
     %The explanation of this effect is clear: with no prediction error loss sent to the neural network, the model cannot learn to predict the effects of its actions.--></p>
        </para>
        <figure inlist="lof" labels="LABEL:fig:predictionAb" placement="t" xml:id="S3.F8">
          <tags>
            <tag>Fig. 8</tag>
            <tag role="refnum">8</tag>
            <tag role="typerefnum">Fig. 8</tag>
          </tags>
          <graphics candidates="Images/noPrediction1000.pdf" class="ltx_centering" graphic="Images/noPrediction1000.pdf" options="width=433.62pt" xml:id="S3.F8.g1"/>
          <toccaption class="ltx_centering"><tag close=" ">8</tag><text font="bold">Ablation of the prediction error.</text> The thick curves show the mean activities (mean activity per limb and per 10-s bin, data on 20 simulation runs), across individual data, of the connected limb (red) and the unconnected limbs (gray). Thinner pale curves show individual simulation runs. (A) Complete model in the Binary condition. (B) Ablated model in the Binary condition. (C) Complete model in the Non-Binary condition. (D) Ablated model the Non-Binary condition.</toccaption>
          <caption class="ltx_centering"><tag close=": ">Fig. 8</tag><text font="bold">Ablation of the prediction error.</text> The thick curves show the mean activities (mean activity per limb and per 10-s bin, data on 20 simulation runs), across individual data, of the connected limb (red) and the unconnected limbs (gray). Thinner pale curves show individual simulation runs. (A) Complete model in the Binary condition. (B) Ablated model in the Binary condition. (C) Complete model in the Non-Binary condition. (D) Ablated model the Non-Binary condition.</caption>
        </figure>
      </subsubsection>
      <subsubsection inlist="toc" xml:id="S3.SS5.SSS2">
        <tags>
          <tag>III-E2</tag>
          <tag role="refnum">III-E2</tag>
          <tag role="typerefnum">§III-E2</tag>
        </tags>
        <title><tag close=" ">III-E2</tag>No novelty-based reinforcement calculator</title>
        <para xml:id="S3.SS5.SSS2.p1">
          <p>Figure <ref labelref="LABEL:fig:expoAb"/> shows the ablation run where the novelty-based reinforcement calculator is not used to modify the weights of the neural network. This ablation not only prevents the model from being able to distinguish between the connected and unconnected limbs but also lowers the overall limb activity. This effect is expected since the model has no incentive to explore, therefore it optimizes its prediction ability and reduces the deviation from baseline limb activity.</p>
        </para>
        <figure inlist="lof" labels="LABEL:fig:expoAb" placement="t" xml:id="S3.F9">
          <tags>
            <tag>Fig. 9</tag>
            <tag role="refnum">9</tag>
            <tag role="typerefnum">Fig. 9</tag>
          </tags>
          <graphics candidates="Images/noExploration1000.pdf" class="ltx_centering" graphic="Images/noExploration1000.pdf" options="width=433.62pt" xml:id="S3.F9.g1"/>
          <toccaption class="ltx_centering"><tag close=" ">9</tag><text font="bold">Ablation of the exploration loss.</text> The thick curves show the mean activities (mean activity per limb and per 10-s bin, data on 20 simulation runs), across individual data, of the connected limb (red) and the unconnected limbs (gray). Thinner pale curves show individual simulation runs. (A) Complete model in the Binary condition. (B) Ablated model in the Binary condition. (C) Complete model in the Non-Binary condition. (D) Ablated model the Non-Binary condition.</toccaption>
          <caption class="ltx_centering"><tag close=": ">Fig. 9</tag><text font="bold">Ablation of the exploration loss.</text> The thick curves show the mean activities (mean activity per limb and per 10-s bin, data on 20 simulation runs), across individual data, of the connected limb (red) and the unconnected limbs (gray). Thinner pale curves show individual simulation runs. (A) Complete model in the Binary condition. (B) Ablated model in the Binary condition. (C) Complete model in the Non-Binary condition. (D) Ablated model the Non-Binary condition.</caption>
        </figure>
      </subsubsection>
      <subsubsection inlist="toc" xml:id="S3.SS5.SSS3">
        <tags>
          <tag>III-E3</tag>
          <tag role="refnum">III-E3</tag>
          <tag role="typerefnum">§III-E3</tag>
        </tags>
        <title><tag close=" ">III-E3</tag>Reducing the number of muscle commands</title>
        <para xml:id="S3.SS5.SSS3.p1">
          <p>In this ablation study, we changed the output layer from 600 muscle commands to 50, 100 or 300. The main finding is that, to consistently simulate infant behavior, the model needs a large number of muscle commands, as shown in Fig. <ref labelref="LABEL:fig:muscleAb"/>. The number of required muscle commands is larger in the non-binary condition (more than 300) compared to the binary condition (<Math mode="inline" tex="\sim 100" text="absent similar-to 100" xml:id="S3.SS5.SSS3.p1.m1">
              <XMath>
                <XMApp>
                  <XMTok meaning="similar-to" name="sim" role="RELOP">∼</XMTok>
                  <XMTok meaning="absent"/>
                  <XMTok meaning="100" role="NUMBER">100</XMTok>
                </XMApp>
              </XMath>
            </Math>).</p>
        </para>
        <para xml:id="S3.SS5.SSS3.p2">
          <p>The effect of this ablation is seen mainly in the variability across individual simulations. For small numbers of muscle commands, there are more individual runs in which the distinction between connected and unconnected is not learned by the model (e.g. Fig. <ref labelref="LABEL:fig:muscleAb"/> B and F). In the binary condition, fewer muscle commands (100, Fig. <ref labelref="LABEL:fig:muscleAb"/> C and G) are needed to reliably elicit a difference in activity between connected and unconnected limbs. In the non-binary condition, even with 300 muscle commands individual runs that do not learn to differentiate the limbs remain.</p>
        </para>
        <figure inlist="lof" labels="LABEL:fig:muscleAb" placement="t" xml:id="S3.F10">
          <tags>
            <tag>Fig. 10</tag>
            <tag role="refnum">10</tag>
            <tag role="typerefnum">Fig. 10</tag>
          </tags>
          <graphics candidates="Images/numberMuscles1000.pdf" class="ltx_centering" graphic="Images/numberMuscles1000.pdf" options="width=433.62pt" xml:id="S3.F10.g1"/>
          <toccaption class="ltx_centering"><tag close=" ">10</tag><text font="bold">Ablation of the output layer from 600 muscle commands to 50, 100 or 300.</text> The thick curves show the mean activities (mean activity per limb and per 10-s bin, data on 20 simulation runs), across individual data, of the connected limb (red) and the unconnected limbs (gray). Thinner pale curves show individual simulation runs. (A) Complete model in the Binary condition that includes 600 muscle commands (mc). (B, C, D) Ablation of the model in the Binary condition to 50, 100, and 300 muscle commands (mc), respectively. (E) Complete model in the Non-Binary condition that includes 600 muscle commands (mc). (F, G, H) Ablation of the model in the Non-Binary condition to 50, 100, and 300 muscle commands (mc), respectively.</toccaption>
          <caption class="ltx_centering"><tag close=": ">Fig. 10</tag><text font="bold">Ablation of the output layer from 600 muscle commands to 50, 100 or 300.</text> The thick curves show the mean activities (mean activity per limb and per 10-s bin, data on 20 simulation runs), across individual data, of the connected limb (red) and the unconnected limbs (gray). Thinner pale curves show individual simulation runs. (A) Complete model in the Binary condition that includes 600 muscle commands (mc). (B, C, D) Ablation of the model in the Binary condition to 50, 100, and 300 muscle commands (mc), respectively. (E) Complete model in the Non-Binary condition that includes 600 muscle commands (mc). (F, G, H) Ablation of the model in the Non-Binary condition to 50, 100, and 300 muscle commands (mc), respectively.</caption>
<!--  %**** Mobile˙paradigm˙model.tex Line 350 **** -->        </figure>
      </subsubsection>
      <subsubsection inlist="toc" xml:id="S3.SS5.SSS4">
        <tags>
          <tag>III-E4</tag>
          <tag role="refnum">III-E4</tag>
          <tag role="typerefnum">§III-E4</tag>
        </tags>
        <title><tag close=" ">III-E4</tag>Changing the amount of motor noise</title>
        <para xml:id="S3.SS5.SSS4.p1">
          <p>In this ablation study, shown in Fig. <ref labelref="LABEL:fig:MotorAb"/>, we explored the effect of changing the amount of motor noise from the value of 0.3 used in the complete model to very low values of 0.01 and 0.1 and a very high value of 0.9. We observe that the model’s learning process is highly sensitive to motor noise. To achieve optimal learning, motor noise is essential; however, both insufficient and excessive levels of motor noise are detrimental.</p>
        </para>
        <para xml:id="S3.SS5.SSS4.p2">
          <p>As shown in Fig. <ref labelref="LABEL:fig:MotorAb"/> B and F, the ablation with a motor noise of 0.001 stops the model from learning limb differentiation, with the connected and unconnected limbs behaving very similarly, with a lot of individual variation. There are additionally some individual cases with almost no exploration. As an explanation of this effect, we suggest that having a motor noise of 0.001 provides no opportunity for the model to notice differences between the limbs. There will be little difference in prediction loss between the limbs, so the model will have difficulty knowing which limb causes the effect on the mobile. Said in another way, since the correlation between the different limbs and the sensory effect is so similar for all limbs, the model cannot discern the causal relationship triggering the sensory effect and is therefore unable to distinguish between the four limbs and find which limb is the connected one.</p>
        </para>
        <para xml:id="S3.SS5.SSS4.p3">
          <p>In the ablation in which the motor noise was lowered to 0.1 (Figure <ref labelref="LABEL:fig:MotorAb"/> C and G), we observe a stark difference between the binary condition and the non-binary condition. In the binary condition, the model quickly learns to identify the connected limb and significantly increases the activity in that limb compared to the unconnected limbs. In contrast, in the non-binary condition, the model seems to have more problems identifying the connected limb. For the first 2.5 pseudo-minutes of the non-binary condition, the connected and unconnected limbs have very similar activity levels. After that, surprisingly, the connected limb has lower activity than the unconnected limbs. The overall activity of the unconnected limbs is also higher than without ablation, and the overall variability in activity of all limbs is increased.</p>
        </para>
        <para xml:id="S3.SS5.SSS4.p4">
          <p>Finally, in the ablation in which the motor noise was raised to 0.9 (Figure <ref labelref="LABEL:fig:MotorAb"/> D and H), learning is hindered even more. The network has very little control over its actions because they become essentially random. This lack of control prevents learning: the average activity of the connected limb and the unconnected limbs progresses similarly, although there is some differentiation of the connected limb in the binary condition. A point worth noting is that when the motor noise is as high as 0.9, the model activity starts lower than 0.2. We assume this occurs because, with such high motor noise, the model will not have attained the correct baseline of about 0.2, even after the initialization period of 1000 steps.</p>
        </para>
        <figure inlist="lof" labels="LABEL:fig:MotorAb" placement="ht" xml:id="S3.F11">
          <tags>
            <tag>Fig. 11</tag>
            <tag role="refnum">11</tag>
            <tag role="typerefnum">Fig. 11</tag>
          </tags>
          <graphics candidates="Images/motorNoise1000.pdf" class="ltx_centering" graphic="Images/motorNoise1000.pdf" options="width=433.62pt" xml:id="S3.F11.g1"/>
          <toccaption class="ltx_centering"><tag close=" ">11</tag><text font="bold">Changing the level of motor noise.</text> The thick curves show the mean activities (mean activity per limb and per 10-s bin, data on 20 simulation runs), across individual data, of the connected limb (red) and the unconnected limbs (gray). Thinner pale curves show individual simulation runs. (A) Complete model in the Binary condition with motor noise mn = 0.3. (B-D) Ablated models in the Binary condition with motor noise of 0.001, 0.1 and 0.9, respectively. (E) Complete model in the Non-Binary condition with motor noise mn = 0.3. (F-H) Ablated models the Non-Binary condition with motor noise of 0.001, 0.1 and 0.9, respectively.</toccaption>
          <caption class="ltx_centering"><tag close=": ">Fig. 11</tag><text font="bold">Changing the level of motor noise.</text> The thick curves show the mean activities (mean activity per limb and per 10-s bin, data on 20 simulation runs), across individual data, of the connected limb (red) and the unconnected limbs (gray). Thinner pale curves show individual simulation runs. (A) Complete model in the Binary condition with motor noise mn = 0.3. (B-D) Ablated models in the Binary condition with motor noise of 0.001, 0.1 and 0.9, respectively. (E) Complete model in the Non-Binary condition with motor noise mn = 0.3. (F-H) Ablated models the Non-Binary condition with motor noise of 0.001, 0.1 and 0.9, respectively.</caption>
        </figure>
        <figure inlist="lof" labels="LABEL:fig:noveltyAb" xml:id="S3.F12">
          <tags>
            <tag>Fig. 12</tag>
            <tag role="refnum">12</tag>
            <tag role="typerefnum">Fig. 12</tag>
          </tags>
          <graphics candidates="Images/highNovelty1000.pdf" class="ltx_centering" graphic="Images/highNovelty1000.pdf" options="width=433.62pt" xml:id="S3.F12.g1"/>
          <toccaption class="ltx_centering"><tag close=" ">12</tag><text font="bold">Increasing the novelty threshold.</text> The thick curves show the mean activities (mean activity per limb and per 10-s bin, data on 20 simulation runs), across individual data, of the connected limb (red) and the unconnected limbs (gray). Thinner pale curves show individual simulation runs. (A) Complete model in the Binary condition. (B) Ablated model in the Binary condition. (C) Complete model in the Non-Binary condition. (D) Ablated model the Non-Binary condition.</toccaption>
          <caption class="ltx_centering"><tag close=": ">Fig. 12</tag><text font="bold">Increasing the novelty threshold.</text> The thick curves show the mean activities (mean activity per limb and per 10-s bin, data on 20 simulation runs), across individual data, of the connected limb (red) and the unconnected limbs (gray). Thinner pale curves show individual simulation runs. (A) Complete model in the Binary condition. (B) Ablated model in the Binary condition. (C) Complete model in the Non-Binary condition. (D) Ablated model the Non-Binary condition.</caption>
        </figure>
      </subsubsection>
      <subsubsection inlist="toc" xml:id="S3.SS5.SSS5">
        <tags>
          <tag>III-E5</tag>
          <tag role="refnum">III-E5</tag>
          <tag role="typerefnum">§III-E5</tag>
        </tags>
        <title><tag close=" ">III-E5</tag>Increasing the novelty threshold</title>
<!--  %**** Mobile˙paradigm˙model.tex Line 375 **** -->        <para xml:id="S3.SS5.SSS5.p1">
          <p>In this ablation study we increased the novelty threshold (from 0.1 to 0.3). The novelty threshold determines whether the model interprets a sensory feedback as surprising or not. The main results of this ablation study is that a higher novelty thresholds leads to a higher variability and an earlier differentiation of the connected and unconnected limbs, as shown in Figure <ref labelref="LABEL:fig:noveltyAb"/>. In the non-binary condition the final difference in activity of the limbs also seems somewhat higher. Presumably, these effects arise because increasing the novelty threshold decreases the effect of motor noise and makes learning easier. It may be possible that carefully adjusting the novelty threshold could at least partially allow us to match the increased variability observed overall in infants, including in the binary condition and from the very beginning of the experiment.</p>
        </para>
        <figure inlist="lof" labels="LABEL:fig:baselineAb" xml:id="S3.F13">
          <tags>
            <tag>Fig. 13</tag>
            <tag role="refnum">13</tag>
            <tag role="typerefnum">Fig. 13</tag>
          </tags>
          <graphics candidates="Images/noFatigue1000.pdf" graphic="Images/noFatigue1000.pdf" options="width=433.62pt" xml:id="S3.F13.g1"/>
          <toccaption><tag close=" ">13</tag><text font="bold">Ablation of the deviation from baseline activity loss.</text> The thick curves show the mean activities (mean activity per limb and per 10-s bin, data on 20 simulation runs), across individual data, of the connected limb (red) and the unconnected limbs (gray). Thinner pale curves show individual simulation runs. (A) Complete model in the Binary condition with the novelty threshold set at 0.1. (B) Ablated model in the Binary condition with the novelty threshold increased to 0.3. (C) Complete model in the Non-Binary condition with the novelty threshold set at 0.1. (D) Ablated model the Non-Binary condition with the novelty threshold increased to 0.3.</toccaption>
          <caption><tag close=": ">Fig. 13</tag><text font="bold">Ablation of the deviation from baseline activity loss.</text> The thick curves show the mean activities (mean activity per limb and per 10-s bin, data on 20 simulation runs), across individual data, of the connected limb (red) and the unconnected limbs (gray). Thinner pale curves show individual simulation runs. (A) Complete model in the Binary condition with the novelty threshold set at 0.1. (B) Ablated model in the Binary condition with the novelty threshold increased to 0.3. (C) Complete model in the Non-Binary condition with the novelty threshold set at 0.1. (D) Ablated model the Non-Binary condition with the novelty threshold increased to 0.3.</caption>
        </figure>
      </subsubsection>
      <subsubsection inlist="toc" xml:id="S3.SS5.SSS6">
        <tags>
          <tag>III-E6</tag>
          <tag role="refnum">III-E6</tag>
          <tag role="typerefnum">§III-E6</tag>
        </tags>
        <title><tag close=" ">III-E6</tag>No deviation from baseline activity error</title>
        <para xml:id="S3.SS5.SSS6.p1">
          <p>In this ablation study (Fig. <ref labelref="LABEL:fig:baselineAb"/>), we remove the deviation from baseline activity loss. This has a very small effect. The reason for introducing the deviation from baseline activity loss in our model was twofold: to limit the overall activity to some activity range around the baseline level and to account for the gradual increase in fussiness observed in infants as the experiments progressed. Presumably, the reason for the small effect of this ablation is that the urge to explore different activity ranges is the main driver of limb activity, and adding a factor that was supposed to simulate ”fussiness” over time was superfluous.</p>
        </para>
      </subsubsection>
    </subsection>
  </section>
  <section inlist="toc" xml:id="S4">
    <tags>
      <tag>IV</tag>
      <tag role="refnum">IV</tag>
      <tag role="typerefnum">§IV</tag>
    </tags>
    <title><tag close=" ">IV</tag><text font="smallcaps">Discussion, Conclusion, and Future Work</text></title>
    <para xml:id="S4.p1">
      <p>This paper introduced a computational model that simulates key mechanisms of contingency learning in the “mobile paradigm”. In this paradigm, an infant observes movements or listens to sounds produced by a ’mobile’ connected to one of its limbs. It has been observed that infants soon move the <text font="italic">connected</text> limb more than others. We compare model simulations with the behavior classically observed in the literature and with behavior of 6-month-old infants in two recent studies: one using a <text font="italic">conjugate</text> (<text font="italic">non-binary</text>) version where the sensory feedback strength varied with the activity of the connected limb <cite class="ltx_citemacro_cite">[<bibref bibrefs="jacquey_popescu_2020" separator="," yyseparator=","/>]</cite> and another using a <text font="italic">binary</text> version where the mobile was activated for a fixed time once activity of the connected limb passed a threshold <cite class="ltx_citemacro_cite">[<bibref bibrefs="popescu_6-month-old_2021" separator="," yyseparator=","/>]</cite>. Both studies included a <text font="italic">contingent</text> group (mobile response tied to limb movement) and a <text font="italic">non-contingent</text> control group. In addition to replicating these results in our simulations, we also investigated the presence of an <text font="italic">extinction burst</text>—an increase in activity when the connection is removed—a phenomenon reported in some studies <cite class="ltx_citemacro_cite">[<bibref bibrefs="alessandri_violation_1990" separator="," yyseparator=","/>]</cite>,<cite class="ltx_citemacro_cite">[<bibref bibrefs="heathcock_relative_2005" separator="," yyseparator=","/>]</cite>, <cite class="ltx_citemacro_cite">[<bibref bibrefs="lewis_violation_1990" separator="," yyseparator=","/>]</cite>,<cite class="ltx_citemacro_cite">[<bibref bibrefs="rovee-collier_topographical_1978" separator="," yyseparator=","/>]</cite> but not consistently observed.</p>
    </para>
    <para xml:id="S4.p2">
      <p>Our model of the mobile paradigm incorporates components that we believe are important in real infants: an action-outcome prediction mechanism and an exploration mechanism, as well as motor noise and motor control involving multiple muscle commands. However, our current model excludes attentional, social, and environmental factors.</p>
    </para>
    <para xml:id="S4.p3">
      <p>The results of our simulations show that for the most part our model exhibits behavior similar to that of infants. In the contingent group, the model rapidly differentiates between the connected and unconnected limbs and increases the activity of its connected limb relative to the unconnected limbs. The model also correctly simulates the higher activity observed for infants in a contingent group compared to a non-contingent control group. Our model also simulates the non-systematic emergence of an extinction burst following the disconnection of contingency. Without any modification, the model also correctly simulates the slightly more robust differentiation of the connected versus unconnected limb in the binary variant of the paradigm compared to the non-binary conjugate variant.</p>
    </para>
    <para xml:id="S4.p4">
      <p>Of course, there are differences between the behavior of the model and infants (see below subsection <ref labelref="LABEL:limitations_and_future"/>). Nonetheless, we hypothesize that our model and ablation studies demonstrate that certain mechanisms in our model appear to be essential to simulate infant behavior correctly. This suggests that similar developmental mechanisms could be necessary in infants.
<!--  %For these, by more carefully adjusting model parameters (for example, learning rate and novelty threshold) we would have been able to achieve better fits with infants’ limb activity. But even without doing this, we believe that our work, coupled with the accompanying ablation studies, demonstrates that certain developmental mechanisms are probably essential in order to replicate the main behavioral findings in the mobile paradigm studies. 
     %For example, infants differentiate the connected limb from the unconnected limbs more quickly than the model. Another example is the fact that the inter- and intra-individual differences that are much larger in infants compared to simulation runs.--></p>
    </para>
<!--  %**** Mobile˙paradigm˙model.tex Line 400 **** 
     %QUESTION: in our case motor noise and sensory noise are the same?
     %Simulations with the model also cast light on findings in the literature about a supposed “extinction burst” following disconnection of the connected limb. Our results suggest that the reason why in the infant literature the extinction burst is not found systematically is that rather than looking for a sudden increase in activity, we should actually look for a sudden increase in the variability of activity.-->    <subsection inlist="toc" xml:id="S4.SS1">
      <tags>
        <tag>IV-A</tag>
        <tag role="refnum">IV-A</tag>
        <tag role="typerefnum">§IV-A</tag>
      </tags>
      <title><tag close=" ">IV-A</tag><text font="italic">Essential components of our model</text></title>
      <para xml:id="S4.SS1.p1">
        <p>Internal <text font="bold">prediction of sensory effects</text> was a component suggested by contemporary literature on motor control and intrinsic motivation <cite class="ltx_citemacro_cite">[<bibref bibrefs="wolpert_motor_2001" separator="," yyseparator=","/>]</cite>, <cite class="ltx_citemacro_cite">[<bibref bibrefs="friston_world_2021" separator="," yyseparator=","/>]</cite>,<cite class="ltx_citemacro_cite">[<bibref bibrefs="poli_curiosity_2024" separator="," yyseparator=","/>]</cite>. Our ablation studies show that without this component the model cannot learn to differentiate the connected and unconnected limbs. The prediction mechanism strongly affects exploration, and removing it prevents the model from properly evaluating the effectiveness of its exploration.
<!--  %activity as compared to the complete model. However, consideration of the individual simulation runs shows that the underlying individual behaviors are very different. When ablating the prediction error, the connected and unconnected limbs are only distinguished in some of the simulation runs, suggesting that the prediction mechanism makes learning more consistent. -->This exploration itself is supported by a second essential component of our model—the <text font="bold">novelty-based reinforcement calculator</text>. Its purpose is to encourage the model to explore new activity levels of its limbs. This mechanism proved essential for the model to learn which of the limbs is connected to the mobile.
<!--  %If possible add REFS in the paragraph above --></p>
      </para>
      <para xml:id="S4.SS1.p2">
        <p>To correctly simulate infant behavior, the model also appears to need a large number of <text font="bold">muscle commands</text>, thereby simulating a population code innervating multiple muscles that control each limb. The ablation studies show that without the numerous muscle commands, the model cannot identify the connected limb. Our intuition is that having multiple ways of moving the limb provides more opportunities for the network to discover an effective movement. This is consistent with some results observed in robot learning, where controlling joint angles provided more accurate learning than directly controlling the position of the end effector <cite class="ltx_citemacro_cite">[<bibref bibrefs="spisak2025dirigent" separator="," yyseparator=","/>]</cite>.</p>
      </para>
      <para xml:id="S4.SS1.p3">
        <p><text font="bold">Motor noise</text> is another component suggested in the literature (see recent reviews <cite class="ltx_citemacro_cite">[<bibref bibrefs="casartelli_neural_2023" separator="," yyseparator=","/>]</cite> and <cite class="ltx_citemacro_cite">[<bibref bibrefs="gliga_telling_2018" separator="," yyseparator=","/>]</cite>) that diversifies exploration by adding variability. Our ablation study showed that while high motor noise and low motor noise can both inhibit learning, a specific intermediate amount of motor noise is needed to enable quick learning. This finding is consistent with a computational model by Caligiore et al. <cite class="ltx_citemacro_cite">[<bibref bibrefs="caligiore_integrating_2014" separator="," yyseparator=","/>]</cite> showing that some variability supports exploration and learning. It is also consistent with the finding by Ossmy et al. <cite class="ltx_citemacro_cite">[<bibref bibrefs="ossmy_variety_2018" separator="," yyseparator=","/>]</cite> that in a robotic simulation of soccer, similarly to when infants learn to walk, greater variability leads to better performance.</p>
      </para>
<!--  %Lastly, our model included a “textbf–baseline activity˝ to imitate the fact that infants’ general activity level must presumably be neither too high nor too low. However the ablation study showed that ignoring the deviation from baseline activity had little effect. In our simulations, the activity level of the model appears to mostly depend on exploration and prediction. %Based on this we assume that the overall loss will be smaller by focusing on exploration and prediction, perhaps simply because these already lead to a small deviation from baseline or because without focusing on them the error will be much higher. -->    </subsection>
    <subsection inlist="toc" labels="LABEL:limitations_and_future" xml:id="S4.SS2">
      <tags>
        <tag>IV-B</tag>
        <tag role="refnum">IV-B</tag>
        <tag role="typerefnum">§IV-B</tag>
      </tags>
      <title><tag close=" ">IV-B</tag><text font="italic">Limitations of our model and future work</text></title>
<!--  %Several aspects of our model’s behavior are notably different from observed infant behavior. -->      <para xml:id="S4.SS2.p1">
        <p>An aspect of infant behavior that our simulations did not replicate well is the variability between individual infants as well as within a single infant. Observing less variability across different simulations is expected because our model is designed to imitate an individual infant, whereas behavioral data is obtained from many different infants. However, the lower variability within individual simulation runs seems to be a limitation of our model. As observed in our ablation studies, tripling the value of the novelty threshold increased the variability of individual runs. Thus, adjusting the novelty threshold of the model would be one way to make the model behave more similarly to infants.</p>
      </para>
      <para xml:id="S4.SS2.p2">
        <p>Our lower observed variability must also be understood from the fact that our model is explicitly designed to be focused solely on the task at hand, whereas real infants are subject to multiple distractions during the course of an experiment and are presumably switching their cognitive resources from one task to another. Future work on our model could implement such concurrent learning opportunities and a mechanism that allows task switching. Similar ideas have been advanced, for example, in <cite class="ltx_citemacro_cite">[<bibref bibrefs="oudeyer2007intrinsic" separator="," yyseparator=","/>]</cite> and <cite class="ltx_citemacro_cite">[<bibref bibrefs="butko_learningToLearn_2007" separator="," yyseparator=","/>]</cite>. Note that variability could also be related to the issues of habituation <cite class="ltx_citemacro_cite">[<bibref bibrefs="sirois_habituation_models_2002" separator="," yyseparator=","/>]</cite>, boredom, fussiness, loss of interest, and, more generally, attentional fluctuations, which were not considered in our current model.</p>
      </para>
      <para xml:id="S4.SS2.p3">
        <p>One other limitation of our model is that we only tested one type of sensorimotor contingency. To determine how general our model is and whether it could learn a wider variety of sensorimotor contingencies encountered in early infancy, it should simulate the transfer of learning to new contingencies or with different environmental variables. For example, future experiments could test the introduction of a time delay between the motor action and its sensory consequences, changing which limbs are connected, a change in the specificity of motor actions required (e.g., specific angle of knee flexion as in <cite class="ltx_citemacro_cite">[<bibref bibrefs="angulo-kinzler_three-month-old_2002" separator="," yyseparator=","/>]</cite>), or a change in the schedule of contingency (e.g. from deterministic to probabilistic or random).
<!--  %**** Mobile˙paradigm˙model.tex Line 425 **** --></p>
      </para>
<!--  %In conclusion, we have introduced a novel biologically inspired model that allowed us to evaluate multiple aspects of infant behavior in the mobile paradigm. 
     %Another limitation is that the current model only includes discrete updates based on previous states. To extend this into continuous time leaky integrate-and-fire (LIF) neurons could be included in future neural network implementations.
     %Our current and future work involves extending the model and integrating it with a physical body__an iCub humanoid robot “cite–metta2010icub˝. Thanks to our model the robot would be able to detect sensorimotor contingencies and learn from them like infants. The robot will then have to face the additional challenges of exploring multiple concurrent channels of sensory feedback and choosing among several learning opportunities.-->      <para xml:id="S4.SS2.p4">
        <p>In conclusion, our model has established that motor noise, exploration, and motor control involving multiple degrees of freedom are important for sensorimotor learning. The model could be extended to include the role of habituation, boredom and attentional fluctuations so as to replicate observed infant variability. By replicating the main findings of the mobile paradigm, our model provides a strong starting point for further exploration of the mechanisms of sensorimotor learning in infants and applications to robotics.</p>
      </para>
<!--  %In conclusion, our model replicates the main findings in infant studies using the mobile paradigm. It is a good starting point for further investigation of sensorimotor learning. 
     %Learning of tactile space of the body using self-touch. Our conclusion from the present simulation is that the basic ingredients of action-effect prediction, ... motor noise, high-dimensional motor control, would possibly be important to include in any such extension of our work.
     %TODO:
     %Short-term: fussiness, habituation, seeing whether we can simulate changes of types of contingency, changing the schedule
     %Grandiose: attention fluctuations
     %Our model did not simulate habituation behavior reported in some infant studies, nor any mechanisms that could simulate sustained attention and boredom. Future studies could explore these questions.
     %Thus it could be said that the model has a stronger tendency to optimize its behavior compared to real infants.
     %The second aspect that we are missing is a component in the model that imitates infants’ growing “fussiness”, perhaps corresponding to growing boredom or habituation, an increasing need for activity or any other desires the infant might have that we did not simulate.
     %**** Mobile˙paradigm˙model.tex Line 450 ****
     %The second aspect of infants’ behavior that we did not fully replicate is the progressively increasing activity in the infants. Although the model incorporated a “baseline activity” parameter that we programmed to increase gradually over time, the actual activity of the model, apart from an initial increase, stayed stable for the remaining part of the run. It might seem that simply increasing the baseline activity of the model over the course of a simulation should easily remedy this defect. However, the baseline activity increase we used in the model is already rather large. It appears (see Figure “ref–fig:baselineAb˝) that this increase is overridden by the fact that the model must actively explore. When exploration loss is removed, the increasing baseline activity becomes evident (see Figure “ref–fig:expoAb˝). In conclusion, it seems that we are missing a component in the model that imitates infants’ growing “fussiness”, perhaps corresponding to growing boredom, an increasing need for activity or any other desires the infant might have that we did not simulate.
     %“subsection–Future work˝
     %TBD An alternative candidate type of model could be the Dynamic Field Theory (see “cite–schoner2006using˝ for its application to infant habituation) or the dynamical systems approach “cite–kelso˙coordination˙2016, fujihira2023dynamical˝.
     %sergiu
     %“bibliography–bibtex/zotero˙references˝-->    </subsection>
  </section>
  <bibliography xml:id="bib">
    <title>References</title>
    <biblist>
      <bibitem key="jacquey_fagard_2020" xml:id="bib.bib1">
        <tags>
          <tag>[1]</tag>
          <tag role="refnum">1</tag>
        </tags>
        <bibblock>
L. Jacquey, J. Fagard, R. Esseily, and J. K. O’Regan, “Detection of sensorimotor contingencies in infants before the age of 1 year: A comprehensive review,” <text font="italic">Developmental Psychology</text>, vol. 56, pp. 1233–1251, July 2020.
</bibblock>
      </bibitem>
      <bibitem key="sen_making_2021" xml:id="bib.bib2">
        <tags>
          <tag>[2]</tag>
          <tag role="refnum">2</tag>
        </tags>
        <bibblock>
U. Sen and G. Gredebäck, “Making the World Behave: A New Embodied Account on Mobile Paradigm,” <text font="italic">Frontiers in Systems Neuroscience</text>, vol. 15, Mar. 2021.
</bibblock>
        <bibblock>Publisher: Frontiers.
</bibblock>
      </bibitem>
      <bibitem key="tarabulsy_contingency_1996" xml:id="bib.bib3">
        <tags>
          <tag>[3]</tag>
          <tag role="refnum">3</tag>
        </tags>
        <bibblock>
G. M. Tarabulsy, R. Tessier, and A. Kappas, “Contingency detection and the contingent organization of behavior in interactions: Implications for socioemotional development in infancy,” <text font="italic">Psychological Bulletin</text>, vol. 120, no. 1, pp. 25–41, 1996.
</bibblock>
        <bibblock>Place: US Publisher: American Psychological Association.
</bibblock>
      </bibitem>
      <bibitem key="adamson_stillface_2003" xml:id="bib.bib4">
        <tags>
          <tag>[4]</tag>
          <tag role="refnum">4</tag>
        </tags>
        <bibblock>
L. Adamson and J. Frick, “The still face: A history of a shared experimental paradigm,” <text font="italic">INFANCY</text>, vol. 4, no. 4, pp. 451–473, 2003.
</bibblock>
        <bibblock>International Conference on Infant Studies, TORONTO, CANADA, APR, 2002.
</bibblock>
      </bibitem>
      <bibitem key="rovee_conjugate_1969" xml:id="bib.bib5">
        <tags>
          <tag>[5]</tag>
          <tag role="refnum">5</tag>
        </tags>
        <bibblock>
C. K. Rovee and D. T. Rovee, “Conjugate reinforcement of infant exploratory behavior,” <text font="italic">Journal of experimental child psychology</text>, vol. 8, no. 1, pp. 33–39, 1969.
</bibblock>
        <bibblock>00334 Citation Key Alias: rovee1969.
</bibblock>
      </bibitem>
      <bibitem key="watson_reactions_1972" xml:id="bib.bib6">
        <tags>
          <tag>[6]</tag>
          <tag role="refnum">6</tag>
        </tags>
        <bibblock>
J. S. Watson and C. T. Ramey, “Reactions to response-contingent stimulation in early infancy,” <text font="italic">Merrill-Palmer Quarterly of Behavior and Development</text>, vol. 18, no. 3, pp. 219–227, 1972.
</bibblock>
        <bibblock>Publisher: Wayne State University Press.
<!--  %**** Mobile˙paradigm˙model.bbl Line 25 **** --></bibblock>
      </bibitem>
      <bibitem key="greco_ontogeny_1986" xml:id="bib.bib7">
        <tags>
          <tag>[7]</tag>
          <tag role="refnum">7</tag>
        </tags>
        <bibblock>
C. Greco, C. K. Rovee-Collier, H. Hayne, P. C. Griesler, and L. A. Earley, “Ontogeny of Early Event Memory: I. Forgetting and Retrieval by 2- and 3-Month-Olds,” <text font="italic">Infant Behavior and Development</text>, vol. 9, pp. 441–460, 1986.
</bibblock>
      </bibitem>
      <bibitem key="hayne_ontogeny_1986" xml:id="bib.bib8">
        <tags>
          <tag>[8]</tag>
          <tag role="refnum">8</tag>
        </tags>
        <bibblock>
H. Hayne, C. Greco, L. A. Earley, P. C. Griesler, and C. Rovee-Collier, “Ontogeny of Early Event Memory: II. Encoding and Retrieval by 2- and 3-Month-Olds,” <text font="italic">Infant Behavior and Development</text>, vol. 9, pp. 461–472, 1986.
</bibblock>
      </bibitem>
      <bibitem key="lewis_emotional_1985" xml:id="bib.bib9">
        <tags>
          <tag>[9]</tag>
          <tag role="refnum">9</tag>
        </tags>
        <bibblock>
M. Lewis, M. W. Sullivan, and J. Brooks-Gunn, “Emotional behaviour during the learning of a contingency in early infancy,” <text font="italic">British Journal of Developmental Psychology</text>, vol. 3, pp. 307–316, Sept. 1985.
</bibblock>
        <bibblock>Citation Key Alias: lewisEmotionalBehaviourLearning1985.
</bibblock>
      </bibitem>
      <bibitem key="alessandri_violation_1990" xml:id="bib.bib10">
        <tags>
          <tag>[10]</tag>
          <tag role="refnum">10</tag>
        </tags>
        <bibblock>
S. M. Alessandri, M. W. Sullivan, and M. Lewis, “Violation of expectancy and frustration in early infancy,” <text font="italic">Developmental Psychology</text>, vol. 26, no. 5, pp. 738–744, 1990.
</bibblock>
        <bibblock>Place: US Publisher: American Psychological Association.
</bibblock>
      </bibitem>
      <bibitem key="jacquey_popescu_2020" xml:id="bib.bib11">
        <tags>
          <tag>[11]</tag>
          <tag role="refnum">11</tag>
        </tags>
        <bibblock>
L. Jacquey, S. T. Popescu, J. Vergne, J. Fagard, R. Esseily, and K. O’Regan, “Development of body knowledge as measured by arm differentiation in infants: From global to local?,” <text font="italic">British Journal of Developmental Psychology</text>, vol. 38, no. 1, pp. 108–124, 2020.
</bibblock>
        <bibblock>_eprint: https://onlinelibrary.wiley.com/doi/pdf/10.1111/bjdp.12309.
</bibblock>
      </bibitem>
      <bibitem key="popescu_6-month-old_2021" xml:id="bib.bib12">
        <tags>
          <tag>[12]</tag>
          <tag role="refnum">12</tag>
        </tags>
        <bibblock>
S. T. Popescu, A. Dauphin, J. Vergne, and J. K. O’Regan, “6-Month-Old Infants’ Sensitivity to Contingency in a Variant of the Mobile Paradigm With Proximal Stimulation Studied at Fine Temporal Resolution in the Laboratory,” <text font="italic">Frontiers in Psychology</text>, vol. 12, 2021.
</bibblock>
        <bibblock>Publisher: Frontiers.
</bibblock>
      </bibitem>
      <bibitem key="rovee-collier_development_1999" xml:id="bib.bib13">
        <tags>
          <tag>[13]</tag>
          <tag role="refnum">13</tag>
        </tags>
        <bibblock>
C. Rovee-Collier, “The Development of Infant Memory,” <text font="italic">Current Directions in Psychological Science</text>, vol. 8, no. 3, p. 6, 1999.
<!--  %**** Mobile˙paradigm˙model.bbl Line 50 **** --></bibblock>
      </bibitem>
      <bibitem key="rovee-collier_long-term_1999" xml:id="bib.bib14">
        <tags>
          <tag>[14]</tag>
          <tag role="refnum">14</tag>
        </tags>
        <bibblock>
C. Rovee-Collier, K. Hartshorn, and M. DiRubbo, “Long-term maintenance of infant memory,” <text font="italic">Developmental Psychobiology</text>, vol. 35, pp. 91–102, Sept. 1999.
</bibblock>
      </bibitem>
      <bibitem key="sen_methodological_2024" xml:id="bib.bib15">
        <tags>
          <tag>[15]</tag>
          <tag role="refnum">15</tag>
        </tags>
        <bibblock>
U. Sen and G. Gredebäck, “Methodological integrity assessment in the mobile paradigm literature: A lesson for understanding opportunistic use of researcher degrees of freedom in psychology,” <text font="italic">Child Development</text>, vol. 95, no. 2, pp. 338–353, 2024.
</bibblock>
        <bibblock>Place: United Kingdom Publisher: Wiley-Blackwell Publishing Ltd.
</bibblock>
      </bibitem>
      <bibitem key="sloan_meaning_2023" xml:id="bib.bib16">
        <tags>
          <tag>[16]</tag>
          <tag role="refnum">16</tag>
        </tags>
        <bibblock>
A. T. Sloan, N. A. Jones, and J. A. S. Kelso, “Meaning from movement and stillness: Signatures of coordination dynamics reveal infant agency,” <text font="italic">Proceedings of the National Academy of Sciences</text>, vol. 120, p. e2306732120, Sept. 2023.
</bibblock>
        <bibblock>Publisher: Proceedings of the National Academy of Sciences.
</bibblock>
      </bibitem>
      <bibitem key="watanabe_general_2006" xml:id="bib.bib17">
        <tags>
          <tag>[17]</tag>
          <tag role="refnum">17</tag>
        </tags>
        <bibblock>
H. Watanabe and G. Taga, “General to specific development of movement patterns and memory for contingency between actions and events in young infants,” <text font="italic">Infant Behavior and Development</text>, vol. 29, no. 3, pp. 402–422, 2006.
</bibblock>
        <bibblock>Citation Key Alias: watanabe2006a, watanabe2006b.
</bibblock>
      </bibitem>
      <bibitem key="rovee-collier_topographical_1978" xml:id="bib.bib18">
        <tags>
          <tag>[18]</tag>
          <tag role="refnum">18</tag>
        </tags>
        <bibblock>
C. K. Rovee-Collier, B. A. Morrongiello, M. Aron, and J. Kupersmidt, “Topographical response differentiation and reversal in 3-month-old infants,” <text font="italic">Infant Behavior and Development</text>, vol. 1, pp. 323–333, Jan. 1978.
</bibblock>
      </bibitem>
      <bibitem key="angulo-kinzler_three-month-old_2002" xml:id="bib.bib19">
        <tags>
          <tag>[19]</tag>
          <tag role="refnum">19</tag>
        </tags>
        <bibblock>
R. M. Angulo-Kinzler, B. Ulrich, and E. Thelen, “Three-Month-Old Infants Can Select Specific Leg Motor Solutions,” <text font="italic">Motor Control</text>, vol. 6, pp. 52–68, Jan. 2002.
</bibblock>
        <bibblock>Publisher: Human Kinetics, Inc. Section: Motor Control.
</bibblock>
      </bibitem>
      <bibitem key="heathcock_relative_2005" xml:id="bib.bib20">
        <tags>
          <tag>[20]</tag>
          <tag role="refnum">20</tag>
        </tags>
        <bibblock>
J. C. Heathcock, A. N. Bhat, M. A. Lobo, and J. C. Galloway, “The Relative Kicking Frequency of Infants Born Full-term and Preterm During Learning and Short-term and Long-term Memory Periods of the Mobile Paradigm,” <text font="italic">Physical Therapy</text>, vol. 85, pp. 8–18, 2005.
<!--  %**** Mobile˙paradigm˙model.bbl Line 75 **** --></bibblock>
      </bibitem>
      <bibitem key="lewis_violation_1990" xml:id="bib.bib21">
        <tags>
          <tag>[21]</tag>
          <tag role="refnum">21</tag>
        </tags>
        <bibblock>
M. Lewis, S. M. Alessandri, and M. W. Sullivan, “Violation of expectancy, loss of control, and anger expressions in young infants.,” <text font="italic">Developmental Psychology</text>, vol. 26, no. 5, p. 745, 1990.
</bibblock>
      </bibitem>
      <bibitem key="butko_detecting_2010" xml:id="bib.bib22">
        <tags>
          <tag>[22]</tag>
          <tag role="refnum">22</tag>
        </tags>
        <bibblock>
N. J. Butko and J. R. Movellan, “Detecting contingencies: an infomax approach,” <text font="italic">Neural Networks: The Official Journal of the International Neural Network Society</text>, vol. 23, no. 8-9, pp. 973–984, 2010.
</bibblock>
      </bibitem>
      <bibitem key="movellan_Watson_development_2002" xml:id="bib.bib23">
        <tags>
          <tag>[23]</tag>
          <tag role="refnum">23</tag>
        </tags>
        <bibblock>
J. R. Movellan and J. S. Watson, “The development of gaze following as a Bayesian systems identification problem,” in <text font="italic">Development and Learning, 2002. Proceedings. The 2nd International Conference on</text>, pp. 34–40, IEEE, 2002.
</bibblock>
      </bibitem>
      <bibitem key="butko_learningToLearn_2007" xml:id="bib.bib24">
        <tags>
          <tag>[24]</tag>
          <tag role="refnum">24</tag>
        </tags>
        <bibblock>
N. J. Butko and J. R. Movellan, “Learning to learn,” in <text font="italic">2007 IEEE 6th International Conference on Development and Learning</text>, pp. 151–156, July 2007.
</bibblock>
      </bibitem>
      <bibitem key="zaadnoordijk_can_2018" xml:id="bib.bib25">
        <tags>
          <tag>[25]</tag>
          <tag role="refnum">25</tag>
        </tags>
        <bibblock>
L. Zaadnoordijk, M. Otworowska, J. Kwisthout, and S. Hunnius, “Can infants’ sense of agency be found in their behavior? Insights from babybot simulations of the mobile-paradigm,” <text font="italic">Cognition</text>, vol. 181, pp. 58–64, Dec. 2018.
</bibblock>
      </bibitem>
      <bibitem key="kelso_coordination_2016" xml:id="bib.bib26">
        <tags>
          <tag>[26]</tag>
          <tag role="refnum">26</tag>
        </tags>
        <bibblock>
J. A. S. Kelso and A. Fuchs, “The coordination dynamics of mobile conjugate reinforcement,” <text font="italic">Biological Cybernetics</text>, vol. 110, pp. 41–53, Feb. 2016.
</bibblock>
      </bibitem>
      <bibitem key="fujihira2023dynamical" xml:id="bib.bib27">
        <tags>
          <tag>[27]</tag>
          <tag role="refnum">27</tag>
        </tags>
        <bibblock>
R. Fujihira and G. Taga, “Dynamical systems model of development of the action differentiation in early infancy: a requisite of physical agency,” <text font="italic">Biological Cybernetics</text>, vol. 117, no. 1, pp. 81–93, 2023.
</bibblock>
      </bibitem>
      <bibitem key="baldassarre2013intrinsically" xml:id="bib.bib28">
        <tags>
          <tag>[28]</tag>
          <tag role="refnum">28</tag>
        </tags>
        <bibblock>
G. Baldassarre, M. Mirolli, <text font="italic">et al.</text>, <text font="italic">Intrinsically motivated learning in natural and artificial systems</text>.
</bibblock>
        <bibblock>Springer, 2013.
<!--  %**** Mobile˙paradigm˙model.bbl Line 100 **** --></bibblock>
      </bibitem>
      <bibitem key="oudeyer2007intrinsic" xml:id="bib.bib29">
        <tags>
          <tag>[29]</tag>
          <tag role="refnum">29</tag>
        </tags>
        <bibblock>
P.-Y. Oudeyer and F. Kaplan, “What is intrinsic motivation? a typology of computational approaches,” <text font="italic">Frontiers in neurorobotics</text>, vol. 1, p. 108, 2007.
</bibblock>
      </bibitem>
      <bibitem key="sutton2018reinforcement" xml:id="bib.bib30">
        <tags>
          <tag>[30]</tag>
          <tag role="refnum">30</tag>
        </tags>
        <bibblock>
R. S. Sutton, “Reinforcement learning: An introduction,” <text font="italic">A Bradford Book</text>, 2018.
</bibblock>
      </bibitem>
      <bibitem key="edwards_motor_2010" xml:id="bib.bib31">
        <tags>
          <tag>[31]</tag>
          <tag role="refnum">31</tag>
        </tags>
        <bibblock>
W. H. Edwards, <text font="italic">Motor Learning and Control: From Theory to Practice</text>.
</bibblock>
        <bibblock>Belmont, CA: Cengage Learning, 1st edition ed., Aug. 2010.
</bibblock>
      </bibitem>
      <bibitem key="zaadnoordijk_movement_2020" xml:id="bib.bib32">
        <tags>
          <tag>[32]</tag>
          <tag role="refnum">32</tag>
        </tags>
        <bibblock>
L. Zaadnoordijk, M. Meyer, M. Zaharieva, F. Kemalasari, S. van Pelt, and S. Hunnius, “From movement to action: An EEG study into the emerging sense of agency in early infancy,” <text font="italic">Developmental Cognitive Neuroscience</text>, vol. 42, p. 100760, Apr. 2020.
</bibblock>
      </bibitem>
      <bibitem key="heathcockPerformanceInfantsBorn2004" xml:id="bib.bib33">
        <tags>
          <tag>[33]</tag>
          <tag role="refnum">33</tag>
        </tags>
        <bibblock>
J. C. Heathcock, A. N. Bhat, M. A. Lobo, and J. C. Galloway, “The performance of infants born preterm and full-term in the mobile paradigm: Learning and memory,” <text font="italic">Physical Therapy</text>, vol. 84, no. 9, pp. 808–821, 2004.
</bibblock>
      </bibitem>
      <bibitem key="bednarski_infants_2022" xml:id="bib.bib34">
        <tags>
          <tag>[34]</tag>
          <tag role="refnum">34</tag>
        </tags>
        <bibblock>
F. M. Bednarski, K. Musholt, and C. Grosse Wiesmann, “Do infants have agency? – The importance of control for the study of early agency,” <text font="italic">Developmental Review</text>, vol. 64, p. 101022, June 2022.
</bibblock>
      </bibitem>
      <bibitem key="wolpert_motor_2001" xml:id="bib.bib35">
        <tags>
          <tag>[35]</tag>
          <tag role="refnum">35</tag>
        </tags>
        <bibblock>
D. M. Wolpert and J. R. Flanagan, “Motor prediction,” <text font="italic">Current Biology</text>, vol. 11, pp. R729–R732, Sept. 2001.
</bibblock>
      </bibitem>
      <bibitem key="friston_world_2021" xml:id="bib.bib36">
        <tags>
          <tag>[36]</tag>
          <tag role="refnum">36</tag>
        </tags>
        <bibblock>
K. Friston, R. J. Moran, Y. Nagai, T. Taniguchi, H. Gomi, and J. Tenenbaum, “World model learning and inference,” <text font="italic">Neural Networks</text>, Sept. 2021.
<!--  %**** Mobile˙paradigm˙model.bbl Line 125 **** --></bibblock>
      </bibitem>
      <bibitem key="poli_curiosity_2024" xml:id="bib.bib37">
        <tags>
          <tag>[37]</tag>
          <tag role="refnum">37</tag>
        </tags>
        <bibblock>
F. Poli, J. X. O’Reilly, R. B. Mars, and S. Hunnius, “Curiosity and the dynamics of optimal exploration,” <text font="italic">Trends in Cognitive Sciences</text>, vol. 0, Feb. 2024.
</bibblock>
        <bibblock>Publisher: Elsevier.
</bibblock>
      </bibitem>
      <bibitem key="spisak2025dirigent" xml:id="bib.bib38">
        <tags>
          <tag>[38]</tag>
          <tag role="refnum">38</tag>
        </tags>
        <bibblock>
J. Spisak, M. Kerzel, and S. Wermter, “Dirigent: End-to-end robotic imitation of human demonstrations based on a diffusion model,” <text font="italic">arXiv preprint arXiv:2501.16800</text>, 2025.
</bibblock>
      </bibitem>
      <bibitem key="casartelli_neural_2023" xml:id="bib.bib39">
        <tags>
          <tag>[39]</tag>
          <tag role="refnum">39</tag>
        </tags>
        <bibblock>
L. Casartelli, C. Maronati, and A. Cavallo, “From neural noise to co-adaptability: Rethinking the multifaceted architecture of motor variability,” <text font="italic">Physics of Life Reviews</text>, vol. 47, pp. 245–263, Dec. 2023.
</bibblock>
      </bibitem>
      <bibitem key="gliga_telling_2018" xml:id="bib.bib40">
        <tags>
          <tag>[40]</tag>
          <tag role="refnum">40</tag>
        </tags>
        <bibblock>
T. Gliga, “Telling Apart Motor Noise and Exploratory Behavior, in Early Development,” <text font="italic">Frontiers in Psychology</text>, vol. 9, 2018.
</bibblock>
      </bibitem>
      <bibitem key="caligiore_integrating_2014" xml:id="bib.bib41">
        <tags>
          <tag>[41]</tag>
          <tag role="refnum">41</tag>
        </tags>
        <bibblock>
D. Caligiore, D. Parisi, and G. Baldassarre, “Integrating reinforcement learning, equilibrium points, and minimum variance to understand the development of reaching: a computational model,” <text font="italic">Psychological Review</text>, vol. 121, pp. 389–421, July 2014.
</bibblock>
      </bibitem>
      <bibitem key="ossmy_variety_2018" xml:id="bib.bib42">
        <tags>
          <tag>[42]</tag>
          <tag role="refnum">42</tag>
        </tags>
        <bibblock>
O. Ossmy, J. E. Hoch, P. MacAlpine, S. Hasan, P. Stone, and K. E. Adolph, “Variety Wins: Soccer-Playing Robots and Infant Walking,” <text font="italic">Frontiers in Neurorobotics</text>, vol. 12, 2018.
</bibblock>
      </bibitem>
      <bibitem key="sirois_habituation_models_2002" xml:id="bib.bib43">
        <tags>
          <tag>[43]</tag>
          <tag role="refnum">43</tag>
        </tags>
        <bibblock>
S. Sirois and D. Mareschal, “Models of habituation in infancy,” <text font="italic">Trends in Cognitive Sciences</text>, vol. 6, pp. 293–298, July 2002.
</bibblock>
        <bibblock>Publisher: Elsevier.
</bibblock>
      </bibitem>
    </biblist>
  </bibliography>
  <pagination role="newpage"/>
  <section inlist="toc" labels="LABEL:Appendix" xml:id="S5">
    <tags>
      <tag>V</tag>
      <tag role="refnum">V</tag>
      <tag role="typerefnum">§V</tag>
    </tags>
    <title><tag close=" ">V</tag><text font="smallcaps">Appendix</text></title>
<!--  %“vspace–0.7cm˝ % uncomment this to have both figs low -->    <figure inlist="lof" labels="LABEL:fig:individualStandardB" placement="H" xml:id="S5.F14">
      <tags>
        <tag>Fig. 14</tag>
        <tag role="refnum">14</tag>
        <tag role="typerefnum">Fig. 14</tag>
      </tags>
<!--  %“vspace–-4.5em˝ -->      <graphics candidates="Images/individualBFig1000v1.pdf" graphic="Images/individualBFig1000v1.pdf" options="width=433.62pt" xml:id="S5.F14.g1"/>
      <toccaption><tag close=" ">14</tag><text font="bold">Binary model - 20 individual runs.</text> The activity is shown per limb and per 10-s bin. The red curve shows the activity of the connected limb and the gray curves show the activities of the unconnected limbs. The time of contingency removal is emphasized with a vertical line.
</toccaption>
      <caption><tag close=": ">Fig. 14</tag><text font="bold">Binary model - 20 individual runs.</text> The activity is shown per limb and per 10-s bin. The red curve shows the activity of the connected limb and the gray curves show the activities of the unconnected limbs. The time of contingency removal is emphasized with a vertical line.
</caption>
    </figure>
<!--  %**** Mobile˙paradigm˙model.tex Line 475 **** -->    <figure inlist="lof" labels="LABEL:fig:individualStandardNB" xml:id="S5.F15">
      <tags>
        <tag>Fig. 15</tag>
        <tag role="refnum">15</tag>
        <tag role="typerefnum">Fig. 15</tag>
      </tags>
<!--  %play with this figure to align the two Figs -->      <graphics candidates="Images/individualNBFig1000v1.pdf" graphic="Images/individualNBFig1000v1.pdf" options="width=433.62pt" xml:id="S5.F15.g1"/>
      <toccaption><tag close=" ">15</tag><text font="bold">Non-Binary model - 20 individual runs.</text> The activity is shown per limb and per 10-s bin. The red curve shows the activity of the connected limb and the gray curves show the activities of the unconnected limbs. The time of contingency removal is emphasized with a vertical line.</toccaption>
      <caption><tag close=": ">Fig. 15</tag><text font="bold">Non-Binary model - 20 individual runs.</text> The activity is shown per limb and per 10-s bin. The red curve shows the activity of the connected limb and the gray curves show the activities of the unconnected limbs. The time of contingency removal is emphasized with a vertical line.</caption>
<!--  %comment this out and remove the comment in 526 to get them both a low -->    </figure>
    <figure inlist="lof" labels="LABEL:fig:LRApendix" placement="ht" xml:id="S5.F16">
      <tags>
        <tag>Fig. 16</tag>
        <tag role="refnum">16</tag>
        <tag role="typerefnum">Fig. 16</tag>
      </tags>
      <graphics candidates="Images/LR1000.pdf" graphic="Images/LR1000.pdf" options="width=433.62pt" xml:id="S5.F16.g1"/>
      <toccaption><tag close=" ">16</tag><text font="bold">Models with different learning rates.</text> The thick curves show the mean activities, across 20 individual simulations runs, of the connected limb (red) and the unconnected limbs (gray). Thinner pale curves
show 20 individual simulation runs. Note that our ”standard” model has a learning rate = .00075. (A, B, C, D, E) Binary simulations with learning rates of respectively .0005, .00075, .001, .005, and .05. (F, G, H, I, J) Non-Binary simulations with learning rates of respectively .0005, .00075, .001, .005, and .05.</toccaption>
      <caption><tag close=": ">Fig. 16</tag><text font="bold">Models with different learning rates.</text> The thick curves show the mean activities, across 20 individual simulations runs, of the connected limb (red) and the unconnected limbs (gray). Thinner pale curves
show 20 individual simulation runs. Note that our ”standard” model has a learning rate = .00075. (A, B, C, D, E) Binary simulations with learning rates of respectively .0005, .00075, .001, .005, and .05. (F, G, H, I, J) Non-Binary simulations with learning rates of respectively .0005, .00075, .001, .005, and .05.</caption>
<!--  %so its at the top of the page -->    </figure>
  </section>
</document>
