<?xml version="1.0" encoding="UTF-8"?>
<?latexml searchpaths="/home/japhy/scienceReplication.artiswrong.com/paper_files/arxiv/1902.07579/latex_extracted"?>
<?latexml class="article" options="10pt,letterpaper"?>
<!--  %Original : Ashwin Ram (ashwin@cc.gatech.edu)       04/01/1994 --><!--  %Modified : Johanna Moore (jmoore@cs.pitt.edu)      03/17/1995 --><!--  %Modified : David Noelle (noelle@ucsd.edu)          03/15/1996 --><!--  %Modified : Pat Langley (langley@cs.stanford.edu)   01/26/1997 --><!--  %Latex2e corrections by Ramin Charles Nakisa        01/28/1997 --><!--  %Modified : Tina Eliassi-Rad (eliassi@cs.wisc.edu)  01/31/1998 --><!--  %Modified : Trisha Yannuzzi (trisha@ircs.upenn.edu) 12/28/1999 (in process) --><!--  %Modified : Mary Ellen Foster (M.E.Foster@ed.ac.uk) 12/11/2000 --><!--  %Modified : Ken Forbus                              01/23/2004 --><!--  %Modified : Eli M. Silk (esilk@pitt.edu)            05/24/2005 --><!--  %Modified : Niels Taatgen (taatgen@cmu.edu)         10/24/2006 --><!--  %Modified : David Noelle (dnoelle@ucmerced.edu)     11/19/2014 --><!--  %% Change ”letterpaper” in the following line to ”a4paper” if you must. --><?latexml package="cogsci"?>
<?latexml package="pslatex"?>
<?latexml package="apacite"?>
<?latexml package="comment"?>
<?latexml package="graphicx"?>
<?latexml package="graphbox"?>
<?latexml package="nicefrac"?>
<?latexml package="lipsum"?>
<?latexml package="amsmath,amssymb"?>
<?latexml package="url" options="hyphens"?>
<?latexml RelaxNGSchema="LaTeXML"?>
<document xmlns="http://dlmf.nist.gov/LaTeXML" class="ltx_authors_1line">
  <resource src="LaTeXML.css" type="text/css"/>
  <resource src="ltx-article.css" type="text/css"/>
  <title>Emulating Human Developmental Stages with Bayesian Neural Networks</title>
  <creator role="author">
    <personname><text font="bold" fontsize="120%">Marcel Binz (binz@staff.uni-marburg.de)</text> <break/>Department of Psychology, Theoretical Neuroscience Group <break/>Philipps-Universität Marburg
<ERROR class="undefined">\AND</ERROR><text font="bold" fontsize="120%">Dominik Endres (dominik.endres@staff.uni-marburg.de)</text> <break/>Department of Psychology, Theoretical Neuroscience Group <break/>Philipps-Universität Marburg</personname>
  </creator>
  <abstract name="Abstract">
    <p>We compare the acquisition of knowledge in humans and machines. Research from the field of developmental psychology indicates, that human-employed hypothesis are initially guided by simple rules, before evolving into more complex theories. This observation is shared across many tasks and domains. We investigate whether stages of development in artificial learning systems are based on the same characteristics. We operationalize developmental stages as the size of the data-set, on which the artificial system is trained. For our analysis we look at the developmental progress of Bayesian Neural Networks on three different data-sets, including occlusion, support and quantity comparison tasks. We compare the results with prior research from developmental psychology and find agreement between the family of optimized models and pattern of development observed in infants and children on all three tasks, indicating common principles for the acquisition of knowledge.</p>
    <p><text font="bold">Keywords:</text>
Core knowledge; developmental psychology; intuitive physics; approximate number system; machine learning, deep learning, variational inference, normative models</p>
  </abstract>
<!--  %**** cogsci˙template.tex Line 50 **** -->  <section inlist="toc" xml:id="S1">
    <tags>
      <tag>1</tag>
      <tag role="refnum">1</tag>
      <tag role="typerefnum">§1</tag>
    </tags>
    <title><tag close=" ">1</tag>Introduction</title>
    <para xml:id="S1.p1">
      <p>The theory of core knowledge in developmental psychology identifies several domains, that build the foundations of human cognition <cite class="ltx_citemacro_cite"><bibref bibrefs="spelke2007core,lake2017building" separator=";" show="Authors Phrase1YearPhrase2" yyseparator=",">
            <bibrefphrase>(</bibrefphrase>
            <bibrefphrase>)</bibrefphrase>
          </bibref></cite>. Typically physics, actions, numbers, space and social interactions are listed among the core domains. Knowledge in these areas is present starting from early stages of childhood and serves as the basis for learning during later life. Research in developmental psychology over the past decades equipped us with a solid understanding about the acquisition of such knowledge. Different stages of development have been identified for a wide range of phenomena. Insights across studies suggest, that established rules generally start with a simple hypothesis before becoming more sophisticated over time <cite class="ltx_citemacro_cite"><bibref bibrefs="baillargeon2002acquisition" separator=";" show="Authors Phrase1YearPhrase2" yyseparator=",">
            <bibrefphrase>(</bibrefphrase>
            <bibrefphrase>)</bibrefphrase>
          </bibref></cite>. <break/></p>
    </para>
    <para xml:id="S1.p2">
      <p>We investigate whether current machine learning systems show generalization behavior reminiscent of human infants and children at different stages of development. For this purpose, we assume that the amount of data available to the learning algorithm is proportional to human age. The class of models we focus on are Bayesian Neural Networks (BNNs), that are trained through variational inference. Neural networks have the desirable property of being able to approximate an arbitrary complex mapping given enough capacity, while Bayesian inference captures normative principles of how to update an initial belief in the light of new evidence. The specific choice of variational inference and neural networks is mainly due to convenience reasons, and we hypothesize, that many different combinations of universal function approximators and Bayesian learning would lead to comparable results. <break/></p>
    </para>
    <para xml:id="S1.p3">
      <p>Our experiments focus on two of the established core domains: physics and numbers. We consider two experiments involving intuitive reasoning about the laws of physics <cite class="ltx_citemacro_cite"><bibref bibrefs="baillargeon2002acquisition" separator=";" show="Authors Phrase1YearPhrase2" yyseparator=",">
            <bibrefphrase>(</bibrefphrase>
            <bibrefphrase>)</bibrefphrase>
          </bibref></cite> and one examining the approximate number system, which is responsible for forming fast, but imprecise, representations of quantities <cite class="ltx_citemacro_cite"><bibref bibrefs="halberda2008developmental" separator=";" show="Authors Phrase1YearPhrase2" yyseparator=",">
            <bibrefphrase>(</bibrefphrase>
            <bibrefphrase>)</bibrefphrase>
          </bibref></cite>. In all three cases we observe pattern in our models, that share similarities with the development progress during childhood, as we increase the data-set size. <break/></p>
    </para>
    <para xml:id="S1.p4">
      <p>There has been a recent interest in replicating reasoning capabilities from the core domains in artificial systems. Prior work regarding intuitive physics has considered generative <cite class="ltx_citemacro_cite"><bibref bibrefs="battaglia2013simulation,chang2016compositional" separator=";" show="Authors Phrase1YearPhrase2" yyseparator=",">
            <bibrefphrase>(</bibrefphrase>
            <bibrefphrase>)</bibrefphrase>
          </bibref></cite> as well as discriminative models <cite class="ltx_citemacro_cite"><bibref bibrefs="lerer2016learning" separator=";" show="Authors Phrase1YearPhrase2" yyseparator=",">
            <bibrefphrase>(</bibrefphrase>
            <bibrefphrase>)</bibrefphrase>
          </bibref></cite>. Both classes of models are often able to reach performance levels comparable to those of adults on specific tasks. Another core domain, that has received some attention within the computational modelling community, is the one of intuitive psychology. Here, for example, <ERROR class="undefined">\citeA</ERROR>baker2009action suggest to employ Bayesian inverse planning for inferring mental states of other agents. In contrast to the aforementioned prior work, we are interested in the differences between optimal models for varying data-set sizes and how these differences compare to observations made in developmental psychology. Existing work on modelling the <emph font="italic">development</emph> of intuitive physics is limited to descriptive models, such as list of rules <cite class="ltx_citemacro_cite"><bibref bibrefs="siegler1998developmental" separator=";" show="Authors Phrase1YearPhrase2" yyseparator=",">
            <bibrefphrase>(</bibrefphrase>
            <bibrefphrase>)</bibrefphrase>
          </bibref></cite> or decision trees <cite class="ltx_citemacro_cite"><bibref bibrefs="baillargeon2009account" separator=";" show="Authors Phrase1YearPhrase2" yyseparator=",">
            <bibrefphrase>(</bibrefphrase>
            <bibrefphrase>)</bibrefphrase>
          </bibref></cite>. In contrast to this, our approach is based on normative principles and we ask the question, whether observed stages emerge naturally in complex artificial learning systems. <break/></p>
    </para>
    <para xml:id="S1.p5">
      <p>In the next section we provide a short technical overview of neural networks and variational inference. This is followed by a description of the three experiments under examination. For each experiment we outline the given task, the empirical observations made in the developmental psychology literature and how we construct an artificial data-set. We finally provide a comparison between the developmental progress of children at different ages and that of optimized models for different data-set sizes. We conclude the article with a discussion of the obtained results and an outlook of the future interaction between the areas of machine learning and developmental psychology.</p>
    </para>
  </section>
  <section inlist="toc" xml:id="S2">
    <tags>
      <tag>2</tag>
      <tag role="refnum">2</tag>
      <tag role="typerefnum">§2</tag>
    </tags>
    <title><tag close=" ">2</tag>Methods</title>
    <subsection inlist="toc" xml:id="S2.SS1">
      <tags>
        <tag>2.1</tag>
        <tag role="refnum">2.1</tag>
        <tag role="typerefnum">§2.1</tag>
      </tags>
      <title><tag close=" ">2.1</tag>Deep Learning</title>
<!--  %**** cogsci˙template.tex Line 75 **** -->      <para xml:id="S2.SS1.p1">
        <p>Neural networks are parametric function approximators, that combine linear transformations <Math mode="inline" tex="\mathbf{W}" text="W" xml:id="S2.SS1.p1.m1">
            <XMath>
              <XMTok font="bold" role="UNKNOWN">W</XMTok>
            </XMath>
          </Math> and non-linear activation functions <Math mode="inline" tex="f" text="f" xml:id="S2.SS1.p1.m2">
            <XMath>
              <XMTok font="italic" role="UNKNOWN">f</XMTok>
            </XMath>
          </Math> in alternating fashion:</p>
        <equation xml:id="S2.Ex1">
          <Math mode="display" tex="\mathbf{h}_{l}=f_{l}\left(\mathbf{W}_{l}^{\top}\mathbf{h}_{l-1}\right)" text="h _ l = f _ l * (W _ l) ^ top * h _ (l - 1)" xml:id="S2.Ex1.m1">
            <XMath>
              <XMApp>
                <XMTok meaning="equals" role="RELOP">=</XMTok>
                <XMApp>
                  <XMTok role="SUBSCRIPTOP" scriptpos="post1"/>
                  <XMTok font="bold" role="UNKNOWN">h</XMTok>
                  <XMTok font="italic" fontsize="70%" role="UNKNOWN">l</XMTok>
                </XMApp>
                <XMApp>
                  <XMTok meaning="times" role="MULOP">⁢</XMTok>
                  <XMApp>
                    <XMTok role="SUBSCRIPTOP" scriptpos="post1"/>
                    <XMTok font="italic" role="UNKNOWN">f</XMTok>
                    <XMTok font="italic" fontsize="70%" role="UNKNOWN">l</XMTok>
                  </XMApp>
                  <XMDual>
                    <XMRef idref="S2.Ex1.m1.1"/>
                    <XMWrap>
                      <XMTok role="OPEN" stretchy="true">(</XMTok>
                      <XMApp xml:id="S2.Ex1.m1.1">
                        <XMTok meaning="times" role="MULOP">⁢</XMTok>
                        <XMApp>
                          <XMTok role="SUPERSCRIPTOP" scriptpos="post2"/>
                          <XMApp>
                            <XMTok role="SUBSCRIPTOP" scriptpos="post2"/>
                            <XMTok font="bold" role="UNKNOWN">W</XMTok>
                            <XMTok font="italic" fontsize="70%" role="UNKNOWN">l</XMTok>
                          </XMApp>
                          <XMTok fontsize="70%" meaning="top" role="ADDOP">⊤</XMTok>
                        </XMApp>
                        <XMApp>
                          <XMTok role="SUBSCRIPTOP" scriptpos="post2"/>
                          <XMTok font="bold" role="UNKNOWN">h</XMTok>
                          <XMApp>
                            <XMTok fontsize="70%" meaning="minus" role="ADDOP">-</XMTok>
                            <XMTok font="italic" fontsize="70%" role="UNKNOWN">l</XMTok>
                            <XMTok fontsize="70%" meaning="1" role="NUMBER">1</XMTok>
                          </XMApp>
                        </XMApp>
                      </XMApp>
                      <XMTok role="CLOSE" stretchy="true">)</XMTok>
                    </XMWrap>
                  </XMDual>
                </XMApp>
              </XMApp>
            </XMath>
          </Math>
        </equation>
        <p>where <Math mode="inline" tex="l\in\{1,\ldots,L\}" text="l element-of set@(1, ldots, L)" xml:id="S2.SS1.p1.m3">
            <XMath>
              <XMApp>
                <XMTok meaning="element-of" name="in" role="RELOP">∈</XMTok>
                <XMTok font="italic" role="UNKNOWN">l</XMTok>
                <XMDual>
                  <XMApp>
                    <XMTok meaning="set"/>
                    <XMRef idref="S2.SS1.p1.m3.1"/>
                    <XMRef idref="S2.SS1.p1.m3.2"/>
                    <XMRef idref="S2.SS1.p1.m3.3"/>
                  </XMApp>
                  <XMWrap>
                    <XMTok role="OPEN" stretchy="false">{</XMTok>
                    <XMTok meaning="1" role="NUMBER" xml:id="S2.SS1.p1.m3.1">1</XMTok>
                    <XMTok role="PUNCT">,</XMTok>
                    <XMTok name="ldots" role="ID" xml:id="S2.SS1.p1.m3.2">…</XMTok>
                    <XMTok role="PUNCT">,</XMTok>
                    <XMTok font="italic" role="UNKNOWN" xml:id="S2.SS1.p1.m3.3">L</XMTok>
                    <XMTok role="CLOSE" stretchy="false">}</XMTok>
                  </XMWrap>
                </XMDual>
              </XMApp>
            </XMath>
          </Math>. <Math mode="inline" tex="\mathbf{h}_{0}" text="h _ 0" xml:id="S2.SS1.p1.m4">
            <XMath>
              <XMApp>
                <XMTok role="SUBSCRIPTOP" scriptpos="post1"/>
                <XMTok font="bold" role="UNKNOWN">h</XMTok>
                <XMTok fontsize="70%" meaning="0" role="NUMBER">0</XMTok>
              </XMApp>
            </XMath>
          </Math> corresponds to the input <Math mode="inline" tex="\mathbf{x}" text="x" xml:id="S2.SS1.p1.m5">
            <XMath>
              <XMTok font="bold" role="UNKNOWN">x</XMTok>
            </XMath>
          </Math> and <Math mode="inline" tex="\mathbf{h}_{L}" text="h _ L" xml:id="S2.SS1.p1.m6">
            <XMath>
              <XMApp>
                <XMTok role="SUBSCRIPTOP" scriptpos="post1"/>
                <XMTok font="bold" role="UNKNOWN">h</XMTok>
                <XMTok font="italic" fontsize="70%" role="UNKNOWN">L</XMTok>
              </XMApp>
            </XMath>
          </Math> to an estimate of the target <Math mode="inline" tex="\hat{y}" text="hat@(y)" xml:id="S2.SS1.p1.m7">
            <XMath>
              <XMApp>
                <XMTok name="hat" role="OVERACCENT" stretchy="false">^</XMTok>
                <XMTok font="italic" role="UNKNOWN">y</XMTok>
              </XMApp>
            </XMath>
          </Math>. Parameters of the model are commonly updated via gradient descent on a loss function, usually some form of negative log-likelihood. The power of neural networks stems from their ability to approximate any continuous function on a compact subset of <Math mode="inline" tex="\mathbb{R}^{n}" text="R ^ n" xml:id="S2.SS1.p1.m8">
            <XMath>
              <XMApp>
                <XMTok role="SUPERSCRIPTOP" scriptpos="post1"/>
                <XMTok font="blackboard" role="UNKNOWN">R</XMTok>
                <XMTok font="italic" fontsize="70%" role="UNKNOWN">n</XMTok>
              </XMApp>
            </XMath>
          </Math>.</p>
      </para>
    </subsection>
    <subsection inlist="toc" xml:id="S2.SS2">
      <tags>
        <tag>2.2</tag>
        <tag role="refnum">2.2</tag>
        <tag role="typerefnum">§2.2</tag>
      </tags>
      <title><tag close=" ">2.2</tag>Variational Inference</title>
      <para xml:id="S2.SS2.p1">
        <p>The task of learning model parameters <Math mode="inline" tex="\mathbf{W}" text="W" xml:id="S2.SS2.p1.m1">
            <XMath>
              <XMTok font="bold" role="UNKNOWN">W</XMTok>
            </XMath>
          </Math> can also be stated as a Bayesian inference problem:</p>
        <equation labels="LABEL:eq:Bayes" xml:id="S2.E1">
          <tags>
            <tag>(1)</tag>
            <tag role="refnum">1</tag>
          </tags>
          <Math mode="display" tex="\underbrace{p(\mathbf{W}|\mathcal{D})}_{\text{posterior}}=\frac{\overbrace{p(%&#10;\mathbf{y}|\mathbf{X},\mathbf{W})}^{\text{likelihood}}\overbrace{p(\mathbf{W})%&#10;}^{\text{prior}}}{\underbrace{p(\mathbf{y}|\mathbf{X})}_{\text{evidence}}}" text="(underbrace@(p * conditional@(W, D))) _ [posterior] = ((overbrace@(p * conditional@(y, list@(X, W)))) ^ [likelihood] * (overbrace@(p * W)) ^ [prior]) / (underbrace@(p * conditional@(y, X))) _ [evidence]" xml:id="S2.E1.m1">
            <XMath>
              <XMApp>
                <XMTok meaning="equals" role="RELOP">=</XMTok>
                <XMApp scriptpos="mid">
                  <XMTok role="SUBSCRIPTOP" scriptpos="mid1"/>
                  <XMApp scriptpos="mid">
                    <XMTok name="underbrace" role="UNDERACCENT">⏟</XMTok>
                    <XMApp>
                      <XMTok meaning="times" role="MULOP">⁢</XMTok>
                      <XMTok font="italic" role="UNKNOWN">p</XMTok>
                      <XMDual>
                        <XMRef idref="S2.E1.m1.1"/>
                        <XMWrap>
                          <XMTok role="OPEN" stretchy="false">(</XMTok>
                          <XMApp xml:id="S2.E1.m1.1">
                            <XMTok meaning="conditional" role="MODIFIEROP" stretchy="false">|</XMTok>
                            <XMTok font="bold" role="UNKNOWN">W</XMTok>
                            <XMTok font="caligraphic" role="UNKNOWN">D</XMTok>
                          </XMApp>
                          <XMTok role="CLOSE" stretchy="false">)</XMTok>
                        </XMWrap>
                      </XMDual>
                    </XMApp>
                  </XMApp>
                  <XMText><text fontsize="70%">posterior</text></XMText>
                </XMApp>
                <XMApp>
                  <XMTok mathstyle="display" meaning="divide" role="FRACOP"/>
                  <XMApp>
                    <XMTok meaning="times" role="MULOP">⁢</XMTok>
                    <XMApp scriptpos="mid">
                      <XMTok role="SUPERSCRIPTOP" scriptpos="mid2"/>
                      <XMApp scriptpos="mid">
                        <XMTok name="overbrace" role="OVERACCENT">⏞</XMTok>
                        <XMApp>
                          <XMTok meaning="times" role="MULOP">⁢</XMTok>
                          <XMTok font="italic" role="UNKNOWN">p</XMTok>
                          <XMDual>
                            <XMRef idref="S2.E1.m1.4"/>
                            <XMWrap>
                              <XMTok role="OPEN" stretchy="false">(</XMTok>
                              <XMApp xml:id="S2.E1.m1.4">
                                <XMTok meaning="conditional" role="MODIFIEROP" stretchy="false">|</XMTok>
                                <XMTok font="bold" role="UNKNOWN">y</XMTok>
                                <XMDual>
                                  <XMApp>
                                    <XMTok meaning="list"/>
                                    <XMRef idref="S2.E1.m1.2"/>
                                    <XMRef idref="S2.E1.m1.3"/>
                                  </XMApp>
                                  <XMWrap>
                                    <XMTok font="bold" role="UNKNOWN" xml:id="S2.E1.m1.2">X</XMTok>
                                    <XMTok role="PUNCT">,</XMTok>
                                    <XMTok font="bold" role="UNKNOWN" xml:id="S2.E1.m1.3">W</XMTok>
                                  </XMWrap>
                                </XMDual>
                              </XMApp>
                              <XMTok role="CLOSE" stretchy="false">)</XMTok>
                            </XMWrap>
                          </XMDual>
                        </XMApp>
                      </XMApp>
                      <XMText><text fontsize="70%">likelihood</text></XMText>
                    </XMApp>
                    <XMApp scriptpos="mid">
                      <XMTok role="SUPERSCRIPTOP" scriptpos="mid2"/>
                      <XMApp scriptpos="mid">
                        <XMTok name="overbrace" role="OVERACCENT">⏞</XMTok>
                        <XMApp>
                          <XMTok meaning="times" role="MULOP">⁢</XMTok>
                          <XMTok font="italic" role="UNKNOWN">p</XMTok>
                          <XMDual>
                            <XMRef idref="S2.E1.m1.5"/>
                            <XMWrap>
                              <XMTok role="OPEN" stretchy="false">(</XMTok>
                              <XMTok font="bold" role="UNKNOWN" xml:id="S2.E1.m1.5">W</XMTok>
                              <XMTok role="CLOSE" stretchy="false">)</XMTok>
                            </XMWrap>
                          </XMDual>
                        </XMApp>
                      </XMApp>
                      <XMText><text fontsize="70%">prior</text></XMText>
                    </XMApp>
                  </XMApp>
                  <XMApp scriptpos="mid">
                    <XMTok role="SUBSCRIPTOP" scriptpos="mid2"/>
                    <XMApp scriptpos="mid">
                      <XMTok name="underbrace" role="UNDERACCENT">⏟</XMTok>
                      <XMApp>
                        <XMTok meaning="times" role="MULOP">⁢</XMTok>
                        <XMTok font="italic" role="UNKNOWN">p</XMTok>
                        <XMDual>
                          <XMRef idref="S2.E1.m1.6"/>
                          <XMWrap>
                            <XMTok role="OPEN" stretchy="false">(</XMTok>
                            <XMApp xml:id="S2.E1.m1.6">
                              <XMTok meaning="conditional" role="MODIFIEROP" stretchy="false">|</XMTok>
                              <XMTok font="bold" role="UNKNOWN">y</XMTok>
                              <XMTok font="bold" role="UNKNOWN">X</XMTok>
                            </XMApp>
                            <XMTok role="CLOSE" stretchy="false">)</XMTok>
                          </XMWrap>
                        </XMDual>
                      </XMApp>
                    </XMApp>
                    <XMText><text fontsize="70%">evidence</text></XMText>
                  </XMApp>
                </XMApp>
              </XMApp>
            </XMath>
          </Math>
        </equation>
      </para>
      <para xml:id="S2.SS2.p2">
        <p>for a given data set <Math mode="inline" tex="\mathcal{D}=\{(\mathbf{x}_{i},y_{i})\}_{i=1}^{N}" text="D = ((set@(open-interval@(x _ i, y _ i))) _ (i = 1)) ^ N" xml:id="S2.SS2.p2.m1">
            <XMath>
              <XMApp>
                <XMTok meaning="equals" role="RELOP">=</XMTok>
                <XMTok font="caligraphic" role="UNKNOWN">D</XMTok>
                <XMApp>
                  <XMTok role="SUPERSCRIPTOP" scriptpos="post1"/>
                  <XMApp>
                    <XMTok role="SUBSCRIPTOP" scriptpos="post1"/>
                    <XMDual>
                      <XMApp>
                        <XMTok meaning="set"/>
                        <XMRef idref="S2.SS2.p2.m1.1"/>
                      </XMApp>
                      <XMWrap>
                        <XMTok role="OPEN" stretchy="false">{</XMTok>
                        <XMDual xml:id="S2.SS2.p2.m1.1">
                          <XMApp>
                            <XMTok meaning="open-interval"/>
                            <XMRef idref="S2.SS2.p2.m1.1.1"/>
                            <XMRef idref="S2.SS2.p2.m1.1.2"/>
                          </XMApp>
                          <XMWrap>
                            <XMTok role="OPEN" stretchy="false">(</XMTok>
                            <XMApp xml:id="S2.SS2.p2.m1.1.1">
                              <XMTok role="SUBSCRIPTOP" scriptpos="post1"/>
                              <XMTok font="bold" role="UNKNOWN">x</XMTok>
                              <XMTok font="italic" fontsize="70%" role="UNKNOWN">i</XMTok>
                            </XMApp>
                            <XMTok role="PUNCT">,</XMTok>
                            <XMApp xml:id="S2.SS2.p2.m1.1.2">
                              <XMTok role="SUBSCRIPTOP" scriptpos="post1"/>
                              <XMTok font="italic" role="UNKNOWN">y</XMTok>
                              <XMTok font="italic" fontsize="70%" role="UNKNOWN">i</XMTok>
                            </XMApp>
                            <XMTok role="CLOSE" stretchy="false">)</XMTok>
                          </XMWrap>
                        </XMDual>
                        <XMTok role="CLOSE" stretchy="false">}</XMTok>
                      </XMWrap>
                    </XMDual>
                    <XMApp>
                      <XMTok fontsize="70%" meaning="equals" role="RELOP">=</XMTok>
                      <XMTok font="italic" fontsize="70%" role="UNKNOWN">i</XMTok>
                      <XMTok fontsize="70%" meaning="1" role="NUMBER">1</XMTok>
                    </XMApp>
                  </XMApp>
                  <XMTok font="italic" fontsize="70%" role="UNKNOWN">N</XMTok>
                </XMApp>
              </XMApp>
            </XMath>
          </Math>, with inputs <Math mode="inline" tex="\mathbf{X}=\{\mathbf{x}_{i}\}_{i=1}^{N}" text="X = ((set@(x _ i)) _ (i = 1)) ^ N" xml:id="S2.SS2.p2.m2">
            <XMath>
              <XMApp>
                <XMTok meaning="equals" role="RELOP">=</XMTok>
                <XMTok font="bold" role="UNKNOWN">X</XMTok>
                <XMApp>
                  <XMTok role="SUPERSCRIPTOP" scriptpos="post1"/>
                  <XMApp>
                    <XMTok role="SUBSCRIPTOP" scriptpos="post1"/>
                    <XMDual>
                      <XMApp>
                        <XMTok meaning="set"/>
                        <XMRef idref="S2.SS2.p2.m2.1"/>
                      </XMApp>
                      <XMWrap>
                        <XMTok role="OPEN" stretchy="false">{</XMTok>
                        <XMApp xml:id="S2.SS2.p2.m2.1">
                          <XMTok role="SUBSCRIPTOP" scriptpos="post1"/>
                          <XMTok font="bold" role="UNKNOWN">x</XMTok>
                          <XMTok font="italic" fontsize="70%" role="UNKNOWN">i</XMTok>
                        </XMApp>
                        <XMTok role="CLOSE" stretchy="false">}</XMTok>
                      </XMWrap>
                    </XMDual>
                    <XMApp>
                      <XMTok fontsize="70%" meaning="equals" role="RELOP">=</XMTok>
                      <XMTok font="italic" fontsize="70%" role="UNKNOWN">i</XMTok>
                      <XMTok fontsize="70%" meaning="1" role="NUMBER">1</XMTok>
                    </XMApp>
                  </XMApp>
                  <XMTok font="italic" fontsize="70%" role="UNKNOWN">N</XMTok>
                </XMApp>
              </XMApp>
            </XMath>
          </Math> and targets <Math mode="inline" tex="\mathbf{y}=\{y_{i}\}_{i=1}^{N}" text="y = ((set@(y _ i)) _ (i = 1)) ^ N" xml:id="S2.SS2.p2.m3">
            <XMath>
              <XMApp>
                <XMTok meaning="equals" role="RELOP">=</XMTok>
                <XMTok font="bold" role="UNKNOWN">y</XMTok>
                <XMApp>
                  <XMTok role="SUPERSCRIPTOP" scriptpos="post1"/>
                  <XMApp>
                    <XMTok role="SUBSCRIPTOP" scriptpos="post1"/>
                    <XMDual>
                      <XMApp>
                        <XMTok meaning="set"/>
                        <XMRef idref="S2.SS2.p2.m3.1"/>
                      </XMApp>
                      <XMWrap>
                        <XMTok role="OPEN" stretchy="false">{</XMTok>
                        <XMApp xml:id="S2.SS2.p2.m3.1">
                          <XMTok role="SUBSCRIPTOP" scriptpos="post1"/>
                          <XMTok font="italic" role="UNKNOWN">y</XMTok>
                          <XMTok font="italic" fontsize="70%" role="UNKNOWN">i</XMTok>
                        </XMApp>
                        <XMTok role="CLOSE" stretchy="false">}</XMTok>
                      </XMWrap>
                    </XMDual>
                    <XMApp>
                      <XMTok fontsize="70%" meaning="equals" role="RELOP">=</XMTok>
                      <XMTok font="italic" fontsize="70%" role="UNKNOWN">i</XMTok>
                      <XMTok fontsize="70%" meaning="1" role="NUMBER">1</XMTok>
                    </XMApp>
                  </XMApp>
                  <XMTok font="italic" fontsize="70%" role="UNKNOWN">N</XMTok>
                </XMApp>
              </XMApp>
            </XMath>
          </Math>. Bayes’ theorem defines how we should update our beliefs as more information becomes available. As <Math mode="inline" tex="N\rightarrow\infty" text="N rightarrow infinity" xml:id="S2.SS2.p2.m4">
            <XMath>
              <XMApp>
                <XMTok name="rightarrow" role="ARROW">→</XMTok>
                <XMTok font="italic" role="UNKNOWN">N</XMTok>
                <XMTok meaning="infinity" name="infty" role="ID">∞</XMTok>
              </XMApp>
            </XMath>
          </Math> the influence of the prior vanishes, while for <Math mode="inline" tex="N\rightarrow 0" text="N rightarrow 0" xml:id="S2.SS2.p2.m5">
            <XMath>
              <XMApp>
                <XMTok name="rightarrow" role="ARROW">→</XMTok>
                <XMTok font="italic" role="UNKNOWN">N</XMTok>
                <XMTok meaning="0" role="NUMBER">0</XMTok>
              </XMApp>
            </XMath>
          </Math> we can only rely on prior assumptions. In our context we assume that experience (i.e. <Math mode="inline" tex="N" text="N" xml:id="S2.SS2.p2.m6">
            <XMath>
              <XMTok font="italic" role="UNKNOWN">N</XMTok>
            </XMath>
          </Math>) increases with age and hence we use an approximation to Equation <ref labelref="LABEL:eq:Bayes"/> with data-sets of varying size to represent agents of different age. <break/></p>
      </para>
      <para xml:id="S2.SS2.p3">
        <p>Equation <ref labelref="LABEL:eq:Bayes"/> is in general hard to compute for models of useful complexity. Variational inference offers a tractable approximation to Equation <ref labelref="LABEL:eq:Bayes"/> <cite class="ltx_citemacro_cite"><bibref bibrefs="hinton1993keeping" separator=";" show="Authors Phrase1YearPhrase2" yyseparator=",">
              <bibrefphrase>(</bibrefphrase>
              <bibrefphrase>)</bibrefphrase>
            </bibref></cite>. Let <Math mode="inline" tex="q_{\phi}(\mathbf{W})" text="q _ phi * W" xml:id="S2.SS2.p3.m1">
            <XMath>
              <XMApp>
                <XMTok meaning="times" role="MULOP">⁢</XMTok>
                <XMApp>
                  <XMTok role="SUBSCRIPTOP" scriptpos="post1"/>
                  <XMTok font="italic" role="UNKNOWN">q</XMTok>
                  <XMTok font="italic" fontsize="70%" name="phi" role="UNKNOWN">ϕ</XMTok>
                </XMApp>
                <XMDual>
                  <XMRef idref="S2.SS2.p3.m1.1"/>
                  <XMWrap>
                    <XMTok role="OPEN" stretchy="false">(</XMTok>
                    <XMTok font="bold" role="UNKNOWN" xml:id="S2.SS2.p3.m1.1">W</XMTok>
                    <XMTok role="CLOSE" stretchy="false">)</XMTok>
                  </XMWrap>
                </XMDual>
              </XMApp>
            </XMath>
          </Math> be a distribution with parameters <Math mode="inline" tex="\phi" text="phi" xml:id="S2.SS2.p3.m2">
            <XMath>
              <XMTok font="italic" name="phi" role="UNKNOWN">ϕ</XMTok>
            </XMath>
          </Math>, that approximates the true posterior <Math mode="inline" tex="p(\mathbf{W}|\mathcal{D})" text="p * conditional@(W, D)" xml:id="S2.SS2.p3.m3">
            <XMath>
              <XMApp>
                <XMTok meaning="times" role="MULOP">⁢</XMTok>
                <XMTok font="italic" role="UNKNOWN">p</XMTok>
                <XMDual>
                  <XMRef idref="S2.SS2.p3.m3.1"/>
                  <XMWrap>
                    <XMTok role="OPEN" stretchy="false">(</XMTok>
                    <XMApp xml:id="S2.SS2.p3.m3.1">
                      <XMTok meaning="conditional" role="MODIFIEROP" stretchy="false">|</XMTok>
                      <XMTok font="bold" role="UNKNOWN">W</XMTok>
                      <XMTok font="caligraphic" role="UNKNOWN">D</XMTok>
                    </XMApp>
                    <XMTok role="CLOSE" stretchy="false">)</XMTok>
                  </XMWrap>
                </XMDual>
              </XMApp>
            </XMath>
          </Math>. Formulating the problem as a minimization of the Kullback-Leibler (KL) divergence between <Math mode="inline" tex="q_{\phi}(\mathbf{W})" text="q _ phi * W" xml:id="S2.SS2.p3.m4">
            <XMath>
              <XMApp>
                <XMTok meaning="times" role="MULOP">⁢</XMTok>
                <XMApp>
                  <XMTok role="SUBSCRIPTOP" scriptpos="post1"/>
                  <XMTok font="italic" role="UNKNOWN">q</XMTok>
                  <XMTok font="italic" fontsize="70%" name="phi" role="UNKNOWN">ϕ</XMTok>
                </XMApp>
                <XMDual>
                  <XMRef idref="S2.SS2.p3.m4.1"/>
                  <XMWrap>
                    <XMTok role="OPEN" stretchy="false">(</XMTok>
                    <XMTok font="bold" role="UNKNOWN" xml:id="S2.SS2.p3.m4.1">W</XMTok>
                    <XMTok role="CLOSE" stretchy="false">)</XMTok>
                  </XMWrap>
                </XMDual>
              </XMApp>
            </XMath>
          </Math> and <Math mode="inline" tex="p(\mathbf{W}|\mathcal{D})" text="p * conditional@(W, D)" xml:id="S2.SS2.p3.m5">
            <XMath>
              <XMApp>
                <XMTok meaning="times" role="MULOP">⁢</XMTok>
                <XMTok font="italic" role="UNKNOWN">p</XMTok>
                <XMDual>
                  <XMRef idref="S2.SS2.p3.m5.1"/>
                  <XMWrap>
                    <XMTok role="OPEN" stretchy="false">(</XMTok>
                    <XMApp xml:id="S2.SS2.p3.m5.1">
                      <XMTok meaning="conditional" role="MODIFIEROP" stretchy="false">|</XMTok>
                      <XMTok font="bold" role="UNKNOWN">W</XMTok>
                      <XMTok font="caligraphic" role="UNKNOWN">D</XMTok>
                    </XMApp>
                    <XMTok role="CLOSE" stretchy="false">)</XMTok>
                  </XMWrap>
                </XMDual>
              </XMApp>
            </XMath>
          </Math> leads to the evidence lower bound (ELBO):</p>
        <equation labels="LABEL:eq:elbo" xml:id="S2.E2">
          <tags>
            <tag>(2)</tag>
            <tag role="refnum">2</tag>
          </tags>
          <Math mode="display" tex="\mathcal{L}(\phi)=\mathbb{E}_{q_{\phi}(\mathbf{W})}\left[\log p(\mathbf{y}|%&#10;\mathbf{X},\mathbf{W})\right]-\text{KL}(q_{\phi}(\mathbf{W})||p(\mathbf{W}))" xml:id="S2.E2.m1">
            <XMath>
              <XMTok font="caligraphic" role="UNKNOWN">L</XMTok>
              <XMWrap>
                <XMTok role="OPEN" stretchy="false">(</XMTok>
                <XMTok font="italic" name="phi" role="UNKNOWN" xml:id="S2.E2.m1.2">ϕ</XMTok>
                <XMTok role="CLOSE" stretchy="false">)</XMTok>
              </XMWrap>
              <XMTok meaning="equals" role="RELOP">=</XMTok>
              <XMApp>
                <XMTok role="SUBSCRIPTOP" scriptpos="post1"/>
                <XMTok font="blackboard" role="UNKNOWN">E</XMTok>
                <XMApp>
                  <XMTok meaning="times" role="MULOP">⁢</XMTok>
                  <XMApp>
                    <XMTok role="SUBSCRIPTOP" scriptpos="post2"/>
                    <XMTok font="italic" fontsize="70%" role="UNKNOWN">q</XMTok>
                    <XMTok font="italic" fontsize="50%" name="phi" role="UNKNOWN">ϕ</XMTok>
                  </XMApp>
                  <XMDual>
                    <XMRef idref="S2.E2.m1.1"/>
                    <XMWrap>
                      <XMTok fontsize="70%" role="OPEN" stretchy="false">(</XMTok>
                      <XMTok font="bold" fontsize="70%" role="UNKNOWN" xml:id="S2.E2.m1.1">W</XMTok>
                      <XMTok fontsize="70%" role="CLOSE" stretchy="false">)</XMTok>
                    </XMWrap>
                  </XMDual>
                </XMApp>
              </XMApp>
              <XMWrap>
                <XMTok role="OPEN" stretchy="true">[</XMTok>
                <XMTok meaning="logarithm" role="OPFUNCTION">log</XMTok>
                <XMTok font="italic" role="UNKNOWN">p</XMTok>
                <XMWrap>
                  <XMTok role="OPEN" stretchy="false">(</XMTok>
                  <XMTok font="bold" role="UNKNOWN">y</XMTok>
                  <XMTok role="VERTBAR" stretchy="false">|</XMTok>
                  <XMTok font="bold" role="UNKNOWN" xml:id="S2.E2.m1.3">X</XMTok>
                  <XMTok role="PUNCT">,</XMTok>
                  <XMTok font="bold" role="UNKNOWN" xml:id="S2.E2.m1.4">W</XMTok>
                  <XMTok role="CLOSE" stretchy="false">)</XMTok>
                </XMWrap>
                <XMTok role="CLOSE" stretchy="true">]</XMTok>
              </XMWrap>
              <XMTok meaning="minus" role="ADDOP">-</XMTok>
              <XMText>KL</XMText>
              <XMWrap>
                <XMTok role="OPEN" stretchy="false">(</XMTok>
                <XMApp>
                  <XMTok role="SUBSCRIPTOP" scriptpos="post1"/>
                  <XMTok font="italic" role="UNKNOWN">q</XMTok>
                  <XMTok font="italic" fontsize="70%" name="phi" role="UNKNOWN">ϕ</XMTok>
                </XMApp>
                <XMWrap>
                  <XMTok role="OPEN" stretchy="false">(</XMTok>
                  <XMTok font="bold" role="UNKNOWN" xml:id="S2.E2.m1.5">W</XMTok>
                  <XMTok role="CLOSE" stretchy="false">)</XMTok>
                </XMWrap>
                <XMTok role="VERTBAR" stretchy="false">|</XMTok>
                <XMTok role="VERTBAR" stretchy="false">|</XMTok>
                <XMTok font="italic" role="UNKNOWN">p</XMTok>
                <XMWrap>
                  <XMTok role="OPEN" stretchy="false">(</XMTok>
                  <XMTok font="bold" role="UNKNOWN" xml:id="S2.E2.m1.6">W</XMTok>
                  <XMTok role="CLOSE" stretchy="false">)</XMTok>
                </XMWrap>
                <XMTok role="CLOSE" stretchy="false">)</XMTok>
              </XMWrap>
            </XMath>
          </Math>
        </equation>
      </para>
      <para xml:id="S2.SS2.p4">
        <p>which can be maximized with respect to <Math mode="inline" tex="\phi" text="phi" xml:id="S2.SS2.p4.m1">
            <XMath>
              <XMTok font="italic" name="phi" role="UNKNOWN">ϕ</XMTok>
            </XMath>
          </Math> using standard optimization techniques. <break/></p>
      </para>
      <para xml:id="S2.SS2.p5">
        <p>In order to be able to scale to large data-sets Equation <ref labelref="LABEL:eq:elbo"/> is often approximated using batches <Math mode="inline" tex="\mathcal{B}\subseteq\mathcal{D}" text="B subset-of-or-equals D" xml:id="S2.SS2.p5.m1">
            <XMath>
              <XMApp>
                <XMTok meaning="subset-of-or-equals" name="subseteq" role="RELOP">⊆</XMTok>
                <XMTok font="caligraphic" role="UNKNOWN">B</XMTok>
                <XMTok font="caligraphic" role="UNKNOWN">D</XMTok>
              </XMApp>
            </XMath>
          </Math> of size M, with the log-likelihood term being scaled appropriately:</p>
        <equation labels="LABEL:eq:N" xml:id="S2.E3">
          <tags>
            <tag>(3)</tag>
            <tag role="refnum">3</tag>
          </tags>
          <Math mode="display" tex="\log p(\mathbf{y}|\mathbf{X},\mathbf{W})\approx\frac{N}{M}\sum_{i\in\mathcal{B%&#10;}}\log p(y_{i}|X_{i},\mathbf{W})" text="logarithm@(p) * conditional@(y, list@(X, W)) approximately-equals (N / M) * (sum _ (i element-of B))@(logarithm@(p) * conditional@(y _ i, list@(X _ i, W)))" xml:id="S2.E3.m1">
            <XMath>
              <XMApp>
                <XMTok meaning="approximately-equals" name="approx" role="RELOP">≈</XMTok>
                <XMApp>
                  <XMTok meaning="times" role="MULOP">⁢</XMTok>
                  <XMApp>
                    <XMTok meaning="logarithm" role="OPFUNCTION">log</XMTok>
                    <XMTok font="italic" role="UNKNOWN">p</XMTok>
                  </XMApp>
                  <XMDual>
                    <XMRef idref="S2.E3.m1.4"/>
                    <XMWrap>
                      <XMTok role="OPEN" stretchy="false">(</XMTok>
                      <XMApp xml:id="S2.E3.m1.4">
                        <XMTok meaning="conditional" role="MODIFIEROP" stretchy="false">|</XMTok>
                        <XMTok font="bold" role="UNKNOWN">y</XMTok>
                        <XMDual>
                          <XMApp>
                            <XMTok meaning="list"/>
                            <XMRef idref="S2.E3.m1.1"/>
                            <XMRef idref="S2.E3.m1.2"/>
                          </XMApp>
                          <XMWrap>
                            <XMTok font="bold" role="UNKNOWN" xml:id="S2.E3.m1.1">X</XMTok>
                            <XMTok role="PUNCT">,</XMTok>
                            <XMTok font="bold" role="UNKNOWN" xml:id="S2.E3.m1.2">W</XMTok>
                          </XMWrap>
                        </XMDual>
                      </XMApp>
                      <XMTok role="CLOSE" stretchy="false">)</XMTok>
                    </XMWrap>
                  </XMDual>
                </XMApp>
                <XMApp>
                  <XMTok meaning="times" role="MULOP">⁢</XMTok>
                  <XMApp>
                    <XMTok mathstyle="display" meaning="divide" role="FRACOP"/>
                    <XMTok font="italic" role="UNKNOWN">N</XMTok>
                    <XMTok font="italic" role="UNKNOWN">M</XMTok>
                  </XMApp>
                  <XMApp>
                    <XMApp scriptpos="mid">
                      <XMTok role="SUBSCRIPTOP" scriptpos="mid1"/>
                      <XMTok mathstyle="display" meaning="sum" role="SUMOP" scriptpos="mid">∑</XMTok>
                      <XMApp>
                        <XMTok fontsize="70%" meaning="element-of" name="in" role="RELOP">∈</XMTok>
                        <XMTok font="italic" fontsize="70%" role="UNKNOWN">i</XMTok>
                        <XMTok font="caligraphic" fontsize="70%" role="UNKNOWN">B</XMTok>
                      </XMApp>
                    </XMApp>
                    <XMApp>
                      <XMTok meaning="times" role="MULOP">⁢</XMTok>
                      <XMApp>
                        <XMTok meaning="logarithm" role="OPFUNCTION">log</XMTok>
                        <XMTok font="italic" role="UNKNOWN">p</XMTok>
                      </XMApp>
                      <XMDual>
                        <XMRef idref="S2.E3.m1.5"/>
                        <XMWrap>
                          <XMTok role="OPEN" stretchy="false">(</XMTok>
                          <XMApp xml:id="S2.E3.m1.5">
                            <XMTok meaning="conditional" role="MODIFIEROP" stretchy="false">|</XMTok>
                            <XMApp>
                              <XMTok role="SUBSCRIPTOP" scriptpos="post1"/>
                              <XMTok font="italic" role="UNKNOWN">y</XMTok>
                              <XMTok font="italic" fontsize="70%" role="UNKNOWN">i</XMTok>
                            </XMApp>
                            <XMDual>
                              <XMApp>
                                <XMTok meaning="list"/>
                                <XMRef idref="S2.E3.m1.5.1"/>
                                <XMRef idref="S2.E3.m1.3"/>
                              </XMApp>
                              <XMWrap>
                                <XMApp xml:id="S2.E3.m1.5.1">
                                  <XMTok role="SUBSCRIPTOP" scriptpos="post1"/>
                                  <XMTok font="italic" role="UNKNOWN">X</XMTok>
                                  <XMTok font="italic" fontsize="70%" role="UNKNOWN">i</XMTok>
                                </XMApp>
                                <XMTok role="PUNCT">,</XMTok>
                                <XMTok font="bold" role="UNKNOWN" xml:id="S2.E3.m1.3">W</XMTok>
                              </XMWrap>
                            </XMDual>
                          </XMApp>
                          <XMTok role="CLOSE" stretchy="false">)</XMTok>
<!--  %**** cogsci_template.tex Line 100 **** -->                        </XMWrap>
                      </XMDual>
                    </XMApp>
                  </XMApp>
                </XMApp>
              </XMApp>
            </XMath>
          </Math>
        </equation>
      </para>
      <para xml:id="S2.SS2.p6">
        <p>Note, that only the first term of the ELBO depends on the data-set size <Math mode="inline" tex="N" text="N" xml:id="S2.SS2.p6.m1">
            <XMath>
              <XMTok font="italic" role="UNKNOWN">N</XMTok>
            </XMath>
          </Math>, while the second term is independent of it. Hence the divergence term will dominate for small data-sets, leading to models that closely reflect our prior assumptions. In this work we employ priors, that promote simple functions. Therefore, our models are able to capture successively more complex functions with increasing data-set size.</p>
      </para>
    </subsection>
    <subsection inlist="toc" xml:id="S2.SS3">
      <tags>
        <tag>2.3</tag>
        <tag role="refnum">2.3</tag>
        <tag role="typerefnum">§2.3</tag>
      </tags>
      <title><tag close=" ">2.3</tag>Implementational Details</title>
      <para xml:id="S2.SS3.p1">
        <p>All models consist of <Math mode="inline" tex="L=3" text="L = 3" xml:id="S2.SS3.p1.m1">
            <XMath>
              <XMApp>
                <XMTok meaning="equals" role="RELOP">=</XMTok>
                <XMTok font="italic" role="UNKNOWN">L</XMTok>
                <XMTok meaning="3" role="NUMBER">3</XMTok>
              </XMApp>
            </XMath>
          </Math> layers with hidden layer sizes <Math mode="inline" tex="|h_{l}|=256" text="absolute-value@(h _ l) = 256" xml:id="S2.SS3.p1.m2">
            <XMath>
              <XMApp>
                <XMTok meaning="equals" role="RELOP">=</XMTok>
                <XMDual>
                  <XMApp>
                    <XMTok meaning="absolute-value"/>
                    <XMRef idref="S2.SS3.p1.m2.1"/>
                  </XMApp>
                  <XMWrap>
                    <XMTok role="VERTBAR" stretchy="false">|</XMTok>
                    <XMApp xml:id="S2.SS3.p1.m2.1">
                      <XMTok role="SUBSCRIPTOP" scriptpos="post1"/>
                      <XMTok font="italic" role="UNKNOWN">h</XMTok>
                      <XMTok font="italic" fontsize="70%" role="UNKNOWN">l</XMTok>
                    </XMApp>
                    <XMTok role="VERTBAR" stretchy="false">|</XMTok>
                  </XMWrap>
                </XMDual>
                <XMTok meaning="256" role="NUMBER">256</XMTok>
              </XMApp>
            </XMath>
          </Math> and ELU activation functions <cite class="ltx_citemacro_cite"><bibref bibrefs="clevert2015fast" separator=";" show="Authors Phrase1YearPhrase2" yyseparator=",">
              <bibrefphrase>(</bibrefphrase>
              <bibrefphrase>)</bibrefphrase>
            </bibref></cite>, unless otherwise mentioned. Inputs <Math mode="inline" tex="\mathbf{x}" text="x" xml:id="S2.SS3.p1.m3">
            <XMath>
              <XMTok font="bold" role="UNKNOWN">x</XMTok>
            </XMath>
          </Math> correspond to flattened images of the scene and targets <Math mode="inline" tex="y" text="y" xml:id="S2.SS3.p1.m4">
            <XMath>
              <XMTok font="italic" role="UNKNOWN">y</XMTok>
            </XMath>
          </Math> are dependent on the current task. We place a group horseshoe prior, which can be viewed as a continuous relaxation of a spike-and-slab prior <cite class="ltx_citemacro_cite"><bibref bibrefs="mitchell1988bayesian" separator=";" show="Authors Phrase1YearPhrase2" yyseparator=",">
              <bibrefphrase>(</bibrefphrase>
              <bibrefphrase>)</bibrefphrase>
            </bibref></cite>, over all parameters:</p>
        <equationgroup class="ltx_eqn_align" xml:id="S5.EGx1">
          <equation xml:id="S2.Ex2">
            <MathFork>
              <Math tex="\displaystyle s\sim\mathcal{C}^{+}(0,\tau_{0});~{}~{}~{}\tilde{z}_{i}\sim%&#10;\mathcal{C}^{+}(0,1);" text="formulae@(s similar-to C ^ + * open-interval@(0, tau _ 0), (tilde@(z)) _ i similar-to C ^ + * open-interval@(0, 1))" xml:id="S2.Ex2.m2">
                <XMath>
                  <XMDual>
                    <XMRef idref="S2.Ex2.m2.4"/>
                    <XMWrap>
                      <XMDual xml:id="S2.Ex2.m2.4">
                        <XMApp>
                          <XMTok meaning="formulae"/>
                          <XMRef idref="S2.Ex2.m2.4.1"/>
                          <XMRef idref="S2.Ex2.m2.4.2"/>
                        </XMApp>
                        <XMWrap>
                          <XMApp xml:id="S2.Ex2.m2.4.1">
                            <XMTok meaning="similar-to" name="sim" role="RELOP">∼</XMTok>
                            <XMTok font="italic" role="UNKNOWN">s</XMTok>
                            <XMApp>
                              <XMTok meaning="times" role="MULOP">⁢</XMTok>
                              <XMApp>
                                <XMTok role="SUPERSCRIPTOP" scriptpos="post1"/>
                                <XMTok font="caligraphic" role="UNKNOWN">C</XMTok>
                                <XMTok fontsize="70%" meaning="plus" role="ADDOP">+</XMTok>
                              </XMApp>
                              <XMDual>
                                <XMApp>
                                  <XMTok meaning="open-interval"/>
                                  <XMRef idref="S2.Ex2.m2.1"/>
                                  <XMRef idref="S2.Ex2.m2.4.1.1"/>
                                </XMApp>
                                <XMWrap>
                                  <XMTok role="OPEN" stretchy="false">(</XMTok>
                                  <XMTok meaning="0" role="NUMBER" xml:id="S2.Ex2.m2.1">0</XMTok>
                                  <XMTok role="PUNCT">,</XMTok>
                                  <XMApp xml:id="S2.Ex2.m2.4.1.1">
                                    <XMTok role="SUBSCRIPTOP" scriptpos="post1"/>
                                    <XMTok font="italic" name="tau" role="UNKNOWN">τ</XMTok>
                                    <XMTok fontsize="70%" meaning="0" role="NUMBER">0</XMTok>
                                  </XMApp>
                                  <XMTok role="CLOSE" stretchy="false">)</XMTok>
                                </XMWrap>
                              </XMDual>
                            </XMApp>
                          </XMApp>
                          <XMTok role="PUNCT" rpadding="9.9pt">;</XMTok>
                          <XMApp xml:id="S2.Ex2.m2.4.2">
                            <XMTok meaning="similar-to" name="sim" role="RELOP">∼</XMTok>
                            <XMApp>
                              <XMTok role="SUBSCRIPTOP" scriptpos="post1"/>
                              <XMApp>
                                <XMTok name="tilde" role="OVERACCENT" stretchy="false">~</XMTok>
                                <XMTok font="italic" role="UNKNOWN">z</XMTok>
                              </XMApp>
                              <XMTok font="italic" fontsize="70%" role="UNKNOWN">i</XMTok>
                            </XMApp>
                            <XMApp>
                              <XMTok meaning="times" role="MULOP">⁢</XMTok>
                              <XMApp>
                                <XMTok role="SUPERSCRIPTOP" scriptpos="post1"/>
                                <XMTok font="caligraphic" role="UNKNOWN">C</XMTok>
                                <XMTok fontsize="70%" meaning="plus" role="ADDOP">+</XMTok>
                              </XMApp>
                              <XMDual>
                                <XMApp>
                                  <XMTok meaning="open-interval"/>
                                  <XMRef idref="S2.Ex2.m2.2"/>
                                  <XMRef idref="S2.Ex2.m2.3"/>
                                </XMApp>
                                <XMWrap>
                                  <XMTok role="OPEN" stretchy="false">(</XMTok>
                                  <XMTok meaning="0" role="NUMBER" xml:id="S2.Ex2.m2.2">0</XMTok>
                                  <XMTok role="PUNCT">,</XMTok>
                                  <XMTok meaning="1" role="NUMBER" xml:id="S2.Ex2.m2.3">1</XMTok>
                                  <XMTok role="CLOSE" stretchy="false">)</XMTok>
                                </XMWrap>
                              </XMDual>
                            </XMApp>
                          </XMApp>
                        </XMWrap>
                      </XMDual>
                      <XMTok role="PUNCT">;</XMTok>
                    </XMWrap>
                  </XMDual>
                </XMath>
              </Math>
              <MathBranch>
                <td align="left"><Math mode="inline" tex="\displaystyle s\sim\mathcal{C}^{+}(0,\tau_{0});~{}~{}~{}\tilde{z}_{i}\sim%&#10;\mathcal{C}^{+}(0,1);" text="formulae@(s similar-to C ^ + * open-interval@(0, tau _ 0), (tilde@(z)) _ i similar-to C ^ + * open-interval@(0, 1))" xml:id="S2.Ex2.m1">
                    <XMath>
                      <XMDual>
                        <XMRef idref="S2.Ex2.m1.4"/>
                        <XMWrap>
                          <XMDual xml:id="S2.Ex2.m1.4">
                            <XMApp>
                              <XMTok meaning="formulae"/>
                              <XMRef idref="S2.Ex2.m1.4.1"/>
                              <XMRef idref="S2.Ex2.m1.4.2"/>
                            </XMApp>
                            <XMWrap>
                              <XMApp xml:id="S2.Ex2.m1.4.1">
                                <XMTok meaning="similar-to" name="sim" role="RELOP">∼</XMTok>
                                <XMTok font="italic" role="UNKNOWN">s</XMTok>
                                <XMApp>
                                  <XMTok meaning="times" role="MULOP">⁢</XMTok>
                                  <XMApp>
                                    <XMTok role="SUPERSCRIPTOP" scriptpos="post1"/>
                                    <XMTok font="caligraphic" role="UNKNOWN">C</XMTok>
                                    <XMTok fontsize="70%" meaning="plus" role="ADDOP">+</XMTok>
                                  </XMApp>
                                  <XMDual>
                                    <XMApp>
                                      <XMTok meaning="open-interval"/>
                                      <XMRef idref="S2.Ex2.m1.1"/>
                                      <XMRef idref="S2.Ex2.m1.4.1.1"/>
                                    </XMApp>
                                    <XMWrap>
                                      <XMTok role="OPEN" stretchy="false">(</XMTok>
                                      <XMTok meaning="0" role="NUMBER" xml:id="S2.Ex2.m1.1">0</XMTok>
                                      <XMTok role="PUNCT">,</XMTok>
                                      <XMApp xml:id="S2.Ex2.m1.4.1.1">
                                        <XMTok role="SUBSCRIPTOP" scriptpos="post1"/>
                                        <XMTok font="italic" name="tau" role="UNKNOWN">τ</XMTok>
                                        <XMTok fontsize="70%" meaning="0" role="NUMBER">0</XMTok>
                                      </XMApp>
                                      <XMTok role="CLOSE" stretchy="false">)</XMTok>
                                    </XMWrap>
                                  </XMDual>
                                </XMApp>
                              </XMApp>
                              <XMTok role="PUNCT" rpadding="9.9pt">;</XMTok>
                              <XMApp xml:id="S2.Ex2.m1.4.2">
                                <XMTok meaning="similar-to" name="sim" role="RELOP">∼</XMTok>
                                <XMApp>
                                  <XMTok role="SUBSCRIPTOP" scriptpos="post1"/>
                                  <XMApp>
                                    <XMTok name="tilde" role="OVERACCENT" stretchy="false">~</XMTok>
                                    <XMTok font="italic" role="UNKNOWN">z</XMTok>
                                  </XMApp>
                                  <XMTok font="italic" fontsize="70%" role="UNKNOWN">i</XMTok>
                                </XMApp>
                                <XMApp>
                                  <XMTok meaning="times" role="MULOP">⁢</XMTok>
                                  <XMApp>
                                    <XMTok role="SUPERSCRIPTOP" scriptpos="post1"/>
                                    <XMTok font="caligraphic" role="UNKNOWN">C</XMTok>
                                    <XMTok fontsize="70%" meaning="plus" role="ADDOP">+</XMTok>
                                  </XMApp>
                                  <XMDual>
                                    <XMApp>
                                      <XMTok meaning="open-interval"/>
                                      <XMRef idref="S2.Ex2.m1.2"/>
                                      <XMRef idref="S2.Ex2.m1.3"/>
                                    </XMApp>
                                    <XMWrap>
                                      <XMTok role="OPEN" stretchy="false">(</XMTok>
                                      <XMTok meaning="0" role="NUMBER" xml:id="S2.Ex2.m1.2">0</XMTok>
                                      <XMTok role="PUNCT">,</XMTok>
                                      <XMTok meaning="1" role="NUMBER" xml:id="S2.Ex2.m1.3">1</XMTok>
                                      <XMTok role="CLOSE" stretchy="false">)</XMTok>
                                    </XMWrap>
                                  </XMDual>
                                </XMApp>
                              </XMApp>
                            </XMWrap>
                          </XMDual>
                          <XMTok role="PUNCT">;</XMTok>
                        </XMWrap>
                      </XMDual>
                    </XMath>
                  </Math></td>
              </MathBranch>
            </MathFork>
          </equation>
          <equation xml:id="S2.Ex3">
            <MathFork>
              <Math tex="\displaystyle\tilde{w}_{ij}\sim\mathcal{N}(0,1);~{}~{}~{}w_{ij}=\tilde{w}_{ij}%&#10;\tilde{z}_{i}s" text="formulae@((tilde@(w)) _ (i * j) similar-to N * open-interval@(0, 1), w _ (i * j) = (tilde@(w)) _ (i * j) * (tilde@(z)) _ i * s)" xml:id="S2.Ex3.m2">
                <XMath>
                  <XMDual>
                    <XMApp>
                      <XMTok meaning="formulae"/>
                      <XMRef idref="S2.Ex3.m2.3"/>
                      <XMRef idref="S2.Ex3.m2.4"/>
                    </XMApp>
                    <XMWrap>
                      <XMApp xml:id="S2.Ex3.m2.3">
                        <XMTok meaning="similar-to" name="sim" role="RELOP">∼</XMTok>
                        <XMApp>
                          <XMTok role="SUBSCRIPTOP" scriptpos="post1"/>
                          <XMApp>
                            <XMTok name="tilde" role="OVERACCENT" stretchy="false">~</XMTok>
                            <XMTok font="italic" role="UNKNOWN">w</XMTok>
                          </XMApp>
                          <XMApp>
                            <XMTok meaning="times" role="MULOP">⁢</XMTok>
                            <XMTok font="italic" fontsize="70%" role="UNKNOWN">i</XMTok>
                            <XMTok font="italic" fontsize="70%" role="UNKNOWN">j</XMTok>
                          </XMApp>
                        </XMApp>
                        <XMApp>
                          <XMTok meaning="times" role="MULOP">⁢</XMTok>
                          <XMTok font="caligraphic" role="UNKNOWN">N</XMTok>
                          <XMDual>
                            <XMApp>
                              <XMTok meaning="open-interval"/>
                              <XMRef idref="S2.Ex3.m2.1"/>
                              <XMRef idref="S2.Ex3.m2.2"/>
                            </XMApp>
                            <XMWrap>
                              <XMTok role="OPEN" stretchy="false">(</XMTok>
                              <XMTok meaning="0" role="NUMBER" xml:id="S2.Ex3.m2.1">0</XMTok>
                              <XMTok role="PUNCT">,</XMTok>
                              <XMTok meaning="1" role="NUMBER" xml:id="S2.Ex3.m2.2">1</XMTok>
                              <XMTok role="CLOSE" stretchy="false">)</XMTok>
                            </XMWrap>
                          </XMDual>
                        </XMApp>
                      </XMApp>
                      <XMTok role="PUNCT" rpadding="9.9pt">;</XMTok>
                      <XMApp xml:id="S2.Ex3.m2.4">
                        <XMTok meaning="equals" role="RELOP">=</XMTok>
                        <XMApp>
                          <XMTok role="SUBSCRIPTOP" scriptpos="post1"/>
                          <XMTok font="italic" role="UNKNOWN">w</XMTok>
                          <XMApp>
                            <XMTok meaning="times" role="MULOP">⁢</XMTok>
                            <XMTok font="italic" fontsize="70%" role="UNKNOWN">i</XMTok>
                            <XMTok font="italic" fontsize="70%" role="UNKNOWN">j</XMTok>
                          </XMApp>
                        </XMApp>
                        <XMApp>
                          <XMTok meaning="times" role="MULOP">⁢</XMTok>
                          <XMApp>
                            <XMTok role="SUBSCRIPTOP" scriptpos="post1"/>
                            <XMApp>
                              <XMTok name="tilde" role="OVERACCENT" stretchy="false">~</XMTok>
                              <XMTok font="italic" role="UNKNOWN">w</XMTok>
                            </XMApp>
                            <XMApp>
                              <XMTok meaning="times" role="MULOP">⁢</XMTok>
                              <XMTok font="italic" fontsize="70%" role="UNKNOWN">i</XMTok>
                              <XMTok font="italic" fontsize="70%" role="UNKNOWN">j</XMTok>
                            </XMApp>
                          </XMApp>
                          <XMApp>
                            <XMTok role="SUBSCRIPTOP" scriptpos="post1"/>
                            <XMApp>
                              <XMTok name="tilde" role="OVERACCENT" stretchy="false">~</XMTok>
                              <XMTok font="italic" role="UNKNOWN">z</XMTok>
                            </XMApp>
                            <XMTok font="italic" fontsize="70%" role="UNKNOWN">i</XMTok>
                          </XMApp>
                          <XMTok font="italic" role="UNKNOWN">s</XMTok>
                        </XMApp>
                      </XMApp>
                    </XMWrap>
                  </XMDual>
                </XMath>
              </Math>
              <MathBranch>
                <td align="left"><Math mode="inline" tex="\displaystyle\tilde{w}_{ij}\sim\mathcal{N}(0,1);~{}~{}~{}w_{ij}=\tilde{w}_{ij}%&#10;\tilde{z}_{i}s" text="formulae@((tilde@(w)) _ (i * j) similar-to N * open-interval@(0, 1), w _ (i * j) = (tilde@(w)) _ (i * j) * (tilde@(z)) _ i * s)" xml:id="S2.Ex3.m1">
                    <XMath>
                      <XMDual>
                        <XMApp>
                          <XMTok meaning="formulae"/>
                          <XMRef idref="S2.Ex3.m1.3"/>
                          <XMRef idref="S2.Ex3.m1.4"/>
                        </XMApp>
                        <XMWrap>
                          <XMApp xml:id="S2.Ex3.m1.3">
                            <XMTok meaning="similar-to" name="sim" role="RELOP">∼</XMTok>
                            <XMApp>
                              <XMTok role="SUBSCRIPTOP" scriptpos="post1"/>
                              <XMApp>
                                <XMTok name="tilde" role="OVERACCENT" stretchy="false">~</XMTok>
                                <XMTok font="italic" role="UNKNOWN">w</XMTok>
                              </XMApp>
                              <XMApp>
                                <XMTok meaning="times" role="MULOP">⁢</XMTok>
                                <XMTok font="italic" fontsize="70%" role="UNKNOWN">i</XMTok>
                                <XMTok font="italic" fontsize="70%" role="UNKNOWN">j</XMTok>
                              </XMApp>
                            </XMApp>
                            <XMApp>
                              <XMTok meaning="times" role="MULOP">⁢</XMTok>
                              <XMTok font="caligraphic" role="UNKNOWN">N</XMTok>
                              <XMDual>
                                <XMApp>
                                  <XMTok meaning="open-interval"/>
                                  <XMRef idref="S2.Ex3.m1.1"/>
                                  <XMRef idref="S2.Ex3.m1.2"/>
                                </XMApp>
                                <XMWrap>
                                  <XMTok role="OPEN" stretchy="false">(</XMTok>
                                  <XMTok meaning="0" role="NUMBER" xml:id="S2.Ex3.m1.1">0</XMTok>
                                  <XMTok role="PUNCT">,</XMTok>
                                  <XMTok meaning="1" role="NUMBER" xml:id="S2.Ex3.m1.2">1</XMTok>
                                  <XMTok role="CLOSE" stretchy="false">)</XMTok>
                                </XMWrap>
                              </XMDual>
                            </XMApp>
                          </XMApp>
                          <XMTok role="PUNCT" rpadding="9.9pt">;</XMTok>
                          <XMApp xml:id="S2.Ex3.m1.4">
                            <XMTok meaning="equals" role="RELOP">=</XMTok>
                            <XMApp>
                              <XMTok role="SUBSCRIPTOP" scriptpos="post1"/>
                              <XMTok font="italic" role="UNKNOWN">w</XMTok>
                              <XMApp>
                                <XMTok meaning="times" role="MULOP">⁢</XMTok>
                                <XMTok font="italic" fontsize="70%" role="UNKNOWN">i</XMTok>
                                <XMTok font="italic" fontsize="70%" role="UNKNOWN">j</XMTok>
                              </XMApp>
                            </XMApp>
                            <XMApp>
                              <XMTok meaning="times" role="MULOP">⁢</XMTok>
                              <XMApp>
                                <XMTok role="SUBSCRIPTOP" scriptpos="post1"/>
                                <XMApp>
                                  <XMTok name="tilde" role="OVERACCENT" stretchy="false">~</XMTok>
                                  <XMTok font="italic" role="UNKNOWN">w</XMTok>
                                </XMApp>
                                <XMApp>
                                  <XMTok meaning="times" role="MULOP">⁢</XMTok>
                                  <XMTok font="italic" fontsize="70%" role="UNKNOWN">i</XMTok>
                                  <XMTok font="italic" fontsize="70%" role="UNKNOWN">j</XMTok>
                                </XMApp>
                              </XMApp>
                              <XMApp>
                                <XMTok role="SUBSCRIPTOP" scriptpos="post1"/>
                                <XMApp>
                                  <XMTok name="tilde" role="OVERACCENT" stretchy="false">~</XMTok>
                                  <XMTok font="italic" role="UNKNOWN">z</XMTok>
                                </XMApp>
                                <XMTok font="italic" fontsize="70%" role="UNKNOWN">i</XMTok>
                              </XMApp>
                              <XMTok font="italic" role="UNKNOWN">s</XMTok>
                            </XMApp>
                          </XMApp>
                        </XMWrap>
                      </XMDual>
                    </XMath>
                  </Math></td>
              </MathBranch>
            </MathFork>
          </equation>
        </equationgroup>
      </para>
      <para xml:id="S2.SS3.p2">
        <p>and represent the approximate posterior <Math mode="inline" tex="q_{\phi}(\mathbf{W})" text="q _ phi * W" xml:id="S2.SS3.p2.m1">
            <XMath>
              <XMApp>
                <XMTok meaning="times" role="MULOP">⁢</XMTok>
                <XMApp>
                  <XMTok role="SUBSCRIPTOP" scriptpos="post1"/>
                  <XMTok font="italic" role="UNKNOWN">q</XMTok>
                  <XMTok font="italic" fontsize="70%" name="phi" role="UNKNOWN">ϕ</XMTok>
                </XMApp>
                <XMDual>
                  <XMRef idref="S2.SS3.p2.m1.1"/>
                  <XMWrap>
                    <XMTok role="OPEN" stretchy="false">(</XMTok>
                    <XMTok font="bold" role="UNKNOWN" xml:id="S2.SS3.p2.m1.1">W</XMTok>
                    <XMTok role="CLOSE" stretchy="false">)</XMTok>
                  </XMWrap>
                </XMDual>
              </XMApp>
            </XMath>
          </Math> through a fully factorized distribution as proposed in <cite class="ltx_citemacro_cite"><bibref bibrefs="louizos2017bayesian" separator=";" show="Authors Phrase1YearPhrase2" yyseparator=",">
              <bibrefphrase>(</bibrefphrase>
              <bibrefphrase>)</bibrefphrase>
            </bibref></cite>. The sparsity hyperparameter of the horseshoe prior is fixed to <Math mode="inline" tex="\tau_{0}=10^{-5}" text="tau _ 0 = 10 ^ (- 5)" xml:id="S2.SS3.p2.m2">
            <XMath>
              <XMApp>
                <XMTok meaning="equals" role="RELOP">=</XMTok>
                <XMApp>
                  <XMTok role="SUBSCRIPTOP" scriptpos="post1"/>
                  <XMTok font="italic" name="tau" role="UNKNOWN">τ</XMTok>
                  <XMTok fontsize="70%" meaning="0" role="NUMBER">0</XMTok>
                </XMApp>
                <XMApp>
                  <XMTok role="SUPERSCRIPTOP" scriptpos="post1"/>
                  <XMTok meaning="10" role="NUMBER">10</XMTok>
                  <XMApp>
                    <XMTok fontsize="70%" meaning="minus" role="ADDOP">-</XMTok>
                    <XMTok fontsize="70%" meaning="5" role="NUMBER">5</XMTok>
                  </XMApp>
                </XMApp>
              </XMApp>
            </XMath>
          </Math>. During training we approximate the expectation of the log-likelihood term with a single sample from <Math mode="inline" tex="q_{\phi}(\mathbf{W})" text="q _ phi * W" xml:id="S2.SS3.p2.m3">
            <XMath>
              <XMApp>
                <XMTok meaning="times" role="MULOP">⁢</XMTok>
                <XMApp>
                  <XMTok role="SUBSCRIPTOP" scriptpos="post1"/>
                  <XMTok font="italic" role="UNKNOWN">q</XMTok>
                  <XMTok font="italic" fontsize="70%" name="phi" role="UNKNOWN">ϕ</XMTok>
                </XMApp>
                <XMDual>
                  <XMRef idref="S2.SS3.p2.m3.1"/>
                  <XMWrap>
                    <XMTok role="OPEN" stretchy="false">(</XMTok>
                    <XMTok font="bold" role="UNKNOWN" xml:id="S2.SS3.p2.m3.1">W</XMTok>
                    <XMTok role="CLOSE" stretchy="false">)</XMTok>
                  </XMWrap>
                </XMDual>
              </XMApp>
            </XMath>
          </Math> and make use of the local reparametrization trick <cite class="ltx_citemacro_cite"><bibref bibrefs="kingma2015variational" separator=";" show="Authors Phrase1YearPhrase2" yyseparator=",">
              <bibrefphrase>(</bibrefphrase>
              <bibrefphrase>)</bibrefphrase>
            </bibref></cite>. Gradient-based optimization is performed using Adam <cite class="ltx_citemacro_cite"><bibref bibrefs="kingma2014adam" separator=";" show="Authors Phrase1YearPhrase2" yyseparator=",">
              <bibrefphrase>(</bibrefphrase>
              <bibrefphrase>)</bibrefphrase>
            </bibref></cite> with batches consisting of <Math mode="inline" tex="64" text="64" xml:id="S2.SS3.p2.m4">
            <XMath>
              <XMTok meaning="64" role="NUMBER">64</XMTok>
            </XMath>
          </Math> samples. Results reported after training correspond to a Monte-Carlo estimate using 100 samples from <Math mode="inline" tex="q_{\phi}(\mathbf{W})" text="q _ phi * W" xml:id="S2.SS3.p2.m5">
            <XMath>
              <XMApp>
                <XMTok meaning="times" role="MULOP">⁢</XMTok>
                <XMApp>
                  <XMTok role="SUBSCRIPTOP" scriptpos="post1"/>
                  <XMTok font="italic" role="UNKNOWN">q</XMTok>
                  <XMTok font="italic" fontsize="70%" name="phi" role="UNKNOWN">ϕ</XMTok>
                </XMApp>
                <XMDual>
                  <XMRef idref="S2.SS3.p2.m5.1"/>
                  <XMWrap>
                    <XMTok role="OPEN" stretchy="false">(</XMTok>
                    <XMTok font="bold" role="UNKNOWN" xml:id="S2.SS3.p2.m5.1">W</XMTok>
                    <XMTok role="CLOSE" stretchy="false">)</XMTok>
                  </XMWrap>
                </XMDual>
              </XMApp>
            </XMath>
          </Math>.</p>
      </para>
    </subsection>
  </section>
  <section inlist="toc" xml:id="S3">
    <tags>
      <tag>3</tag>
      <tag role="refnum">3</tag>
      <tag role="typerefnum">§3</tag>
    </tags>
    <title><tag close=" ">3</tag>Experiments</title>
    <para xml:id="S3.p1">
      <p>In this section we present an analysis of the proposed model on three different tasks adopted from the developmental psychology literature. The first two tasks involve reasoning about physical events (occlusion and support), while the last is concerned with the intuitive representation of quantities. For each task we include a summary of empirical observations made in children, alongside a comparison between these results and our models. Code for performing all experiments and generating artificial data-sets is publicly available<note mark="1" role="footnote" xml:id="footnote1"><tags>
            <tag>1</tag>
            <tag role="refnum">1</tag>
            <tag role="typerefnum">footnote 1</tag>
          </tags><ref class="ltx_url" font="typewriter" href="https://github.com/marcelbinz/Developmental-Stages-of-BNNs">https://github.com/marcelbinz/Developmental-Stages-of-BNNs</ref></note>.</p>
    </para>
    <subsection inlist="toc" xml:id="S3.SS1">
      <tags>
        <tag>3.1</tag>
        <tag role="refnum">3.1</tag>
        <tag role="typerefnum">§3.1</tag>
      </tags>
      <title><tag close=" ">3.1</tag>Occlusion Events</title>
      <figure inlist="lof" labels="LABEL:fig:occu" placement="t" xml:id="S3.F1">
        <tags>
          <tag>Figure 1</tag>
          <tag role="refnum">1</tag>
          <tag role="typerefnum">Figure 1</tag>
        </tags>
        <inline-para align="center" class="ltx_minipage" vattach="middle" width="125.7pt">
          <para align="center" xml:id="S3.F1.p1">
            <p><text font="bold">Original Task</text>
<!--  %**** cogsci˙template.tex Line 125 **** --></p>
          </para>
        </inline-para>
        <inline-para align="center" class="ltx_minipage" vattach="middle" width="108.4pt">
          <para align="center" xml:id="S3.F1.p2">
            <p><text font="bold">Artificial Data</text></p>
          </para>
        </inline-para>
        <inline-para align="center" class="ltx_minipage" vattach="middle" width="182.1pt">
          <para align="center" xml:id="S3.F1.p3">
            <p><text font="bold">Results</text></p>
          </para>
        </inline-para>
        <break class="ltx_centering"/>
        <inline-para align="center" class="ltx_minipage" vattach="middle" width="125.7pt">
          <para align="center" xml:id="S3.F1.p4">
            <graphics candidates="exp1.png" graphic="exp1.png" options="width=433.62pt" xml:id="S3.F1.p4.g1"/>
          </para>
        </inline-para>
        <inline-para align="center" class="ltx_minipage" vattach="middle" width="108.4pt">
          <para align="center" xml:id="S3.F1.p5">
            <graphics candidates="3.pdf" graphic="3.pdf" options="width=195.129pt" xml:id="S3.F1.p5.g1"/>
            <graphics candidates="2.pdf" graphic="2.pdf" options="width=195.129pt" xml:id="S3.F1.p5.g2"/>
            <graphics candidates="1.pdf" graphic="1.pdf" options="width=195.129pt" xml:id="S3.F1.p5.g3"/>
            <graphics candidates="1_2.pdf" graphic="1_2.pdf" options="width=195.129pt" xml:id="S3.F1.p5.g4"/>
          </para>
        </inline-para>
        <inline-para align="center" class="ltx_minipage" vattach="middle" width="182.1pt">
          <para align="center" xml:id="S3.F1.p6">
            <graphics candidates="occlusion.pdf" graphic="occlusion.pdf" options="width=390.258pt" xml:id="S3.F1.p6.g1"/>
          </para>
        </inline-para>
        <toccaption class="ltx_centering"><tag close=" ">1</tag>Visualization of the occlusion event experiment. <text font="bold">Left:</text> Schematic illustration of the original setup. Figure adopted from <cite class="ltx_citemacro_cite"><bibref bibrefs="baillargeon2002acquisition" separator=";" show="Authors Phrase1YearPhrase2" yyseparator=",">
              <bibrefphrase>(</bibrefphrase>
              <bibrefphrase>)</bibrefphrase>
            </bibref></cite>. <text font="bold">Center:</text> Examples from the artificial data-set. <text font="bold">Right:</text> Performance for models with access to different amounts of data in the different conditions. Baseline indicates the chance for guessing randomly. Models discover solutions in order of their difficulty (green <Math mode="inline" tex="\rightarrow" text="rightarrow" xml:id="S3.F1.m1">
            <XMath>
              <XMTok name="rightarrow" role="ARROW">→</XMTok>
            </XMath>
          </Math> blue <Math mode="inline" tex="\rightarrow" text="rightarrow" xml:id="S3.F1.m2">
            <XMath>
              <XMTok name="rightarrow" role="ARROW">→</XMTok>
            </XMath>
          </Math> red), which is in accordance with observations made in the developmental psychology literature. </toccaption>
        <caption class="ltx_centering"><tag close=": ">Figure 1</tag>Visualization of the occlusion event experiment. <text font="bold">Left:</text> Schematic illustration of the original setup. Figure adopted from <cite class="ltx_citemacro_cite"><bibref bibrefs="baillargeon2002acquisition" separator=";" show="Authors Phrase1YearPhrase2" yyseparator=",">
              <bibrefphrase>(</bibrefphrase>
              <bibrefphrase>)</bibrefphrase>
            </bibref></cite>. <text font="bold">Center:</text> Examples from the artificial data-set. <text font="bold">Right:</text> Performance for models with access to different amounts of data in the different conditions. Baseline indicates the chance for guessing randomly. Models discover solutions in order of their difficulty (green <Math mode="inline" tex="\rightarrow" text="rightarrow" xml:id="S3.F1.m3">
            <XMath>
              <XMTok name="rightarrow" role="ARROW">→</XMTok>
            </XMath>
          </Math> blue <Math mode="inline" tex="\rightarrow" text="rightarrow" xml:id="S3.F1.m4">
            <XMath>
              <XMTok name="rightarrow" role="ARROW">→</XMTok>
            </XMath>
          </Math> red), which is in accordance with observations made in the developmental psychology literature. </caption>
      </figure>
      <para xml:id="S3.SS1.p1">
        <p>The first task under investigation is concerned with occlusion events. It is based on an experiment conducted by <ERROR class="undefined">\citeA</ERROR>baillargeon2002acquisition. Each scene consists of a cylinder and a screen in form of a rectangular plane. During experimental manipulation the cylinder is moved back and forth behind the screen, while parts of the screen are removed. There are three different experimental conditions, each differing in which part of the screen is removed (top, bottom or everything removed). A depiction of the setup is shown in Figure <ref labelref="LABEL:fig:occu"/> (left). We are interested in infants’ ability to judge, whether the cylinder remains visible as it moves behind the screen, which is measured via violation-of-expectation methods <cite class="ltx_citemacro_cite"><bibref bibrefs="baillargeon1985object" separator=";" show="Authors Phrase1YearPhrase2" yyseparator=",">
              <bibrefphrase>(</bibrefphrase>
              <bibrefphrase>)</bibrefphrase>
            </bibref></cite>. In violation-of-expectation methods gazing times for physically implausible events are measured. High gazing times for such events indicate, that children are surprised by the observation, which is interpreted as a violation of their expectation of what should have happened. <break/></p>
      </para>
      <para xml:id="S3.SS1.p2">
        <p>Empirical evidence indicates, that infants form an initial concept based on a behind/not-behind distinction starting from the age of 2.5 months. At this stage they have learned, that the cylinder remains visible while moving past the screen, if the entire middle section of the screen is removed. They do not expect the cylinder to be visible in the other two conditions. This initial concept is refined during later stages of development. At the age of three months they also predict to see parts of the cylinder, if only the bottom part of the screen is removed. The knowledge of 3.5 months olds additionally extends to screens with removed top fractions. Note, that the last condition is more challenging compared to the other two, as it involves a comparison of heights between the cylinder and the lower connection of the screen (if the connection height is lower than the cylinder, the cylinder will be visible, otherwise it will not be visible). Baillargeon <emph font="italic">et al.</emph> conclude from these observations, that infants start with initially simple rules about the laws of physics (behind/not-behind distinction), which become successively more sophisticated over time (reasoning about relative heights). <break/></p>
      </para>
      <figure inlist="lof" labels="LABEL:fig:support" placement="t" xml:id="S3.F2">
        <tags>
          <tag>Figure 2</tag>
          <tag role="refnum">2</tag>
          <tag role="typerefnum">Figure 2</tag>
        </tags>
        <inline-para align="center" class="ltx_minipage" vattach="middle" width="108.4pt">
          <para align="center" xml:id="S3.F2.p1">
            <p><text font="bold">Original Task</text></p>
          </para>
        </inline-para>
        <inline-para align="center" class="ltx_minipage" vattach="middle" width="108.4pt">
          <para align="center" xml:id="S3.F2.p2">
            <p><text font="bold">Artificial Data</text></p>
          </para>
        </inline-para>
        <inline-para align="center" class="ltx_minipage" vattach="middle" width="186.5pt">
          <para align="center" xml:id="S3.F2.p3">
            <p><text font="bold">Results</text></p>
          </para>
        </inline-para>
        <break class="ltx_centering"/>
        <inline-para align="center" class="ltx_minipage" vattach="middle" width="108.4pt">
<!--  %**** cogsci˙template.tex Line 175 **** -->          <para align="center" xml:id="S3.F2.p4">
            <graphics candidates="exp2.png" graphic="exp2.png" options="width=455.301pt" xml:id="S3.F2.p4.g1"/>
          </para>
        </inline-para>
        <inline-para align="center" class="ltx_minipage" vattach="middle" width="108.4pt">
          <para align="center" xml:id="S3.F2.p5">
            <graphics candidates="0_s.pdf" graphic="0_s.pdf" options="width=169.1118pt" xml:id="S3.F2.p5.g1"/>
            <graphics candidates="0_2_s.pdf" graphic="0_2_s.pdf" options="width=169.1118pt" xml:id="S3.F2.p5.g2"/>
            <graphics candidates="1_s.pdf" graphic="1_s.pdf" options="width=169.1118pt" xml:id="S3.F2.p5.g3"/>
            <graphics candidates="1_2_s.pdf" graphic="1_2_s.pdf" options="width=169.1118pt" xml:id="S3.F2.p5.g4"/>
            <graphics candidates="2_s.pdf" graphic="2_s.pdf" options="width=169.1118pt" xml:id="S3.F2.p5.g5"/>
            <graphics candidates="2_2_s.pdf" graphic="2_2_s.pdf" options="width=169.1118pt" xml:id="S3.F2.p5.g6"/>
            <graphics candidates="3_s.pdf" graphic="3_s.pdf" options="width=169.1118pt" xml:id="S3.F2.p5.g7"/>
            <graphics candidates="3_2_s.pdf" graphic="3_2_s.pdf" options="width=169.1118pt" xml:id="S3.F2.p5.g8"/>
          </para>
        </inline-para>
        <inline-para align="center" class="ltx_minipage" vattach="middle" width="21.7pt">
          <para align="center" xml:id="S3.F2.p6">
            <p><text fontsize="90%">top</text></p>
            <p><text fontsize="90%">side</text></p>
            <p><text fontsize="90%">amount</text></p>
            <p><text fontsize="90%">prop.</text></p>
          </para>
        </inline-para>
        <inline-para align="center" class="ltx_minipage" vattach="middle" width="186.5pt">
          <para align="center" xml:id="S3.F2.p7">
            <graphics candidates="support.pdf" graphic="support.pdf" options="width=433.62pt" xml:id="S3.F2.p7.g1"/>
          </para>
        </inline-para>
<!--  %**** cogsci˙template.tex Line 200 **** -->        <toccaption class="ltx_centering"><tag close=" ">2</tag>Visualization of the support event experiment. <text font="bold">Left:</text> Schematic illustration of the original setup. Figure adopted from <cite class="ltx_citemacro_cite"><bibref bibrefs="baillargeon2002acquisition" separator=";" show="Authors Phrase1YearPhrase2" yyseparator=",">
              <bibrefphrase>(</bibrefphrase>
              <bibrefphrase>)</bibrefphrase>
            </bibref></cite>. <text font="bold">Center:</text> Examples from the artificial data-set alongside the name of the corresponding condition. The left column shows stable configurations, while those in the right column are unstable. <text font="bold">Right:</text> Result from training models with access to different amounts of data. Baseline indicates the performance for guessing randomly. Models discover solutions to the easier condition (two upper rows in the left figure) first, and solutions to the harder conditions (two bottom rows in the left figure) later. Note the slight increase in number of samples required for learning between the third and forth condition.
</toccaption>
        <caption class="ltx_centering"><tag close=": ">Figure 2</tag>Visualization of the support event experiment. <text font="bold">Left:</text> Schematic illustration of the original setup. Figure adopted from <cite class="ltx_citemacro_cite"><bibref bibrefs="baillargeon2002acquisition" separator=";" show="Authors Phrase1YearPhrase2" yyseparator=",">
              <bibrefphrase>(</bibrefphrase>
              <bibrefphrase>)</bibrefphrase>
            </bibref></cite>. <text font="bold">Center:</text> Examples from the artificial data-set alongside the name of the corresponding condition. The left column shows stable configurations, while those in the right column are unstable. <text font="bold">Right:</text> Result from training models with access to different amounts of data. Baseline indicates the performance for guessing randomly. Models discover solutions to the easier condition (two upper rows in the left figure) first, and solutions to the harder conditions (two bottom rows in the left figure) later. Note the slight increase in number of samples required for learning between the third and forth condition.
</caption>
      </figure>
      <para xml:id="S3.SS1.p3">
        <p>We construct an artificial version of this task as follows. Each input <Math mode="inline" tex="\mathbf{x}" text="x" xml:id="S3.SS1.p3.m1">
            <XMath>
              <XMTok font="bold" role="UNKNOWN">x</XMTok>
            </XMath>
          </Math> corresponds to a <Math mode="inline" tex="24\times 24" text="24 * 24" xml:id="S3.SS1.p3.m2">
            <XMath>
              <XMApp>
                <XMTok meaning="times" role="MULOP">×</XMTok>
                <XMTok meaning="24" role="NUMBER">24</XMTok>
                <XMTok meaning="24" role="NUMBER">24</XMTok>
              </XMApp>
            </XMath>
          </Math> image with three channels, containing segregated information about the floor, the cylinder and the screen respectively. Input images show an initial configuration of the scene, in which the cylinder is located on the left side of the screen. Each target <Math mode="inline" tex="y" text="y" xml:id="S3.SS1.p3.m3">
            <XMath>
              <XMTok font="italic" role="UNKNOWN">y</XMTok>
            </XMath>
          </Math> indicates the visibility of the cylinder when passing behind the screen, as it is moved towards the right side of the image. Cylinder height and position, screen position and size and floor level are determined randomly. We generate a training and test set, each consisting of 10000 data-points. Figure <ref labelref="LABEL:fig:occu"/> (center) shows examples for each of the conditions. Both sets include 2000 images for each of the three condition, as well as 4000 baseline images (nothing of the screen is removed). This is to ensure, that both <Math mode="inline" tex="y=0" text="y = 0" xml:id="S3.SS1.p3.m4">
            <XMath>
              <XMApp>
                <XMTok meaning="equals" role="RELOP">=</XMTok>
                <XMTok font="italic" role="UNKNOWN">y</XMTok>
                <XMTok meaning="0" role="NUMBER">0</XMTok>
              </XMApp>
            </XMath>
          </Math> (cylinder is visible) and <Math mode="inline" tex="y=1" text="y = 1" xml:id="S3.SS1.p3.m5">
            <XMath>
              <XMApp>
                <XMTok meaning="equals" role="RELOP">=</XMTok>
                <XMTok font="italic" role="UNKNOWN">y</XMTok>
                <XMTok meaning="1" role="NUMBER">1</XMTok>
              </XMApp>
            </XMath>
          </Math> (cylinder is not visible) are represented in equal fractions, i.e. the chance of guessing correctly is 50 percent. <break/></p>
      </para>
      <para xml:id="S3.SS1.p4">
        <p>We train otherwise identical models for <Math mode="inline" tex="N\in\{256,512,1024,2048,8192\}" text="N element-of set@(256, 512, 1024, 2048, 8192)" xml:id="S3.SS1.p4.m1">
            <XMath>
              <XMApp>
                <XMTok meaning="element-of" name="in" role="RELOP">∈</XMTok>
                <XMTok font="italic" role="UNKNOWN">N</XMTok>
                <XMDual>
                  <XMApp>
                    <XMTok meaning="set"/>
                    <XMRef idref="S3.SS1.p4.m1.1"/>
                    <XMRef idref="S3.SS1.p4.m1.2"/>
                    <XMRef idref="S3.SS1.p4.m1.3"/>
                    <XMRef idref="S3.SS1.p4.m1.4"/>
                    <XMRef idref="S3.SS1.p4.m1.5"/>
                  </XMApp>
                  <XMWrap>
                    <XMTok role="OPEN" stretchy="false">{</XMTok>
                    <XMTok meaning="256" role="NUMBER" xml:id="S3.SS1.p4.m1.1">256</XMTok>
                    <XMTok role="PUNCT">,</XMTok>
                    <XMTok meaning="512" role="NUMBER" xml:id="S3.SS1.p4.m1.2">512</XMTok>
                    <XMTok role="PUNCT">,</XMTok>
                    <XMTok meaning="1024" role="NUMBER" xml:id="S3.SS1.p4.m1.3">1024</XMTok>
                    <XMTok role="PUNCT">,</XMTok>
                    <XMTok meaning="2048" role="NUMBER" xml:id="S3.SS1.p4.m1.4">2048</XMTok>
                    <XMTok role="PUNCT">,</XMTok>
                    <XMTok meaning="8192" role="NUMBER" xml:id="S3.SS1.p4.m1.5">8192</XMTok>
                    <XMTok role="CLOSE" stretchy="false">}</XMTok>
                  </XMWrap>
                </XMDual>
              </XMApp>
            </XMath>
          </Math> until convergence and report results averaged over ten random seeds. The resulting performances on the artificial data-set are visualized in Figure <ref labelref="LABEL:fig:occu"/> (right). We observe, that the percentage of incorrect predictions, similar to the developmental progress in infants, decreases first in the easier conditions. The network is able to predict visibility of the cylinder reliably (with less than ten percent errors), if the entire middle section of the screen is removed, for <Math mode="inline" tex="N" text="N" xml:id="S3.SS1.p4.m2">
            <XMath>
              <XMTok font="italic" role="UNKNOWN">N</XMTok>
            </XMath>
          </Math> larger than 512. For <Math mode="inline" tex="N" text="N" xml:id="S3.SS1.p4.m3">
            <XMath>
              <XMTok font="italic" role="UNKNOWN">N</XMTok>
            </XMath>
          </Math> larger than 4096 it is additionally able to predict the correct targets, if the bottom part is removed (corresponding to knowledge of a three months old). This extends to the condition, in which the top part is removed for <Math mode="inline" tex="N=8192" text="N = 8192" xml:id="S3.SS1.p4.m4">
            <XMath>
              <XMApp>
                <XMTok meaning="equals" role="RELOP">=</XMTok>
                <XMTok font="italic" role="UNKNOWN">N</XMTok>
                <XMTok meaning="8192" role="NUMBER">8192</XMTok>
              </XMApp>
            </XMath>
          </Math> (corresponding to knowledge of a 3.5 months old). Hence we conclude, that in this task the family of BNNs recovers the order of developmental stages observed in infants.</p>
      </para>
    </subsection>
    <subsection inlist="toc" xml:id="S3.SS2">
      <tags>
        <tag>3.2</tag>
        <tag role="refnum">3.2</tag>
        <tag role="typerefnum">§3.2</tag>
      </tags>
      <title><tag close=" ">3.2</tag>Support Events</title>
      <para xml:id="S3.SS2.p1">
        <p>Next we take a closer look at infants’ knowledge of support events, for which we adopt another experiment of <ERROR class="undefined">\citeA</ERROR>baillargeon2002acquisition. In this task a scene consists of a box and a platform. The box is presented in different positions relative to the platform and the experimenter measures (via violation-of-expectation methods), if infants are able to predict, whether the given configuration is stable or not. Four different conditions are investigated. In the first the box is positioned either on top of the platform or some distance away from it. This condition requires a simple contact/no-contact distinction to make reliable predictions about the stability of the configuration. The second condition involves a distinction between different types of contact. Here the box connects with the platform either on the top (as before) or on the side. The third condition requires judgments based on the amount of contact, i.e. the box is only partially positioned on the platform. The final condition adds an additional layer of complexity, as it involves reasoning about non-rectangular shapes. The different conditions are summarized in Figure <ref labelref="LABEL:fig:support"/> (left). <break/></p>
      </para>
      <para xml:id="S3.SS2.p2">
        <p>According to <ERROR class="undefined">\citeA</ERROR>baillargeon2002acquisition, from three months onward infants knowledge about the stability of a configuration is captured through a contact/no-contact distinction. At this stage they expect the box to be stable if and only if it touches the platform in some way. This initial hypothesis is than refined, as they grow older. At the age of around five months infants begin to distinguish between different types of contacts. They realize, that the box will only be stable, if it positioned above the platform, but not if it touches it on the side. Starting with an age of 6.5 months they are able to take into account the center of mass of simple objects (rectangular boxes) when reasoning about stability. This is extended to more complex, asymmetrical shapes at an age of roughly twelve months. As in the occlusion task, infants start with an initially simple hypothesis of how the laws of physics work, which is subsequently refined to better fit the observed data. <break/></p>
      </para>
      <para xml:id="S3.SS2.p3">
        <p>In our artificial version of this task inputs <Math mode="inline" tex="\mathbf{x}" text="x" xml:id="S3.SS2.p3.m1">
            <XMath>
              <XMTok font="bold" role="UNKNOWN">x</XMTok>
            </XMath>
          </Math> are represented as <Math mode="inline" tex="24\times 24" text="24 * 24" xml:id="S3.SS2.p3.m2">
            <XMath>
              <XMApp>
                <XMTok meaning="times" role="MULOP">×</XMTok>
                <XMTok meaning="24" role="NUMBER">24</XMTok>
                <XMTok meaning="24" role="NUMBER">24</XMTok>
              </XMApp>
            </XMath>
          </Math> images with three channels (one for platform, box and floor) and targets <Math mode="inline" tex="y" text="y" xml:id="S3.SS2.p3.m3">
            <XMath>
              <XMTok font="italic" role="UNKNOWN">y</XMTok>
            </XMath>
          </Math> are a binary indicator of the stability of the given configuration. Floor level as well as the size and position of the platform and the box are randomized. Again we generate a training set of 10000 samples and an equally large test set. In both sets 2500 data-points belong to each of the described conditions. Within each condition the amount of stable and unstable configurations is balanced, leading to a chance of 50 percent for guessing correctly. Example configurations are shown in Figure <ref labelref="LABEL:fig:support"/> (center). <break/></p>
      </para>
      <figure inlist="lof" labels="LABEL:fig:ans" placement="t" xml:id="S3.F3">
        <tags>
          <tag>Figure 3</tag>
          <tag role="refnum">3</tag>
          <tag role="typerefnum">Figure 3</tag>
        </tags>
        <inline-para align="center" class="ltx_minipage" vattach="middle" width="112.7pt">
          <para align="center" xml:id="S3.F3.p1">
            <p><text font="bold">Original Task</text></p>
          </para>
        </inline-para>
        <inline-para align="center" class="ltx_minipage" vattach="middle" width="169.1pt">
<!--  %**** cogsci˙template.tex Line 225 **** -->          <para align="center" xml:id="S3.F3.p2">
            <p><text font="bold">Artificial Data</text></p>
          </para>
        </inline-para>
        <inline-para align="center" class="ltx_minipage" vattach="middle" width="134.4pt">
          <para align="center" xml:id="S3.F3.p3">
            <p><text font="bold">Results</text></p>
          </para>
        </inline-para>
        <break class="ltx_centering"/>
        <inline-para align="center" class="ltx_minipage" vattach="middle" width="112.7pt">
          <para align="center" xml:id="S3.F3.p4">
            <graphics candidates="ans.png" graphic="ans.png" options="width=411.939pt" xml:id="S3.F3.p4.g1"/>
          </para>
        </inline-para>
        <inline-para align="center" class="ltx_minipage" vattach="middle" width="169.1pt">
          <para align="center" xml:id="S3.F3.p5">
            <graphics candidates="anspng.png" graphic="anspng.png" options="width=411.939pt" xml:id="S3.F3.p5.g1"/>
          </para>
        </inline-para>
        <inline-para align="center" class="ltx_minipage" vattach="middle" width="134.4pt">
          <para align="center" xml:id="S3.F3.p6">
            <graphics candidates="ansresults.pdf" graphic="ansresults.pdf" options="width=411.939pt" xml:id="S3.F3.p6.g1"/>
          </para>
        </inline-para>
        <toccaption class="ltx_centering"><tag close=" ">3</tag>Visualization of the approximate number system experiment. <text font="bold">Left:</text> Schematic illustration of the original setup. Figure adopted from <cite class="ltx_citemacro_cite"><bibref bibrefs="halberda2008developmental" separator=";" show="Authors Phrase1YearPhrase2" yyseparator=",">
              <bibrefphrase>(</bibrefphrase>
              <bibrefphrase>)</bibrefphrase>
            </bibref></cite>. <text font="bold">Center:</text> Examples from the artificial data-set. This pair corresponds to a ratio of 6:5. <text font="bold">Right:</text> Weber ratios of the experimental data. Red coloring corresponds to estimates from human participants, while blue coloring corresponds to estimates from optimized BNNs. The progression of Weber ratios follows a Weber-Fechner Law and is well described through exponential functions in both cases.</toccaption>
        <caption class="ltx_centering"><tag close=": ">Figure 3</tag>Visualization of the approximate number system experiment. <text font="bold">Left:</text> Schematic illustration of the original setup. Figure adopted from <cite class="ltx_citemacro_cite"><bibref bibrefs="halberda2008developmental" separator=";" show="Authors Phrase1YearPhrase2" yyseparator=",">
              <bibrefphrase>(</bibrefphrase>
              <bibrefphrase>)</bibrefphrase>
            </bibref></cite>. <text font="bold">Center:</text> Examples from the artificial data-set. This pair corresponds to a ratio of 6:5. <text font="bold">Right:</text> Weber ratios of the experimental data. Red coloring corresponds to estimates from human participants, while blue coloring corresponds to estimates from optimized BNNs. The progression of Weber ratios follows a Weber-Fechner Law and is well described through exponential functions in both cases.</caption>
      </figure>
      <para xml:id="S3.SS2.p4">
        <p>As before we train otherwise identical models for <Math mode="inline" tex="N\in\{256,512,1024,2048,8192\}" text="N element-of set@(256, 512, 1024, 2048, 8192)" xml:id="S3.SS2.p4.m1">
            <XMath>
              <XMApp>
                <XMTok meaning="element-of" name="in" role="RELOP">∈</XMTok>
                <XMTok font="italic" role="UNKNOWN">N</XMTok>
                <XMDual>
                  <XMApp>
                    <XMTok meaning="set"/>
                    <XMRef idref="S3.SS2.p4.m1.1"/>
                    <XMRef idref="S3.SS2.p4.m1.2"/>
                    <XMRef idref="S3.SS2.p4.m1.3"/>
                    <XMRef idref="S3.SS2.p4.m1.4"/>
                    <XMRef idref="S3.SS2.p4.m1.5"/>
                  </XMApp>
                  <XMWrap>
                    <XMTok role="OPEN" stretchy="false">{</XMTok>
                    <XMTok meaning="256" role="NUMBER" xml:id="S3.SS2.p4.m1.1">256</XMTok>
                    <XMTok role="PUNCT">,</XMTok>
                    <XMTok meaning="512" role="NUMBER" xml:id="S3.SS2.p4.m1.2">512</XMTok>
                    <XMTok role="PUNCT">,</XMTok>
                    <XMTok meaning="1024" role="NUMBER" xml:id="S3.SS2.p4.m1.3">1024</XMTok>
                    <XMTok role="PUNCT">,</XMTok>
                    <XMTok meaning="2048" role="NUMBER" xml:id="S3.SS2.p4.m1.4">2048</XMTok>
                    <XMTok role="PUNCT">,</XMTok>
                    <XMTok meaning="8192" role="NUMBER" xml:id="S3.SS2.p4.m1.5">8192</XMTok>
                    <XMTok role="CLOSE" stretchy="false">}</XMTok>
                  </XMWrap>
                </XMDual>
              </XMApp>
            </XMath>
          </Math> until convergence and report result averaged over ten random seeds. Inspecting Figure <ref labelref="LABEL:fig:support"/> (right) we observe, that the family of BNNs first discovers solutions to the easier conditions (those where the box is positioned fully on the platform or on the side of it). If we increase <Math mode="inline" tex="N" text="N" xml:id="S3.SS2.p4.m2">
            <XMath>
              <XMTok font="italic" role="UNKNOWN">N</XMTok>
            </XMath>
          </Math> to <Math mode="inline" tex="4096" text="4096" xml:id="S3.SS2.p4.m3">
            <XMath>
              <XMTok meaning="4096" role="NUMBER">4096</XMTok>
            </XMath>
          </Math> or more, they are also able to reason reliably about stability in both of the center of mass conditions. Note, that the error rate decreases slower, when being exposed to the more complex, L-shaped objects, although this difference is only marginal. We conclude, that the models show pattern similar to the developmental progress of infants, as we increase the data-set size, akin to the observation from experiment 1, although not as pronounced.
<!--  %**** cogsci˙template.tex Line 250 **** --></p>
      </para>
    </subsection>
    <subsection inlist="toc" xml:id="S3.SS3">
      <tags>
        <tag>3.3</tag>
        <tag role="refnum">3.3</tag>
        <tag role="typerefnum">§3.3</tag>
      </tags>
      <title><tag close=" ">3.3</tag>Approximate Number System</title>
      <para xml:id="S3.SS3.p1">
        <p>Moving away from probing knowledge about physical laws, we next inspect a different domain: children’s intuitive counting abilities <cite class="ltx_citemacro_cite"><bibref bibrefs="halberda2008developmental" separator=";" show="Authors Phrase1YearPhrase2" yyseparator=",">
              <bibrefphrase>(</bibrefphrase>
              <bibrefphrase>)</bibrefphrase>
            </bibref></cite>. A single trial consists of two images, each containing between 1 and 10 items, and the goal is to determine quickly, i.e. without to much deliberation time, which of the two images contains the larger quantity of items. The display time is adjusted depending on age, such that it is short enough to prevent serial counting. Objects within a trial are identical, but are selected from a set of different objects across trials. Two conditions either control for average item size or the summed continuous area. An example trial from the original task is shown in Figure <ref labelref="LABEL:fig:ans"/> (left). <break/></p>
      </para>
      <para xml:id="S3.SS3.p2">
        <p>Experimental data for this task has been obtained for three to six years olds, as well as for adults <cite class="ltx_citemacro_cite"><bibref bibrefs="halberda2008developmental" separator=";" show="Authors Phrase1YearPhrase2" yyseparator=",">
              <bibrefphrase>(</bibrefphrase>
              <bibrefphrase>)</bibrefphrase>
            </bibref></cite>. Human perception is sensitive to the ratio between amounts of objects in the two images and not their difference, i.e. it follows a Weber-Fechner law. Levels of accuracy are measured for different ratios of objects, ranging from 1.11 (10:9) to 2.0 (2:1), from which Weber ratios are estimated. The Weber ratio is the smallest ratio, where a participant is able to identify the correct stimulus in more than 75 percent of the cases. Empirical results indicate, that the Weber ratio decreases during childhood and our ability to distinguish similar stimuli improves over time. Prior work <cite class="ltx_citemacro_cite"><bibref bibrefs="halberda2008developmental" separator=";" show="Authors Phrase1YearPhrase2" yyseparator=",">
              <bibrefphrase>(</bibrefphrase>
              <bibrefphrase>)</bibrefphrase>
            </bibref></cite> estimates Weber ratios of around 1.53 in three years olds, which improves to 1.38 in four years olds, to 1.23 in five years olds, to 1.18 in six years olds and to 1.11 in adults. Overall the decrease of Weber ratios is well described through an exponential function of age (see Figure <ref labelref="LABEL:fig:ans"/>, right). <break/></p>
      </para>
      <para xml:id="S3.SS3.p3">
        <p>We created an artificial version of this task, with inputs <Math mode="inline" tex="\mathbf{x}" text="x" xml:id="S3.SS3.p3.m1">
            <XMath>
              <XMTok font="bold" role="UNKNOWN">x</XMTok>
            </XMath>
          </Math> corresponding to two <Math mode="inline" tex="24\times 24" text="24 * 24" xml:id="S3.SS3.p3.m2">
            <XMath>
              <XMApp>
                <XMTok meaning="times" role="MULOP">×</XMTok>
                <XMTok meaning="24" role="NUMBER">24</XMTok>
                <XMTok meaning="24" role="NUMBER">24</XMTok>
              </XMApp>
            </XMath>
          </Math> images. For simplicity images contain only rectangular objects and the difference in quantity within a pair of images is always one. Targets <Math mode="inline" tex="y" text="y" xml:id="S3.SS3.p3.m3">
            <XMath>
              <XMTok font="italic" role="UNKNOWN">y</XMTok>
            </XMath>
          </Math> are the number of objects in each images and the final prediction is obtained by comparing expected values between the two estimates. Object sizes are randomly drawn from <Math mode="inline" tex="\{1,2,3\}" text="set@(1, 2, 3)" xml:id="S3.SS3.p3.m4">
            <XMath>
              <XMDual>
                <XMApp>
                  <XMTok meaning="set"/>
                  <XMRef idref="S3.SS3.p3.m4.1"/>
                  <XMRef idref="S3.SS3.p3.m4.2"/>
                  <XMRef idref="S3.SS3.p3.m4.3"/>
                </XMApp>
                <XMWrap>
                  <XMTok role="OPEN" stretchy="false">{</XMTok>
                  <XMTok meaning="1" role="NUMBER" xml:id="S3.SS3.p3.m4.1">1</XMTok>
                  <XMTok role="PUNCT">,</XMTok>
                  <XMTok meaning="2" role="NUMBER" xml:id="S3.SS3.p3.m4.2">2</XMTok>
                  <XMTok role="PUNCT">,</XMTok>
                  <XMTok meaning="3" role="NUMBER" xml:id="S3.SS3.p3.m4.3">3</XMTok>
                  <XMTok role="CLOSE" stretchy="false">}</XMTok>
                </XMWrap>
              </XMDual>
            </XMath>
          </Math> and their position is randomized, such that neighbouring objects do not overlap. We controlled for an equal expected, total area between both images as done in the second condition of the original task. Both training and test set contain 6000 samples for each ratio from <Math mode="inline" tex="\{" text="{" xml:id="S3.SS3.p3.m5">
            <XMath>
              <XMTok role="OPEN" stretchy="false">{</XMTok>
            </XMath>
          </Math>10:9, 9:8, 8:7, 7:6, 6:5, 5:4, 4:3, 3:2, 2:1<Math mode="inline" tex="\}" text="}" xml:id="S3.SS3.p3.m6">
            <XMath>
              <XMTok role="CLOSE" stretchy="false">}</XMTok>
            </XMath>
          </Math>. Examples are provided in Figure <ref labelref="LABEL:fig:ans"/> (center). <break/></p>
      </para>
      <para xml:id="S3.SS3.p4">
        <p>We train one model for each value from <Math mode="inline" tex="N\in\{4,\ldots,15\}\cdot 2048" text="N element-of set@(4, ldots, 15) cdot 2048" xml:id="S3.SS3.p4.m1">
            <XMath>
              <XMApp>
                <XMTok meaning="element-of" name="in" role="RELOP">∈</XMTok>
                <XMTok font="italic" role="UNKNOWN">N</XMTok>
                <XMApp>
                  <XMTok name="cdot" role="MULOP">⋅</XMTok>
                  <XMDual>
                    <XMApp>
                      <XMTok meaning="set"/>
                      <XMRef idref="S3.SS3.p4.m1.1"/>
                      <XMRef idref="S3.SS3.p4.m1.2"/>
                      <XMRef idref="S3.SS3.p4.m1.3"/>
                    </XMApp>
                    <XMWrap>
                      <XMTok role="OPEN" stretchy="false">{</XMTok>
                      <XMTok meaning="4" role="NUMBER" xml:id="S3.SS3.p4.m1.1">4</XMTok>
                      <XMTok role="PUNCT">,</XMTok>
                      <XMTok name="ldots" role="ID" xml:id="S3.SS3.p4.m1.2">…</XMTok>
                      <XMTok role="PUNCT">,</XMTok>
                      <XMTok meaning="15" role="NUMBER" xml:id="S3.SS3.p4.m1.3">15</XMTok>
                      <XMTok role="CLOSE" stretchy="false">}</XMTok>
                    </XMWrap>
                  </XMDual>
                  <XMTok meaning="2048" role="NUMBER">2048</XMTok>
                </XMApp>
              </XMApp>
            </XMath>
          </Math>. Hidden layers have 512 units in this experiment and we found it helpful to initialize weights for all models from a pretrained network. For the resulting models we calculate Weber ratios after estimating accuracy levels of 75 percent via linear interpolation and visualize the results in Figure <ref labelref="LABEL:fig:ans"/> (right). We observe an improvement of Weber ratios as the data-set size <Math mode="inline" tex="N" text="N" xml:id="S3.SS3.p4.m2">
            <XMath>
              <XMTok font="italic" role="UNKNOWN">N</XMTok>
            </XMath>
          </Math> increases and conclude, that our models also follow a Weber-Fechner law. The resulting progression is well described through an exponential function<note mark="2" role="footnote" xml:id="footnote2"><tags>
              <tag>2</tag>
              <tag role="refnum">2</tag>
              <tag role="typerefnum">footnote 2</tag>
            </tags><Math mode="inline" tex="y=ae^{-bx+c}+d" text="y = a * e ^ ((- b * x) + c) + d" xml:id="footnote2.m1">
              <XMath>
                <XMApp>
                  <XMTok meaning="equals" role="RELOP">=</XMTok>
                  <XMTok font="italic" role="UNKNOWN">y</XMTok>
                  <XMApp>
                    <XMTok meaning="plus" role="ADDOP">+</XMTok>
                    <XMApp>
                      <XMTok meaning="times" role="MULOP">⁢</XMTok>
                      <XMTok font="italic" role="UNKNOWN">a</XMTok>
                      <XMApp>
                        <XMTok role="SUPERSCRIPTOP" scriptpos="post1"/>
                        <XMTok font="italic" role="UNKNOWN">e</XMTok>
                        <XMApp>
                          <XMTok fontsize="70%" meaning="plus" role="ADDOP">+</XMTok>
                          <XMApp>
                            <XMTok fontsize="70%" meaning="minus" role="ADDOP">-</XMTok>
                            <XMApp>
                              <XMTok meaning="times" role="MULOP">⁢</XMTok>
                              <XMTok font="italic" fontsize="70%" role="UNKNOWN">b</XMTok>
                              <XMTok font="italic" fontsize="70%" role="UNKNOWN">x</XMTok>
                            </XMApp>
                          </XMApp>
                          <XMTok font="italic" fontsize="70%" role="UNKNOWN">c</XMTok>
                        </XMApp>
                      </XMApp>
                    </XMApp>
                    <XMTok font="italic" role="UNKNOWN">d</XMTok>
                  </XMApp>
                </XMApp>
              </XMath>
            </Math>, with <Math mode="inline" tex="x" text="x" xml:id="footnote2.m2">
              <XMath>
                <XMTok font="italic" role="UNKNOWN">x</XMTok>
              </XMath>
            </Math> corresponding to data-set size or age.</note> of data-set size, a characteristic shared with the curve obtained from human participants. There is however a small gap between the overall model performance in the artificial task and that of human participants in the original one, even for large data-sets.</p>
      </para>
    </subsection>
  </section>
  <section inlist="toc" xml:id="S4">
    <tags>
      <tag>4</tag>
      <tag role="refnum">4</tag>
      <tag role="typerefnum">§4</tag>
    </tags>
    <title><tag close=" ">4</tag>Discussion</title>
    <para xml:id="S4.p1">
      <p>We investigated the progress of Bayesian Neural Networks with access to increasingly large data-sets on three different tasks. In all three examples we find an at least partial agreement between the development of our artificial learning systems and findings from the developmental psychology literature. However we also observe some considerable differences. The best performing BNNs in the quantity comparison task, for example, do not reach the level of human adults. We attribute this effect to difficulties for standard neural networks architectures on relational reasoning tasks and hypothesize that recent advances in visual relational reasoning <cite class="ltx_citemacro_cite"><bibref bibrefs="santoro2017simple" separator=";" show="Authors Phrase1YearPhrase2" yyseparator=",">
            <bibrefphrase>(</bibrefphrase>
            <bibrefphrase>)</bibrefphrase>
          </bibref></cite> could close this gap. In general we interpret our results as additional evidence for Bayesian theories of cognition and learning <cite class="ltx_citemacro_cite"><bibref bibrefs="griffiths2008bayesian" separator=";" show="Authors Phrase1YearPhrase2" yyseparator=",">
            <bibrefphrase>(</bibrefphrase>
            <bibrefphrase>)</bibrefphrase>
          </bibref></cite>. <break/></p>
    </para>
    <para xml:id="S4.p2">
      <p>The discriminative models employed in this work require large amounts of input-target pairs to obtain the desired result. Children on the other hand have to operate in a much more data-efficient manner, as they do not have constant access to a teacher providing correct targets. One approach to resolve the question of sources, that children use for learning, are generative models, which are able to discover underlying structures without explicitly provided targets. Whether our results transfer to generative models remains to be seen. Indeed applying generative models in this context would enable us to measure performance in artificial systems directly via violation-of-expectation methods, as was done with infants in two of our examples. In the future it would be natural to extend our work to more realistic settings and apply different architectures, such as recurrent or convolutional networks. <break/></p>
    </para>
    <para xml:id="S4.p3">
      <p>We believe there are exciting opportunities for research on the intersection of machine learning and developmental psychology. On one hand insights from developmental psychology can provide guidelines of how to build more intelligent, human-like systems. The machine learning framework on the other hand enables researchers to formulate normative theories, that can be empirically verified. We already see some progress in these areas. Examples include the usage of violation-of-expectation methods for probing the knowledge of deep networks <cite class="ltx_citemacro_cite"><bibref bibrefs="DBLP:journals/corr/abs-1804-01128" separator=";" show="Authors Phrase1YearPhrase2" yyseparator=",">
            <bibrefphrase>(</bibrefphrase>
            <bibrefphrase>)</bibrefphrase>
          </bibref></cite> or the proposal to select training curricula for machine learning systems, based on how children obtain samples <cite class="ltx_citemacro_cite"><bibref bibrefs="10.3389/fpsyg.2017.02124" separator=";" show="Authors Phrase1YearPhrase2" yyseparator=",">
            <bibrefphrase>(</bibrefphrase>
            <bibrefphrase>)</bibrefphrase>
          </bibref></cite>.</p>
    </para>
  </section>
  <section inlist="toc" xml:id="S5">
    <tags>
      <tag>5</tag>
      <tag role="refnum">5</tag>
      <tag role="typerefnum">§5</tag>
    </tags>
    <title><tag close=" ">5</tag>Acknowledgments</title>
    <para xml:id="S5.p1">
      <p>This work was supported by the DFG GRK-RTG 2271 ’Breaking Expectations’.</p>
    </para>
<!--  %**** cogsci˙template.tex Line 275 **** -->  </section>
  <bibliography xml:id="bib">
    <title>References</title>
    <biblist>
      <bibitem key="baillargeon2002acquisition" xml:id="bib.bib1">
        <tags>
          <tag role="number">1</tag>
          <tag role="year"><ERROR class="undefined">\APACyear</ERROR>2002</tag>
          <tag role="authors">Baillargeon</tag>
          <tag role="fullauthors">Baillargeon</tag>
          <tag role="refnum">Baillargeon (<ERROR class="undefined">\APACyear</ERROR>2002)</tag>
          <tag role="key">baillargeon2002acquisition</tag>
        </tags>
        <bibblock>
<ERROR class="undefined">\APACinsertmetastar</ERROR>baillargeon2002acquisition<ERROR class="undefined">{APACrefauthors}</ERROR>Baillargeon, R. </bibblock>
        <bibblock><ERROR class="undefined">\APACrefYearMonthDay</ERROR>2002.
</bibblock>
        <bibblock><ERROR class="undefined">\BBOQ</ERROR><ERROR class="undefined">\APACrefatitle</ERROR>The acquisition of physical knowledge in infancy: A
summary in eight lessons The acquisition of physical knowledge in infancy:
A summary in eight lessons.<ERROR class="undefined">\BBCQ</ERROR>
</bibblock>
        <bibblock><ERROR class="undefined">\APACjournalVolNumPages</ERROR>Blackwell handbook of childhood cognitive
<!--  %**** cogsci˙template.bbl Line 25 **** -->development146-831.
<ERROR class="undefined">\PrintBackRefs</ERROR><ERROR class="undefined">\CurrentBib</ERROR>
</bibblock>
      </bibitem>
      <bibitem key="baillargeon2009account" xml:id="bib.bib2">
        <tags>
          <tag role="number">2</tag>
          <tag role="year"><ERROR class="undefined">\APACyear</ERROR>2009</tag>
          <tag role="authors">Baillargeon <ERROR class="undefined">\BOthers</ERROR>.</tag>
          <tag role="fullauthors">Baillargeon, Li, Ng<ERROR class="undefined">\BCBL</ERROR> <ERROR class="undefined">\BBA</ERROR> Yuan</tag>
          <tag role="refnum">Baillargeon <ERROR class="undefined">\BOthers</ERROR>. (<ERROR class="undefined">\APACyear</ERROR>2009)</tag>
          <tag role="key">baillargeon2009account</tag>
        </tags>
        <bibblock>
<ERROR class="undefined">\APACinsertmetastar</ERROR>baillargeon2009account<ERROR class="undefined">{APACrefauthors}</ERROR>Baillargeon, R., Li, J., Ng, W.<ERROR class="undefined">\BCBL</ERROR> <ERROR class="undefined">\BBA</ERROR> Yuan, S. </bibblock>
        <bibblock><ERROR class="undefined">\APACrefYearMonthDay</ERROR>2009.
<!--  %**** cogsci˙template.bbl Line 50 **** --></bibblock>
        <bibblock><ERROR class="undefined">\BBOQ</ERROR><ERROR class="undefined">\APACrefatitle</ERROR>An account of infants’ physical reasoning An account
of infants’ physical reasoning.<ERROR class="undefined">\BBCQ</ERROR>
</bibblock>
        <bibblock><ERROR class="undefined">\APACjournalVolNumPages</ERROR>Learning and the infant mind66–116.
<ERROR class="undefined">\PrintBackRefs</ERROR><ERROR class="undefined">\CurrentBib</ERROR>
</bibblock>
      </bibitem>
      <bibitem key="baillargeon1985object" xml:id="bib.bib3">
        <tags>
          <tag role="number">3</tag>
          <tag role="year"><ERROR class="undefined">\APACyear</ERROR>1985</tag>
          <tag role="authors">Baillargeon <ERROR class="undefined">\BOthers</ERROR>.</tag>
          <tag role="fullauthors">Baillargeon, Spelke<ERROR class="undefined">\BCBL</ERROR> <ERROR class="undefined">\BBA</ERROR> Wasserman</tag>
          <tag role="refnum">Baillargeon <ERROR class="undefined">\BOthers</ERROR>. (<ERROR class="undefined">\APACyear</ERROR>1985)</tag>
          <tag role="key">baillargeon1985object</tag>
        </tags>
        <bibblock>
<ERROR class="undefined">\APACinsertmetastar</ERROR>baillargeon1985object<ERROR class="undefined">{APACrefauthors}</ERROR>Baillargeon, R., Spelke, E<ERROR class="undefined">\BPBI</ERROR>S.<ERROR class="undefined">\BCBL</ERROR> <ERROR class="undefined">\BBA</ERROR> Wasserman, S. <!--  %**** cogsci˙template.bbl Line 75 **** --></bibblock>
        <bibblock><ERROR class="undefined">\APACrefYearMonthDay</ERROR>1985.
</bibblock>
        <bibblock><ERROR class="undefined">\BBOQ</ERROR><ERROR class="undefined">\APACrefatitle</ERROR>Object permanence in five-month-old infants Object
permanence in five-month-old infants.<ERROR class="undefined">\BBCQ</ERROR>
</bibblock>
        <bibblock><ERROR class="undefined">\APACjournalVolNumPages</ERROR>Cognition203191–208.
<ERROR class="undefined">\PrintBackRefs</ERROR><ERROR class="undefined">\CurrentBib</ERROR>
</bibblock>
      </bibitem>
      <bibitem key="baker2009action" xml:id="bib.bib4">
        <tags>
          <tag role="number">4</tag>
          <tag role="year"><ERROR class="undefined">\APACyear</ERROR>2009</tag>
          <tag role="authors">Baker <ERROR class="undefined">\BOthers</ERROR>.</tag>
          <tag role="fullauthors">Baker, Saxe<ERROR class="undefined">\BCBL</ERROR> <ERROR class="undefined">\BBA</ERROR> Tenenbaum</tag>
          <tag role="refnum">Baker <ERROR class="undefined">\BOthers</ERROR>. (<ERROR class="undefined">\APACyear</ERROR>2009)</tag>
          <tag role="key">baker2009action</tag>
        </tags>
        <bibblock>
<ERROR class="undefined">\APACinsertmetastar</ERROR>baker2009action<ERROR class="undefined">{APACrefauthors}</ERROR>Baker, C<ERROR class="undefined">\BPBI</ERROR>L., Saxe, R.<ERROR class="undefined">\BCBL</ERROR> <ERROR class="undefined">\BBA</ERROR> Tenenbaum, J<ERROR class="undefined">\BPBI</ERROR>B. </bibblock>
        <bibblock><ERROR class="undefined">\APACrefYearMonthDay</ERROR>2009.
</bibblock>
        <bibblock><ERROR class="undefined">\BBOQ</ERROR><ERROR class="undefined">\APACrefatitle</ERROR>Action understanding as inverse planning Action
understanding as inverse planning.<ERROR class="undefined">\BBCQ</ERROR>
</bibblock>
        <bibblock><ERROR class="undefined">\APACjournalVolNumPages</ERROR>Cognition1133329–349.
<ERROR class="undefined">\PrintBackRefs</ERROR><ERROR class="undefined">\CurrentBib</ERROR>
</bibblock>
      </bibitem>
      <bibitem key="battaglia2013simulation" xml:id="bib.bib5">
        <tags>
          <tag role="number">5</tag>
          <tag role="year"><ERROR class="undefined">\APACyear</ERROR>2013</tag>
          <tag role="authors">Battaglia <ERROR class="undefined">\BOthers</ERROR>.</tag>
          <tag role="fullauthors">Battaglia, Hamrick<ERROR class="undefined">\BCBL</ERROR> <ERROR class="undefined">\BBA</ERROR> Tenenbaum</tag>
          <tag role="refnum">Battaglia <ERROR class="undefined">\BOthers</ERROR>. (<ERROR class="undefined">\APACyear</ERROR>2013)</tag>
          <tag role="key">battaglia2013simulation</tag>
        </tags>
        <bibblock>
<ERROR class="undefined">\APACinsertmetastar</ERROR>battaglia2013simulation<ERROR class="undefined">{APACrefauthors}</ERROR>Battaglia, P<ERROR class="undefined">\BPBI</ERROR>W.<!--  %**** cogsci˙template.bbl Line 125 **** -->, Hamrick, J<ERROR class="undefined">\BPBI</ERROR>B.<ERROR class="undefined">\BCBL</ERROR> <ERROR class="undefined">\BBA</ERROR> Tenenbaum, J<ERROR class="undefined">\BPBI</ERROR>B. </bibblock>
        <bibblock><ERROR class="undefined">\APACrefYearMonthDay</ERROR>2013.
</bibblock>
        <bibblock><ERROR class="undefined">\BBOQ</ERROR><ERROR class="undefined">\APACrefatitle</ERROR>Simulation as an engine of physical scene understanding
Simulation as an engine of physical scene understanding.<ERROR class="undefined">\BBCQ</ERROR>
</bibblock>
        <bibblock><ERROR class="undefined">\APACjournalVolNumPages</ERROR>Proceedings of the National Academy of
Sciences201306572.
<ERROR class="undefined">\PrintBackRefs</ERROR><ERROR class="undefined">\CurrentBib</ERROR>
</bibblock>
      </bibitem>
      <bibitem key="chang2016compositional" xml:id="bib.bib6">
        <tags>
          <tag role="number">6</tag>
          <tag role="year"><ERROR class="undefined">\APACyear</ERROR>2016</tag>
          <tag role="authors">Chang <ERROR class="undefined">\BOthers</ERROR>.</tag>
          <tag role="fullauthors">Chang, Ullman, Torralba<ERROR class="undefined">\BCBL</ERROR> <ERROR class="undefined">\BBA</ERROR> Tenenbaum</tag>
          <tag role="refnum">Chang <ERROR class="undefined">\BOthers</ERROR>. (<ERROR class="undefined">\APACyear</ERROR>2016)</tag>
          <tag role="key">chang2016compositional</tag>
        </tags>
        <bibblock>
<!--  %**** cogsci˙template.bbl Line 150 **** --><ERROR class="undefined">\APACinsertmetastar</ERROR>chang2016compositional<ERROR class="undefined">{APACrefauthors}</ERROR>Chang, M<ERROR class="undefined">\BPBI</ERROR>B., Ullman, T., Torralba, A.<ERROR class="undefined">\BCBL</ERROR> <ERROR class="undefined">\BBA</ERROR> Tenenbaum, J<ERROR class="undefined">\BPBI</ERROR>B. </bibblock>
        <bibblock><ERROR class="undefined">\APACrefYearMonthDay</ERROR>2016.
</bibblock>
        <bibblock><ERROR class="undefined">\BBOQ</ERROR><ERROR class="undefined">\APACrefatitle</ERROR>A compositional object-based approach to learning
physical dynamics A compositional object-based approach to learning
physical dynamics.<ERROR class="undefined">\BBCQ</ERROR>
</bibblock>
        <bibblock><ERROR class="undefined">\APACjournalVolNumPages</ERROR>arXiv preprint arXiv:1612.00341.
<ERROR class="undefined">\PrintBackRefs</ERROR><ERROR class="undefined">\CurrentBib</ERROR>
</bibblock>
      </bibitem>
      <bibitem key="clevert2015fast" xml:id="bib.bib7">
        <tags>
          <tag role="number">7</tag>
          <tag role="year"><ERROR class="undefined">\APACyear</ERROR>2015</tag>
          <tag role="authors">Clevert <ERROR class="undefined">\BOthers</ERROR>.</tag>
          <tag role="fullauthors">Clevert, Unterthiner<ERROR class="undefined">\BCBL</ERROR> <ERROR class="undefined">\BBA</ERROR> Hochreiter</tag>
          <tag role="refnum">Clevert <ERROR class="undefined">\BOthers</ERROR>. (<ERROR class="undefined">\APACyear</ERROR>2015)</tag>
          <tag role="key">clevert2015fast</tag>
        </tags>
        <bibblock>
<ERROR class="undefined">\APACinsertmetastar</ERROR>clevert2015fast<ERROR class="undefined">{APACrefauthors}</ERROR>Clevert, D<ERROR class="undefined">\BHBI</ERROR>A., Unterthiner, T.<ERROR class="undefined">\BCBL</ERROR> <ERROR class="undefined">\BBA</ERROR> Hochreiter, S. </bibblock>
        <bibblock><ERROR class="undefined">\APACrefYearMonthDay</ERROR>2015.
</bibblock>
        <bibblock><ERROR class="undefined">\BBOQ</ERROR><ERROR class="undefined">\APACrefatitle</ERROR>Fast and accurate deep network learning by exponential
linear units (elus) Fast and accurate deep network learning by exponential
linear units (elus).<ERROR class="undefined">\BBCQ</ERROR>
</bibblock>
        <bibblock><ERROR class="undefined">\APACjournalVolNumPages</ERROR>arXiv preprint arXiv:1511.07289.
<ERROR class="undefined">\PrintBackRefs</ERROR><ERROR class="undefined">\CurrentBib</ERROR>
</bibblock>
      </bibitem>
      <bibitem key="griffiths2008bayesian" xml:id="bib.bib8">
        <tags>
          <tag role="number">8</tag>
          <tag role="year"><ERROR class="undefined">\APACyear</ERROR>2008</tag>
          <tag role="authors">Griffiths <ERROR class="undefined">\BOthers</ERROR>.</tag>
          <tag role="fullauthors">Griffiths, Kemp<ERROR class="undefined">\BCBL</ERROR> <ERROR class="undefined">\BBA</ERROR> Tenenbaum</tag>
          <tag role="refnum">Griffiths <ERROR class="undefined">\BOthers</ERROR>. (<ERROR class="undefined">\APACyear</ERROR>2008)</tag>
          <tag role="key">griffiths2008bayesian</tag>
        </tags>
        <bibblock>
<ERROR class="undefined">\APACinsertmetastar</ERROR>griffiths2008bayesian<ERROR class="undefined">{APACrefauthors}</ERROR>Griffiths, T<ERROR class="undefined">\BPBI</ERROR>L., Kemp, C.<ERROR class="undefined">\BCBL</ERROR> <ERROR class="undefined">\BBA</ERROR> Tenenbaum, J<ERROR class="undefined">\BPBI</ERROR>B. </bibblock>
        <bibblock><ERROR class="undefined">\APACrefYearMonthDay</ERROR>2008.
</bibblock>
        <bibblock><ERROR class="undefined">\BBOQ</ERROR><ERROR class="undefined">\APACrefatitle</ERROR>Bayesian models of cognition Bayesian models of
cognition.<ERROR class="undefined">\BBCQ</ERROR>
</bibblock>
        <bibblock><ERROR class="undefined">\PrintBackRefs</ERROR><ERROR class="undefined">\CurrentBib</ERROR>
</bibblock>
      </bibitem>
      <bibitem key="halberda2008developmental" xml:id="bib.bib9">
        <tags>
          <tag role="number">9</tag>
          <tag role="year"><ERROR class="undefined">\APACyear</ERROR>2008</tag>
          <tag role="authors">Halberda <ERROR class="undefined">\BBA</ERROR> Feigenson</tag>
          <tag role="fullauthors">Halberda <ERROR class="undefined">\BBA</ERROR> Feigenson</tag>
          <tag role="refnum">Halberda <ERROR class="undefined">\BBA</ERROR> Feigenson (<ERROR class="undefined">\APACyear</ERROR>2008)</tag>
          <tag role="key">halberda2008developmental</tag>
        </tags>
        <bibblock>
<ERROR class="undefined">\APACinsertmetastar</ERROR>halberda2008developmental<ERROR class="undefined">{APACrefauthors}</ERROR>Halberda, J.<ERROR class="undefined">\BCBT</ERROR> <ERROR class="undefined">\BBA</ERROR> Feigenson, L. </bibblock>
        <bibblock><ERROR class="undefined">\APACrefYearMonthDay</ERROR>2008.
</bibblock>
        <bibblock><ERROR class="undefined">\BBOQ</ERROR><ERROR class="undefined">\APACrefatitle</ERROR>Developmental change in the acuity of the” Number
Sense”: The Approximate Number System in 3-, 4-, 5-, and 6-year-olds and
adults. Developmental change in the acuity of the” number sense”: The
approximate number system in 3-, 4-, 5-, and 6-year-olds and adults.<ERROR class="undefined">\BBCQ</ERROR>
</bibblock>
        <bibblock><ERROR class="undefined">\APACjournalVolNumPages</ERROR>Developmental psychology4451457.
<!--  %**** cogsci˙template.bbl Line 250 **** --><ERROR class="undefined">\PrintBackRefs</ERROR><ERROR class="undefined">\CurrentBib</ERROR>
</bibblock>
      </bibitem>
      <bibitem key="hinton1993keeping" xml:id="bib.bib10">
        <tags>
          <tag role="number">10</tag>
          <tag role="year"><ERROR class="undefined">\APACyear</ERROR>1993</tag>
          <tag role="authors">Hinton <ERROR class="undefined">\BBA</ERROR> Van Camp</tag>
          <tag role="fullauthors">Hinton <ERROR class="undefined">\BBA</ERROR> Van Camp</tag>
          <tag role="refnum">Hinton <ERROR class="undefined">\BBA</ERROR> Van Camp (<ERROR class="undefined">\APACyear</ERROR>1993)</tag>
          <tag role="key">hinton1993keeping</tag>
        </tags>
        <bibblock>
<ERROR class="undefined">\APACinsertmetastar</ERROR>hinton1993keeping<ERROR class="undefined">{APACrefauthors}</ERROR>Hinton, G<ERROR class="undefined">\BPBI</ERROR>E.<ERROR class="undefined">\BCBT</ERROR> <ERROR class="undefined">\BBA</ERROR> Van Camp, D. </bibblock>
        <bibblock><ERROR class="undefined">\APACrefYearMonthDay</ERROR>1993.
</bibblock>
        <bibblock><ERROR class="undefined">\BBOQ</ERROR><ERROR class="undefined">\APACrefatitle</ERROR>Keeping the neural networks simple by minimizing the
description length of the weights Keeping the neural networks simple by
minimizing the description length of the weights.<ERROR class="undefined">\BBCQ</ERROR>
<!--  %**** cogsci˙template.bbl Line 275 **** --></bibblock>
        <bibblock><ERROR class="undefined">\BIn</ERROR> <ERROR class="undefined">\APACrefbtitle</ERROR>Proceedings of the sixth annual conference on
Computational learning theory Proceedings of the sixth annual conference on
computational learning theory (<ERROR class="undefined">\BPGS</ERROR> 5–13).
<ERROR class="undefined">\PrintBackRefs</ERROR><ERROR class="undefined">\CurrentBib</ERROR>
</bibblock>
      </bibitem>
      <bibitem key="kingma2014adam" xml:id="bib.bib11">
        <tags>
          <tag role="number">11</tag>
          <tag role="year"><ERROR class="undefined">\APACyear</ERROR>2014</tag>
          <tag role="authors">Kingma <ERROR class="undefined">\BBA</ERROR> Ba</tag>
          <tag role="fullauthors">Kingma <ERROR class="undefined">\BBA</ERROR> Ba</tag>
          <tag role="refnum">Kingma <ERROR class="undefined">\BBA</ERROR> Ba (<ERROR class="undefined">\APACyear</ERROR>2014)</tag>
          <tag role="key">kingma2014adam</tag>
        </tags>
        <bibblock>
<ERROR class="undefined">\APACinsertmetastar</ERROR>kingma2014adam<ERROR class="undefined">{APACrefauthors}</ERROR>Kingma, D<ERROR class="undefined">\BPBI</ERROR>P.<ERROR class="undefined">\BCBT</ERROR> <ERROR class="undefined">\BBA</ERROR> Ba, J. </bibblock>
        <bibblock><ERROR class="undefined">\APACrefYearMonthDay</ERROR>2014.
<!--  %**** cogsci˙template.bbl Line 300 **** --></bibblock>
        <bibblock><ERROR class="undefined">\BBOQ</ERROR><ERROR class="undefined">\APACrefatitle</ERROR>Adam: A method for stochastic optimization Adam: A
method for stochastic optimization.<ERROR class="undefined">\BBCQ</ERROR>
</bibblock>
        <bibblock><ERROR class="undefined">\APACjournalVolNumPages</ERROR>arXiv preprint arXiv:1412.6980.
<ERROR class="undefined">\PrintBackRefs</ERROR><ERROR class="undefined">\CurrentBib</ERROR>
</bibblock>
      </bibitem>
      <bibitem key="kingma2015variational" xml:id="bib.bib12">
        <tags>
          <tag role="number">12</tag>
          <tag role="year"><ERROR class="undefined">\APACyear</ERROR>2015</tag>
          <tag role="authors">Kingma <ERROR class="undefined">\BOthers</ERROR>.</tag>
          <tag role="fullauthors">Kingma, Salimans<ERROR class="undefined">\BCBL</ERROR> <ERROR class="undefined">\BBA</ERROR> Welling</tag>
          <tag role="refnum">Kingma <ERROR class="undefined">\BOthers</ERROR>. (<ERROR class="undefined">\APACyear</ERROR>2015)</tag>
          <tag role="key">kingma2015variational</tag>
        </tags>
        <bibblock>
<ERROR class="undefined">\APACinsertmetastar</ERROR>kingma2015variational<ERROR class="undefined">{APACrefauthors}</ERROR>Kingma, D<ERROR class="undefined">\BPBI</ERROR>P., Salimans, T.<ERROR class="undefined">\BCBL</ERROR> <ERROR class="undefined">\BBA</ERROR> Welling, M. <!--  %**** cogsci˙template.bbl Line 325 **** --></bibblock>
        <bibblock><ERROR class="undefined">\APACrefYearMonthDay</ERROR>2015.
</bibblock>
        <bibblock><ERROR class="undefined">\BBOQ</ERROR><ERROR class="undefined">\APACrefatitle</ERROR>Variational dropout and the local reparameterization
trick Variational dropout and the local reparameterization trick.<ERROR class="undefined">\BBCQ</ERROR>
</bibblock>
        <bibblock><ERROR class="undefined">\BIn</ERROR> <ERROR class="undefined">\APACrefbtitle</ERROR>Advances in Neural Information Processing Systems
Advances in neural information processing systems (<ERROR class="undefined">\BPGS</ERROR> 2575–2583).
<ERROR class="undefined">\PrintBackRefs</ERROR><ERROR class="undefined">\CurrentBib</ERROR>
</bibblock>
      </bibitem>
      <bibitem key="lake2017building" xml:id="bib.bib13">
        <tags>
          <tag role="number">13</tag>
          <tag role="year"><ERROR class="undefined">\APACyear</ERROR>2017</tag>
          <tag role="authors">Lake <ERROR class="undefined">\BOthers</ERROR>.</tag>
          <tag role="fullauthors">Lake, Ullman, Tenenbaum<ERROR class="undefined">\BCBL</ERROR> <ERROR class="undefined">\BBA</ERROR> Gershman</tag>
          <tag role="refnum">Lake <ERROR class="undefined">\BOthers</ERROR>. (<ERROR class="undefined">\APACyear</ERROR>2017)</tag>
          <tag role="key">lake2017building</tag>
        </tags>
        <bibblock>
<ERROR class="undefined">\APACinsertmetastar</ERROR>lake2017building<ERROR class="undefined">{APACrefauthors}</ERROR>Lake, B<ERROR class="undefined">\BPBI</ERROR>M.<!--  %**** cogsci˙template.bbl Line 350 **** -->, Ullman, T<ERROR class="undefined">\BPBI</ERROR>D., Tenenbaum, J<ERROR class="undefined">\BPBI</ERROR>B.<ERROR class="undefined">\BCBL</ERROR> <ERROR class="undefined">\BBA</ERROR> Gershman, S<ERROR class="undefined">\BPBI</ERROR>J. </bibblock>
        <bibblock><ERROR class="undefined">\APACrefYearMonthDay</ERROR>2017.
</bibblock>
        <bibblock><ERROR class="undefined">\BBOQ</ERROR><ERROR class="undefined">\APACrefatitle</ERROR>Building machines that learn and think like people
Building machines that learn and think like people.<ERROR class="undefined">\BBCQ</ERROR>
</bibblock>
        <bibblock><ERROR class="undefined">\APACjournalVolNumPages</ERROR>Behavioral and Brain Sciences40.
<ERROR class="undefined">\PrintBackRefs</ERROR><ERROR class="undefined">\CurrentBib</ERROR>
</bibblock>
      </bibitem>
      <bibitem key="lerer2016learning" xml:id="bib.bib14">
        <tags>
          <tag role="number">14</tag>
          <tag role="year"><ERROR class="undefined">\APACyear</ERROR>2016</tag>
          <tag role="authors">Lerer <ERROR class="undefined">\BOthers</ERROR>.</tag>
          <tag role="fullauthors">Lerer, Gross<ERROR class="undefined">\BCBL</ERROR> <ERROR class="undefined">\BBA</ERROR> Fergus</tag>
          <tag role="refnum">Lerer <ERROR class="undefined">\BOthers</ERROR>. (<ERROR class="undefined">\APACyear</ERROR>2016)</tag>
          <tag role="key">lerer2016learning</tag>
        </tags>
        <bibblock>
<ERROR class="undefined">\APACinsertmetastar</ERROR><!--  %**** cogsci˙template.bbl Line 375 **** -->lerer2016learning<ERROR class="undefined">{APACrefauthors}</ERROR>Lerer, A., Gross, S.<ERROR class="undefined">\BCBL</ERROR> <ERROR class="undefined">\BBA</ERROR> Fergus, R. </bibblock>
        <bibblock><ERROR class="undefined">\APACrefYearMonthDay</ERROR>2016.
</bibblock>
        <bibblock><ERROR class="undefined">\BBOQ</ERROR><ERROR class="undefined">\APACrefatitle</ERROR>Learning physical intuition of block towers by example
Learning physical intuition of block towers by example.<ERROR class="undefined">\BBCQ</ERROR>
</bibblock>
        <bibblock><ERROR class="undefined">\APACjournalVolNumPages</ERROR>arXiv preprint arXiv:1603.01312.
<ERROR class="undefined">\PrintBackRefs</ERROR><ERROR class="undefined">\CurrentBib</ERROR>
</bibblock>
      </bibitem>
      <bibitem key="louizos2017bayesian" xml:id="bib.bib15">
        <tags>
          <tag role="number">15</tag>
          <tag role="year"><ERROR class="undefined">\APACyear</ERROR>2017</tag>
          <tag role="authors">Louizos <ERROR class="undefined">\BOthers</ERROR>.</tag>
          <tag role="fullauthors">Louizos, Ullrich<ERROR class="undefined">\BCBL</ERROR> <ERROR class="undefined">\BBA</ERROR> Welling</tag>
          <tag role="refnum">Louizos <ERROR class="undefined">\BOthers</ERROR>. (<ERROR class="undefined">\APACyear</ERROR>2017)</tag>
          <tag role="key">louizos2017bayesian</tag>
        </tags>
        <bibblock>
<ERROR class="undefined">\APACinsertmetastar</ERROR>louizos2017bayesian<ERROR class="undefined">{APACrefauthors}</ERROR>Louizos, C., Ullrich, K.<ERROR class="undefined">\BCBL</ERROR> <ERROR class="undefined">\BBA</ERROR> Welling, M. </bibblock>
        <bibblock><ERROR class="undefined">\APACrefYearMonthDay</ERROR>2017.
</bibblock>
        <bibblock><ERROR class="undefined">\BBOQ</ERROR><ERROR class="undefined">\APACrefatitle</ERROR>Bayesian compression for deep learning Bayesian
compression for deep learning.<ERROR class="undefined">\BBCQ</ERROR>
</bibblock>
        <bibblock><ERROR class="undefined">\BIn</ERROR> <ERROR class="undefined">\APACrefbtitle</ERROR>Advances in Neural Information Processing Systems
Advances in neural information processing systems (<ERROR class="undefined">\BPGS</ERROR> 3288–3298).
<ERROR class="undefined">\PrintBackRefs</ERROR><ERROR class="undefined">\CurrentBib</ERROR>
</bibblock>
      </bibitem>
      <bibitem key="mitchell1988bayesian" xml:id="bib.bib16">
        <tags>
          <tag role="number">16</tag>
          <tag role="year"><ERROR class="undefined">\APACyear</ERROR>1988</tag>
          <tag role="authors">Mitchell <ERROR class="undefined">\BBA</ERROR> Beauchamp</tag>
          <tag role="fullauthors">Mitchell <ERROR class="undefined">\BBA</ERROR> Beauchamp</tag>
          <tag role="refnum">Mitchell <ERROR class="undefined">\BBA</ERROR> Beauchamp (<ERROR class="undefined">\APACyear</ERROR>1988)</tag>
          <tag role="key">mitchell1988bayesian</tag>
        </tags>
        <bibblock>
<ERROR class="undefined">\APACinsertmetastar</ERROR>mitchell1988bayesian<ERROR class="undefined">{APACrefauthors}</ERROR>Mitchell, T<ERROR class="undefined">\BPBI</ERROR>J.<ERROR class="undefined">\BCBT</ERROR> <ERROR class="undefined">\BBA</ERROR> Beauchamp, J<ERROR class="undefined">\BPBI</ERROR>J. </bibblock>
        <bibblock><ERROR class="undefined">\APACrefYearMonthDay</ERROR>1988.
</bibblock>
        <bibblock><ERROR class="undefined">\BBOQ</ERROR><ERROR class="undefined">\APACrefatitle</ERROR>Bayesian variable selection in linear regression
Bayesian variable selection in linear regression.<ERROR class="undefined">\BBCQ</ERROR>
</bibblock>
        <bibblock><ERROR class="undefined">\APACjournalVolNumPages</ERROR>Journal of the American Statistical
Association834041023–1032.
<ERROR class="undefined">\PrintBackRefs</ERROR><ERROR class="undefined">\CurrentBib</ERROR>
</bibblock>
      </bibitem>
      <bibitem key="DBLP:journals/corr/abs-1804-01128" xml:id="bib.bib17">
        <tags>
          <tag role="number">17</tag>
          <tag role="year"><ERROR class="undefined">\APACyear</ERROR>2018</tag>
          <tag role="authors">Piloto <ERROR class="undefined">\BOthers</ERROR>.</tag>
          <tag role="fullauthors">Piloto <ERROR class="undefined">\BOthers</ERROR>.</tag>
          <tag role="refnum">Piloto <ERROR class="undefined">\BOthers</ERROR>. (<ERROR class="undefined">\APACyear</ERROR>2018)</tag>
          <tag role="key">DBLP:journals/corr/abs-1804-01128</tag>
        </tags>
        <bibblock>
<ERROR class="undefined">\APACinsertmetastar</ERROR>DBLP:journals/corr/abs-1804-01128<ERROR class="undefined">{APACrefauthors}</ERROR>Piloto, L., Weinstein, A., TB, D., Ahuja, A., Mirza, M., Wayne, G.<ERROR class="undefined">\BDBL</ERROR>Botvinick, M<ERROR class="undefined">\BPBI</ERROR>M. </bibblock>
        <bibblock><ERROR class="undefined">\APACrefYearMonthDay</ERROR>2018.
</bibblock>
        <bibblock><ERROR class="undefined">\BBOQ</ERROR><ERROR class="undefined">\APACrefatitle</ERROR>Probing Physics Knowledge Using Tools from Developmental
Psychology Probing physics knowledge using tools from developmental
psychology.<ERROR class="undefined">\BBCQ</ERROR>
</bibblock>
        <bibblock><ERROR class="undefined">\PrintBackRefs</ERROR><ERROR class="undefined">\CurrentBib</ERROR>
<!--  %**** cogsci˙template.bbl Line 475 **** --></bibblock>
      </bibitem>
      <bibitem key="santoro2017simple" xml:id="bib.bib18">
        <tags>
          <tag role="number">18</tag>
          <tag role="year"><ERROR class="undefined">\APACyear</ERROR>2017</tag>
          <tag role="authors">Santoro <ERROR class="undefined">\BOthers</ERROR>.</tag>
          <tag role="fullauthors">Santoro <ERROR class="undefined">\BOthers</ERROR>.</tag>
          <tag role="refnum">Santoro <ERROR class="undefined">\BOthers</ERROR>. (<ERROR class="undefined">\APACyear</ERROR>2017)</tag>
          <tag role="key">santoro2017simple</tag>
        </tags>
        <bibblock>
<ERROR class="undefined">\APACinsertmetastar</ERROR>santoro2017simple<ERROR class="undefined">{APACrefauthors}</ERROR>Santoro, A., Raposo, D., Barrett, D<ERROR class="undefined">\BPBI</ERROR>G., Malinowski, M., Pascanu, R., Battaglia, P.<ERROR class="undefined">\BCBL</ERROR> <ERROR class="undefined">\BBA</ERROR> Lillicrap, T. </bibblock>
        <bibblock><ERROR class="undefined">\APACrefYearMonthDay</ERROR>2017.
</bibblock>
        <bibblock><ERROR class="undefined">\BBOQ</ERROR><ERROR class="undefined">\APACrefatitle</ERROR>A simple neural network module for relational reasoning
<!--  %**** cogsci˙template.bbl Line 500 **** -->A simple neural network module for relational reasoning.<ERROR class="undefined">\BBCQ</ERROR>
</bibblock>
        <bibblock><ERROR class="undefined">\BIn</ERROR> <ERROR class="undefined">\APACrefbtitle</ERROR>Advances in neural information processing systems
Advances in neural information processing systems (<ERROR class="undefined">\BPGS</ERROR> 4967–4976).
<ERROR class="undefined">\PrintBackRefs</ERROR><ERROR class="undefined">\CurrentBib</ERROR>
</bibblock>
      </bibitem>
      <bibitem key="siegler1998developmental" xml:id="bib.bib19">
        <tags>
          <tag role="number">19</tag>
          <tag role="year"><ERROR class="undefined">\APACyear</ERROR>1998</tag>
          <tag role="authors">Siegler <ERROR class="undefined">\BBA</ERROR> Chen</tag>
          <tag role="fullauthors">Siegler <ERROR class="undefined">\BBA</ERROR> Chen</tag>
          <tag role="refnum">Siegler <ERROR class="undefined">\BBA</ERROR> Chen (<ERROR class="undefined">\APACyear</ERROR>1998)</tag>
          <tag role="key">siegler1998developmental</tag>
        </tags>
        <bibblock>
<ERROR class="undefined">\APACinsertmetastar</ERROR>siegler1998developmental<ERROR class="undefined">{APACrefauthors}</ERROR>Siegler, R<ERROR class="undefined">\BPBI</ERROR>S.<ERROR class="undefined">\BCBT</ERROR> <ERROR class="undefined">\BBA</ERROR> Chen, Z. </bibblock>
        <bibblock><ERROR class="undefined">\APACrefYearMonthDay</ERROR>1998.
<!--  %**** cogsci˙template.bbl Line 525 **** --></bibblock>
        <bibblock><ERROR class="undefined">\BBOQ</ERROR><ERROR class="undefined">\APACrefatitle</ERROR>Developmental differences in rule learning: A
microgenetic analysis Developmental differences in rule learning: A
microgenetic analysis.<ERROR class="undefined">\BBCQ</ERROR>
</bibblock>
        <bibblock><ERROR class="undefined">\APACjournalVolNumPages</ERROR>Cognitive psychology363273–310.
<ERROR class="undefined">\PrintBackRefs</ERROR><ERROR class="undefined">\CurrentBib</ERROR>
</bibblock>
      </bibitem>
      <bibitem key="10.3389/fpsyg.2017.02124" xml:id="bib.bib20">
        <tags>
          <tag role="number">20</tag>
          <tag role="year"><ERROR class="undefined">\APACyear</ERROR>2017</tag>
          <tag role="authors">Smith <ERROR class="undefined">\BBA</ERROR> Slone</tag>
          <tag role="fullauthors">Smith <ERROR class="undefined">\BBA</ERROR> Slone</tag>
          <tag role="refnum">Smith <ERROR class="undefined">\BBA</ERROR> Slone (<ERROR class="undefined">\APACyear</ERROR>2017)</tag>
          <tag role="key">10.3389/fpsyg.2017.02124</tag>
        </tags>
        <bibblock>
<ERROR class="undefined">\APACinsertmetastar</ERROR>10.3389/fpsyg.2017.02124<ERROR class="undefined">{APACrefauthors}</ERROR>Smith, L<ERROR class="undefined">\BPBI</ERROR>B.<ERROR class="undefined">\BCBT</ERROR> <ERROR class="undefined">\BBA</ERROR> Slone, L<ERROR class="undefined">\BPBI</ERROR>K. <!--  %**** cogsci˙template.bbl Line 550 **** --></bibblock>
        <bibblock><ERROR class="undefined">\APACrefYearMonthDay</ERROR>2017.
</bibblock>
        <bibblock><ERROR class="undefined">\BBOQ</ERROR><ERROR class="undefined">\APACrefatitle</ERROR>A Developmental Approach to Machine Learning? A
developmental approach to machine learning?<ERROR class="undefined">\BBCQ</ERROR>
</bibblock>
        <bibblock><ERROR class="undefined">\APACjournalVolNumPages</ERROR>Frontiers in Psychology82124.
<ERROR class="undefined">\PrintBackRefs</ERROR><ERROR class="undefined">\CurrentBib</ERROR>
</bibblock>
      </bibitem>
      <bibitem key="spelke2007core" xml:id="bib.bib21">
        <tags>
          <tag role="number">21</tag>
          <tag role="year"><ERROR class="undefined">\APACyear</ERROR>2007</tag>
          <tag role="authors">Spelke <ERROR class="undefined">\BBA</ERROR> Kinzler</tag>
          <tag role="fullauthors">Spelke <ERROR class="undefined">\BBA</ERROR> Kinzler</tag>
          <tag role="refnum">Spelke <ERROR class="undefined">\BBA</ERROR> Kinzler (<ERROR class="undefined">\APACyear</ERROR>2007)</tag>
          <tag role="key">spelke2007core</tag>
        </tags>
        <bibblock>
<ERROR class="undefined">\APACinsertmetastar</ERROR>spelke2007core<ERROR class="undefined">{APACrefauthors}</ERROR>Spelke, E<ERROR class="undefined">\BPBI</ERROR>S.<ERROR class="undefined">\BCBT</ERROR> <ERROR class="undefined">\BBA</ERROR> Kinzler, K<ERROR class="undefined">\BPBI</ERROR>D. </bibblock>
        <bibblock><ERROR class="undefined">\APACrefYearMonthDay</ERROR>2007.
</bibblock>
        <bibblock><ERROR class="undefined">\BBOQ</ERROR><ERROR class="undefined">\APACrefatitle</ERROR>Core knowledge Core knowledge.<ERROR class="undefined">\BBCQ</ERROR>
</bibblock>
        <bibblock><ERROR class="undefined">\APACjournalVolNumPages</ERROR>Developmental science10189–96.
<ERROR class="undefined">\PrintBackRefs</ERROR><ERROR class="undefined">\CurrentBib</ERROR>
</bibblock>
      </bibitem>
    </biblist>
  </bibliography>
</document>
