Research Reports

Parallel Individuation Supports Numerical Comparisons in Preschoolers

Pierina Cheung^*^a ^b, Mathieu Le Corre^c

[a] Department of Psychology, University of Waterloo, Waterloo, Canada. [b] National Institute of Education, Nanyang Technological University, Singapore. [c] Centro de Investigación en Ciencias Cognitivas, Universidad Autónoma del Estado de Morelos, Cuernavaca, Morelos, México.

Journal of Numerical Cognition, 2018, Vol. 4(2), 380–409, https://doi.org/10.5964/jnc.v4i2.110

Received: 2016-12-12. Accepted: 2017-09-20. Published (VoR): 2018-09-07.

*Corresponding author at: National Institute of Education, Nanyang Technological University, 1 Nanyang Walk, Singapore, 637616. E-mail: cheung.pierina@gmail.com

This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Abstract

While the approximate number system (ANS) has been shown to represent relations between numerosities starting in infancy, little is known about whether parallel individuation – a system dedicated to representing objects in small collections – can also be used to represent numerical relations between collections. To test this, we asked preschoolers between the ages of 2 ½ and 4 ½ to compare two arrays of figures that either included exclusively small numerosities (< 4) or exclusively large numerosities (> 4). The ratios of the comparisons were the same in both small and large numerosity conditions. Experiment 1 used a between-subject design, with different groups of preschoolers comparing small and large numerosities, and found that small numerosities are easier to compare than large ones. Experiment 2 replicated this finding with a wider range of set sizes. Experiment 3 further replicated the small-large difference in a within-subject design. We also report tentative evidence that non- and 1-knowers perform better on comparing small numerosities than large numerosities. These results suggest that preschoolers can use parallel individuation to compare numerosities, possibly prior to the onset of number word learning, and thus support previous proposals that there are numerical operations defined over parallel individuation (e.g., Feigenson & Carey, 2003; https://doi.org/10.1111/1467-7687.00313).

Keywords: parallel individuation, analog magnitudes, numerical comparison, small, large, preschoolers

According to structuralism, an influential theory of the nature of mathematical objects, the natural numbers are nothing but a system of relations (e.g., Shapiro, 2000). Therefore, accounts of our knowledge of the natural numbers and of its developmental origins cannot be limited to explaining how we represent individual numbers and how we acquire these representations. They must also explain how we represent relations between them and how these representations develop. Randy Gallistel (1989) put the point quite forcefully:

“If the only sense in which the brain represents number is that there is a sensory/perceptual mapping from numerosity to brain states (the activities of detectors for specific numerosities), which make possible simple numerical discriminations, then the brain’s representation of number is a representation in name only. Only if the brain brings combinatorial processes to bear on the neural entities that represent numerosities may we say that the brain represents number in an interesting sense of the term representation.” (p. 159).

One of the possible developmental roots of our representations of relations between numbers is the Approximate Number System (ANS). The ANS represents number approximately – i.e., it does not discriminate all pairs of numerosities with equal precision (Dehaene, 1997; Gallistel & Gelman, 2000; Meck & Church, 1983). Rather, the discrimination precision of the ANS is a function of the ratio of numerosities (e.g., Coubart, Izard, Spelke, Marie, & Streri, 2014; Lipton & Spelke, 2004; Moyer & Landauer, 1967; Xu & Arriaga, 2007; Xu & Spelke, 2000; Xu, Spelke, & Goddard, 2005). For example, 9-month-olds can discriminate numerosities that differ by a 2:3 ratio (e.g., 6 vs. 9), but not a 3:4 ratio (e.g., 12 vs. 16) (Xu, 2003; Xu & Spelke, 2000; Xu et al., 2005). Starting in infancy, the ANS also represents relations between numerosities such as relative numerosity (Brannon, 2002; Suanda, Tompson, & Brannon, 2008; see Barth et al., 2005, 2006 for evidence from preschoolers), addition and subtraction (McCrink & Wynn, 2004; see also Barth et al., 2005, 2006), and proportions of discrete quantities (McCrink & Wynn, 2007). Although this is controversial, some studies suggest that later numerical reasoning about numerical relations is at least partially rooted in the ANS, even when the reasoning involves manipulating mathematical symbols (e.g., Halberda, Mazzocco, & Feigenson, 2008; Libertus, Feigenson, & Halberda, 2011; Starr, Libertus, & Brannon, 2013a; but see Gilmore et al., 2013; Szűcs, Nobes, Devine, Gabriel, & Gebuis, 2013; for reviews, see De Smedt, Noël, Gilmore, & Ansari, 2013).

However, multiple studies have shown that, in various contexts, infants and adults often do not use the ANS to represent the numerosity of collections when they include 4 or fewer objects. Instead, they use a distinct system whose capacity is limited to collections of up to 3 or 4 objects (Choo & Franconeri, 2014; Feigenson & Carey, 2003, 2005; Feigenson, Carey, & Hauser, 2002; Feigenson, Dehaene, & Spelke, 2004; Lipton & Spelke, 2004; Revkin, Piazza, Izard, Cohen, & Dehaene, 2008; Trick & Pylyshyn, 1994; Xu, 2003; see also Van Herwegen, Ansari, Xu, & Karmiloff-Smith, 2008; Xu, Spelke, & Goddard, 2005). Most agree that this system represents individual objects and that the limit on its capacity comes from the number of objects it can represent in parallel. It is thus often referred to as “parallel individuation.” There is also growing agreement that this system develops early in infancy (e.g., Coubart et al., 2014; Feigenson, Carey, & Hauser, 2002; Hyde & Spelke, 2011).

Some have proposed that, in addition to representing objects, parallel individuation supports numerical operations (Carey, 2009; Feigenson & Carey, 2003, 2005; Le Corre & Carey, 2007, 2008). On this view, representations of objects created by the parallel individuation system can enter into computations of one-to-one correspondence from infancy on (Carey, 2009; Feigenson & Carey, 2003, 2005; see also Uller, Carey, Huntley-Fenner, & Klatt, 1999). For example, to explain how infants can keep track of up to three objects hidden in an opaque box, Feigenson and Carey (2003) suggested that infants represent each hidden object with a unique symbol that is held in working memory, where each symbol functions as a mental tally mark of sorts. Every time they retrieve an object, they match it to an active tally mark in working memory. When each tally mark has been matched to an object, infants stop reaching. Second, infants can use parallel individuation to represent more than one collection at a time. That is, under some conditions, representations of objects can be grouped into up to two or three “chunks” of up to three objects (Moher, Tuerk, & Feigenson, 2012; Zosh, Halberda, & Feigenson, 2011; see Frank & Barner, 2012 for evidence of object-chunking in school-age children).

If parallel individuation does indeed support numerical operations, one might expect that the system could be used to compare distinct collections of objects on the basis of their numerosity. Some studies have shown that infants can use parallel individuation to compare collections. However, these studies found that infants’ comparisons were based on physical attributes of the collections (e.g., their total physical size) rather than on their numerosity (Clearfield & Mix, 1999, 2001; Feigenson, Carey, & Hauser, 2002; Feigenson, Carey, & Spelke, 2002; Xu, et al., 2005). For example, in a quantity choice task, 10- to 12-month-olds were shown small collections of crackers put into two buckets. Feigenson, Carey, and Hauser (2002) found that infants chose the bucket with the larger number of crackers only when there were no more than three in each bucket; when one or both of the buckets contained more than three (e.g., 1 vs. 4, 2 vs. 4, 3 vs. 6), infants chose both buckets equally frequently. The upper limit on infants’ performance suggests that parallel individuation was recruited. Nevertheless, it was also found that for comparisons that involved less than four crackers, infants reliably chose the bucket with more cracker stuff rather than more pieces of crackers (e.g., they chose one big cracker with a larger surface area over two pieces of cracker with a smaller combined surface area). Studies requiring that infants retrieve hidden objects from a box suggest that infants can compare a set against another held in working memory (e.g., Feigenson & Carey, 2003), but they do not test whether infants can perform comparisons on two physically distinct collections under parallel individuation because they never require infants to do this. Thus, these studies have not shown that parallel individuation can be used to determine which of two collections contains more elements on the basis of number.

To test whether parallel individuation can support numerical comparisons, a study must include numerosities that are small enough to be compared with parallel individuation. But this is not enough, since, in principle, participants can also use the ANS to compare small numerosities (Cordes et al., 2001; Cordes & Brannon, 2009; Starr, Libertus, & Brannon, 2013b). Therefore, the study must also include a way to determine which of the two systems is used to compare the small numerosities. One way to do so is to include pairs of numerosities that can be compared with the ANS but not with parallel individuation, and whose ratio is the same as the ratio of the pairs of small numerosities. Evidence that, despite the fact that all comparisons have the same ratio, performance on comparisons of collections that can be represented with parallel individuation (henceforth, “small numerosities”) is significantly different from performance on comparisons of numerosities that can only be represented with the ANS (henceforth, “large numerosities”) would suggest that parallel individuation can support numerical comparisons.

Pairs of collections that straddle the boundary between parallel individuation and the ANS must be avoided because the pattern of performance to be expected in such situations is not well known. Unfortunately, the boundary between the two systems is somewhat unclear. Some studies suggest that parallel individuation can hold up to 4 objects, but many others suggest that it cannot hold more than 3 (e.g., Feigenson & Carey, 2005; Feigenson, Carey, & Hauser, 2002). The bulk of the evidence that suggests it can hold up to 4 objects comes from studies of human adults or rhesus monkeys (e.g., Hauser, Carey, & Hauser, 2000; Luck & Vogel, 1997; Pylyshyn & Storm, 1988); only one out of many infant studies suggests that it can hold up to 4 objects (Ross-Sheehy, Oakes, & Luck, 2003). Moreover, a study of non-verbal numerical comparisons in human adults has shown that they use parallel individuation when collections are comprised of 3 or fewer objects, and that they rely on the ANS when they are comprised of 4 or more objects (Choo & Franconeri, 2014). Therefore, we suggest that comparisons should not include collections of 4 objects because the way such collections are represented is unclear. In other words, small numerosities should be limited to collections of up to 3 objects and large numerosities should consist of collections of at least 5 objects.

Previous studies with preschoolers have shown that children can compare two collections on the basis of number, but these studies do not meet the conditions outlined above (Abreu-Mendoza, Soto-Alba, & Arias-Trejo, 2013; Odic, Libertus, Feigenson, & Halberda, 2013; Odic, Pietroski, Hunter, Lidz, & Halberda, 2013; Rousselle & Noël, 2008; Rousselle, Palmers, & Noël, 2004; Wagner & Johnson, 2011). Some showed clear evidence of comparisons based on numerosity but the pairs of small collections also included collections of more than 3 objects (Abreu-Mendoza, Soto-Alba, & Arias-Trejo, 2013; Cantrell, Kuwubara, & Smith, 2015; Odic, Libertus, et al., 2013; Odic, Pietroski, et al., 2013; Rousselle, Palmers, & Noël, 2004; Wagner & Johnson, 2011). Other studies did include a condition where none of the pairs of small collections contained more than 3 objects but did not include a condition where the two collections in each pair contained at least 5 objects (Barner & Snedeker, 2005, 2006; Brannon & Van de Walle, 2001; Feigenson, 2005; Feigenson, Carey, & Hauser, 2002; Feigenson, Carey, & Spelke, 2002). It is thus not clear whether preschoolers were recruiting the ANS or parallel individuation to compare small collections on the basis of their numerosity in these studies.

To our knowledge, only two studies met all of the demands laid out above. One of these studies tested adults (Choo & Franconeri, 2014). While it provided evidence that parallel individuation can be used to compare numerosities (i.e., at equal ratios, adults were faster at comparing small numerosities than large numerosities), it did not provide evidence that is relevant to the development of this capacity. Cantlon, Safford, and Brannon (2010) tested preschoolers. However, their results are difficult to interpret. Their study tested comparisons with a numerical matching task – i.e., participants were shown a target collection and then had to match it to one of two other collections on the basis of numerosity. The task included only one pair of comparisons with the right design features – namely, a pair comprised of comparisons of small collections and of large collections at the same ratio – 1 vs. 2 and 6 vs. 12. They found that children were better at finding the numerical match for 1 object (out of 1 and 2 objects) than at finding the match for 6 or 12 objects (out of 6 and 12). However, they also found that children were not significantly better at finding a match for 2 (out of 1 and 2) than at finding a match for 6 or 12 (out of 6 and 12). Therefore, their results are inconclusive.

The present study aims to contribute to research on the development of parallel individuation as a system for numerical reasoning by applying a numerical comparison task with the right design features to preschoolers, namely two- to four-year-olds. Since, other than one study with inconclusive results (Cantlon et al., 2010), no previous study of numerical comparison in preschoolers had all of the right design features, our study provides the clearest test thus far of whether preschoolers can use parallel individuation to compare numerosities. Moreover, our study included children who were young enough to be at the earliest stages of number word learning, namely children who had not learned the meaning of any of the number words beyond “one.” Thus, it also provides evidence that bears on whether the development of the capacity to use parallel individuation to compare numerosities depends on number word learning.

Experiment 1

Method

Participants

A total of 99 2½ - to 4½-year-olds participated in this study. Fifty of them were tested on comparisons of small collections only (average age of 3 years 7 months; range: 2 years 7 months – 4 years 7 months; 22 males), and 49 were tested on comparisons of large collections only (average age of 3 years 7 months; range: 2 years 6 months – 4 years 7 months; 30 males). All of the children were recruited in Southwestern Ontario, Canada, and were predominantly monolingual speakers of English. An additional two children were excluded for always choosing the same side.

Design and Procedure

Children were tested on a numerical comparison task and on Give-N, a standard assessment of number word knowledge (Wynn, 1990). The Give-N task was always administered at the end of the testing session.

Numerical comparison

The numerical comparison task always started with an experimenter introducing two puppets to the children – a frog and a duck. Then, children were shown a picture of a rectangle and asked to name it, e.g., “Do you know what this is?” If children failed to provide a label, the experimenter suggested one (e.g., block, rectangle), and encouraged children to repeat it. After familiarization with the stimuli, the experimenter showed children two pictures of blocks, placed one in front of each of the two puppets, and said, “Froggie has some [blocks], duckie has some [blocks], who has more [blocks]?” This instruction was repeated for each trial.

The numerical comparison task had two between-subject conditions that differed on the range of numerosities tested: small (< 4) and large (> 4). In each condition, collections that differed by a 1:3 and 2:3 ratio were shown. Children in the small numerosity condition were asked to compare 1 vs. 3 (4 trials) and 2 vs. 3 (4 trials). Children in the large numerosity condition were asked to compare 6 vs. 18 (4 trials) and 6 vs. 9 (4 trials). All collections consisted of red rectangles printed on letter-sized paper. The rectangles in each individual collection were all of the same physical size. However, the relation between the physical size of individual rectangles, their total perimeter and area, and numerosity varied across trials (see Figures 1a, 1b, 1c and 1d). As in previous studies (e.g., Halberda & Feigenson, 2008; Rousselle & Noël, 2008), on half of the trials, the two collections had the same cumulative perimeter and surface area, so that the physical size of the individual rectangles in the collections conflicted with numerosity (i.e., the rectangles in the numerically smaller collection were physically larger). We refer to these trials as “size-number incongruent.” (Figures 1b and 1d). On the other half of the trials, the cumulative perimeter and surface area of the collections was confounded with numerosity (i.e., the collection with a greater number of objects also had a larger cumulative perimeter and a larger surface area; Figures 1a and 1c). We refer to these trials as “size-number congruent.”

In the small numerosity condition, for size-number congruent trials, the cumulative surface area of the collections ranged from 1.5 cm² to 12 cm², and their cumulative perimeter ranged from 5.32 cm to 30 cm. For size-number incongruent trials, the cumulative surface area was always 6 cm², and cumulative perimeter was 18 cm. In the large numerosity condition, for size-number congruent trials, the cumulative surface area of the collections ranged from 18 cm² to 72 cm², and their cumulative perimeter ranged from 31.92 cm to 144 cm. For size-number incongruent trials, cumulative surface area ranged from 18 cm² to 36 cm², and cumulative perimeter ranged from 54 cm to 108 cm.

Pairs of collections of rectangles were presented in one of two item orders. The orderings of correct side (left or right), ratio (1:3 or 2:3), numerosity pair (small condition: 2 vs. 3, 1 vs. 3; large condition: 6 vs. 9, 6 vs. 18), and trial type (size-number congruent or size-number incongruent) were randomized across orders. No two consecutive trials were of the same type or pair of numerosity. No feedback was given.

Click to enlarge

Figure 1a

A size-number congruent trial in the small numerosity condition in Experiment 1.

Click to enlarge

Figure 1b

A size-number incongruent trial in the small numerosity condition in Experiment 1.

Click to enlarge

Figure 1c

A size-number congruent trial in the large numerosity condition in Experiment 1.

Click to enlarge

Figure 1d

A size-number incongruent trial in the large numerosity condition in Experiment 1.

Give-N

The purpose of this task was to assess children’s number word knowledge. Children were first introduced to a puppet, a tub of 10 fish, and a plate. They were then told that the puppet wanted to eat some fish, and the experimenter asked the child to give the puppet N fish (e.g., “Can you put one fish on the plate?”). After the child gave the puppet some fish, the experimenter asked whether it is N; if the child said ‘no’, s/he was asked to fix it and if the child said ‘yes’, the experimenter moved onto the next trial. Children were asked to give 1 and then 3 fish on the first two trials. If children succeeded on both, they were asked to give 5 fish. If children failed to correctly give 1 for “one” or 3 for “three”, the experimenter asked for two fish. At this point, if children succeeded in response to a request for N, the next request was N+1; if they incorrectly responded to the request for N, the next request was for N-1. The highest numeral requested was “six”.

Children were called ‘N-knowers’ (e.g., ‘1-knowers’) if they correctly gave N fish two out of three times when asked for N, but failed to give the correct number two out of three times for N+1. Children who failed to give one fish when asked for “one” were classified as ‘non-knowers’. Children who only knew a subset of the number words – i.e., ‘1-knowers’, ‘2-knowers’, ‘3-knowers’, and ‘4-knowers’ – were called ‘subset-knowers’. Children who gave the correct number of fish for all numerals asked for (up to six) were called ‘Cardinal Principle-knowers’, ‘CP-knowers’ for short. In the analyses throughout this paper, we divided children into three knower-level groups: non- and 1-knowers vs. 2- and 3-knowers vs. CP-knowers.

Results

Across all three experiments, whenever multiple comparisons were performed, the alpha level was adjusted using the Holm-Bonferroni method.

Give-N

The number of children and the mean age in each knower-level group are presented in Table 1. Age was significantly correlated with children’s knower-level, Pearson’s r = .51, p < .001.

Table 1

Age and Number of Children in Each Knower-Level Group in Experiment 1

Knower-Level	Small			Large
Knower-Level	n	Mean Age	Range	n	Mean Age	Range
Non-knowers	8	3;1	2;9 - 3;8	7	3;5	3;2 - 3;10
1-knowers	12	3;5	2;7 - 4;0	10	3;0	2;6 - 3;7
2-knowers	10	3;5	2;8 - 3;11	12	3;9	3;2 - 4;7
3-knowers	12	3;10	3;1 - 4;7	11	3;8	2;11 - 4;2
CP-knowers	8	3;11	3;2 - 4;7	9	3;11	2;11 - 4;7

Numerical Comparisons

Our first analysis found no effect of order or gender (all ts < -1.26, ps > .21), so these variables were not included in subsequent analyses. A logistic mixed effects model was used to analyze the effect of our independent variables on proportion correct responses on the numerical comparison task. We began with a maximal model that included random slopes of Congruence and Ratio by subjects but this model did not converge. We then sequentially removed random effects by Congruence and Ratio and none of the models converged; thus, in our main model, we only included random intercept by subject.

We included 2-way interaction terms involving Number Range (Number Range x Congruence, Number Range x Ratio, Number Range x Knower-Level, and Number Range x Age) because we were primarily interested in the effects of that factor. In particular, the model included Number Range (small vs. large), Size-Number Congruence (congruent vs. incongruent), Ratio (1:3 vs. 2:3), Knower-Level (non- and 1-knowers vs. 2- and 3-knowers vs. CP-knowers; CP-knowers as the reference category) and centered Age as fixed effects, with by-subject random intercept. This model did not increase the fit over a model with main effects only, χ²(5) = 4.82, p = .44.ⁱ A main-effects-only model revealed a main effect of Number Range, β = -.87, SE = .29, z = -3.01, p = .0026. Children were better at comparing small numerosities (M = .79, SD = .24) than large numerosities (M = .68, SD = .26). There was a main effect of Size-Number Congruence, β = -.99, SE = .20, z = -5.02, p < .001, with better performance on congruent (M = .81, SD = .27) than on incongruent trials (M = .67, SD = .32). We also found an effect of Age, β = 1.03 SE = .36, z = 2.89, p = .0039, with older children performing better overall. Finally, we found a main effect of Knower-Level, with CP-knowers (M = .93, SD = .16) performing better than both non- and 1-knowers (M = .58, SD = .25; β = -2.26, SE = .57, z = -3.96, p < .001) and 2- and 3-knowers (M = .79, SD = .22; β = -1.36, SE = .53, z = -2.57, p = .010). Non- and 1-knowers also differed significantly from 2- and 3-knowers (t(80) = -4.08, p < .001). No other effects were significant. Figure 2 displays children’s performance on small and large comparisons by knower-levels and by size-number congruence.

Click to enlarge

Figure 2

Overall proportion correct on small and large numerosity comparisons on congruent trials (top panel) and incongruent trials (bottom panel), separated by knower-levels in Experiment 1. Violin plots are used to depict the distribution and probability density of the data. Error bars represent standard errors of the mean.

Finally, we asked whether the ability to use parallel individuation to compare numerosities is available even at the earliest stages of number word learning. To test this, we analyzed whether children with minimal number word knowledge – non-knowers and 1-knowers – performed significantly above chance on size-number incongruent trials for small numerosities, and whether their performance for small numerosities was better than that for large numerosities. We combined 2 vs. 3 and 1 vs. 3 comparisons and found that non-knowers and 1-knowers as a group performed significantly above chance on small comparisons, M = .63, SD = .28, t(19) = 2.03, p = .028 (1-tailed).ⁱⁱ We also found that they performed significantly better on comparisons of small numerosities (1 vs. 3 and 2 vs. 3) than of large numerosities (6 vs. 18 and 6 vs. 9; M = .41, SD = .28; t(35) = 2.34, p = .025 (2-tailed), d = .77, 95% CI [0.08, 1.46]).ⁱⁱⁱ These results suggest that parallel individuation can support numerical comparisons even in children with little or no knowledge of number word meanings. They also suggest that children’s performance on the numerical comparisons task cannot be explained by counting, because these children have not acquired the cardinal principle, and thus cannot have used counting to solve the task (Le Corre, Van de Walle, Brannon, & Carey, 2006; Sarnecka & Carey, 2008; Wynn, 1990, 1992).

Discussion

Two- to four-year-old children were tested on a non-verbal numerical comparison task in one of two conditions: comparing collections of three or fewer objects or collections of six or more objects. The ratios of the numerosities in both the small and large numerosity comparisons were the same. Despite that, children did not perform equally well on all comparisons. Rather, they were more accurate on small than on large comparisons. This difference also held in children at the earliest stages of number word learning – i.e., non-knowers and 1-knowers. These children performed significantly above chance when they compared small numerosities but not when they compared large ones. Given that the number range effect was observed in children at the earliest stages of number word learning and that these children performed above chance on comparisons of small numerosities even when area and numerosity were not congruent, our results also suggest that the development of the ability to use parallel individuation to compare numerosities does not depend on number word learning.

Two aspects of preschoolers’ performance on large numerosity comparisons warrant discussion. First, children at the earliest stages of number word learning (i.e., non- and 1-knowers), and to some extent, 2- and 3-knowers, performed poorly on comparisons of large numerosities (see Figure 2). This seems surprising given that by 9 months, infants can discriminate between large collections that differ by a 2:3 ratio (e.g., Xu & Spelke, 2000). We speculate that the discrepancy between infants and two- to four-year-olds is that the habituation paradigm used with infants cues the fact that object size and summed area are not relevant to the task to a greater extent than our numerical comparison task. In the habituation experiments (e.g., Xu & Spelke, 2000), number stays constant in all habituation arrays but the size of individual objects and their total area vary from array to array. This may signal to infants that numerosity is more relevant to the task than individual object size or total area such that by the time the test arrays are presented, infants can discriminate the habituation arrays from the test arrays on the basis of numerosity. In contrast, aside from the count noun in the question (“Who has more blocks?”), no feature of our design cues children to ignore the size of the objects in the collections on size-number incongruent trials. Thus, although the acuity of children’s ANS is sufficiently high to compare numerosities at the ratios we presented, they fail to do so because nothing helps them overcome the interference between object size and numerosity. Negen and Sarnecka (2015) provide evidence that is consistent with this explanation. They presented subset-knowers with numerical comparisons where numerosity conflicted with total area and individual object size. Like us, they found that subset-knowers performed at chance on these comparisons, despite the fact that the ratios of the comparisons were well within the capacity of young infants. However, they also found that after training subset-knowers to focus on numerosity instead of total area and individual object size, they were able to choose the numerically larger collection instead of the collection with larger individual objects. These findings thus show that their initial failure was due to the interference between object size and numerosity, and that they can overcome this interference when cues (in this case, explicit feedback) are provided.

Second, none of the effects involving ratio were significant. Since this suggests that children were not more accurate on the large comparisons with a large ratio (6 vs. 18) than on the large comparisons with a smaller ratio (6 vs. 9), it may seem to pose problems for our suggestion that children used the ANS to compare large numerosities. That is, one may be concerned that our task was not appropriately designed to test our question of interest because it may not have engaged the ANS. We believe that this concern is unwarranted for two reasons. First, three other studies of non-verbal numerical comparisons in preschoolers with designs similar to ours also failed to find differences of accuracy between ratios similar to ours – i.e., 1:2 and 2:3 (Abreu-Mendoza, Soto-Alba, & Arias-Trejo, 2013; Halberda & Feigenson, 2008; Rousselle & Noël, 2008). Importantly, two of these studies provided positive evidence that their task engaged the ANS – i.e., they found that children were better on comparisons with 1:2 and 2:3 ratios than on comparisons with harder-to-discriminate ratios (Abreu-Mendoza et al., 2013; Halberda & Feigenson, 2008). Moreover, children’s average accuracy on the large numerosity comparisons in the present study was similar to that which has been reported in previous studies for children of the same age – i.e., near 70% for comparisons with ratios between 2:3 and 1:2 (Halberda & Feigenson, 2008; Rousselle & Noël, 2008). Therefore, our failure to find a difference between performance on comparisons of 6 vs. 9 (a 2:3 ratio) and of 6 vs. 18 (a 1:3 ratio) does not mean that our task did not engage the ANS. Rather, it is consistent with, perhaps even predicted by, what we know about how ratio affects children’s performance when they use the ANS to compare numerosities.

There is no other plausible explanation of children’s performance on large comparisons in our study. The only other strategies that are available to adults are counting, and breaking up the large collections into smaller collections of 2 to 4 elements and adding these up. Neither of these strategies can explain why we find the same pattern of results when we restrict our analyses to subset-knowers. Subset-knowers do not know the cardinal principle. Therefore, they cannot have compared the collections by counting them. Moreover, it is highly unlikely that children who do not know the cardinal principle nonetheless know the sums required to determine the numerosity of collections of 6, 9 or 18 elements by breaking them up into small collections of 2 to 4 and then summing these. Therefore, it is unreasonable to assume that subset-knowers used this strategy. Thus, we believe that, short of postulating new representational systems for which no evidence has been provided, the only explanation left is that children compared the large numerosities with the ANS. Nevertheless, in Experiment 3, we directly address the concern of a lack of ratio effect by including a harder-to-discriminate ratio.

Another possible concern is that children controlled how long the collections were presented to them, so that, consequently, they could have used counting instead of parallel individuation and/or the ANS to compare numerosities. Since small collections are easier to count than large ones, this could explain why children were more accurate on small comparisons. However, the fact that children who did not know the cardinal principle – i.e., non-knowers and 1-knowers – showed this pattern of results makes this very unlikely. Nevertheless, in Experiment 3, we address this concern by asking children to compare the numerosities of collections that are presented too quickly to be counted.

Finally, it is possible that the better performance on comparisons of small numerosities was driven by better performance on the 1 vs. 3 comparisons only. By physical necessity, whenever one of the choices in a comparison is one object, the other choice is always the right answer. Thus, comparisons where one of the choices is a single object may be easier than all other comparisons. That is, one may predict that the effect of Number Range is specific to comparisons with a 1:3 ratio. Nevertheless, the lack of an interaction between Number Range and Ratio has ruled out this possibility.

There remain two alternatives that cannot be ruled out directly by the results of Experiment 1. First, it may be that children performed better on comparisons of small numerosities because the correct answer in this condition was always the same – i.e., 3 – whereas the correct answer in the large numerosity condition alternated between 9 and 18. Second, it could be that children were, in fact, relying on the ANS for all comparisons but that they performed more poorly on large comparisons because these comparisons make greater demands on non-numerical aspects of processing. For example, larger collections necessarily require children to divide their attention over more objects than small collections, and this could cause a decrement in performance. Although possible in theory, this “processing demands” alternative is unlikely to be the right explanation. On this alternative, the difficulty of comparisons should increase continuously as a function of the absolute size of numerosities. Contrary to this prediction, previous research provides strong evidence that when infants (Wood & Spelke, 2005) or adults (Barth, Kanwisher, & Spelke, 2003) use the ANS to compare numerosities, their level of performance for a given ratio of numerosities is the same regardless of the absolute size of the numerosities. Experiment 2 was explicitly designed to test both of these alternatives directly.

Experiment 2

Like Experiment 1, Experiment 2 included one group of children who compared small numerosities only and another group who compared large numerosities only. Unlike Experiment 1, Experiment 2 only included size-number incongruent trials. This feature was changed to increase the number of trials that test whether children use distinct systems to compare collections on the basis of numerosity.

Experiment 2 had two main goals. First, we sought to replicate the effect of Number Range observed in Experiment 1. Second, we aimed to address the alternative explanations raised in Experiment 1. To address the processing demands alternative the large comparisons included a wider range of pairs of numerosities of the same ratio. To ensure that the number of trials was reasonable for young preschoolers, we only included comparisons with a ratio of 2:3, namely, 6 vs. 9, 10 vs. 15 and 12 vs. 18. Evidence that children’s accuracy decreases as numerosity increases would support this alternative. On the other hand, evidence that (1) children perform better when they compare small numerosities than when they compare large numerosities at the same ratio, and that (2) they perform equally well on all comparisons of large numerosities would provide strong evidence against the processing demands alternative.

We also tested whether the children in Experiment 1 who compared small numerosities were more accurate than those who compared large ones because the correct answer was always the same numerosity in the former condition but not in the latter. To test this, the small numerosity condition of Experiment 2 included comparisons of 1 vs. 2 and 3 vs. 4 in addition to comparisons of 2 vs. 3. Evidence that children were nonetheless more accurate on 2 vs. 3 than on the large comparisons would show that this alternative is false.