Empirical Research

Enhancing Cognitive Flexibility Through a Training Based on Multiple Categorization: Developing Proportional Reasoning in Primary School

Calliste Scheibling-Sève*¹, Katarina Gvozdic¹, Elena Pasquinelli², Emmanuel Sander¹

[1] IDEA Lab, Faculty of Psychology and Science Education, University of Geneva, Geneva, Switzerland. [2] La main à la pâte Foundation, Paris, France.

Journal of Numerical Cognition, 2022, Vol. 8(3), 443–472, https://doi.org/10.5964/jnc.7661

Received: 2021-10-15. Accepted: 2022-05-02. Published (VoR): 2022-11-16.

Handling Editor: Lieven Verschaffel, University of Leuven, Leuven, Belgium

*Corresponding author at: bd du Pont-d’Arve 40, 1205 Geneva, Switzerland. E-mail: calliste.scheibling.seve@gmail.com

This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Abstract

Proportional reasoning is a key topic both at school and in everyday life. However, students are often misled by their preconceptions regarding proportions. Our hypothesis is that these limitations can be mitigated by working on alternative ways of categorizing situations that enable more adequate inferences. Multiple categorization triggers flexibility, which enables reinterpreting a problem statement and adopting a more relevant point of view. The present study aims to show the improvements in proportional reasoning after an intervention focusing on such a multiple categorization. Twenty-eight 4th and 5th grade classes participated in the study during one school year. Schools were classified by the SES of their neighborhood. The experimental group received 12 math lessons focusing on flexibly envisioning a situation involving proportional reasoning from different points of view. At the end of the school year, compared to a control group, the experimental group had better results on the posttest when solving proportion word problems and proposed more diverse solving strategies. The analyses also show that the performance gap linked to the school’s SES classification was reduced. This offers promising perspectives regarding multiple categorization as a path to overtake preconceptions and develop cognitive flexibility at school.

Keywords: mathematical flexibility, multiple categorization, proportional reasoning, evidence-based education, preconceptions

Flexibility in Problem Solving and Multiple Categorization

Thinking flexibly and finding solutions in a flexible manner is a highly prominent goal of teaching, and mathematics education is no exception (Baroody, 2003; Verschaffel et al., 2009). Cognitive flexibility refers to the ability to select among multiple representations or among several strategies to adjust to the demands of a situation in an adaptive manner (Cragg & Chevalier, 2012; Diamond, 2013). For example, when presented with the problem “A boy wants to buy a type of chocolate that costs 3 cruzeiros each. He wants to buy 50 chocolates. How much money does he need?", Brazilian adolescents – who were used to street trading without ever having attended school – failed to find the solution when attempting to perform a repeated addition 50 times (3 + 3 + … + 3) (Schliemann et al., 1998). This strategy of repeated addition might be considered as a lack of flexibility, since a more adaptive strategy would be to add 50 three times (50 + 50 + 50) or even perform a multiplication 50 × 3. Attempting to solve the problem with a 50 times repeated addition in this case would reveal that the participants stuck to their initial strategy and did not adapt to the task demands. However, adapting the strategy to perform a multiplication is constrained by the accessibility of alternative points of views and strategies to the first ones that come to mind.

Indeed, adaptive expertise in mathematics is akin to finding the solution to a problem in a flexible manner by selecting the most appropriate strategy, and not merely using multiple strategies (Verschaffel et al., 2009). The construct of adaptive expertise has been considered to integrate both conceptual and procedural knowledge (Baroody, 2003; Hatano, 1982). Conceptual knowledge as a necessity for achieving an adaptive expertise is in line with insights on conceptual development. Several approaches stipulate that the conceptual system relies on mental categories (Barsalou, 1991; Malt & Johnson, 1992; Tversky & Hemenway, 1983; Vosniadou, 2012). Mental categories “embody much of our knowledge of the world, telling us what things there are and what properties they have” (Murphy, 2002, p. 2). In fact, categorization provides a maximum amount of information with the least cognitive effort (Rosch, 1978). It makes it possible to relate newly encountered situations to previous ones and to attribute the properties associated with a category to a new situation: when a situation is categorized as member of a specific category, that situation inherits its properties (Hofstadter & Sander, 2013). In this view, assigning an object or a situation to a category provides a certain point of view on that object or that situation by making salient the properties associated with the category. For instance, assigning a tomato to the category of fruits makes salient its botanical properties, whereas assigning it to the category of vegetables makes salient its culinary properties. The most obvious forms of categories concern concrete objects, such as a category for chairs where an object that is considered a chair triggers the inference that one can sit on it (Murphy, 2002). But categories also regard abstract mental constructs such as a category for freedom or proverbs such as a category for situations that can be labeled as you can’t judge a book by its cover, which triggers the inference that appearances are often misleading. Mathematics are no exceptions, where for example, repeated addition situations are candidate categories for multiplication (Anghileri, 1989; Mulligan & Mitchelmore, 1997). Therefore, when a mathematical problem is categorized as, for instance, a repeated addition situation, a student is able to make inferences about the solving strategy associated with it, such as adding as many times the multiplier as the multiplicand indicates for finding the solution (Hofstadter & Sander, 2013; Scheibling-Sève et al., 2020). However, a problem such as “what is the surface of a rectangle whose sides are 10 cm and 16 cm” would be outside the scope of the category multiplication as repeated addition.

Some studies contrast deep structures and surface features of a situation as two possible directions for categorization to take place (Chi & VanLehn, 2012; Gentner & Kurtz, 2006). Two situations share the same deep structure if they invoke the same relation, even if they are superficially dissimilar (Gentner & Kurtz, 2006). However, surface features are easily perceived while deep structure features are hardly perceivable unless relevant knowledge has been acquired (Chi & VanLehn, 2012). Novices, unlike experts, construct their categories primarily based on superficial information (Schoenfeld & Herrmann, 1982). One of the goals of teaching is to make it possible for students to transfer knowledge learned in one situation to another. To do this, they must be able to recognize that two situations with different surface features belong to a common category in terms of their deep structure (Dupuch & Sander, 2007). This means that to go beyond one’s initial point of view, a categorical shift might be crucial (Vosniadou, 2012). In this approach, participants who failed to solve the problem of calculating the total cost of 50 chocolates, as introduced above, do so because they categorize the problem as member of the multiplication as repeated addition situations category and fail to perceive the situation as part of the category of product situations, that allows commutativity (Lakoff & Núñez, 2000). In the perspective presented in the current study, adaptively selecting different strategies requires flexibly changing between different mental categories. Recognizing structural features makes this recategorization possible. For example, in the first problem regarding the cost of 50 chocolates, one would need to recategorize the situation from a repeated addition to a product situation.

To help students recognize common structural features among different situations and flexibly switch between different categories, a pedagogical approach can be based on semantic recoding (Gamo et al., 2010; Gvozdic & Sander, 2020; Scheibling-Sève et al., 2017). The principle of semantic recoding is to lead the student to recode the representation of the problem initially based on superficial features into a representation that makes the problem’s deep mathematical structure salient. Indeed, adopting different representations of an object or situation opens possibilities for new inferences, depending on the category that is solicited (Hofstadter & Sander, 2013). A pathway for triggering this kind of cognitive flexibility is to enhance multiple categorization, i.e. a mechanism by which an individual is able to perceive a given entity from different points of view (Scheibling-Sève et al., 2022). For example, first grade students who participated in a semantic recoding intervention practiced solving a subtraction problem such a “There are 11 flowers in the bouquet. Sophie takes out 9 flowers from the bouquet. How many flowers are in the bouquet now?” with strategies compatible with two different points of views (Gvozdic & Sander, 2020). The problem could be either solved by a direct subtraction, which was within the scope of the initial categorization of the problem, as a looking for the remainder situation, either the problem could be solved by an indirect addition. To use the later strategy, a student would need to put aside the semantic features of the wording in the problem, which describe a search for the remainder, and identify an underlying structure, that makes it possible to consider it as well as a part-whole situation, making the missing addend strategy accessible and searching for what needs to be added to 9 to reach 11. Indeed, students who took part in this intervention succeeded better than the control group at solving problems which would require such a recategorization, and used the strategies consistent with this recoded representation to a greater extent. Being able to adopt multiple points of views for a same object or situation therefore makes it easier to move from one category to another and choose the most relevant viewpoint for the situation, and the most adequate solving strategy (Hofstadter & Sander, 2013). Multiple categorization thus constitutes a hallmark of cognitive flexibility.

Nevertheless, achieving cognitive flexibility can be especially difficult when a situation elicits preconceptions. In fact, children conceptualize most notions taught at school based on prior knowledge, also known as preconceptions (e.g., Ausubel, 1968; Carey, 1985). Preconceptions lead to inferences which make them useful for providing explanations and making predictions about future outcomes regarding target notions (Gopnik & Wellman, 1994; Vosniadou, 2017). Another specificity of preconceptions is that they are rarely challenged for falsification (Vosniadou, 2012) and continue to be influential even after instruction (Shtulman & Harrington, 2016). Following previously presented works on categorization, preconceptions as such can be regarded as categories (Babai et al., 2010). Indeed, they can be considered as initial categories which are used as a first categorization at the disposal of students to understand a new situation and make it possible to access specific solving strategies that are associated with the category. Furthermore, preconceptions are characterized by a set of properties that determine the perimeter for their validity (Fischbein, 1989). By assigning the properties of a preconception to a mathematical situation, these preconceptions can at times be useful for learning, but at other times they can lead to erroneous conclusions (Inagaki & Hatano, 2008; Lautrey et al., 2008). Indeed, these initial spontaneously evoked categories are often not aligned with expert categories of the target notion. Failures to solve the problem might result from being stuck on an initial point of view as determined by the preconception. Ultimately, to adaptively use a strategy on problems that are not compatible with a preconception, one would need to recategorize a situation from the preconception to a more expert category.

Thus, multiple categorization can be a leverage to overcome preconceptions and adopt a more relevant perspective on the problem. A domain prone to understanding what the constraints imposed by preconceptions are and how the mechanism of multiple categorization plays a role in overtaking them is proportional reasoning (Carey, 2000; diSessa et al., 2004; Keil, 2011; Vosniadou, 1994).

Proportional Reasoning and Misconceptions

Proportional reasoning has been long time considered as an activity of high expertise (Piaget & Inhelder, 1951). Its misunderstanding leads to many errors and biases (Mackie & Bruce, 2016; Noelting, 1980; Van Dooren et al., 2005). For example, it can affect decision making that relies on statistical information, such as neglecting prime rates which plays a role in risk taking (Casscells et al., 1978). But as with many other mathematical activities, proportional reasoning has cognitive foundations in the early conceptions of number, space and time. Indeed, elementary numerical cognition is delimited by the approximate sense of number (Dehaene, 1997; Spelke & Kinzler, 2007). It allows children to identify a common invariant relation between two variables, representing a first form of proportional reasoning (McCrink & Spelke, 2010). This identification of multiplicative relations also relies on a specific vocabulary, for instance, mastering “twice as many” or “more than half” (Staples & Truxaw, 2012). Some specific mathematical vocabulary concepts (“double”, “three more”) are even significant predictors of proportional reasoning at the beginning of primary school (Vanluydt et al., 2021).

This early proportional reasoning however also relies on several preconceptions that are constraining and impose limits for reaching an expertise and flexibility in solving proportional problems in school and in daily life. Four main preconceptions which act as initial categories used for interpreting situations regarding proportionality can be identified.

Multiplication as Repeated Addition

Proportional reasoning relies on the identification of multiplicative relations (McCrink & Spelke, 2010), and therefore it is influenced by preconceptions regarding multiplication. Indeed, a multiplicative situation which is categorized as part of the well identified preconception of repeated addition (Fischbein, 1989) will not be in line with the concept of ratio. Bell et al. (1981) found that when students aged 12-15 were asked to solve the problem "If petrol costs £1.2 per gallon, what would be the cost of filling a can containing 0.22 gallons?", the most common operation used to find the answer was division (1.2 ÷ 0.22), instead of the correct one, multiplication (1.2 × 0.22). Indeed, when the categorization of multiplication as a repeated addition is adopted, multiplying by 0.22 is equivalent to adding 0.22 times, which is hardly meaningful. Indeed, problems containing a decimal multiplier smaller than one are more difficult for students to solve (Fischbein et al., 1985): such problems are not compatible with the constraining inferences imposed by the preconception since it leads to a smaller result than the initial value. Thus, the preconception of multiplication as repeated addition imposes constraints such as believing that the multiplier must be a whole number and the result must be greater than the multiplied value. This also hinders the possibility to conceive multiplication as a commutative operation.

Division as Sharing

A second preconception one can associate with proportional reasoning and the concept of ratio is division. Indeed, division is not intuitively categorized as the ratio between two quantities, but as sharing, where one quantity is shared into equal parts and one searches for the size of a part (Fischbein, 1989). Fischbein et al. (1985) showed that problems falling outside the scope of the inferences that are made based on the preconception, when the divisor is larger than the dividend, such as "15 friends together bought 5 kg of cookies. How much did each one get?", are challenging for 5^th graders since it is difficult to use the correct solving strategy ‘5 ÷ 15’. The mental category of dividing is partitioning (or sharing) is thus restrictive because it precludes viewing division as a measurement (Fischbein et al., 1985). Indeed, such an alternative quotative view of division refers to the ratio between quantities of the same unit and entails less constraints than the partitive view. The quotative view only considers that the dividend should be larger than the divisor. Therefore, in proportional problems such as "2 baguettes cost 3€. How much do 8 baguettes cost?", the partitive view would lead to first calculate the price of a baguette. This strategy, known as identifying the base rate, consists of first calculating the base quantity (here, 1 baguette costs 1.50€) and then multiplying the base quantity by the number of units sought (1.50€ × 8 = 12€). However, the quotative view would make it possible to solve the problem with a different strategy (8 ÷ 2 = 4, I buy 4 times more baguettes, so I will pay 4 times more, 4 × 3 = 12€).

Fraction as a Bipartite Structure

Another difficulty in grasping proportional reasoning comes from not categorizing a fraction as a ratio between two quantities. One of the difficulties stems from the fact that students make an analogy between natural numbers and rational numbers and therefore categorize rational numbers as natural numbers and apply the properties of integers to fractions. This phenomenon is called whole number bias (Ni & Zhou, 2005). This can lead to inferences that rational numbers have a single successor (Siegler & Lortie-Forgues, 2015; Vamvakoussi & Vosniadou, 2010), that a larger numerator, denominator, or both, represent a larger fraction (Ni & Zhou, 2005), or that multiplying two fractions necessarily makes the result larger while dividing two fractions necessarily makes it smaller (Siegler & Lortie-Forgues, 2015). A second difficulty amounts to categorizing a fraction as a bipartite, i.e., a part-whole structure (Bonato et al., 2007; DeWolf et al., 2014), related to division (a/b is a division of a by b, with a and b integers and b bigger than a). Thus, the fraction is seen as a division of 2 numbers and not as a number (Sophian, 2007). A challenge in teaching fractions is therefore to build the mental category of fractions as magnitudes. This bipartite conception entails difficulties for comparing fractions, since students see it as a comparison between number pairs. For example, in a paradigm often used, two fractions are presented to participants, and they are asked to judge which fraction is larger. Some pairs are consistent with the whole number bias (6/8 vs. 7/9) and others are inconsistent with that bias (2/9 vs. 1/3). Van Hoof et al. (2013) showed that first and fifth graders reaction times are longer for incongruent pairs, thus illustrating the difficulty to perceive fraction as magnitude. Furthermore, empirical findings reveal that when fractions are viewed as magnitudes and not bipartite structures, expert mathematicians use different fraction comparison strategies (Obersteiner et al., 2013). This suggests that recategorizing fractions from a bipartite point of view to a holistic, magnitude based point of view is important to favor the appropriate and flexible use of strategies.

The Illusion of Linearity

Finally, the last preconception we will develop concerns the linear property of proportions. In fact, as young as 6, students can solve some missing value proportional problems, especially with an informal proportional reasoning that amounts to a repeated addition strategy (Kaput & West, 1994; Sophian & Wood, 1997; Van Den Brink & Streefland, 1979). For example, “For 6 m², I need 0.75 liters of paint. How many liters do I need for 18 m²?” is solved by applying additive reasoning as a principal for linearity: “18 m² is 6 + 6 + 6 m², I need 0.75 + 0.75 + 0.75 liter of paint”.

But even in contexts where the proportional strategy is not valid, students implement the principle of linearity (Van Dooren et al., 2005). Surprisingly, as they progress through school, from second grade to sixth grade, when students solve non-proportional problems, the number of proportional (linear) strategies and answers, which are incorrect, increases. One explanation of this result comes from students’ school experience: at certain points in the mathematics curriculum, significant attention is given to proportionality. The emphasis then is often on performing the procedures correctly and students apply it consistently. Thus, students categorize a problem describing a situation with one missing value among 4 values as a proportional problem e.g. “In his toy box, John has dice in several sizes. The smallest one has a side of 10 mm and weighs 800 mg. What would be the weight of the largest die (with a side of 30 mm)?” (Van Dooren et al., 2004). This categorization is based on surface features – 3 known values and 1 missing value – and not on the mathematical structure, related to the principles at play in the situation, in this case a non-linear situation, since volumes of cubes extend exponentially and not linearly with the size of the sides.

A Pedagogical Intervention for Enhancing Flexibility Through Multiple Categorization

In order to attenuate the obstacles imposed by preconceptions in the domain of proportional reasoning, we created a pedagogical intervention based on principles of multiple categorization as a way of increasing flexibility. The intervention program consisted of 12 one-hour in class math lessons. These lessons were composed of different written arithmetic word problems. Each lesson focused on one key concept of proportional reasoning in relation to the relevant preconceptions (Table 1). By studying how to solve comparison problems, students first worked on additive structures (e.g., three more and three less) and multiplicative structures (e.g., three times more and three times less). Then they learned to distinguish between additive and multiplicative structures (e.g., three more vs. three times more). After these first steps, fractions and proportions were studied. The aim of this intervention was to guide the students towards an awareness of their preconceptions and the construction of mental categories more in line with the academic notion. The latter will be qualified as an expert conception in this study, and strategies in line with expert conceptions will be considered as a reflection of flexibility, since they indicate that a more adequate perspective has been adopted instead of a more intuitive but less relevant one.

Table 1

Program of the Intervention

Session		Intuitive conception	Expert conception to be build	Objectives
1	From additive to multiplicative language	More and times more are similar	Times more as a ratio	Understanding the equivalence between more and less Distinguishing between more and times more
2	Multiplicative language	Times more as a repeated addition. Times less as a repeated subtraction	Times more / Times less as the search of a ratio	Move from the "repeated addition" view (3 + 3 + 3 + 3) to the notion of ratio: 4 × 3 Adopt the "times more" and "times less" points of view Understand that multiplication and division correspond to the search for a ratio
3	Conversion problems	Multiplying to get more and dividing to get less	Multiplying and dividing to find a ratio between 2 quantities	Understand that multiplication and division are about finding a ratio
4	Distributivity	Quantizer is not taken into account	Proceed by part (expansion) or grouping (factoring)	Perceive a quantity as parts and perform an expansion or perceive the whole and perform a factorization
5	Fraction	Fraction as a bipartite structure (parts/whole)	The fraction as a number of something	Understand the fraction as a number Identify the fraction of the whole and the fraction of each part
6	Partitive division and quotitive division	Dividing for sharing	Divide for measuring	Understand that division addresses not only a situation of sharing but also of quotition
7	Equivalence between division and multiplying by a fraction	Fraction as a bipartite structure: « 3/4, it is 3 divided by 4 »	Fraction as a multiplication of a fraction and an integer: “3/4 is 3 quarters, is 3 × 1/4”	Understand that multiplying by a fraction is akin to dividing by the inverse fraction
8	Proportion	Proportion as the conservation of difference	The proportion as a ratio to be kept	Understand proportion as a ratio between two quantities
9	Proportion - 3 strategies	Proportion as the conservation of difference	Proportion as the conservation of ratio	Use 3 different strategies (proportion, fraction, times less) to find the same result
10	Proportion - 3 strategies	Proportion as the conservation of difference	Proportion as the conservation of ratio	Use 3 different reasonings (times more, times less, proportion) to solve a missing value proportional problem without performing a base unit rate
11	Proportion - 4 strategies	Proportion as the conservation of difference	Proportion as the conservation of ratio	Use 4 different reasonings (times more, times less, proportion, base unit rate) to solve a missing value proportional problem
12	Final session	Isomorphic problems to the previous sessions

In our intervention, the notion of multiple categorization was made explicit to the students through the notion of point of view. For example, in order to learn the reciprocity between multiplication and division, crucial in the construction of the concept of ratio, two points of view can be taken on the following situation: “Jena has 15 marbles and Mateo has 5 marbles”. Taking Jena's point of view, labeled “times more” one can conclude: “Jena has three times more marbles than Mateo”. While, from Mateo's point of view, labeled “times less” one can conclude: “Mateo has three times less marbles than Jena”. Indeed, explicit methods increase performance compared to implicit methods (see meta-analysis by Alfieri et al., 2011) and using labelling helps students to identify a deep structure (Namy & Gentner, 2002).

Furthermore, comparing and contrasting two solution methods by their efficiency can lead to greater gains in flexibility than studying the solution methods one at a time (Rittle-Johnson & Star, 2007). This is indeed beneficial for better understanding (Hattikudur et al., 2016; Rittle-Johnson & Star, 2007). And furthermore, in order to promote transfer, it is important to practice the same reasoning across a variety of contents (Bransford et al., 2000; Halpern, 2013; Perkins & Salomon, 1989). In our intervention, in line with these studies, a side-by-side presentation and comparison of the different strategies, associated to the points of views, was made. Students were also prompted to transfer the same points of view on various contexts through the 12 lessons.

The Current Study

The current study investigated the impact of a pedagogical intervention based on multiple categorization principles as a way of achieving flexibility. It used proportional reasoning as a tool of intervention and investigation. The general rationale was that since difficulties in understanding proportionality are rooted in preconceptions, categorizing situations in alternative ways should make it possible for students to overcome the constraints induced by preconceptions and to adopt strategies aligned with the expert conception of the mathematical concepts.

We expected students in the experimental group to better succeed than students in the active control group. Each group included subgroups created based on grade level and on the school’s SES. First, during the pretest, we expected no difference between the groups and between the different subgroups (Prediction 1). At posttest, the experimental group and subgroups were expected to score higher than the control group (Prediction 2). For each skill measured in the tests, at pretest, we expected no differences between groups and between subgroups (Prediction 3). And at posttest, the experimental group and subgroups should score higher than the control groups and subgroups for each subscore regarding the studied notions (Prediction 4).

Method

Participants

Twenty-eight French classes participated to the study. 588 students (53% female, mean age 10.5 years, SD = 0.65 for the experimental group; 48% female, mean age 10.6 years, SD = 0.62 for the active control group) were present at both pre- and posttests (Table 2 for exact distributions).

The experimental and control classes were paired according to the socio-economic status commonly associated with the context of the participating schools (low SES, middle SES, high SES). In France, most students attend non-priority education public schools. These are schools with a relatively mixed student population (Botton & Miletto, 2018). All the classes of the middle SES group belonged to such public schools. Furthermore, since 2015 priority education schools are split between Priority Education Networks (REP) and Enhanced Priority Education Networks (REP+). 74.1% of REP+ students are children of working class or unemployed parents (Direction de l’évaluation, de la prospective et de la performance [DEPP], 2016). Only 8% of primary school pupils are enrolled in REP+ network (DEPP, 2018). All the classes of the low SES group belonged to this REP+ network. Lastly, there also exist private schools, which enroll 14.5% of pupils (DEPP, 2017). All the classes of the high SES group came from a selective private Parisian school whose admission is based on exam and interviews.

Table 2

Number of Students per Groups and Subgroups

Subgroups	Experimental Group			Control Group			Total
Subgroups	4^th grade	5^th grade	Subtotal	4^th grade	5^th grade	Subtotal	Total
Pretest
Middle SES	36	48	84	29	52	81	165
Low SES	61	42	103	46	66	112	215
High SES	54	56	110	54	52	106	216
Total	151	146	297	129	170	299	596
Posttest
Middle SES	37	48	85	30	55	85	170
Low SES	57	43	100	47	68	115	215
High SES	55	56	111	54	54	108	219
Total	149	147	296	131	177	308	604
Pre- and Posttest
Middle SES	36	48	84	29	52	81	165
Low SES	57	40	97	44	66	110	207
High SES	54	56	110	54	52	106	216
Total	147	144	291	127	170	297	588

The teachers, who participated in experimental and control groups, did so on a voluntary basis. In each of the subgroups, the selection process for teachers was similar. At the beginning of the year, several projects were presented to them, including the current project. The objective was thus to control for the teacher's "motivation" effect (Willingham, 2008): all the teachers included in the control and experimental groups were motivated to invest themselves in an optional subject project, included in their class hours.

Procedure

Pre- and Posttests

The pretest consisted of 17 items for 4^th graders and 23 items for 5^th graders. The pretest differed between the two grades since at the beginning of the school year, 4^th graders have never been taught division, fractions, and proportionality. The posttest was identical for the two grades and consisted of 35 items. The items included in the posttest all required expert conceptions of proportional reasoning to successfully solve the problem. Four items from French national evaluations and 4 items from TIMMS (2015) were integrated. At posttest, 2 items for which the threshold (75% of success) had been met at pretest were removed. The different items of the tests are detailed in Appendix. They were classified according to the 6 different notions that are studied in these grades:

Distinguishing between additive and multiplicative structures
Solving distributivity problems
Solving multiplicative problems
Decomposing and comparing fractions
Solving fraction problems
Solving proportion problems

The control and experimental groups took the pretest at the end of the first trimester and the posttest during the last month of school year. The booklets of the tests were composed of a series of problems. Each problem statement was followed by a box to indicate the calculation and a line for the answer statement. In order to control for order effects, 4 booklets were created. At pre- and posttest, students were informed that the test was part of a scientific study and were instructed about the importance of completing the calculation. Each item had to be solved in a limited time (2 or 3 minutes depending on the item). The timing was determined based on pilot tests, and it was introduced to limit the total duration of the test. Once the time was up, the experimenter informed the students they should move on to the next exercise without going back to the previous ones. The pretest was administered by the first author. The posttest was divided into two testing sessions to limit the duration of each testing session for the students. Due to the high amount of testing sessions, two additional experimenters were recruited for conducting the experiments in the classrooms. Teachers were present during the administrations of pre- and posttest but did not intervene and did not keep copies of the tests.

The Intervention Program

The control group followed the usual math curriculum. In France, each class has to follow an official mathematics curriculum specified for each grade (Eduscol, 2022a, 2022b). All the studied notions seen by the experimental group were part of the official curriculum. Thus, experimental and control classes studied the same notions.

The experimental group participated in 12 lessons of 1 hour over a 5-month period. The lessons were part of the teaching hours dedicated to math teaching. The lessons in the middle SES group were entirely conducted by the first author in the presence of the teacher. For the other two groups, half of the lessons were conducted by the first author and half by the teachers. Before the beginning of the intervention, teachers from the experimental classes participated in a 2-hour training on preconceptions and multiple categorization, given by the first and last authors. Before each lesson they had to teach, the teachers received a teacher's guide and the necessary material (student worksheets and slides) (Figure 1). The teacher’s guide started with a summary of the general objectives of the lesson (Figure 2). Then the teacher's sheet described step by step the problems and the points which needed particular attention.

Click to enlarge

Figure 1

Exemplary Student Worksheet of One Problem

Click to enlarge

Figure 2

Exemplary Summary of a Teacher's Guide Sheet

Scoring

For each problem, the expert strategy – i.e., a strategy that does not rely on preconceptions but requires categorizing the situation in the expert point of view – was defined prior to collecting the data (Appendix). Each expert strategy counted for 1 point. Calculation errors were not taken into account. For items involving more than one question, the answer to each question was given 1 point. Several scores were derived from the coding:

A global score (ranging from 0 to 18 points for 4^th graders and from 0 to 29 points for 5^th graders on the pretest and 40 points on the posttest)
A sub-score associated with each studied notion (see Appendix Tables A.1, A.2, A.3, A.4, A.5, A.6, A.7, A.8, and A.9)

To compare the pretest and posttest which did not contain the same items, a z-score per student at pre- and posttest, relative to the mean and standard deviation of the control group, was calculated (Dillon et al., 2017).

Results

Results at Global Level

The data regarding student performance were not independent, since it was the classrooms that were recruited and not individual students Along with checking the equivalence of the two groups at pretest, this also required to check the variance explained by the hierarchical organization of the data (class clustering). At pretest, the z-score of the experimental group was equal to 0.02 (SD = 0.90) (Figure 3). A t-test comparison of the mean scores of the two population revealed no significant differences among them, t(587.45) = -.17, p > .5. However, such a probabilistic approach is not sufficient to conclude that there is no difference between two groups. Therefore, we resorted to the Bayesian approach and calculated the Bayes factor with the BayesFactor package in R. According to the classification of Kass and Raftery (1995) the BF01 = 10.79 provides substantial support for the absence of difference between the performance of the two groups at pretest. This therefore leads us to consider that the assignment of the participants to the experimental and control classes could be considered quasi-random and makes it possible to further conduct the inferential analysis.

Click to enlarge

Figure 3

Boxplots of z-scores at Pre- and Posttest by Experimental Conditions

At posttest, the average z-score of the experimental group was 0.66 (SD = 1.2) (Figure 3). To study if the improvement from pretest to posttest was significantly influenced by the intervention, a multilevel analysis was applied, since the data had a hierarchical structure, that considers the dependency of the students nested into classrooms. We first ran models using only the classroom as the random intercept, for the performance both at pre- and posttest. This model made it possible to quantify the intra-class coefficient (ICC). At pretest the ICC = .421 and at posttest the ICC = .387 indicated that there was substantial intra-class homogeneity. Hence, there was 41.9% of the observed variance at the pretest and 38.7% of the variance at outcome of the posttest which can be attributed to the effect of the classroom clustering.

To study the interaction between the Time of testing (Pretest vs. Posttest) and Group (experimental vs. control), linear mixed-effects models (Bates et al., 2015) with the Z-score performance was further fitted. The null model (M0) included only the participants and classroom as random effects. Departing from the null model, we constructed four new models, adding the Time of testing (M1), its interaction with Group (M2), and subsequentially adding Grade (4^th vs. 5^th) (M3), and SES (Low vs. Middle vs. High) (M4) as the fixed effects. We conducted an ANOVA with the 4 models. As indicated in Table 3, the Akaike Information Criterion (AIC) decreases from the M0 to the M4, which is consistent with the improvement of the fit at each step of the model construction, therefore the M4 was retained. The results from the M4 model revealed that there was a significant interaction between Time of testing and Group (β = -0.66777, t = -10.361, p < .001), with an effect size of the M4 model $R_{GLMM(c)}^{2}$ = .75 (Bartoń, 2020).

Table 3

Comparison of the Mixed-Effects Models for Performance on Pre- and Posttest

Models	AIC	χ²	df	p
M0: Performance ~ (1 \| Participants) + (1 \| Classroom)	3017.5	–	–	–
M1: Performance ~ Time + (1 \| Participants) + (1 \| Classroom)	2941.5	77.964	1	< .001
M1: Performance ~ Time * Group + (1 \| Participants) + (1 \| Classroom)	2844.8	100.721	2	< .001
M2: Performance ~ Time * Group + Grade + (1 \| Classroom)	2836.8	10.055	1	< .01
M3: Performance ~ Time * Group + Grade + SES + (1 \| Participants) + (1\|Classroom)	2794.0	46.771	2	< .001

To better understand the importance of the fixed factors, we then also constructed a model to investigate only the results of the posttest, which included the Group, Grade and SES as the fixed factors and classroom as random factor (Table 4). For each fixed effect, the level of the variable whose effect is estimated compared to the reference level for that predictor is indicated in the parentheses. The results unambiguously confirm the highly significant influence of the three factors, in the expected directions: on the posttest, the Experimental group performed better than the Control group, the 5^th graders performed better than the 4^th graders, students from the Low SES group perform lower than the students from the Medium SES, and lower than the High SES group.

Table 4

Posttest Model Results

Effect	Name	Estimate	SE	df	t	p
Fixed	Intercept	0.4243	0.1439	21.6586	2.949	.0075
	Group (Experimental)	0.6893	0.1261	22.3942	5.467	< .001
	Grade (5^th)	0.5709	0.1164	40.2340	4.905	< .001
	SES (Low)	-1.2468	0.1499	20.8792	-8.316	< .001
	SES (Middle)	-1.0636	0.1634	20.4878	-6.508	< .001
		Variance	SD	N classes		N observations
Random	Classroom (Interc)	0.07096	0.2664	28		604
	Residual	0.82442	0.9080	28		604

Results by Grades

Furthermore, the experimental conditions depending on the grade level (Tables 5 and 6 and Figure 4) were distinguished. Pairwise comparisons were conducted, with Bonferroni correction for p-values based on the retained M4 model using lsmeans function from the lsmeans package in R. At pretest, no significant difference between the control and experimental group in 4^th grade (β = -0.0365, t = -.324, p > .05), nor in 5^th grade (β = -.0365, t = -.324, p > .05). This result confirms the second part of Prediction 1.

Table 5

z-Scores at Pre- and Posttest by Experimental Conditions and Grades

Groups	z-Score
	Pretest		Posttest
	M	SD	M	SD
4^th Grade
Experimental group	-.07	.93	.39	1.13
Control group	.03	1.23	-.28	.94
5^th Grade
Experimental group	.11	.86	.94	1.28
Control group	-.02	.79	.21	1.00

Table 6

p-Values of Pairwise Comparisons by Grades at Posttest With Bonferroni Correction

Posttest Group*Grade	Control-4^th Grade	Control-5^th Grade	Experimental-4^th Grade
Control-5^th Grade	< .05	–	–
Experimental-4^th Grade	< .0001	.8311	–
Experimental-5^th Grade	< .0001	< .0001	< .05

Note. p-values in bold are inferior to .05.

Click to enlarge

Figure 4

Boxplots of z-scores at Pre- and Posttest by Experimental Conditions and by Grades

At posttest, each control subgroup scored a lower z-score than the corresponding experimental group (β = -0.6943, t = -6.168, p > .001 for both 4^th and 5^th grade classes). This result confirms the second part of Prediction 2. In addition, while the 4^th grade control group had a significantly lower z-score than the 5^th grade control group, the 4^th grade experimental group had a significantly similar z-score than the 5^th grade control group.

Results by SES

The performance regarding the different SES conducting were then compared, using pairwise comparison with Bonferroni correction for p-values based on the retained M4 model (Lenth, 2016).

At pretest, for each SES the experimental group had a similar z-score to the corresponding control group. This result confirms the last part of Prediction 1. At posttest, each experimental group had a significant higher z-score than its corresponding control group (Tables 7 and 8 and Figure 5). This result confirms the last part of Prediction 2.

Table 7

z-Scores at Pre- and Posttest by Experimental Conditions and by SES

Groups	z-Score
	Pretest		Posttest
	M	SD	M	SD
High SES
Experimental group	.60	.74	1.45	1.05
Control group	.89	.77	.66	.89
Middle SES
Experimental group	-.11	.83	.32	1.06
Control group	-.32	.72	-.27	.82
Low SES
Experimental group	-.49	.75	.08	1.11
Control group	-.60	.75	-.43	.90

Table 8

p-Values of Pairwise Comparisons by SES With Bonferroni Correction

Group*SES	SES
Group*SES	Control-High	Control-Low	Control-Middle	Experimental-High	Experimental-Low
Pretest
Control-Low SES	< .0001	–	–	–	–
Control-Middle SES	< .0001	1.00	–	–	–
Experimental-High SES	1.00	< .0001	.0003	–	–
Experimental-Low SES	< .0001	1.00	1.00	< .0001	–
Experimental-Middle SES	.0008	1.00	1.00	<.0001	1.00
Posttest
Control-Low SES	<.0001	–	–	–	–
Control-Middle SES	<.0001	1.00	–	–	–
Experimental-High SES	.0001	< .0001	< .0001	–	–
Experimental-Low SES	.1467	.0001	1.00	< .0001	–
Experimental-Middle SES	1.00	.0006	.0001	< .0001	1.00

Note. p-values in bold are inferior to .05.

Click to enlarge

Figure 5

Boxplots of z-scores at Pre- and Posttest by Experimental Conditions and SES.

Furthermore, the results revealed that the performance gap between the three SES among the experimental groups was maintained at posttest. However, differences were observed in the gap among different SES subgroups between the control and experimental groups. The middle SES experimental group had a lower z-score to the high SES control group at pretest, but a similar z-score at posttest. In contrast, the middle SES control group maintained lower z-scores than the high SES control group. The same trend was significant for the low SES control group and the middle SES experimental group. Interestingly, even though the high SES control group had higher performance than the low SES experimental group at the pretest, the difference was not significant on the posttest.

Results by Sub-Scores

Then, each proportional reasoning sub-score was analyzed at pretest and posttest (Table 9). At pretest, no significant differences were observed through Mann-Whitney-Wilcoxon tests between the two groups for 4 subscores. For the subscore "Decomposing and comparing fractions", the control group was significantly better than the experimental group at pretest. For the subscore "Solving proportion problems", the score on the missing value proportional problems – taken only by the fifth graders – and the score on the proportional graphic situation – taken by all students – were distinguished. While there was no difference between the two groups in the score of the missing value proportional problems, the experimental group did better on the proportional graphic situation. However, given the threshold achieved on the pretest for this item (0.77 for the control group and 0.81 for the experimental group), the item was not kept at posttest.

Table 9

Mean, Median, and Standard Deviation for Each Subscore of the Studied Notion at Pre- and Posttest by Experimental Condition and Mann-Whitney-Wilcoxon Test Results

Subscores according to studied notions	Control Group			Experimental Group			U	p
Subscores according to studied notions	M	Mdn	SD	M	Mdn	SD	U	p
Posttest
Distinguishing between additive and multiplicative structures	.58	.57	.33	.72	.86	.30	32272	< .001***
Solving distributivity problems	.31	.25	.30	.53	.50	.35	27891	< .001***
Solving multiplicative problems	.54	.50	.31	.63	.67	.30	35596	< .001***
Decomposing and comparing fractions	.54	.48	.27	.57	.22	.27	39626	.053
Solving fraction problems	.10	.00	.15	.23	.50	.22	26568	< .001***
Solving proportion problems	.14	.13	.15	.28	.25	.27	30378	< .001***
Pretest
Distinguishing between additive and multiplicative structures	.48	.43	.31	.50	.57	.30	42722	.42 ns
Solving distributivity problems^a	.23	.00	.29	.26	.00	.28	11654	.29 ns
Solving multiplicative problems	.44	.40	.30	.45	.40	.30	43768	.32 ns
Decomposing and comparing fractions	.60	.67	.29	.51	.67	.35	50750	< .01**
Solving fraction problems	.52	.60	.24	.52	.54	.22	44986	.76 ns
Solving proportion problems
Proportion problem^a	.04	.00	.10	.03	.00	.09	12532	.78 ns
Graphic proportional situation	.77	1.00	.28	.81	1.00	.29	40076	.02*

^aItems taken by 5^th graders only.

*p < .05. **p < .01. ***p < .001.

Therefore at pretest: there was no difference on 4 sub-scores of the studied notions with a superiority of the control group for the skill "Decomposing and comparing fractions" and a superiority of the experimental group for only one item – "solve a graphical situation of proportionality". These results partially confirm the first part of Prediction 3.

At posttest, the experimental group had a significantly higher mean than the control group for 5 out of 6 sub-scores regarding the studied notions and with a significant trend (p = .053) for the skill “Decomposing and comparing fractions” (Table 9 and Figure 6). Compared to the pretest, the experimental group caught up and exceeded the control group. These results confirm the first part of Prediction 4.

Click to enlarge

Figure 6

Subscores of the Studied Notion’s Means at Posttest by Experimental Condition

Each sub-score by grade and SES were also analyzed with Mann-Whitney-Wilcoxon tests. At pretest, the results between the subgroups (by level or type of school) are similar (1242 < U < 12398, .07 < p < .96) except for the skill "Decomposing and comparing fractions". On this task, the high SES control group was better than the high SES experimental group (U = 7034, p < .01) and the low SES control group is better than the low SES experimental group (U = 7590, p < .001). There is no difference between each subgroup for the item "solve a graphical situation of proportionality" (3521.5 < U < 11802, 0.07 < p < .86), unlike the analysis by experimental condition. These results confirm the last part of Prediction 3, even though a small superiority for some control subgroups can be noticed.

At posttest, for each sub-score, each experimental subgroup got better scores than the control subgroup, except for the comparison of the middle SES groups on the subscore regarding the studied notion "Decomposing and comparing fractions". On 30 comparisons, 23 comparisons are significant (1565.5 < U < 10590, 1.69E-12 < p < .02), 3 comparisons are at significance threshold (4549 < U < 10828, p = .05), 3 comparisons are non-significant (3140 < U < 5212.5, p > .05), and 1 comparison is in favor of the middle SES control group, although not significantly ("Decomposing and comparing fractions", U = 4015, p > .05). These results are in line with the last part of Prediction 4.

Discussion

The present study was conducted to investigate to which extent a pedagogical intervention based on multiple categorization might improve students’ mathematical flexibility. This intervention focused on proportional reasoning, for which a wide set of preconceptions might hinder students to use an appropriate strategy to find the solution. In fact, preconceptions often lead to problems being categorized based on superficial features and precludes the possibility to consider an alternative, more adequate solving strategy, which would be consistent with the expert point of view. Therefore, teaching students to analyze the notions related to proportional reasoning from different points of views, each point of view being the hallmark of categorizing the problem in a different manner, was expected to lead students to be in position to flexibly adopt relevant strategies. Namely, it was expected that the intervention would allow students to adopt strategies that are outside the scope of the intuitive conception, but consistent with an alternative categorization in line with an expert point of view.

Fourth and fifth graders from three different social backgrounds took part in the study. The experimental classes benefited from 12 lessons based on multiple categorization to guide them in overcoming their initial point of view and build an alternative one, that they could adaptively refer to when the initial one reveals to be inadequate for finding the solution. The performance of the experimental and active control groups was compared before and after the intervention. The results revealed that the control and experimental groups had homogeneous performance at pretest. At posttest, the experimental group outperformed the control group and this was consistent among the different grades and the different SES of the schools. This suggests that the pedagogical intervention based on multiple categorization had a beneficial influence on students from the experimental group when it came to building a better understanding of proportionality. In the current study these observations were made using written word problems. Yet, multiplicative thinking and proportional reasoning is crucial in real-world situations such as financial contexts or when assessing risk taking (Casscells et al., 1978; Sawatzki et al., 2019). For example, when students use additive strategies in proportional situations that require comparisons, their ability to make informed financial decisions seems to be limited (Hilton et al., 2012; Sawatzki et al., 2019). Further studies could therefore directly include real-life situations and have a wider range of tasks (such as students baking based on a recipe and adjusting the ingredients to a different proportion) in order to measure their ability to transfer this kind of mathematics knowledge from school to real-life context.

Additionally, this research supports the idea that to develop flexibility on problem solving in school contexts, it helps to dispose of several solving strategies. Indeed, students were encouraged to adopt as many strategies as possible by adopting different points of view. The wide variety of possible solutions was not simply the result of pooling together the different strategies proposed by different students, but all students had to propose several strategies. As a result, at posttest, more than one third of the experimental students proposed two strategies to solve distributivity problems. It was three times more than students from the control group. Additionally, for missing value proportional problems, no students from the control group succeeded to propose two strategies, whereas one seventh of the students from the experimental group succeeded. Thus, it seems that experimental students were not restricted to the first point of view induced by the problem and developed more flexible strategies. In addition, one can note that at posttest, the experimental 4^th grade group reaches a similar level to the 5^th grade control group. Finally, although the gap between the different SES subgroups remained significant across the experimental sub-groups, the performance gaps between the experimental and control groups by SES subgroups have narrowed. The process of strategy selection among several strategies has also received much attention in works about conceptual and procedural knowledge in mathematics and their relations. In the latter, flexibility has been underlined as a crucial point in Star’s (2005) reconsideration of procedural knowledge: deep procedural knowledge is introduced and defined as “knowledge of procedures that is associated with comprehension, flexibility, and critical judgment and that is distinct from (but possibly related to) knowledge of concepts.” (p. 408). This association of knowing multiple procedures as well as choosing the most appropriate one given a problem’s features has also been termed with procedural flexibility by other scholars (Kilpatrick et al., 2001). Yet, as mentioned in the introduction, the flexible choice between different strategies is also to be considered in connection with conceptual knowledge. The latter “involves connecting concepts to specific procedures – for example, knowing why certain procedures work for certain problems or knowing the purpose of each step in a procedure” (Crooks & Alibali, 2014, p. 371). The relations between conceptual and procedural knowledge are bidirectional (Rittle-Johnson et al., 2001), and developing either one can contribute to fostering flexibility. Yet some studies that focused on instructional interventions have even found that conceptual instruction even leads to greater gains in procedural knowledge than merely focusing on procedural instruction (Rittle-Johnson et al., 2016). The results from our study also contribute to highlighting the importance of conceptual knowledge for fostering procedural knowledge.

Indeed, when a problem can be solved with several strategies, it can be particularly beneficial to work on the conceptual knowledge to which each strategy is attached. This was precisely done in the current study when students were introduced to points of view reflecting the different conceptions. Only after identifying these points of view were mathematical strategies associated with each point of view. This approach is in line with other findings which stress that flexibility cannot simply refer to the smooth transition between several strategies, but that achieving flexibility mobilizes the complex relations between conceptual and procedural knowledge (Baroody, 2003; Prather & Alibali, 2009; Verschaffel et al., 2009). In our view, relying on multiple categorization might contribute to developing conceptual knowledge since it allows one to perceive the common conceptual structure between different problems whose superficial features are different. The intervention and its assessment conducted in this study highlight the usefulness of overcoming some on the limitations of an initial representation, but also provide insight into the benefits for fostering students’ flexibility in strategy use.

Funding

The work carried out by the first author has been financed by a doctoral contract from the Université Paris Lumières.

Acknowledgments

The authors have no additional (i.e., non-financial) support to report.

Competing Interests

The authors have declared that no competing interests exist.

Ethics Statement

The empirical work has been carried in accordance with the relevant ethical principles and standards of University Paris 8.

References

Alfieri, L., Brooks, P. J., Aldrich, N. J., & Tenenbaum, H. R. (2011). Does discovery-based instruction enhance learning? Journal of Educational Psychology, 103(1), 1-18. https://doi.org/10.1037/a0021017
Anghileri, J. (1989). An investigation of young children’s understanding of multiplication. Educational Studies in Mathematics, 20(4), 367-385. https://doi.org/10.1007/BF00315607
Ausubel, D. P. (1968). Educational psychology: A cognitive view. Holt, Rinehart, & Winston.
Babai, R., Sekal, R., & Stavy, R. (2010). Persistence of the intuitive conception of living things in adolescence. Journal of Science Education and Technology, 19(1), 20-26. https://doi.org/10.1007/s10956-009-9174-2
Baroody, A. J. (2003). The development of adaptive expertise and flexibility: The integration of conceptual and procedural knowledge. In A. J. Baroody & A. Dowker (Eds.), The development of arithmetic concepts and skills: Constructive adaptive expertise (pp. 1–33). Lawrence Erlbaum Associates.
Barsalou, L. W. (1991). Deriving categories to achieve goals. In G. H. Bower (Ed.), The psychology of learning and motivation (Vol. 27, pp. 1–64). Academic Press.
Bartoń, K. (2020). MuMIn: Multi-model inference (R package, Version 1.43.17) [Computer software]. https://CRAN.R-project.org/package=MuMIn
Bates, D., Maechler, M., Bolker, B., & Walker, S. (2015). Fitting linear mixed-effects models using lme4. Journal of Statistical Software, 67(1), 1-48. https://doi.org/10.18637/jss.v067.i01
Bell, A., Swan, M., & Taylor, G. (1981). Choice of operation in verbal problems with decimal numbers. Educational Studies in Mathematics, 12(4), 399-420. https://doi.org/10.1007/BF00308139
Bonato, M., Fabbri, S., Umiltà, C., & Zorzi, M. (2007). The mental representation of numerical fractions: Real or integer? Journal of Experimental Psychology: Human Perception and Performance, 33(6), 1410-1419. https://doi.org/10.1037/0096-1523.33.6.1410
Botton, H., & Miletto, V. (2018). Rapport Education et Territoires. CNESCO.
Bransford, J. D., Brown, A. L., & Cocking, R. R. (2000). How people learn (Vol. 11). National Academy Press.
Carey, S. (1985). Conceptual change in childhood. MIT Press.
Carey, S. (2000). The origin of concepts. Journal of Cognition and Development, 1, 37-41. https://doi.org/10.1207/S15327647JCD0101N_3
Casscells, W., Schoenberger, A., & Graboys, T. B. (1978). Interpretation by physicians of clinical laboratory results. The New England Journal of Medicine, 299(18), 999-1001. https://doi.org/10.1056/NEJM197811022991808
Chi, M. T. H., & VanLehn, K. A. (2012). Seeing deep structure from the interactions of surface features. Educational Psychologist, 47(3), 177-188. https://doi.org/10.1080/00461520.2012.695709
Cragg, L., & Chevalier, N. (2012). The processes underlying flexibility in childhood. Quarterly Journal of Experimental Psychology, 65(2), 209-232. https://doi.org/10.1080/17470210903204618
Crooks, N. M., & Alibali, M. W. (2014). Defining and measuring conceptual knowledge in mathematics. Developmental Review, 34(4), 344-377. https://doi.org/10.1016/j.dr.2014.10.001
Dehaene, S. (1997). The number sense. Oxford University Press.
DeWolf, M., Grounds, M. A., Bassok, M., & Holyoak, K. J. (2014). Magnitude comparison with different types of rational numbers. Journal of Experimental Psychology: Human Perception and Performance, 40(1), 71-82. https://doi.org/10.1037/a0032916
Diamond, A. (2013). Executive functions. Annual Review of Psychology, 64, 135-168. https://doi.org/10.1146/annurev-psych-113011-143750
Dillon, M. R., Kannan, H., Dean, J. T., Spelke, E. S., & Duflo, E. (2017). Cognitive science in the field: A preschool intervention durably enhances intuitive but not formal mathematics. Science, 357(6346), 47-55. https://doi.org/10.1126/science.aal4724
Direction de l’évaluation, de la prospective et de la performance. (2016). L’état de l’école. Ministère de l’éducation nationale et de la jeunesse.
Direction de l’évaluation, de la prospective et de la performance. (2017). L’éducation prioritaire: Etats des lieux (Note d’information n°17.25). Ministère de l’éducation nationale et de la jeunesse.
Direction de l’évaluation, de la prospective et de la performance. (2018). L’éducation prioritaire: Etats des lieux (Note d’information n°18.02). Ministère de l’éducation nationale et de la jeunesse. https://cache.media.education.gouv.fr/file/2018/68/4/depp-ni-2018-18-02-l-education-prioritaire-etat-des-lieux_896684.pdf
diSessa, A. A., Gillespie, N. M., & Esterly, J. B. (2004). Coherence versus fragmentation in the development of the concept of force. Cognitive Science, 28(6), 843-900. https://doi.org/10.1207/s15516709cog2806_1
Dupuch, L., & Sander, E. (2007). Apport pour les apprentissages de l’explicitation des relations d’inclusion de classes [The contribution to learning of the explicitness of class inclusion relationships]. L’Année Psychologique, 107(4), 565-596. https://doi.org/10.4074/S0003503307004034
Eduscol. (2022a). Attendus de fin d’année CM1. Ministère de l’éducation nationale et de la jeunesse. https://eduscol.education.fr/document/13990/download
Eduscol. (2022b). Attendus de fin d’année CM2. Ministère de l’éducation nationale et de la jeunesse. https://eduscol.education.fr/document/14002/download
Fischbein, E. (1989). Tacit models and mathematical reasoning. For the Learning of Mathematics, 9(2), 9-14. https://flm-journal.org/Articles/1128E024153DA2C5DE336DFD448BCF.pdf
Fischbein, E., Deri, M., Nello, M. S., & Marino, M. S. (1985). The role of implicit models in solving verbal problems in multiplication and division. Journal for Research in Mathematics Education, 16(1), 3-17. https://doi.org/10.2307/748969
Gamo, S., Sander, E., & Richard, J.-F. (2010). Transfer of strategy use by semantic recoding in arithmetic problem solving. Learning and Instruction, 20(5), 400-410. https://doi.org/10.1016/j.learninstruc.2009.04.001
Gentner, D., & Kurtz, K. J. (2006). Relations, objects, and the composition of analogies. Cognitive Science, 30(4), 609-642. https://doi.org/10.1207/s15516709cog0000_60
Gopnik, A., & Wellman, H. M. (1994). The theory theory. In L. A. Hirschfeld & S. A. Gelman (Eds.), Mapping the mind: Domain specificity in cognition and culture (pp. 257–293). Cambridge University Press.
Gvozdic, K., & Sander, E. (2020). Learning to be an opportunistic word problem solver: Going beyond informal solving strategies. ZDM – Mathematics Education, 52(1), 111-123. https://doi.org/10.1007/s11858-019-01114-z
Halpern, D. F. (2013). Thought and knowledge: An introduction to critical thinking. Psychology Press.
Hatano, G. (1982). Cognitive consequences of practice in culture specific procedural skills. The Quarterly Newsletter of the Laboratory of Comparative Human Cognition, 4, 15-18.
Hattikudur, S., Sidney, P. G., & Alibali, M. W. (2016). Does comparing informal and formal procedures promote mathematics learning? The benefits of bridging depend on attitudes toward mathematics. The Journal of Problem Solving, 9(1), 13-27. https://doi.org/10.7771/1932-6246.1180
Hilton, A., Hilton, G., Dole, S., Goos, M., & O’Brien, M. (2012). Evaluating middle years students’ proportional reasoning. In J. Dindyal, L. P. Chen, & S. F. Ng (Eds.), Mathematics education: Expanding horizons. Proceedings of the 35th annual conference of the Mathematics Education Research Group of Australasia (pp. 330–337). MERGA.
Hofstadter, D., & Sander, E. (2013). Surfaces and essences: Analogy as the fuel and fire of thinking. Basic Books.
Inagaki, K., & Hatano, G. (2008). Conceptual change in naïve biology. In S. Vosniadou (Ed.), International handbook of research on conceptual change (pp. 240–262). Routledge.
Kaput, J. J., & West, M. M. (1994). Missing-value proportional reasoning problems: Factors affecting informal reasoning patterns. In G. Harel & J. Confrey (Eds.), The development of multiplicative reasoning in the learning of mathematics (pp. 235–287). State University of New York Press.
Kass, R. E., & Raftery, A. E. (1995). Bayes factors. Journal of the American Statistical Association, 90(430), 773-795. https://doi.org/10.1080/01621459.1995.10476572
Keil, F. C. (2011). Science starts early. Science, 331(6020), 1022-1023. https://doi.org/10.1126/science.1195221
Kilpatrick, J., Swafford, J., & Findell, B. (Eds.). (2001). Adding it up: Helping children learn mathematics. The National Academies Press. https://doi.org/10.17226/9822
Lakoff, G., & Núñez, R. E. (2000). Where mathematics comes from: How the embodied mind brings mathematics into being. AMC, 10(12), 720-733.
Lautrey, J., Rémi-Giraud, S., Sander, E., & Tiberghien, A. (2008). Les connaissances naïves [Naïve knowledge]. Armand Colin.
Lenth, R. V. (2016). Least-squares means: The R package Ismeans. Journal of Statistical Software, 69(1), 1-33. https://doi.org/10.18637/jss.v069.i01
Mackie, J. E., & Bruce, C. D. (2016). Increasing nursing students’ understanding and accuracy with medical dose calculations: A collaborative approach. Nurse Education Today, 40, 146-153. https://doi.org/10.1016/j.nedt.2016.02.018
Malt, B. C., & Johnson, E. E. (1992). Do artifact concepts have cores? Journal of Memory and Language, 31(2), 195-217. https://doi.org/10.1016/0749-596X(92)90011-L
McCrink, K., & Spelke, E. S. (2010). Core multiplication in childhood. Cognition, 116(2), 204-216. https://doi.org/10.1016/j.cognition.2010.05.003
Mulligan, J. T., & Mitchelmore, M. C. (1997). Young children’s intuitive models of multiplication and division. Journal for Research in Mathematics Education, 28(3), 309-330. https://doi.org/10.2307/749783
Murphy, G. (2002). The big book of concepts. MIT Press.
Namy, L. L., & Gentner, D. (2002). Making a silk purse out of two sow’s ears: Young children’s use of comparison in category learning. Journal of Experimental Psychology: General, 131(1), 5-15. https://doi.org/10.1037/0096-3445.131.1.5
Ni, Y., & Zhou, Y.-D. (2005). Teaching and learning fraction and rational numbers: The origins and implications of whole number bias. Educational Psychologist, 40(1), 27-52. https://doi.org/10.1207/s15326985ep4001_3
Noelting, G. (1980). The development of proportional reasoning and the ratio concept Part I: Differentiation of stages. Educational Studies in Mathematics, 11(2), 217-253. https://doi.org/10.1007/BF00304357
Obersteiner, A., Van Dooren, W., Van Hoof, J., & Verschaffel, L. (2013). The natural number bias and magnitude representation in fraction comparison by expert mathematicians. Learning and Instruction, 28, 64-72. https://doi.org/10.1016/j.learninstruc.2013.05.003
Perkins, D. N., & Salomon, G. (1989). Are cognitive skills context-bound? Educational Researcher, 18(1), 16-25. https://doi.org/10.3102/0013189X018001016
Piaget, J., & Inhelder, B. (1951). La genèse de l’idée de hasard chez l’enfant [The genesis of the idea of chance in children]. Presses Universitaires de France.
Prather, R. W., & Alibali, M. W. (2009). The development of arithmetic principle knowledge: How do we know what learners know? Developmental Review, 29(4), 221-248. https://doi.org/10.1016/j.dr.2009.09.001
Rittle-Johnson, B., Fyfe, E. R., & Loehr, A. M. (2016). Improving conceptual and procedural knowledge: The impact of instructional content within a mathematics lesson. The British Journal of Educational Psychology, 86(4), 576-591. https://doi.org/10.1111/bjep.12124
Rittle-Johnson, B., Siegler, R. S., & Alibali, M. W. (2001). Developing conceptual understanding and procedural skill in mathematics: An iterative process. Journal of Educational Psychology, 93(2), 346-362. https://doi.org/10.1037/0022-0663.93.2.346
Rittle-Johnson, B., & Star, J. R. (2007). Does comparing solution methods facilitate conceptual and procedural knowledge? An experimental study on learning to solve equations. Journal of Educational Psychology, 99(3), 561-574. https://doi.org/10.1037/0022-0663.99.3.561
Rosch, E. (1978). Principles of categorization. In E. Margolis & S. Laurence (Eds.), Concepts: Core readings (pp. 189–206). MIT Press.
Sawatzki, C., Downton, A., & Cheeseman, J. (2019). Stimulating proportional reasoning through questions of finance and fairness. Mathematics Education Research Journal, 31(4), 465-484. https://doi.org/10.1007/s13394-019-00262-5
Scheibling-Sève, C., Pasquinelli, E., & Sander, E. (2020). Assessing conceptual knowledge through solving arithmetic word problems. Educational Studies in Mathematics, 103(3), 293-311. https://doi.org/10.1007/s10649-020-09938-3
Scheibling-Sève, C., Pasquinelli, E., & Sander, E. (2022). Critical thinking and flexibility. In E. Clément (Ed.), Cognitive flexibility: The cornerstone of learning (pp. 77–112). ISTE Ltd and Wiley. https://doi.org/10.1002/9781119902737.ch4
Scheibling-Sève, C., Sander, E., & Pasquinelli, E. (2017). Developing cognitive flexibility in solving arithmetic word problems. In CogSci 2017: Proceedings of the 39th Annual Meeting of the Cognitive Science Society London, UK (pp. 3076-3081). Computational Foundations of Cognition. https://cogsci.mindmodeling.org/2017/papers/0581/index.html
Schliemann, A. D., Araujo, C., Cassundé, M. A., Macedo, S., & Nicéas, L. (1998). Use of multiplicative commutativity by school children and street sellers. Journal for Research in Mathematics Education, 29(4), 422-435. https://doi.org/10.2307/749859
Schoenfeld, A. H., & Herrmann, D. J. (1982). Problem perception and knowledge structure in expert and novice mathematical problem solvers. Journal of Experimental Psychology: Learning, Memory, and Cognition, 8(5), 484-494. https://doi.org/10.1037/0278-7393.8.5.484
Siegler, R. S., & Lortie-Forgues, H. (2015). Conceptual knowledge of fraction arithmetic. Journal of Educational Psychology, 107(3), 909-918. https://doi.org/10.1037/edu0000025
Shtulman, A., & Harrington, K. (2016). Tensions between science and intuition across the lifespan. Topics in Cognitive Science, 8(1), 118-137. https://doi.org/10.1111/tops.12174
Sophian, C. (2007). Measuring spatial factors in comparative judgments about large numerosities. In D. D. Schmorrow & L. M. Reeves (Eds.), Foundations of augmented cognition (FAC 2007, Lecture Notes in Computer Science, Vol. 4565, pp. 157–165). Springer. https://doi.org/10.1007/978-3-540-73216-7_18
Sophian, C., & Wood, A. (1997). Proportional reasoning in young children: The parts and the whole of it. Journal of Educational Psychology, 89(2), 309-317. https://doi.org/10.1037/0022-0663.89.2.309
Spelke, E. S., & Kinzler, K. D. (2007). Core knowledge. Developmental Science, 10(1), 89-96. https://doi.org/10.1111/j.1467-7687.2007.00569.x
Staples, M. E., & Truxaw, M. P. (2012). An initial framework for the language of higher-order thinking mathematics practices. Mathematics Education Research Journal, 24(3), 257-281. https://doi.org/10.1007/s13394-012-0038-3
Star, J. R. (2005). Reconceptualizing procedural knowledge. Journal for Research in Mathematics Education, 36(5), 404-411.
TIMMS 2015 Assessment. Copyright 2015 International Association for the Evaluation of Educational Achievement (IEA). Publisher: TIMMS & PIRLS International Study Center, Lynch School of Education, Boston College.
Tversky, B., & Hemenway, K. (1983). Categories of environmental scenes. Cognitive Psychology, 15(1), 121-149. https://doi.org/10.1016/0010-0285(83)90006-3
Vamvakoussi, X., & Vosniadou, S. (2010). How many decimals are there between two fractions? Aspects of secondary school students’ understanding of rational numbers and their notation. Cognition and Instruction, 28(2), 181-209. https://doi.org/10.1080/07370001003676603
Van Den Brink, J., & Streefland, L. (1979). Young children (6–8)-ratio and proportion. Educational Studies in Mathematics, 10(4), 403-420. https://doi.org/10.1007/BF00417087
Van Dooren, W., De Bock, D., Hessels, A., Janssens, D., & Verschaffel, L. (2004). Remedying secondary school students’ illusion of linearity: A teaching experiment aiming at conceptual change. Learning and Instruction, 14(5), 485-501. https://doi.org/10.1016/j.learninstruc.2004.06.019
Van Dooren, W., De Bock, D., Hessels, A., Janssens, D., & Verschaffel, L. (2005). Not everything is proportional: Effects of age and problem type on propensities for overgeneralization. Cognition and Instruction, 23(1), 57-86. https://doi.org/10.1207/s1532690xci2301_3
Van Hoof, J., Lijnen, T., Verschaffel, L., & Van Dooren, W. V. (2013). Are secondary school students still hampered by the natural number bias? A reaction time study on fraction comparison tasks. Research in Mathematics Education, 15(2), 154-164. https://doi.org/10.1080/14794802.2013.797747
Vanluydt, E., Supply, A.-S., Verschaffel, L., & Van Dooren, W. (2021). The importance of specific mathematical language for early proportional reasoning. Early Childhood Research Quarterly, 55, 193-200. https://doi.org/10.1016/j.ecresq.2020.12.003
Verschaffel, L., Luwel, K., Torbeyns, J., & Van Dooren, W. (2009). Conceptualizing, investigating, and enhancing adaptive expertise in elementary mathematics education. European Journal of Psychology of Education, 24(3), 335-359. https://doi.org/10.1007/BF03174765
Vosniadou, S. (1994). Capturing and modeling the process of conceptual change. Learning and Instruction, 4(1), 45-69. https://doi.org/10.1016/0959-4752(94)90018-3
Vosniadou, S. (2012). Reframing the classical approach to conceptual change: Preconceptions, misconceptions and synthetic models. In B. Fraser, K. Tobin, & C. McRobbie (Eds.), Second international handbook of science education (Vol. 24, pp. 119–130). Springer, Dordrecht. https://doi.org/10.1007/978-1-4020-9041-7_10
Vosniadou, S. (2017). Initial and scientific understandings and the problem of conceptual change. In T. G. Amin & O. Levrini (Eds.), Converging perspectives on conceptual change (pp. 17-25). Routledge.
Willingham, D. T. (2008). Critical thinking: Why is it so hard to teach? Arts Education Policy Review, 109(4), 21-32. https://doi.org/10.3200/AEPR.109.4.21-32

Appendix

Table A.1

Items in the Subscore “Distinguishing Between Additives and Multiplicative Structures”

Item's types	Statements	Expert conceptions	Expert strategies	Pretest		Posttest	Score
Item's types	Statements	Expert conceptions	Expert strategies	4^th grade	5^th grade	4^th and 5^th grades	Score
Comparison problem - Times more	There are 4 apples on the table and there are 5 times more oranges. How many oranges are there?	Distinguish more and times more	4 × 5 = 20 oranges	v	x	v	1
Comparison problem - Ratio	Maria's train has 3 wagons and Lucas' train has 15 wagons. Which one has more? How many times more?		15 ÷ 3 = 5 or 5 × 3 = 15. Lucas has 5 times more wagons than Maria.	v	x	v	1
Comparison problem - More	There are 6 cookies and 18 napkins on the table. But there are also apples and candies. There are 3 more apples than cookies. There are 3 times more candies than cookies. 1) How many apples are there?		6 + 3 = 9 apples	v	v	v	1
Comparison problem - Times more	There are 6 cookies and 18 napkins on the table. But there are also apples and candies. There are 3 more apples than cookies. There are 3 times more candies than cookies. 2) How many candies are there?		6 × 3 = 18 candies	v	v	v	1
Comparison problem - Ratio	There are 6 cookies and 18 napkins on the table. But there are also apples and candy. There are 3 more apples than cookies. There are 3 times more candies than cookies. 3) Are there more cookies or napkins? How many times more?		18 ÷ 6 = 3 or 3 × 6 = 18. There are 3 times more napkins than cookies.	v	v	v	1
Comparison problem - Difference	Amin has 11 marbles. Julian has 22 marbles. Julian has 7 more marbles than Leo. 1) How many more marbles does Julian have than Amin?		22 - 11 = 11. Julien has 11 more marbles than Amin.	v	v	v	1
Comparison problem - More with unknown reference set	Amin has 11 marbles. Julian has 22 marbles. Julian has 7 more marbles than Leo. 2) How many marbles does Leo have?	More indicates a difference	22 - 7 = 15 or 15 + 7 = 22. Leo has 15 marbles.	v	v	v	1

Table A.2

Items in the Subscore “Solving Distributiviy Problems”

Item's types	Statements	Expert conceptions	Expert strategies	Pretest		Posttest	Score
Item's types	Statements	Expert conceptions	Expert strategies	4^th grade	5^th grade	4^th and 5^th grades	Score
Distributivity problem - Variable "Distance"	A team of 4 athletes participated in a rally: each athlete ran an 8 km loop, then a 2 km straight line and finally a 3 km loop. How many km did the team run in total?	Understanding multiplicative quantifiers: expansion or factoring	(8 + 2 + 3) × 4 = 52 km	x	v	v	1
Distributivity problem - Variable "Distance"			(4 × 8) + (4 × 2) + (4 × 3) = 52 km	x	v	v	1
Distributivity problem - Variable "Duration"	A school director has been keeping a list of purchases list over the past 6 years. Each year, he purchased 2 computers, 4 printers, 7 screens. How many items has the school director purchased in total?		(2 + 4 + 7) × 6 = 78 items purchased	x		v	1
Distributivity problem - Variable "Duration"			(6 × 2) + (6 × 4) + (6 × 7) = 78 items purchased	x		v	1

Table A.3

Items in the Subscore “Solving Multiplicative Problems”

Item's types	Statements	Expert conceptions	Expert strategies	Pretest		Posttest	Score
Item's types	Statements	Expert conceptions	Expert strategies	4^th grade	5^th grade	4^th and 5^th grades	Score
French evaluation's item: Multiplicative problem	A farmer puts 12 eggs in each box. When he finishes, he counts his boxes and finds 5. How many eggs did he put away?	Product	12 × 5 = 60 eggs	v	v	v	1
Multiplicative problem	A seller puts 6 chocolates in each box. When he finishes, he counts his boxes and sees there are 13 boxes. How many chocolates did he put away?	Product	13 × 6 = 78 eggs	v	v	v	1
Partitive division problem	There are 8 pieces of cake and there are 4 people. How many pieces of cake will each person get?	Division	8 ÷ 4 = 2 or 2 × 4 = 8. 2 pieces by person	v	v	v	1
Quotitive division problem	We have 90 envelopes. We are making piles of 15 envelopes. How many piles can we make? Other pair of possible values (80, 20)	Quotitive division	90 ÷ 15 = 6 or 6 × 15 = 90. 6 piles	v	v	v	1
Quotititive division problem	With a package of 90 pictures, we are making piles of 6 images. How many piles can we make? Other pair of possible values (80, 4)	Quotitive division	90 ÷ 6 = 15 or 15 × 6 = 90. 15 piles	v	v	v	1
Division problem with remainder	90 students must be transported with 40-seat buses. How many buses are needed to transport all the students?	Partitive division with remainder	90 ÷ 40 = 2, 25 or 40 × 2 = 80. 3 buses are needed.	x	v	v	1

Table A.4

Items in the Subscore “Decomposing and Comparting Fractions” (1)

Item's types	Statements	Expert conceptions	Expert strategies	Pretest		Posttest	Score
Item's types	Statements	Expert conceptions	Expert strategies	4^th grade	5^th grade	4^th and 5^th grades	Score
Fraction comparison	I am filling small bags of flour from a large bag of flour. A. The green bag weighs 4/13 of the large bag. B. The blue bag weighs 8/9 of the large bag. C. The red bag weighs 3/7 of the large bag. Which bag is the heaviest? __________ Which bag is the lightest? __________ Another possible triplet: 3/14, 7/8, 2/5	Analysing the ratio (the magnitude of a fraction)	4/13 < 8/9	x	v	v	1
Fraction comparison	Here are three fractions. A. The fraction: 7/8 B. The fraction 3/14. C. The fraction: 2/5. Which fraction is the largest? __________ Which fraction is the smallest? __________ Another possible triplet: 8/9, 4/13, 3/7	Analysing the ratio (the magnitude of a fraction)	3/14 < 7/8	x	v	v	1
TIMMS 2015 - M041298: Geometric figures	Click to enlarge	Part of a whole composed of equal parts	D: 1/4	v	v	v	1

Table A.5

Items in the Subscore “Decomposing and Comparting Fractions” (2)

Item's types	Statements	Expert conceptions	Expert strategies	Pretest		Posttest	Score
Item's types	Statements	Expert conceptions	Expert strategies	4^th grade	5^th grade	4^th and 5^th grades	Score
TIMMS 2015 - M041065: Fraction of a geometric figure	TIMMS: Which of these circles has 3/8 of its area colored in? Click to enlarge	Fraction as a ratio of 2 numbers	C	v	v	x	1
TIMMS 2015 - M041065: Fraction of a geometric figure	Additional question for 5th graders: Which of these circles has 2/3 of its area colored in? __________	Fraction as a ratio of 2 numbers	B	x	v	x	1
French national evaluation item: from drawings to fractions	Click to enlarge	The fraction of one unit	Each glass contains 1/5 of the contents of the bottle.	x	x	v	1
French national evaluation item: Fraction decomposition	5/4 = 1 + __________	A unit is a fraction such as a/a	5/4 = 1 + 1/4	x	x	v	1
Several numerical representations for the same fraction	Click to enlarge	A fraction has an infinite number of representations	2/3 ; 1/3 + 1/3 ; 2 × 1/3 ; 4/6 ; 2 ÷ 3	x	x	v	1

Table A.6

Items in the Subscore “Solving Fractional Problems” (1)

Item's types	Statements	Expert conceptions	Expert strategies	Pretest		Posttest	Score
Item's types	Statements	Expert conceptions	Expert strategies	4^th grade	5^th grade	4^th and 5^th grades	Score
TIMMS 2011 M041299: Fraction addition problem	Tom ate 1/2 of the cake and Jane ate 1/4 of the cake. How much of the cake did they eat altogether?	Fraction as a number	1/2 = 2/4 ; 2/4 + 1/4 = 3/4	x	x	v	1
Fraction decomposition problem	How many quarter hours are there in 1 hour and 15 minutes? Justify your answer.	A unit can be written in the form of a fraction	1 h = 4/4 ; 4/4 + 1/4 = 5/4. There are 5 quarters.	x	x	v	1
Fraction decomposition problem	Tom ate 1/2 of the cake. And Jane ate 1/4 of the cake. Between them, what fraction of the cake did they eat?	A fraction can be written as a number multiplied by a fraction (a x 1/b)	3 × 1/4 = 3/4 or 1/4 + 1/4 + 1/4 = 3/4. There are 3 quarters.	x	x	v	1
Fraction multiplication problem	18 people eat a third of a pizza each. How many pizzas are there? Other pair of values: (12, one third)	Multiplication is a product between two numbers (integers and/or fractions)	18 × 1/3 = 6 or 18/3 = 6. 6 pizzas	x	x	v	1
Fraction division problem - easy version	There are 4 pieces of pizza and there are 8 people. How many slices can each person have?	The result of a division can be a fraction	4/8 = 1/2 piece	v	v	v	1
Fractional division problem - difficult version	18 people want to share 3 cakes. How much of the cake will each of them get? Other pair of values: (12, 3)	The result of a division can be a fraction	3/18 = 1/6 of the cake	x	x	v	1

Table A.7

Items in the Subscore “Solving Fractional Problems” (2)

Item's types	Statements	Expert conceptions	Expert strategies	Pretest		Posttest	Score
Item's types	Statements	Expert conceptions	Expert strategies	4^th grade	5^th grade	4^th and 5^th grades	Score
Fraction division problem	At snack time, there are 2 cakes: one with nuts and one with grapes. Julia, Ylies and Mylan want to share the cakes. But Julia is allergic to nuts. They all 3 want to eat the same number of slices. How can they do that?	Fraction of each part	Julia will have 2/3 of the cake with grapes.	x	x	v	1
Half of	Circle half of the stars. Click to enlarge	Fraction of a whole and fraction of each part	2 out of 4 stars are circled and half of each star are circled	v	v	v	1
A quarter of	Circle a quarter of the hats. Click to enlarge	Fraction of a whole and fraction of each part	1 hat is circled or 1/4 of each hat is circled	v	v	v	1

Table A.8

Items in the Subscore “Solving Proportion Problems” (1)

Item's types	Statement	Expert conceptions	Expert strategies	Pretest		Posttest	Score
Item's types	Statement	Expert conceptions	Expert strategies	4^th grade	5^th grade	4^th and 5^th grades	Score
Drawing - MCQ	Click to enlarge	Reasoning about ratio	The least: A The most: C Same taste: B et D	v	x	x	1
Missing value proportion problem	16 pens cost 48€. The pens are all identical. They weigh 20 grams each. How much do 4 pens cost? Can you suggest a second way to solve the problem?	Conservation of ratio	Ratio: 16 ÷ 4 = 4; 48 ÷ 4 = 12€ Ratio: 4 × 4 = 16; 4 × 12 = 48. 12€ Fraction: 4 = 1/4 × 16; 1/4 × 48 = 12€ Base rate unit: 48 ÷ 16 = 3, 3 × 4 = 12€	x	v	v	1
Missing value proportion problem				x	v	v	1
Missing value proportion problem	12 kg is the weight of 36 chairs. The chairs are all identical. They cost 20€ each. How many chairs are there if the weight is 4kg?		Ratio: 12 ÷ 4 = 3; 36 ÷ 3 = 12 chairs Ratio: 4 x 3 = 12; 3 × 12 = 36. 12 chairs Fraction: 4 = 1/3 × 12; 1/3 × 36 = 12€ Base rate unit: 36 ÷ 12 = 3, 3 × 4 = 12€	x	v	v	1
Missing value proportion problem				x	v	v	1

Table A.9

Items in the Subscore “Solving Proportion Problems” (2)

Item's types	Statement	Expert conceptions	Expert strategies	Pretest		Posttest	Score
Item's types	Statement	Expert conceptions	Expert strategies	4^th grade	5^th grade	4th and 5th grades	Score
French national evaluation item: missing value proportion problem	6 identical objects cost 150€. How much do 9 of these objects cost? Can you suggest a second way to solve the problem?	Conservation of ratio	Times more: 3 objects = 75€ 9 objects = 75 × 3 = 225€ Half: 9 = 6 + 3, 150 + 150/2 = 225€ Base rate unit: 150 ÷ 6 = 25, 25 × 9 = 225€	x	v	v	1
		Conservation of ratio		x	v	v	1
TIMMS 2011 - M031183: Complete a recipe	Click to enlarge	Division of a fraction	(1/2) ÷ 2 = 1/4 tablespoon	x	v	v	1
Proportion problem	90 students must be transported in 40-seat buses. If the first buses are all full, which proportion of the last bus will be full?	Proportion is a ratio	10/40 = 1/4 of the bus will be filled.	x	x	v	1

Enhancing Cognitive Flexibility Through a Training Based on Multiple Categorization: Developing Proportional Reasoning in Primary School

Abstract

Flexibility in Problem Solving and Multiple Categorization

Proportional Reasoning and Misconceptions

Multiplication as Repeated Addition

Division as Sharing

Fraction as a Bipartite Structure

The Illusion of Linearity

A Pedagogical Intervention for Enhancing Flexibility Through Multiple Categorization

Table 1

The Current Study

Method

Participants

Table 2

Procedure

Pre- and Posttests

The Intervention Program

Figure 1

Exemplary Student Worksheet of One Problem

Figure 2

Exemplary Summary of a Teacher's Guide Sheet

Scoring

Results

Results at Global Level

Figure 3

Boxplots of z-scores at Pre- and Posttest by Experimental Conditions

Table 3

Table 4

Results by Grades

Table 5

Table 6

Figure 4

Boxplots of z-scores at Pre- and Posttest by Experimental Conditions and by Grades

Results by SES

Table 7

Table 8

Figure 5

Boxplots of z-scores at Pre- and Posttest by Experimental Conditions and SES.

Results by Sub-Scores

Table 9

Figure 6

Subscores of the Studied Notion’s Means at Posttest by Experimental Condition

Discussion

Funding

Acknowledgments

Competing Interests

Ethics Statement

References

Appendix

Table A.1

Table A.2

Table A.3

Table A.4

Table A.5

Table A.6

Table A.7

Table A.8

Table A.9

Outline