Elsevier

Behavioural Processes

Volume 89, Issue 3, March 2012, Pages 244-255
Behavioural Processes

Transitive inference in pigeons: Measuring the associative values of Stimuli B and D

https://doi.org/10.1016/j.beproc.2011.12.001Get rights and content

Abstract

Several reinforcement-based models have been proposed to explain transitive-like behavior in nonverbal transitive inference tasks. These models assume that the initial training required for memorizing the premises produces an ordered series of associative values (A > B > C > D > E); these values can then be used to select the “transitively correct” stimulus in a novel pair (e.g., BD). Our study experimentally tested this assumption by using resistance-to-extinction and resistance-to-reinforcement techniques to obtain empirical measures of associative strength for Stimuli B and D. We first measured the associative strengths of these stimuli after completion of initial training with overlapping pairs of colored squares (A + B−, B + C−, C + D−, and D + E−) using resistance-to-extinction and resistance-to-reinforcement procedures. Next, we used massed presentations of Pair D + E− (termed bias reversal) that ought to increase the associative value of Stimulus D, and again measured the associative strengths of the stimuli. None of our experimental measures of associative strength correlated with pigeons’ behavior in the BD test or with BD performance predicted by associative models either before or after bias reversal (Wynne, 1995, Siemann and Delius, 1998). These results question validity of reinforcement-based models for explaining animals’ behavior in nonverbal TI tasks.

Highlights

► We trained pigeons to acquire of 5-term series in a transitive inference task. ► After acquisition, we experimentally measured associative values of Stimuli B and D. ► Experimental measures of associative values did not predict BD performance in test. ► They also did not correlate with BD performance predicted by associative models. ► Results call into question the validity of reinforcement-based model for TI tasks.

Section snippets

Subjects

Ten feral pigeons kept at 85% of their free-feeding weights by controlled daily feeding were used. One of the pigeons failed to learn the discrimination after the extensive training and was dropped from the experiment. Therefore, the final sample consisted of nine birds. Grit and water were available ad lib in the home cages. The pigeons had served in unrelated studies prior to this experiment.

Apparatus

The experiment used four operant conditioning chambers linked to four Macintosh computers detailed

Training and bias reversal

Training in the Downward group took an average of 36 sessions, with a minimum of 11 sessions and a maximum of 75 sessions. Training in the Upward group took an average of 20 sessions, ranging from 15 sessions to 26 sessions. The difference in the speed of learning in the two groups was not statistically significant.

Table 1 shows the number of correct and incorrect choices as well as the reinforcement/nonreinforcement ratios before and after bias reversal for all of the birds. In the Downward

Comparison of Upward and Downward groups

Several studies have reported that adult humans who are presented with verbal premises find it easier to construct an ordered linear series in a downward direction (i.e., starting with the “best” item and proceeding to the “worst” item) than in an upward direction (i.e., starting with the “worst” item and proceeding to the “best” item; De Soto et al., 1965, Jahn et al., 2007). It is possible that this tendency arises from linguistic properties of the premises (e.g., lexical marking, Clark, 1969

Conclusion

In our earlier study (Lazareva and Wasserman, 2006), we found that associative models were unable to predict high BD performance after bias reversal. It was possible, however, that the associative values of the stimuli after the bias reversal phase still formed an ordered series, but the associative models required some modification (e.g., the assumption that the associative value of a given stimulus is not updated during bias reversal; see Vasconcelos, 2008) to account for these data. Even

References (34)

  • M.J. Frank et al.

    Transitivity, flexibility, conjunctive representations, and the hippocampus. II. A computational analysis

    Hippocampus

    (2003)
  • D.J. Gillan

    Reasoning in the chimpanzee: II. Transitive inference

    Journal of Experimental Psychology: Animal Behavior Processes

    (1981)
  • R.C. Grace et al.

    On the relation between preference and resistance to change

    Journal of Experimental Analysis of Behavior

    (1997)
  • E. Hearst

    Resistance-to-extinction functions in the single organism

    Journal of Experimental Analysis of Behavior

    (1961)
  • G. Jahn et al.

    Preferred mental models in reasoning about spatial relations

    Memory & Cognition

    (2007)
  • O.F. Lazareva

    Transitive inference in nonhuman animals

  • O.F. Lazareva et al.

    Transitive responding in hooded crows requires linearly ordered stimuli

    Journal of Experimental Analysis of Behavior

    (2004)
  • Cited by (35)

    • What Is a Cognitive Map? Organizing Knowledge for Flexible Behavior

      2018, Neuron
      Citation Excerpt :

      When they are later asked to choose between A and a control, they choose A—the stimulus that implies a path to reward. While it is possible to solve both of these tasks by simpler associative mechanisms, in the case of transitive inference, at least, evidence strongly suggests that animals instead rely on abstract knowledge of the linear structure (Gazes et al., 2012; Jensen et al., 2015, 2017; Lazareva and Wasserman, 2012). For sensory preconditioning, the jury is still out in animals, but any reader who was able to decipher the plots of “Pulp Fiction” or “Kill Bill” from the sporadic, interleaved, and often time-reversed episodes will know the answer for humans (Figure 6E).

    • Understanding behavior under nonverbal transitive-inference procedures: Stimulus-control-topography analyses

      2017, Behavioural Processes
      Citation Excerpt :

      We analyzed accuracies on these relevant discriminations to assess their relation to probe-trial selections. Specifically, reward ratios and non-reward ratios were calculated (e.g., Lazareva and Wasserman, 2012). Reward ratio was calculated by dividing the number of reinforced responses to stimulus B by the number of reinforced responses to stimulus B plus the number of reinforced responses to stimulus G for each participant in all baseline sessions.

    • It takes two to tango: hemispheric integration in pigeons requires both hemispheres to solve a transitive inference task

      2017, Animal Behaviour
      Citation Excerpt :

      The tested hemisphere had only direct experience with B and it clearly chose this stimulus although B was reinforced in one (B+C−) and not in the other (A+B−) training pair. Owing to the overtraining of B+C−, B was presumably associated with a positive reward (Lazareva et al., 2015; Lazareva & Wasserman, 2012) and therefore was chosen. The hemispheres also successfully discriminated pairs that included colour E and, hence, the colour with the lowest reward value that was, however, not directly experienced.

    View all citing articles on Scopus
    View full text