CONCURRENT-CHAINS PROCEDURE
- Introduction to the Concurrent-Chains Procedure
- Structural Components: The Initial Links
- The Terminal Links: Commitment and Contingency
- The Role of the Primary Reinforcer
- Experimental Methodology and Setup
- Analyzing Preference and the Matching Law
- Applications in Value Assessment and Delay Discounting
- Advantages and Methodological Strengths
Introduction to the Concurrent-Chains Procedure
The Concurrent-Chains Procedure is a sophisticated experimental paradigm utilized extensively within behavioral psychology, particularly in the study of choice, preference, and the reinforcing efficacy of environmental stimuli and scheduled contingencies. It represents a complex extension of basic operant conditioning principles, moving beyond simple, single-schedule arrangements to explore situations where an organism must make a binding commitment to a future schedule of reinforcement. The fundamental structure involves two distinct phases: an initial choice phase, termed the initial links, followed by a terminal outcome phase, known as the terminal links. The core mechanism dictates that the fulfillment of the requirements of either initial schedule leads not to the immediate delivery of the primary reinforcer, but rather to the presentation of a specific, aligned terminal schedule. This crucial intermediate step differentiates the concurrent-chains procedure from simpler concurrent schedules, where primary reinforcement is delivered immediately following the response. The procedure is meticulously designed to measure the relative value or preference an organism places upon different, often delayed or probabilistic, schedules of reinforcement, thus providing invaluable insights into decision-making processes under conditions of commitment.
The primary objective underpinning the deployment of the concurrent-chains procedure is the quantification of the relative reinforcing effectiveness of various stimuli or schedules that serve as conditioned reinforcers. By requiring the subject to choose between two initial schedules, each leading irrevocably to a different subsequent schedule, the experimenter can assess which future contingency holds greater value. This value is reflected in the relative response rate allocated to the corresponding initial link key or lever. Crucially, the initial links are typically associated with distinctive stimuli—such as different colored lights or sounds—which act as discriminative stimuli signaling the upcoming terminal contingency. These stimuli themselves acquire reinforcing properties through their consistent association with the eventual primary outcome. Therefore, the choice observed during the initial phase is not merely a choice between two responses, but a choice between two separate, time-extended environments or futures, each governed by its own unique rules for obtaining the ultimate reward.
Understanding the concurrent-chains procedure necessitates recognizing that the organism’s behavior in the initial links is governed by the anticipated consequences associated with the terminal links. If the terminal link associated with Initial Link A promises a higher density of reinforcement, or reinforcement delivered more promptly, relative to Initial Link B, the organism will demonstrate a robust and measurable preference for Initial Link A. This preference demonstrates that the organism is sensitive not only to the immediate contingencies but also to the parameters governing future reinforcement availability. This experimental design allows researchers to manipulate critical variables such as delay to reinforcement, magnitude of reinforcement, and the probability of reinforcement, and observe how these manipulations systematically alter the subjective value of the future schedule. Consequently, the concurrent-chains procedure stands as a powerful tool for investigating complex psychological phenomena such as self-control, impulsivity, and the fundamental mechanisms of conditioned reinforcement.
Structural Components: The Initial Links
The initial links constitute the first and arguably the most crucial phase of the concurrent-chains procedure, serving as the choice environment where the organism makes its commitment. In a standard setup, the subject is presented with two simultaneously available response options, often two response keys illuminated by distinct stimuli. These stimuli function as discriminative signals indicating the specific terminal schedule that will become available upon successful completion of the initial link requirements. Typically, these initial link schedules are relatively lean and often involve Variable Interval (VI) schedules, designed to ensure continuous responding without allowing momentary response rates to skew the preference measure significantly. For instance, the subject might face a VI 30-second schedule on Key A and a VI 30-second schedule on Key B. The key operational difference lies not in the schedule requirements of the initial link themselves, but in the consequences that follow the first response meeting the interval requirement on either key.
Once the requirement for an initial link schedule is met—for example, a response occurring after 30 seconds have elapsed on VI 30s Key A—the initial link is terminated. The critical structural element here is the commitment: the choice is binding. Termination of Initial Link A immediately removes the alternative choice (Initial Link B) from availability, and the experimental chamber transitions into the conditions associated with the terminal link designated by the choice. This commitment phase ensures that the observed preference is a true reflection of the relative value of the subsequent schedules, uncontaminated by momentary switching behavior. The response allocation observed during this phase—the proportion of responses directed toward Key A versus Key B—is the primary dependent variable used by researchers to quantify the organism’s preference for the future reinforcing environment. If the subject spends 80% of its initial link time responding on Key A, it is inferred that the terminal schedule associated with Key A possesses approximately four times the subjective value of the terminal schedule associated with Key B.
The duration and contingency requirements of the initial links must be carefully calibrated to permit the study of preference without introducing confounding variables. If the initial links are too short, preference may be unstable or governed by immediate factors; if they are too long, the organism might experience fatigue or extinction effects before reaching the commitment point. Furthermore, the schedules used in the initial links are typically identical to one another (e.g., VI 30s vs. VI 30s) to ensure that differences in responding are solely attributable to the differences in the forthcoming terminal links, rather than differences in the effort required to initiate the terminal phase. This methodological rigor ensures that the procedure isolates the reinforcing efficacy of the future schedule itself, allowing for a clean measure of choice based on anticipated outcomes.
The Terminal Links: Commitment and Contingency
The terminal links represent the second, consequential phase of the concurrent-chains procedure. Once the organism has successfully completed the requirements of an initial link (e.g., Initial Link A), the environment changes dramatically. The visual and auditory stimuli associated with the choice phase are extinguished, and the chamber adopts the specific stimuli associated with the chosen terminal schedule (e.g., Key A might turn green, signaling Terminal Link A). This transition signifies that the organism is now locked into the selected contingency, and the only way to obtain the primary reinforcer is by fulfilling the requirements of the terminal schedule. The schedules employed in the terminal links are the variables of interest, as their characteristics are what determine the value assigned to the corresponding initial link.
Terminal links often vary along dimensions such as the density of reinforcement (e.g., VI 10s vs. VI 60s), the magnitude of the reinforcer (e.g., 1 pellet vs. 4 pellets), or the required effort (e.g., Fixed Ratio 5 vs. Fixed Ratio 20). The fulfillment of the terminal link schedule—the first response that meets the contingency requirement within that chosen schedule—instigates the delivery of the primary reinforcer. This relationship is paramount: the terminal schedule itself functions as a chain of conditioned reinforcement, where the successful completion of the chain is ultimately reinforced by the primary event (e.g., food, water). The stimuli present during the terminal link acquire powerful conditioned reinforcing properties because they signal a state of high certainty regarding the imminent availability of the primary reward.
A critical aspect of the terminal link phase is the duration and delay imposed by the schedule. By manipulating the time required to complete the terminal link, researchers can precisely study the phenomenon of delay discounting—the reduction in the subjective value of a reinforcer as the delay to its receipt increases. For instance, if Terminal Link A requires only 5 seconds of waiting (VI 5s), while Terminal Link B requires 60 seconds (VI 60s), the organism will overwhelmingly prefer Initial Link A, demonstrating that the immediate availability of reinforcement elevates the subjective value of the entire sequence leading to it. This structural arrangement provides a rigorous, objective measure of temporal sensitivity and the psychological mechanisms underlying impulsive or self-controlled choices, depending on whether the subject chooses the immediate, smaller reward or the delayed, larger reward.
The Role of the Primary Reinforcer
The primary reinforcer constitutes the final, unconditioned consequence in the concurrent-chains procedure. Typically, this refers to biologically significant stimuli such as food, water, or access to a suitable mate. While the initial choice phase and the terminal schedule phase are both critical, their reinforcing efficacy is derived entirely from their predictive relationship with the primary reinforcer. It is the delivery of this unconditioned stimulus that maintains the entire chain of behavior. Without the consistent and reliable delivery of the primary reinforcer upon completion of the terminal link, the discriminative stimuli and the terminal schedules themselves would undergo rapid extinction, and the organism would cease to show preference in the initial links.
The primary reinforcer serves two essential functions within this paradigm. Firstly, it is the ultimate motivational driver. The organism’s sustained responding across potentially effortful or lengthy schedules is attributable to the deprivation state established prior to the experiment (e.g., food deprivation) and the subsequent satiation provided by the reinforcer. Secondly, the primary reinforcer acts as the validation point for the entire chain. Each time the primary reinforcer is delivered, it strengthens the conditioned reinforcing properties of the preceding terminal link stimuli, which in turn strengthens the choice behavior observed in the initial links. This systematic process ensures that the value measured in the choice phase is dynamically tied to the true utility of the final outcome.
Variations in the primary reinforcer’s parameters are often the key manipulations in concurrent-chains experiments. For example, researchers might compare an initial link leading to Terminal Link A (which yields 1 pellet) versus an initial link leading to Terminal Link B (which yields 4 pellets). If all other temporal parameters are held constant, the subject will strongly prefer Initial Link B, demonstrating sensitivity to reinforcer magnitude. This sensitivity confirms that the organism is not merely responding to the timing of the schedule, but to the actual payoff associated with the contingency. The consistent presence and manipulation of the primary reinforcer allows the concurrent-chains procedure to function as a precise instrument for measuring the interaction between time, effort, and reward quantity in shaping choice behavior.
Experimental Methodology and Setup
The concurrent-chains procedure is typically implemented within highly controlled environments, most commonly using specialized apparatus known as operant conditioning chambers (Skinner boxes), often involving pigeons or rats as subjects due to their high responsiveness to scheduled reinforcement. The chamber is configured specifically to support the two-phase nature of the procedure. For avian subjects, the choice phase involves two distinct response keys, usually illuminated by different colors (e.g., red and green) serving as the initial link stimuli.
The experimental sequence follows a systematic progression:
- The experiment begins with the initial link phase, where both keys are illuminated (e.g., Red Key A, Green Key B), and both initial schedules (e.g., VI 60s) are running concurrently and independently.
- The subject responds on both keys. The response ratio (responses on A / total responses) is continuously measured.
- When the requirement of one schedule is met (e.g., a peck on the Red Key satisfies the VI 60s requirement), the chamber transitions immediately.
- The alternative key is extinguished, and the chosen key changes color (e.g., Red Key turns Blue), signaling the beginning of the terminal link (e.g., VI 10s for the Blue Key).
- The subject must now respond according to the terminal schedule requirements.
- Fulfillment of the terminal schedule results in the delivery of the primary reinforcer (e.g., a brief access to food).
- Following a brief time-out period, the chamber returns to the initial link phase, and the choice keys are re-illuminated, restarting the cycle.
The selection of specific schedules for the initial links is vital. Researchers generally favor Variable Interval (VI) schedules for the initial links because they generate stable, steady rates of responding. This steadiness is crucial because the goal is to measure the relative *time* allocated to each option, which is proportional to the response rate on VI schedules. Using Fixed Interval (FI) or Fixed Ratio (FR) schedules in the initial links could lead to burst-pause patterns of responding that obscure the true underlying preference. Conversely, the terminal links can employ a wider variety of schedules (VI, VR, FR, or multiple schedules) depending on the specific psychological process being investigated, such as effort tolerance or resistance to extinction.
Analyzing Preference and the Matching Law
Data analysis in the concurrent-chains procedure centers on quantifying the strength of preference for one initial link over the other. This quantification is typically achieved by calculating the relative response rate or the relative time allocation to the initial links. The established theoretical framework for interpreting these choice data is often the Generalized Matching Law (GML), developed by Richard Herrnstein. The GML posits that the relative rate of responding on an initial link matches the relative rate of reinforcement obtained from the terminal link to which it leads.
Mathematically, the fundamental relationship measured is expressed as the ratio of responses on Initial Link A relative to the total responses on both links (A + B), matched against the ratio of reinforcers obtained from Terminal Link A relative to the total reinforcers obtained from both terminal links. When the schedules in the terminal links offer equal rates of reinforcement, the subject should allocate approximately 50% of its initial link responses to each key, indicating indifference. However, when Terminal Link A offers reinforcement three times more frequently than Terminal Link B (e.g., VI 10s vs. VI 30s), the organism will typically allocate significantly more responses to Initial Link A, demonstrating a preference that reflects the better future payoff.
The concurrent-chains procedure provides a robust method for testing deviations from ideal matching behavior, which are often observed when studying complex variables like delay or magnitude. When the time delay to reinforcement is large, the organism may show a systematic bias away from the option that yields the objectively better but more delayed reward, illustrating phenomena such as undermatching or bias in the Generalized Matching Law framework. By generating comprehensive data on response allocation under various terminal link conditions, researchers can derive critical psychological parameters that describe how subjects subjectively weigh delayed versus immediate, or large versus small, rewards, providing an empirical foundation for models of rational choice.
Applications in Value Assessment and Delay Discounting
The utility of the concurrent-chains procedure extends far beyond basic schedule analysis, serving as a primary methodology for investigating phenomena of significant psychological and clinical relevance. One of its most powerful applications is the rigorous study of conditioned reinforcement. Since the initial link stimuli gain their value purely through association with the terminal schedules, the procedure allows for precise measurement of how reinforcement history, schedule density, and temporal proximity affect the strength of secondary reinforcers. This has profound implications for understanding how neutral stimuli (e.g., money, tokens, grades) acquire motivational power.
Furthermore, the procedure is unparalleled in its ability to model and measure impulsivity and self-control. By designing one terminal link to provide a small, immediate reinforcer (the impulsive choice) and the other terminal link to provide a large, delayed reinforcer (the self-controlled choice), researchers can quantify the steepness of the subject’s delay discounting curve. For instance, if Initial Link A leads to 1 pellet immediately, and Initial Link B leads to 5 pellets after a 60-second delay, the degree to which the subject chooses A over B reveals their sensitivity to temporal proximity. Subjects who consistently choose the immediate, smaller reward demonstrate a high degree of impulsivity, while those who choose the delayed, larger reward demonstrate self-control. This allows for cross-species comparisons and the assessment of pharmacological or neurological manipulations on decision-making pathology.
In clinical and applied settings, the principles derived from concurrent-chains research have informed interventions for addictive behaviors and behavioral economics. Addictive behaviors, for instance, are often characterized by a steep delay discounting function—an extreme preference for immediate, albeit harmful, consequences over delayed, beneficial ones. By utilizing the concurrent-chains structure to identify the parameters that govern preference shifts, researchers can design contingency management programs that enhance the subjective value of long-term, healthier outcomes, effectively mitigating the immediate pull of short-term, detrimental reinforcement. Thus, the concurrent-chains procedure acts as a foundational bridge between fundamental laboratory research and practical applications in understanding complex human decision-making biases.
Advantages and Methodological Strengths
The concurrent-chains procedure offers several distinct methodological advantages over simpler choice paradigms, making it the preferred tool for studying specific facets of choice behavior. The foremost strength is its capacity to measure the value of a schedule *before* the subject has been exposed to the schedule itself during that trial. In simple concurrent schedules, the subject may switch back and forth between options based on momentary changes in the schedules (e.g., responding on the key that has just expired its interval). The concurrent-chains structure eliminates this ambiguity by enforcing a binding commitment once the initial link is completed. This commitment ensures that the recorded preference truly reflects the anticipated long-term value of the terminal schedule, rather than short-term tactical switching.
Secondly, the procedure provides a clean separation between the behavior used to initiate the choice (the initial links) and the behavior required to obtain the reward (the terminal links). This allows researchers to manipulate the schedule characteristics in the terminal phase—the variable of interest—without altering the response requirements in the initial choice phase, thereby isolating the effects of terminal schedule parameters on conditioned reinforcement. This clarity is essential for establishing robust functional relationships between schedule characteristics (delay, magnitude) and subjective preference.
Finally, the concurrent-chains procedure is inherently flexible, allowing for the substitution of various terminal schedules and outcome parameters. Researchers can easily incorporate punishment, effort contingencies, or different qualities of reinforcement into the terminal links to study how these factors modulate choice. This flexibility has enabled the investigation of highly nuanced psychological questions, such as the preference for probabilistic outcomes versus certain outcomes (risk assessment), or the preference for information versus uncertainty, placing the concurrent-chains procedure at the forefront of experimental analysis of behavior related to complex economic and temporal decision-making.