d

DIFFERENTIAL REINFORCEMENT OF OTHER BEHAVIOR (DRO)



Introduction to Differential Reinforcement of Other Behavior (DRO)

Differential Reinforcement of Other Behavior, commonly abbreviated as DRO, is a foundational procedure within the field of Applied Behavior Analysis (ABA) designed explicitly to decrease the rate or frequency of a specific targeted maladaptive response. This technique operates by providing a potent reinforcer contingent upon the non-occurrence of the undesirable behavior within a defined interval of time. In essence, the individual earns reinforcement for engaging in literally any behavior other than the one targeted for reduction, hence the term “other behavior.” This approach is highly valued for its reliance on positive reinforcement strategies rather than aversive or punitive methods, aligning with modern ethical standards in behavioral intervention.

The core objective of the DRO procedure is to establish a temporal contingency where the absence of the problematic response becomes the criterion for reward. If the targeted behavior occurs at any point during the set interval, the timer is immediately reset, and the delivery of the reinforcer is withheld, thereby subjecting the undesirable behavior to a form of extinction or non-reinforcement. This systematic arrangement creates a powerful incentive for the individual to inhibit the target response, as the consequence of engaging in the behavior is the immediate loss of the forthcoming positive stimulus. The effectiveness of DRO stems from this clear, immediate, and consistent contingency.

DRO is frequently utilized in clinical, educational, and residential settings to address a wide range of challenging behaviors, including aggression, self-injurious behavior (SIB), stereotypy, and disruptive classroom conduct. Due to its defining characteristic—reinforcement for the absence of behavior—it is sometimes referred to by the equally descriptive title of Omission Training. While all differential reinforcement strategies (including DRA, DRI, and DRL) aim to reduce unwanted behavior, DRO is often selected when it is difficult to define or prompt a specific, incompatible replacement behavior, or when the immediate goal is simply cessation of the problematic response.

Core Principles and Operational Mechanics

The success of Differential Reinforcement of Other Behavior hinges upon the meticulous definition and execution of the reinforcement schedule. Operationally, the procedure requires the interventionist to first establish a precise measure of the target behavior, typically its frequency or duration during a baseline period. This baseline data is crucial for determining the initial length of the reinforcement interval, often set slightly shorter than the average time between occurrences of the problem behavior (the mean inter-response time, or IRT). This strategic initial setting ensures a high probability of success and subsequent delivery of reinforcement, which is vital for establishing the contingency quickly.

A defining characteristic of DRO is its reliance on a time-based contingency. Unlike Differential Reinforcement of Alternative Behavior (DRA), which reinforces a specified adaptive response, DRO reinforces the individual for engaging in anything that is not the target behavior. For instance, if the target behavior is yelling, the individual is reinforced for silence, reading, playing, or sitting quietly—as long as they are not yelling. This non-specific nature simplifies implementation but also requires careful monitoring, as it carries the inherent risk of accidentally reinforcing a behavior that is non-target but still marginally undesirable (e.g., reinforcing excessive fidgeting while reducing aggression).

The mechanism by which DRO decreases behavior is primarily through the establishment of a powerful competing contingency. By making highly desirable consequences contingent upon the non-occurrence of the target response, the procedure effectively weakens the association between the problem behavior and any intrinsic or natural reinforcement it might previously have received. If the problem behavior is maintained by attention, for example, the DRO procedure ensures that attention is delivered only during periods when the behavior is absent. Over time, the individual learns that accessing positive outcomes is optimally achieved by refraining from the targeted undesirable response.

Historical Context and Theoretical Foundations

Differential Reinforcement of Other Behavior is firmly rooted in the principles of operant conditioning, a psychological framework pioneered by B.F. Skinner. Skinner’s extensive research demonstrated that behavior is a function of its consequences, and interventions like DRO utilize the manipulation of these consequences to achieve behavioral change. DRO represents a sophisticated application of reinforcement schedules, specifically leveraging the power of positive reinforcement to suppress behavior rather than relying on punishment, which carries significant ethical and practical drawbacks.

The theoretical foundation of DRO lies in its functional relationship with extinction. When the target behavior occurs, the scheduled positive reinforcement is withheld, effectively placing that specific instance of the behavior on extinction. However, DRO is generally considered superior to simple extinction procedures because it simultaneously reinforces the absence of the behavior. Simple extinction can often lead to an “extinction burst”—a temporary increase in the frequency, duration, or intensity of the problem behavior—as the individual escalates attempts to elicit the previously earned consequence. DRO mitigates this risk by ensuring that the individual continues to access reinforcement, albeit contingent on the omission of the problem behavior, thus minimizing frustration and distress.

The development of differential reinforcement procedures marked a significant advancement in behavior modification during the mid-20th century, offering humane and effective alternatives to control procedures. While DRO was initially conceptualized alongside Differential Reinforcement of Incompatible Behavior (DRI) and Differential Reinforcement of Alternative Behavior (DRA), it carved out a unique role. DRO became the go-to strategy when the goal was immediate suppression and when the practitioner did not yet possess sufficient data to identify a specific, functionally equivalent replacement behavior. Its versatility allowed it to be applied across a broader range of behavioral issues than the more restrictive DRI procedure.

Procedural Variations of DRO Implementation

Although the basic premise of reinforcing the absence of behavior remains constant, DRO can be implemented through several procedural variations, each tailored to the specific characteristics of the target behavior and the setting. The two primary categories relate to how the interval is measured: Interval DRO and Momentary DRO. Interval DRO (DRO-I) requires the non-occurrence of the behavior for the entire duration of the specified time period. If the behavior occurs even once within the interval, the reinforcement is denied, and the interval timer is immediately reset back to zero. This total-interval requirement makes DRO-I highly effective for suppressing low-frequency, high-intensity behaviors, such as aggressive outbursts.

A significant variation is the Momentary DRO (DRO-M). In this procedure, reinforcement is contingent only upon the absence of the target behavior at the precise moment the scheduled interval terminates. If the behavior occurred five seconds before the timer ended, but is absent at the moment of the interval conclusion, reinforcement is still delivered. DRO-M is often preferred when the target behavior is highly frequent or when the continuous monitoring required for DRO-I is impractical, such as in busy classroom or group settings. While DRO-M is easier to implement and maintain, DRO-I typically yields faster and more comprehensive suppression of the target response because the contingency is stricter.

Furthermore, the scheduling of the reinforcement itself can vary, leading to Fixed Interval DRO (FI-DRO) and Variable Interval DRO (VI-DRO). FI-DRO, where the time period remains the same (e.g., every five minutes), is the standard and initial approach, offering predictability that helps the learner adjust rapidly to the contingency. Conversely, Variable Interval DRO (VI-DRO) utilizes a changing, unpredictable interval length (e.g., an average of five minutes, but varying randomly between three and seven minutes). VI-DRO is generally employed during the maintenance phase of treatment. Introducing variability makes the contingency less predictable, which is crucial for promoting the generalization of the behavior reduction across different times and environments, ultimately fostering greater behavioral independence.

Steps for Effective Implementation

Effective deployment of a DRO procedure requires systematic planning and adherence to rigorous behavioral assessment steps. The initial and most critical step is the completion of a thorough Functional Behavior Assessment (FBA) to determine the maintaining variables (the function) of the target behavior. While DRO does not strictly require teaching a replacement behavior, understanding function ensures that the selected reinforcer is powerful enough to compete with the natural reinforcement the problem behavior is currently accessing. Following the FBA, the target behavior must be defined clearly, objectively, and measurably, ensuring high inter-observer agreement.

The second crucial step involves establishing the initial interval length (T). Practitioners typically analyze baseline data to calculate the mean Inter-Response Time (IRT)—the average time elapsed between occurrences of the problem behavior. The initial DRO interval is often set at a duration slightly less than this mean IRT (e.g., 80% of the mean IRT). This conservative approach ensures that the individual immediately contacts reinforcement frequently, maximizing the opportunity for successful learning and minimizing frustration. If the interval is set too long initially, the individual may rarely or never earn the reinforcer, leading to the failure of the entire procedure.

Finally, once the initial interval is mastered and the target behavior frequency shows reliable reduction, the procedure must move into the crucial phase of interval thinning, or fading the schedule. Interval thinning involves systematically and gradually increasing the time required for non-occurrence (e.g., moving from 5 minutes to 7 minutes, then 10 minutes, and so on). This gradual increase ensures that the reinforcement schedule becomes leaner, mirroring the natural reinforcement schedules encountered in daily life. Successful thinning is essential for maintaining the behavioral gains over the long term and preventing dependency on dense schedules of reinforcement.

Advantages and Potential Disadvantages

The primary advantage of DRO is its ethical and humanitarian alignment with modern behavior intervention standards. As a positive reinforcement technique, it avoids the use of punishment, minimizing the potential for negative side effects such as emotional distress, avoidance, or aggression toward the interventionist. Furthermore, DRO is highly versatile; because it reinforces “other” behavior rather than a specific alternative, it can be applied quickly and effectively to almost any behavior targeted for reduction, regardless of whether a functional replacement behavior has been identified or taught. This makes it an excellent choice for initial stabilization of highly dangerous or disruptive behaviors.

However, DRO is not without its limitations, the most significant of which is the potential for the accidental reinforcement of an undesirable non-target behavior. Since reinforcement is delivered for any behavior other than the defined target, if the individual engages in a second, problematic behavior during the interval (e.g., rocking or verbal protesting), that behavior could inadvertently be strengthened simply because it was not the specified target. This necessitates continuous, vigilant observation by the practitioner to ensure that the “other behavior” being reinforced is, at minimum, neutral or adaptive. If a second undesirable behavior is observed, it must be either added to the DRO contingency definition or addressed through a concurrent intervention.

Another potential disadvantage relates to the efficiency of behavior change. While highly effective at suppressing the target behavior, DRO does not inherently teach the individual a more adaptive or functionally appropriate way to meet their needs. Unlike DRA or DRI, which explicitly teach replacement skills, DRO leaves a behavioral vacuum. If the underlying function of the behavior (e.g., seeking attention) is not met by the reinforced “other behavior,” the behavior reduction may be fragile, and relapse is possible. Therefore, best practice often dictates that DRO should be implemented alongside skill-building components to ensure meaningful, lasting behavioral change and improved quality of life.

Ethical Considerations and Responsible Application

The responsible application of Differential Reinforcement of Other Behavior is governed by strict ethical guidelines established by professional bodies in behavior analysis. Central to these guidelines is the requirement that all behavior reduction programs must be based on a thorough FBA. Implementing DRO without understanding the function of the behavior risks merely suppressing the symptom without addressing the underlying cause, which is ethically unsound and likely to lead to poor long-term outcomes. The intervention must be clinically necessary and socially significant, meaning the target behavior must genuinely impede the individual’s learning, social integration, or safety.

Furthermore, ethical practice demands that the procedure is implemented with fidelity and consistency. Inconsistency in administering the DRO schedule—such as resetting the timer inconsistently or failing to deliver the reinforcer when earned—can rapidly undermine the effectiveness of the procedure, potentially leading to increased behavioral variability and frustration for the learner. Therefore, extensive training of all staff and caregivers involved in the implementation is a non-negotiable ethical requirement, ensuring high treatment integrity. Data collection must also be continuous and accurate to allow the interventionist to make timely, data-driven decisions regarding interval thinning or procedural adjustments.

Finally, even though DRO reinforces “other behavior,” ethical mandates strongly encourage pairing DRO with procedures that teach functional communication and alternative, adaptive skills. While DRO is effective for rapid suppression, teaching the individual a specific, appropriate response (like asking for a break instead of engaging in aggression) ultimately empowers the individual with replacement skills. Therefore, the most ethically robust treatment packages often combine DRO for rapid reduction with DRA or skill-training components to ensure that the individual gains true behavioral competence and self-management capabilities necessary for generalization and maintenance in natural environments.

Applications Across Diverse Settings

Differential Reinforcement of Other Behavior is a versatile and powerful tool utilized across numerous professional and organizational settings due to its straightforward implementation and high efficacy. In clinical and therapeutic environments, particularly those serving individuals with developmental disabilities such as Autism Spectrum Disorder (ASD), DRO is frequently employed to address severe challenges. For instance, it is highly effective in reducing chronic self-injurious behaviors (SIB) by setting a short interval and reinforcing the absence of SIB with access to highly preferred sensory input or adult attention. Similarly, it is used to decrease high-frequency, low-intensity stereotypy (repetitive motor behaviors) when those behaviors interfere with learning.

In educational settings, DRO offers teachers a manageable strategy for promoting classroom compliance and reducing disruptive behaviors. A teacher might use DRO to address a student who frequently calls out answers without raising their hand. By reinforcing the student with tokens or preferred activities for the absence of calling out during defined 10-minute intervals, the teacher systematically increases the student’s ability to inhibit the impulsive behavior. This application enhances the learning environment for the entire class while positively shaping the student’s conduct.

Beyond clinical and educational contexts, the principles of DRO are also applicable in Organizational Behavior Management (OBM) and employee performance settings. For example, a manager seeking to reduce the frequency of safety violations might implement a DRO contingency where the entire team receives a bonus or extended break time if zero safety incidents are reported during a specific weekly interval. This utilizes the omission training paradigm to reinforce collective adherence to safety protocols. Whether applied to complex clinical profiles or general performance management, DRO remains a cornerstone technique for achieving durable behavior reduction through positive, time-contingent reinforcement.