1. Context
2. Objective
3. Evidence Acquisition
3.1. Search Strategy
3.2. Criteria for Including Studies
3.3. Criteria for Excluding Studies
3.4. Study Identification
3.5. Study Selection
3.6. Quality Assessment
3.7. Data Synthesis
4. Results
| Author, Year | Design, Level of Evidence | Setting, Population | N | Intervention | Duration | Objectives | Result | Limitation |
|---|---|---|---|---|---|---|---|---|
| Meyers et al, 2018 (18) | Quasi-experimental, 2 | Prison, SMI | 58 | CM + psychotherapy and psychoeducational group, individual counseling | 1 year based on the patients` function | Risk of self-injury, n* of requesting protective custody, decrease in crime steps, non-compliance, N of visits, minor or major violations, drug violations, N of physical attack | The N of requests for custody (M = 0.16, P < .05) and supervision (M = 0.66, P < 0.05) and the N of drug abuse decreased. The N of visits raised; But there was a slight increase in the scores of mental health condition (M = 3.63, P < 0.05) due to rises of minor violence on average between Time 1 (M = 0.54) and Time 4 (M = 1.04, P < 0.05). | Limited data available to assess program outcomes, the program is restricted to one facility, lack of control group, lack of individualized reinforcement, lack of procedural integrity |
| Travis, and Stornay, 2013 (19) | Single subject with ABA design, 4 | Residential forensic facility, adults with schizoaffective disorder and mild IDD | 3 | Skills training+ TE (Behavioral Skills Training | 3 weeks intervention and 3 months follow-up | Rate of aggressive and replacement responses that each participant shows after presenting 15 antecedent stimuli | During Aa, the average percentage of aggressive responses for each participant was 54%, 71%, and 64% which decreased to 12%, 10%, and 9% during Ba. The average percentage of replacement responses during Aa was 34%, 15%, and 26% for each participant, and during Ba, they increased to 79%, 78%, and 82%. During the generalization phase, the mean percentage of replacement responses was 74%, 71%, and 75%. During 3 months after the intervention, the N of community trips increased from 1 community trip for all participants to 5, 5, and 2. | Possibility of increased replacement behaviors due to the presentation of the experimenter in post-treatment sessions, lack of functional analysis, no follow-up, and the participants, themselves, did not evaluate the treatment efficacy in the social validity process |
| Park, and Lee, 2012 (6) | Quasi-experimental,2 | hospital, SMI | 44 | Short-Term TE | 8 weeks | Verbal and physical aggression, and damaging objects (Overt Aggression Scale) | After eight weeks, the experimental group showed a significant reduction in aggressive behavior for verbal attacks, from 87% to 63%, and for physical attacks and property damage, from 91% to 78%. However, the control group's scores showed increased aggressive behaviors and had significant differences (intervention group F = 194.46, P < 0.001). | Lack of randomization, lack of individualized reinforcement |
| Taylor, and Mudford, 2012 (17) | Single subject with ABAB design, 3 | Residential therapeutic community, youth with substance abuse and mental health disorders | 16 | TE (Thumbs Up) | 60 sessions | Decrease of inappropriate statements (verbal states that contain swear words, venting, neg-raving), increase of positive statements (showing approval, empathizing, describing desirable state or mood) | Positive statements increased from Aa to Ba for most residents (11/16). A sign test with an alpha level of 0.05 showed statistical significance (Z = 2.89, P = 0.006). All participants had a low rate of verbal statements in Aa and did not change during thumbs up (remained at near zero levels). | Limited time of intervention, no follow-up and generalization, limited data collection due to lack of control over participants entering and leaving the program, lack of individualized reinforcer |
| Tomaka, 2009 (10) | Single subject with ABAB design for 2 participants and AB design for 1 participant, 4 | Residential group homes, IDD with behavioral issues | 3 | A level system with built-in TE+ skills training | Nearly 3 months (it was different for each participant) | Frequency of problematic sexual behavior, assault, damaging objects, stripping, pulling the siren of the emergency system, and escaping | Two of the participants in an ABAB design could make it to and maintain the status of level 4 after approximately 78 and 90 days. The average replacement behavior per day for the first client was A1: 2.7, B1: 4.1, A2: 3.2, B2: 4.6, and for the second one was A1: 2.3, B1: 3.7, A2: 2.5, B2: 5.3. The third participant in an AB design who avoided performing the inappropriate behaviors, demonstrated more replacement behaviors in an average of 2.2 to 2.6 per day and made it to Level 3 after 95 days | Some vague descriptions of the topographies of the behaviors, limited time of fully training and monitoring staff implementing the level system. |
| Holmqvist, et al. 2007 (20) | Quasi-experimental, comparison of two interventions in two institute, 2 | Adolescents in residential care | 57 | Aggression replacement training+ TE compared to the relationally oriented treatment model | 10 weeks, twice a week and 1 hour for each session, 1 year follow-up | Rate of sentence register & the police suspension register (measured by penalty value score), admission and guilt feelings | The institutions had no significant differences in the rate of criminal activity. The values were: F = 0.69, P = 0.51 for the major sentence; F = 0.80, P = 0.46 for all sentences; F = 2.60, P = 0.09 for the major suspicion; and F = 0.22, P = 0.81 for the three major suspicions. 20% of the participants had no major crime after two years; 17 out of 56 had not been sentenced, while the remaining 6 had minor crimes. The effect size of all sentences was 0.71, while the score for the three major suspicions was 0.30. After B a, 9 adolescents were not reported as a criminal, and 5 participants were reported for minor crimes. | Lack of procedural integrity, limited N of participants, no control group, and randomization |
| Coogan et al. 2007 (21) | Single subject with ABAB design, 4 | School, adolescents with inappropriate behavior | 5 | Group CM + Self- Monitoring | 53 days | Any behavior that involves inappropriate physical contact, speaking, hostility, playfulness, distraction, noise-making, or leaving one's seat | The average percentage of disruptive behavior in student 1 was 25% in A1, 3.07% in B1, 21.06% in A2, and decreased to 5.71% in B2. For student 2 it was 30.44% in A1, 5.96% in B1, 20.33% in A2, and 8.17% in B2. In student 3, 30.35% in A1, 6.37% in B1, 25.33% in A2, and 8.95% in B2. For student 4, 38.25% in A1, 5.81% in B1, 32.33% in A2, and 12.6% in B2. In student 5, 34% in A1, 3.96% in B1, 19.89% in A2, and decreased to 4.96% in B2. Students 1, 2, and 3 had significant effect sizes of 1.88, 1.36, and 1.30, respectively. Whereas, students 4 and 5 had the largest effect sizes (2.24 and 2.26, respectively). | Lack of follow-up |
| Luby, 2007 (22) | Single subject with ABA design, 4 | Residential school, an adolescent with ASD | 1 | TE | 9 weeks | Frequency of assault, damaging objects, and inappropriate statements on the behavioral datasheet | The TE was an ineffective method to modify the participant's behavior. However, assault decreased from Stage I of 0 to 10 to Stage III of 0 to 9 (the highest frequency: 49th day & the lowest frequency: 42th - 45th day), and damaging objects decreased from Stage I of 0 to 6 to Stage III of 0 to 2 (the highest frequency: 45th day & the lowest frequency: 42th - 45th day); but these behaviors did not stabilize. Disruptive talk increased from Stage I of 0 to 7 to Stage III of 0 to 15. | Single subject design, the unwillingness of the participant to earn tokens, and the antecedent stimulus that triggered the target behavior were not addressed and most of the target behaviors occurred in the classroom, lunchtime, and cafeteria |
| Bisconer et al.2006 (23) | Single subject with AB a design, 4 | Psychiatric hospital, schizoaffective bipolar type, mild IDD | 1 | Social skill training+ contingent reinforcement | 42 months | Aggression towards others, personnel, and self, frequency of restrictive interventions usage | A reduction in the rate of aggression led to a lesser need for restrictive treatments to manage aggression and a reduction in staff injury. In addition, the TE has let the participant learn appropriate behaviors and skills, and to move to a less restrictive community home. | Lack of procedural integrity and follow-up sessions |
| Freeman, and Dexter-Mazza, 2004 (24) | Single subject with ABAB design,4 | Classroom of a Residential facility, ADHD, conduct disorder, adjustment disorder, and learning disorder | 1 | Self-monitoring, skill training + TE | 3 - 4 times per week, 24 sessions | Disruptive behavior and off-task behavior | Self-monitoring plus matching (adult feedback as a social reinforcer) was effective in preventing disruptive and off-task behavior in the classroom. The frequency of problem behavior decreased from 40% of intervals in the baseline to 12.06% when therapy was introduced. The mean percentage occurrence of targeted behaviors was M = 21.26 in A1, it decreased to M = 20.89 in B1, and in withdrawal and reimplementation of the matching process, they reached 28.6 and 13.2. | Significant discrepancy between the N of observations during Ba, with earlier phases with a greater N of observations, no follow-up, and contingency of reward with accurate self-monitoring made it difficult to determine which variable was responsible for changes |
| Wolfe et al. 2003 (4) | Single subject with A-B1-BC1-B2-BC2 design, 4 | School, adolescents with Emotional and Behavioral Disorders | 3 | TE+ cooperative games | 35 sessions based on the school year (started in the second semester and ended with the termination of the school year) | Pro-social (compliment, supporting, thanking, handshakes, or high fives) and anti-social actions (swearing, physical threats, name-calling, assault) | Prosocial verbal and physical behaviors increased during BC1 and BC2. B1 and B2 had lower means compared to the means of BC1 and BC2. Antisocial verbal behaviors were higher in the A and B1 phases, but in BC1 these behaviors decreased. During B2 behaviors returned to baseline but with the re-initiation of the BC2 these behaviors again decreased and were virtually non-existent. During A and BC1 antisocial physical behaviors were high; but when BC1 was initiated these behaviors dropped to zero and remained there till the end. | A small period of observation, during B2 the primary observer was not able to attend 2/3 sessions which is a concern for inter-rater agreements. |
| Zlomke and Leland 2003 (25) | Single subject with ABCB design,4 | School and residential group homes, youth with bipolar and conduct disorder | 1 | TE + self- monitoring | 19 weeks | Decrease of minor behavior (non-compliance and being out of the seat), disruptive behavior ( excessive talking out loud and singing), and aggression | In the school setting, A significant reduction in challenging behaviors occurred from a baseline mean of 118 after implementing TE. The decrease continued during the TE phase (mean = 63). Introducing self-monitoring alongside the TE resulted in a significant decrease in challenging behaviors. (mean = 7.75). However, the occurrence of targeted behaviors did not change in the group home. During the maintenance phase (4 weeks) compared to the baseline, the rate of challenging behaviors remained low, in both the school and group home (mean = 13). | Lack of procedural integrity, antecedent stimulus that triggered the target behavior were not addressed |
Abbreviations: TE, token economy; CM, contingency management; SMI, severe mental illness; N, number; ADHD, attention-deficit/hyperactivity disorder; IDD, intellectual disability disorder; ASD, Autism spectrum disorder.
a A: Baseline phase, B: Intervention phase, BC: Combination of the interventions.
| Study | Selection Bias | Study Design | Confounders | Blinding | Data Collection Methods | Withdrawals and Drop-outs | Study Quality |
|---|---|---|---|---|---|---|---|
| Park, and Lee, (2012) (6) | 2 | 2 | 1 | 2 | 1 | 1 | 1 |
| Wolfe et al. (2003) (4) | 3 | 2 | 3 | 2 | 1 | 1 | 3 |
| Tomaka, 2009 (10) | 3 | 2 | 3 | 2 | 1 | 1 | 3 |
| Freeman and Dexter-Mazza, (2004) (24) | 3 | 2 | 3 | 2 | 1 | 1 | 3 |
| Coogan et al. (2007) (21) | 3 | 2 | 3 | 2 | 1 | 1 | 3 |
| Luby, 2007 (22) | 3 | 2 | 3 | 2 | 1 | 1 | 3 |
| Zlomke and Leland (2003) (25) | 3 | 2 | 3 | 2 | 1 | 1 | 3 |
| Holmqvist et al. (2007) (20) | 2 | 2 | 1 | 2 | 1 | 1 | 1 |
| Taylor and Mudford (2012) (17) | 2 | 2 | 3 | 2 | 1 | 1 | 2 |
| Bisconer et al. 2006 (23) | 3 | 2 | 3 | 2 | 3 | 1 | 3 |
| Meyers et al, (2018) (18) | 2 | 2 | 3 | 2 | 3 | 1 | 3 |
| Travis and Sturmey (2013) (19) | 3 | 2 | 3 | 2 | 1 | 1 | 3 |
a 1, strong; 2, moderate; 3, weak.

