Objective: In the past decades, many different types of psychotherapy for adult depression have been developed. Method: In this meta-analysis we examined the effects of 15 different types of psychotherapy using 385 comparisons between a therapy and a control condition: Acceptance and commitment therapy, mindfulness-based cognitive behavior therapy (CBT), guided self-help using a self-help book from David Burns, Beck’s CBT, the “Coping with Depression” course, two subtypes of behavioral activation, extended and brief problem-solving therapy, self-examination therapy, brief psychodynamic therapy, non-directive counseling, full and brief interpersonal psychotherapy, and life review therapy. Results: The effect sizes ranged from g = 0.38 for the “Coping with Depression” course to g = 1.10 for life review therapy. There was significant publication bias for most therapies. In 70% of the trials there was at least some risk of bias. After adjusting studies with low risk of bias for publication bias, only two types of therapy remained significant (the “Coping with Depression” course, and self-examination therapy). Conclusions: We conclude that the 15 types of psychotherapy may be effective in the treatment of depression. However, the evidence is not conclusive because of high levels of heterogeneity, publication bias, and the risk of bias in the majority of studies.

Obiettivi: Nei decenni passati, sono stati sviluppati molti tipi differenti di psicoterapia per la depressione degli adulti. Metodo: In questa meta-analisi abbiamo esaminato gli effetti di 15 differenti tipi di psicoterapia usando 385 confronti tra una terapia e una condizione di controllo: acceptance and commitment therapy, terapia cognitivo-comportamentamentale basata sulla mindfulness, auto-aiuto guidato usando un libro di auto-aiuto di David Burns, la CBT di Beck, il corso “Coping con depressione”, due sottotipi di attivazione comportamentale, terapia problem-solving breve e estesa, terapia auto-esame, terapia psicodinamica breve, counseling non direttivo, psicoterapia interpersonale completa e breve, e terapia life review. Risultati: Gli effect sizes vanno da g=.38 per il corso “Coping con Depression” a g=1.10 per la terapia life review. C’è un significativo bias di pubblicazione per la maggior parte delle terapie. Nel 70% dei trials c'era almeno qualche rischio di bias. Dopo studi di adattamento con basso rischio di errori per bias di pubblicazione, solo due tipi di terapia rimangono significative (il corso “Coping con Depression” e la terapia auto-esame). Conclusioni: Concludiamo che i 15 tipi di psicoterapia potrebbe essere efficace nel trattamento della depressione. Comunque, l'evidenza non è definitiva a causa dei livelli elevati di eterogeneità, bias di pubblicazione, e il rischio di bias nella maggioranza degli studi.

Objetivo: Nas últimas décadas, muitos tipos diferentes de psicoterapia para depressão em adultos foram desenvolvidos. Método: Nesta meta-análise, examinamos os efeitos de 15 tipos diferentes de psicoterapia usando 385 comparações entre uma terapia e uma condição de controle: terapia de aceitação e compromisso, terapia cognitivo comportamental (TCC) baseada em mindfulness, auto-ajuda guiada - livro de ajuda de David Burns, TCC de Beck, o curso "Lidando com a depressão", dois subtipos de ativação comportamental, terapia estendida e breve para solução de problemas, terapia de auto-exame, terapia psicodinâmica breve, terapia psicodinâmica breve, aconselhamento não-diretivo, psicoterapia interpessoal completa e breve e terapia de revisão de vida. Resultados: os tamanhos dos efeitos variaram de g=0,38 para o curso "Lidar com a Depressão" até g=1,10 para a terapia de revisão da vida. Houve um viés de publicação significativo para a maioria das terapias. Em 70% dos estudos, havia pelo menos algum risco de viés. Após o ajuste de estudos com baixo risco de viés para viés de publicação, apenas dois tipos de terapia permaneceram significativos (o curso “Lidar com a Depressão” e terapia de auto-exame). Conclusões: Concluímos que os 15 tipos de psicoterapia podem ser eficazes no tratamento da depressão. No entanto, as evidências não são conclusivas devido aos altos níveis de heterogeneidade, viés de publicação e risco de viés na maioria dos estudos.

目的:過去數十年已經發展出許多不同類型的成人憂鬱症心理治療方法。方 法:在這項後設分析中,我們針對有治療組和控制組設計的研究,包含385 種治療 情境,用以檢視15 種不同類型心理治療的效果:接受與承諾治療、以正念為基礎 的認知行為治療(CBT)、採用David Burns 自助書籍的引導式自我協助、 Beck 的 CBT、「戰勝憂鬱」課程、行為激發的兩種子類型、廣義與狹義的問題解決治療、 自我檢視治療、短期心理動力治療、非指導式諮商、全面和短期的人際互動心理治 療,以及生命回顧治療。結果:效果值從「戰勝憂鬱」課程的g = 0.38 到g=1.10 的 生命回顧治療。大多數的療法存在著明顯的出版偏誤,且在70%的實驗中,都至少 有些偏誤的風險。在針對出版偏誤較低的研究進行調整後,只剩兩種類型的治療仍 舊有效(「戰勝憂鬱」課程和自我檢視治療)。結論:我們得到的結論是,這15 種 心理療法可能對治療憂鬱症有效。但是由於大多數研究的異質性、出版偏誤,以及 存在著偏見的風險均高,因此證據尚無定論。

Clinical or methodological significance of this article: Although many types of psychotherapy for adult depression have been examined in randomized trials, only a relatively small number of generic types of psychotherapy have been examined in meta-analyses. In this meta-analysis, we examined the effects of fifteen more specific types of therapy. This is important for clinical practice because these therapies can be used in routine care. Although most of them may be effective, the research is limited by high levels of heterogeneity, publication bias, and the risk of bias in the majority of studies.

Several different types of psychotherapies have been found to be effective in the treatment of adult depression, including cognitive behavior therapy (CBT) (Cuijpers, Sijbrandij, et al., 2013; Furukawa et al., 2014), behavioral activation therapy (BAT) (Dimidjian, Barrera Jr, Martell, Muñoz, & Lewinsohn, 2011; Ekers, Richards, & Gilbody, 2008; Shinohara et al., 2013), interpersonal psychotherapy (IPT) (Churchill et al., 2010; Cuijpers, Donker, Weissman, Ravitz, & Cristea, 2016), problem-solving therapy (PST) (Cuijpers, de Wit, Kleiboer, Karyotaki, & Ebert, 2017; Malouff, Thorsteinsson, & Schutte, 2007), non-directive counseling (Cuijpers et al., 2012) and brief psychodynamic therapy (Driessen et al., 2013). Meta-analyses of trials directly comparing these therapies with each other, typically indicate that there are no or no major differences between the effects of these therapies (Barth et al., 2013; Cuijpers, 2017; Cuijpers, van Straten, Andersson, & van Oppen, 2008).

How to define these different types of therapies is not straightforward, however. There are no “official” definitions for these types of therapy. On the one hand, there are the very broad generic categories of psychodynamic, cognitive–behavioral and humanistic categories of psychotherapy (Wampold & Imel, 2015). These categories are, however, not well delineated, and there are all kinds of therapies that are not easily captured in one of these categories, such as IPT or couple therapy. On the other end of the spectrum of definitions are the therapies that are defined by the use of a specific manual, such as CBT according to the manual of Beck, Rush, Shaw, and Emery (1979), or IPT according to the manual of Klerman, Weissman, Rounsaville, and Chevron (1984). But most reports on psychological treatments only describe the actual therapies very briefly, which makes it impossible to make clear categories of therapies. Furthermore, in many studies, the authors refer to a specific manual, but also report that they have made adaptations to a specific population, setting or aims of the intervention, or they have inserted parts from other therapies in the original therapy. This makes it even more complicated to define and operationalize types of therapies.

Most meta-analyses have solved this problem by using more generic “brand” names, such as CBT (when cognitive restructuring is a core element of the therapy), BAT (when pleasant activity scheduling is one of the core elements of the therapy), or PST (when problem-solving techniques are the core element of the therapy). However, such generic brand names often include several different subtypes of therapy with different manuals or different approaches. For many of these subtypes, no meta-analyses have ever been done. This information is important, however, because when only some of these subtypes are effective, it indicates which manuals or approaches should be advised to clinicians and patients. Moreover, generic brand names do not indicate how these therapies should be conducted in routine practice, while manuals and specific approaches do.

In the current study, we build on the approach to categorize psychotherapies for depression that we used in an earlier meta-analysis (Cuijpers, van Straten, Warmerdam, & Andersson, 2008). In this study, we closely examined the therapies that were used in 91 comparative outcome studies on depression and categorized them into clusters of therapies of 5 or more studies. We formulated definitions of the major types of psychological treatment that were found, and checked whether the interventions from the studies met these descriptions (the definitions are given in Cuijpers, van Straten, Warmerdam, & Andersson, 2008). The exact formulation of the definitions was discussed in detail by the authors, who are experts in the research and practice of psychological treatments for depression.

In the current study, we started with the major categories and definitions that were developed in the earlier study (“generic” therapies), and tried to make subcategories and more specified definitions of the treatments (“specific”) therapies. We did this using the same approach as we used for the broad categories, by examining the interventions described in the studies, as well as literature on each of the treatments (the exact methods and references are given in the Methods section). We stopped this process until we had treatments that had a sufficient number of studies (at least 5) and was based on one specific manual or method that could clearly be distinguished from other psychological treatments of the same category.

Our goal was to examine the effects of each of these specific treatments compared to control conditions (waiting list, care as usual, placebo, other) and to explore in meta-regression analyses whether we could find differences between the specific psychological treatments.

We used an existing database of studies on the psychological treatment of depression. This database has been described in detail elsewhere (Cuijpers, Reijnders, & Karyotaki, 2018), and has been used in a series of earlier published meta-analyses (Cuijpers, 2017). For this database, we searched four major bibliographical databases (PubMed, PsycInfo, Embase and the Cochrane Library) by combining terms (both index terms and text words) indicative of depression and psychotherapies, with filters for randomized controlled trials. The full search string for one database (PubMed) is given in Appendix A. We also searched a number of bibliographical databases to identify trials in non-Western countries (Cuijpers, Karyotaki, Reijnders, Purgato, & Barbui, 2018), because the number of trials on psychological treatments in these countries is growing rapidly. Furthermore, we checked the references of earlier meta-analyses on psychological treatments of depression. The database is continuously updated and was developed through a comprehensive literature search (from 1966 to January, 1st 2018). All records were screened by two independent researchers and all papers that could possibly meet inclusion criteria according to one of the researchers were retrieved as full-text. The decision to include or exclude a study in the database was also done by the two independent researchers, and disagreements were solved through discussion.

We included studies that were: (i) a randomized trial (ii) in which a psychotherapy (iii) for adult depression was (iv) compared with a control group (waiting list, care-as-usual, placebo, other inactive treatment) or another treatment (psychological or pharmacological). Depression could be established with a diagnostic interview or with a score above a cut-off on a self-report measure. We defined psychotherapy according to Norcross (Campbell, Norcross, Vasquez, & Kaslow, 2013):

Psychotherapy is the informed and intentional application of clinical methods and interpersonal stances derived from established psychological principles for the purpose of assisting people to modify their behaviors, cognitions, emotions, and/or other personal characteristics in directions that the participants deem desirable.

Because we found in a previous meta-analyses (Cuijpers, Noma, Karyotaki, Cipriani, & Furukawa, 2019) that there are no significant differences between treatment formats (individual, group, guided self-help, internet-based therapy) as long as there is human involvement, we allowed any of these treatment formats.

Co-morbid mental or somatic disorders were not used as an exclusion criterion. Because of the differences in control conditions, studies on inpatients were excluded, in order to reduce heterogeneity (Cuijpers et al., 2011). We also excluded maintenance studies, aimed at people who had already recovered or partly recovered after an earlier treatment.

As indicated earlier, we used the broad categories and definitions of psychotherapies that were developed in an earlier study as a starting point (Cuijpers, van Straten, Warmerdam, et al., 2008). We removed one category (social skills training, because there were only few controlled trials on this treatment) and added two other categories that were examined in more than 10 trials (the cluster of third wave therapies and life review therapies). We critically read the controlled trials on psychotherapies for depression and tried to identify more specific treatments. Wherever possible, we built on the literature for each of these therapy types. We stopped with further specification when the number of studies was 5 or less. The resulting specified types of therapy and the literature on which these specified types were based, are given in .

As in our previous meta-analyses using our database of randomized trials, we assessed the validity of included studies using four criteria of the “Risk of bias” assessment tool, developed by the Cochrane Collaboration (Higgins et al., 2011). This tool assesses possible sources of bias in randomized trials, including the adequate generation of allocation sequence; the concealment of allocation to conditions; the prevention of knowledge of the allocated intervention (masking of assessors); and dealing with incomplete outcome data (this was assessed as positive when intention-to-treat analyses were conducted, meaning that all randomized patients were included in the analyses). Assessment of the validity of the included studies was conducted by two independent researchers, and disagreements were solved through discussion.

We also coded participant characteristics (depressive disorder of scoring high on a self-rating scale; recruitment method; target group); characteristics of the psychotherapies (treatment format; number of sessions); and general characteristics of the studies (type of control group; country where the study was conducted). Treatment format was coded as individual, group or guided-self help (including internet-based guided self-help).

For each comparison between a psychotherapy and a control condition, the effect size indicating the difference between the two groups at post-test was calculated (Hedges’ g) (Hedges & Olkin, 1985). Effect sizes of 0.8 can be assumed to be large, while effect sizes of 0.5 are moderate, and effect sizes of 0.2 are small (Cohen, 1988). Effect sizes were calculated by subtracting (at post-test) the average score of the psychotherapy group from the average score of the control group, and dividing the result by the pooled standard deviation. Because some studies had relatively small sample sizes we corrected the effect size for small sample bias (Hedges & Olkin, 1985). If means and standard deviations were not reported, we used the procedures of the Comprehensive Meta-Analysis software (see below) to calculate the effect size using dichotomous outcomes (Borenstein, Hedges, Higgins, & Rothstein, 2009); and if these were not available either, we used other statistics (such as t-value or p-value) to calculate the effect size.

In order to calculate effect sizes we used all measures examining depressive symptoms (such as the Beck Depression Inventory/BDI (Beck, Ward, Mendelson, Mock, & Erbaugh, 1961); the BDI-II (Beck, Steer, & Brown, 1996); or the Hamilton Rating Scale for Depression/HAMD-17 (Hamilton, 1960)). If more than one depression measure was used in a study, the effect sizes of for these measures were pooled within the study, before the effect sizes were pooled across studies, so that each comparison had only one effect size.

To calculate pooled mean effect sizes, we used the computer program Comprehensive Meta-Analysis (version 3.3070; CMA). Because we expected considerable heterogeneity among the studies, we employed a random-effects pooling model in all analyses.

Numbers-needed-to-be-treated (NNT) were calculated using the formulae provided by Furukawa (1999), in which the control group’s event rate was set at a conservative 19% (based on the pooled response rate of 50% reduction of symptoms across trials in psychotherapy for depression) (Cuijpers, Turner, Koole, van Dijke, & Smit, 2014). As a test of homogeneity of effect sizes, we calculated the I2-statistic, which is an indicator of heterogeneity in percentages. A value of 0% indicates no observed heterogeneity, and larger values indicate increasing heterogeneity, with 25% as low, 50% as moderate, and 75% as high heterogeneity (Higgins, Thompson, Deeks, & Altman, 2003). We calculated 95% confidence intervals around I2 (Ioannidis, Patsopoulos, & Evangelou, 2007), using the non-central chi-squared-based approach within the heterogi module for Stata (Orsini, Bottai, Higgins, & Buchan, 2006). In addition, we calculated the prediction interval, which indicates the range in which the true effect size of 95% of all populations will fall.

We tested for publication bias by inspecting the funnel plot on primary outcome measures and by Duval and Tweedie’s trim and fill procedure (Duval & Tweedie, 2000) as implemented in CMA. This procedure yields an estimate of the effect size after the publication bias has been taken into account, through imputing negative studies that should have been available according to the asymmetry of the funnel plot, but were not found in the systematic searches. We also conducted Egger’s test of the intercept to quantify the bias captured by the funnel plot and to test whether it was significant.

In order to examine whether the effects of the different types of therapy differed significantly from each other, we conducted a meta-regression analysis with the effect size as the dependent variable. As predictors we entered the categories of therapies, as well as three variables that have been consistently found to be significant predictors of the effects of therapies: type of control condition (Mohr et al., 2014), risk of bias (Cuijpers, van Straten, Bohlmeijer, Hollon, & Andersson, 2010), and whether or not the study was conducted in a Western country (Cuijpers et al., 2018). No other characteristic of psychotherapies or studies has been found to consistently predict outcome (Cuijpers, 2017; Cuijpers, van Straten, Andersson, et al., 2008; Cuijpers, van Straten, et al., 2009).

Of the 385 psychotherapy conditions, 167 (43%) used an individual format, 121 (31%) a group format, 77 (20%) guided self-help, 11 (3%) telephone, and 9 (2%) used a mixed treatment format. The number of sessions ranged from 1 to 60, with 113 conditions (29%) have 6 or less sessions, 202 (52%) with 7 to 12, 60 (16%) with 13 to 24, and 3 (1%) with more than 24 sessions (in 3 comparisons the number of sessions was not reported).

In 204 of the 385 comparisons between a treatment and a control condition, CBT was used as the intervention (53%), 21 used BAT (5%), 19 used third wave therapies (5%), 30 used PST (8%), 27 used IPT (7%), 12 used psychodynamic therapy (3%), 19 used non-directive supportive therapy (5%), and the remaining 53 used another type of treatment (14%).

In 127 studies (41%), participants were (partly) recruited through the community, 77 studies (25%) recruited only from clinical samples, and 105 (34%) used other recruitment strategies (such as screening general medical patients or pregnant women). In 161 studies (52%) participants had to meet criteria for a diagnosed mood disorder, while in 148 studies (48%) they had to score above a cut-off on a self-rating depression scale. Of the 309 studies, 174 (56%) were aimed at adults in general, 47 (15%) were aimed at older adults, 21 (7%) at student populations, 42 (14%) at women with postpartum depression, 64 (21%) at patients with comorbid general medical disorders and 37 (12%) were aimed at other specific target groups.

Of the 309 studies, 262 were conducted in a Western country (85%). A total of 27 studies (9%) were conducted before 1991, 105 (34%) between 1991 and 2010, and 177 (57%) after 2010. For the control group, 146 studies used care-as-usual (47%), nine studies used placebo (3%), 111 studies used a waiting list (36%), and the remaining 42 used another control group (14%).

The effect size was calculated based on the means, standard deviation and N in 345 comparisons (90%), in 28 comparisons it was based on dichotomous outcomes (7%), and 12 comparisons (3%) used other statistics (e.g., t or p-value) to calculate the effect size.

The risk of bias in most studies was considerable. A total of 169 of the 309 studies reported an adequate sequence generation (55%). 144 studies reported allocation to conditions by an independent (third) party (47%). 83 studies reported using blinded outcome assessors (27%), and 202 used only self-report outcomes (65%). In 184 studies intent-to-treat analyses were conducted (60%). Only 91 studies (29%) met all quality criteria (using a self-report measure was rated as positive for blinding of outcome assessors). 136 studies (44%) met two or three of the criteria and the 82 remaining studies met no or only one criterion (27%).

The overall pooled effect size for all psychotherapies was g = 0.72 (95% CI: 0.67 ∼ 0.78), which corresponds with an NNT of 4.04. Heterogeneity was high (I2 = 81; 95% CI: 79 ∼ 81), and the prediction interval ranged from −0.17 to 1.61 ().

There were several studies in which more than one psychotherapy was compared with the same control group. These effect sizes are not independent of each other and may affect the effect sizes and heterogeneity. We examined this by conducting two analyses in which only one effect size per study was included, one with only the highest effect size of the study and one with only the lowest effect of the study. As can be seen in , the overall pooled effect size and the level of heterogeneity was not affected considerably. We also conducted an analysis in which outliers, specifically studies with very large effect sizes (g > 2), were excluded. This resulted in a somewhat smaller effect size (g = 0.61), but heterogeneity remained moderate to high (I2 = 65).

Duvall and Tweedie’s trim and fill procedure pointed at a considerable risk for publication bias in all main analyses. After imputation of missing studies, the effect size dropped below g = 0.50 in most analyses ().

We first examined the effects of the generic types of psychotherapy, based on our previous definitions of psychotherapy (Cuijpers, van Straten, Warmerdam, et al., 2008). These definitions are given in . The effect sizes for each of these therapies ranged from g = 0.39 for psychodynamic therapy to g = 1.05 for BAT. Heterogeneity was high in all therapies, except in counseling where heterogeneity was moderate (I2 ranged from 45% for counseling to 87% for PST and IPT). The NNTs ranged from 3 to 8. Because the effects of the therapies may differ across type of control group, we have also given the effect sizes for each type of therapy for the three types of control group (waiting list, care-as-usual, and the other control groups) in .

Download CSVDisplay Table

Indications for publication bias were found for CBT, BAT, PST, psychodynamic therapy, and counseling. The adjusted effect sizes were considerably smaller than the unadjusted effect sizes for all these therapies. No indications for publication bias were found for 3rd wave therapies and IPT.

We examined whether the generic types of therapy differed from each other in a multivariate meta-regression analysis (), in which we adjusted for the three variables that have been consistently found to be associated with the effect size (type of control condition, risk of bias, and whether or not the study was conducted in a Western country or not). As can be seen in , a test for the whole set of generic therapies was not significant either (p = .73), although BAT was found to be more effective than CBT (the reference).

Download CSVDisplay Table

The effects of the fifteen more specific types of psychotherapy are given in Tables II and III. The effect sizes ranged from g = 0.38 for the “Coping with Depression” course to g = 1.10 for Life review therapy. All effect sizes were significantly different from zero. The NNTs ranged from 3 to 8. Heterogeneity was high (I2 > 75%) for contextual behavioral activation, extended and brief PST, full and brief IPT, and life review. Heterogeneity was low to moderate (I2 < 50%) for acceptance and commitment therapy, pleasant events behavioral activation, guided self-help with Burns’ book, the “Coping with Depression” course, and non-directive counseling.

Indications for publication bias was found for the majority of therapies, except for acceptance and commitment therapy, pleasant events behavioral activation, guided self-help with Burns’ book, and full and brief IPT. The effect size was reduced with more than 50% after adjustment for publication bias in contextual BAT, extended and brief PST, and psychodynamic therapy.

In the meta-regression analyses (), we found that overall the specific types of therapies did not differ significantly from each other (p = 0.16). However, when the “Coping with Depression” course was compared to the reference group (Beck’s CBT), it was significantly less effective (p = <0.001).

Because we found a highly significant association between the effect size and risk of bias, we conducted sensitivity analyses with the set of studies with low risk of bias. In 103 comparisons (27%) risk of bias was low. The pooled effect size of these comparisons was g = 0.48, which corresponds with an NNT of 6.44. Heterogeneity was high (I2 = 69) and the prediction interval ranged from −0.05 to 1.01.

For the generic therapies, non-directive counseling and IPT had less than 5 studies and were not included in these analyses. The effect sizes from the other generic therapies ranged from g = 0.27 (PST) to 0.78 (BAT). Heterogeneity was still considerable for all therapies, except for third wave therapies. There were indications of publication bias in all generic types of therapy. In a multivariate metaregression analysis in which we adjusted for control group, and whether or not the study was conducted in a Western country, we found no indication that one type of generic therapy was more or less effective than other therapies.

Only four specific therapies had five or more studies with low risk of bias (). Effect sizes ranged from g = 0.34 for the “Coping with Depression” course, to g = 0.87 for Beck’s CBT. Heterogeneity was considerable in all four therapies. When the effect sizes were adjusted for publication bias (using Duval and Tweedie’s trim and fill procedure ()), only two therapies remained effective: the “Coping with Depression” course and self-examination therapy.

Download CSVDisplay Table

We examined the effects of 7 generic types of psychotherapy for adult depression, as well as those of 15 more specific types of psychotherapy. We found that all therapies, generic and specific, had significant, moderate to large effects on adult depression. Psychodynamic therapy, the “Coping with Depression” course and a specific version of PST (self-examination therapy) had effect sizes smaller than g = 0.5. All other therapies had effect sizes ranging from g = 0.57 for full IPT to g = 1.07 for extended PST. This is good news in the sense that all therapies had significant effects on depression. However, these findings were much less positive after taking heterogeneity, publication bias and risk of bias into account.

One problem for many types of the examined psychotherapy is that the level of heterogeneity was high, and many of the prediction intervals were broad and included zero. This means that it is difficult to predict the effect size of the next study that is done with this therapy, and that study may just as well find negative effects. The resulting effect sizes differ so much for one type of therapy, that it cannot be reliably predicted what the true effect size is.

Publication bias is another problem for this body of research. Although we examined this with indirect evidence (the asymmetry of the funnel plot), we found strong indications that the effects of several of the therapies are overestimated because negative studies are not published. Although it may be argued that this is indirect evidence and there may be other causes for these findings, this is in line with other research in which we found that NIH-funded trials on psychotherapy for depression were indeed often not published and that this affects the overall effects found for psychotherapy that are comparable to what we found (Driessen, Hollon, Bockting, Cuijpers, & Turner, 2015). After adjustment for publication bias, four specific types were no longer significantly different from zero (contextual behavioral activation therapy, extended and brief PST, and psychodynamic therapy).

Risk of bias is another important problem in research on psychotherapies for depression. In 70% of the trials (92/309) there was at least some risk of bias. And the studies with low risk of bias, clearly indicated smaller effect sizes than the ones that had (at least some) risk of bias. Only four of the 15 specific types of therapy had 5 or more trials without risk of bias. And the effects found in these studies were more modest than what was found for all studies (including the ones with risk of bias). When the studies with low risk of bias were adjusted for publication bias, only two types of therapy remained significant (the “Coping with Depression” course, and self-examination therapy).

These findings suggest that the effects of the therapies are considerably overestimated when all studies are taken together and when heterogeneity, publication bias, and risk of bias are not taken into account. In an earlier study, we already found that the effects of CBT for depression and anxiety disorders have been overestimated considerably because of publication bias and risk of bias (Cuijpers, Cristea, Karyotaki, Reijnders, & Huibers, 2016). Now we have the same problem for other, more specific types of psychotherapy for depression.

The present study adds considerably to the existing evidence for a number of specific types of therapies for depression. For example, previous systematic reviews were not able to document the efficacy of specific types of psychotherapy for depression (i.e., ACT) for the treatment of depression, or the efficacy had, to the best of our knowledge, yet not been explored in recent previous meta-analyses (i.e., life review therapy). For several of the examined therapies, such as mindfulness-based CBT, acceptance and commitment therapy, and life-review it was not clear yet how these findings relate to other types of psychotherapy for depression. Our finding clearly indicates that these types of therapies may not only be effective in the treatment of depression, but also result in comparable effect sizes to more established treatments such as CBT.

We did not find any significant difference between the effect sizes of the generic therapies. This suggests that these therapies have comparable effects, although it should be noted that trials in which therapies are directly compared with each other give much stronger evidence on whether therapies are indeed equally effective (Barth et al., 2013; Cuijpers, van Straten, Andersson, et al., 2008). We also found no significant differences between the specific therapies, although there were some indications that the “Coping with Depression” course was less effective than other therapies. This may be a true differential effect (the “Coping with Depression” course differs from other therapies in that it is a psychoeducational treatment; Lewinsohn, Antonuccio, Steinmetz, & Teri, 1984), but it is also possible that the studies in which this course is used is conducted with more complicated target groups. This course is often used with difficult populations, such as alcoholics or juvenile delinquents with depression, because it can be easily adapted to different populations. In a separate meta-analysis of the “Coping with Depression” course that we conducted some time ago, we found that trials in which the course was directly compared with other interventions did not indicate significant differences with other therapies (Cuijpers, Munoz, Clarke, & Lewinsohn, 2009). This suggests that the course may not be less effective than other therapies. For now, however, we must assume that it does have smaller effects, based on the results of the current study. It was also, however, one of the two studies that remained significant after excluding studies with risk of bias and after adjustment for publication bias.

That most therapies seem to have comparable effects may be interpreted as indirect evidence that the therapies work through common, non-specific or universal mechanisms (Wampold & Imel, 2015). That is certainly a good possibility. However, it must be noted that there are other reasons that may explain these equal, comparable effects (Cuijpers, Reijnders, & Huibers, 2018). Comparable effects cannot be seen as evidence that the effects are realized by the same mechanisms. Depression is a complex disorder, with many different dimensions and characteristics. It is very well possible that a therapy changes one specific dimension or characteristic, which in turn changes and improves the whole system of depression-related characteristics of the patient. Furthermore, all characteristics of the patient, the therapist and the interaction between patient and therapist may be so complicated that therapies work through numerous different pathways, that all lead to improvement, but that may not be detected in meta-analyses because only averages across all studies and patients are examined.

One important issue is that we measured the effects of psychotherapies compared to control conditions, including waiting list, care-as-usual and other control conditions. These control conditions, however, differ considerably from each other (Gold et al., 2017; Mohr, Spring, Freedland, Beckner, Arean, Hollon et al., 2009). This introduces heterogeneity in our analyses, although after adjustment for control condition in the multivariate metaregression analyses, the results did not change. It should also be mentioned that comparing psychotherapies with control conditions can result in estimates of the effects of these therapies, but comparisons between therapies result in much stronger evidence for potential differences.

The results of this meta-analysis should be considered in the light of some important limitations. One important limitation is that the categories of the therapies we examined in this meta-analyses, are not always straightforward. Therapies may be based on specific manuals or methods, but are typically still adapted to the population or setting where it is used to a certain extent. “Pure” manuals are only seldom used. Making categories of therapies has therefore always some inherent uncertainties. This is made worse by the fact that authors often describe their treatments very briefly, often enclosed by word limits of the journals where the studies are published. Other limitations of this meta-analyses include the considerable risk of bias in the majority of studies, which we already discussed, and the problem of publication bias. We also only looked at short-term outcomes and not at longer-term outcomes, because they are usually not reported, or reported in widely differing follow-up periods, and because they are almost always naturalistic follow-up studies. Furthermore, the generalizability of our findings are limited to patients treated in an outpatient setting, because, due to considerable differences, we excluded studies focusing on inpatients. In our analyses we did not examine all potential moderators of outcome, although previous meta-analytic research has not indicated that for example treatment format (Cuijpers et al., 2019), number of sessions (Cuijpers, Huibers, Ebert, Koole, & Andersson, 2013) or characteristics of participants of psychotherapies (Cuijpers, Karyotaki, Reijnders, et al., 2018) are associated with outcome. Another limitation is that the number of studies for some categories was very small, and power may have been too low to find significant effects. We also did not register the agreements and disagreements between raters of the characteristics of the studies because this was done over a period of more than 10 years with every update of our dataset. Furthermore, we focused only on depressive symptoms as outcomes and did not focus on other outcomes, such as quality of life or functional limitations. We did not examine therapist effects either, which may have influenced the outcomes.

Despite these limitations, we can conclude that the 15 types of therapy that were examined in this meta-analysis may be effective in the treatment of depression, but the evidence is not conclusive because of high levels of heterogeneity, publication bias, and the risk of bias in the majority of studies.

Table I. Definitions of psychological treatments of depression.

Generic types of therapySpecific types of therapy
Cognitive Behavior Therapy (CBT)
In CBT the therapists focus on the impact a patient’s present dysfunctional thoughts have on current behavior and future functioning. CBT is aimed at evaluating, challenging and modifying a patient’s dysfunctional beliefs (cognitive restructuring). In this form of treatment the therapist mostly emphasizes homework assignments and outside-of-session activities. Therapists exert an active influence over therapeutic interactions and topics of discussion, use a psycho educational approach, and teach patients new ways of coping with stressful situations.
CBT according to the Beck’s manual
The manual developed by Beck and colleagues (1979) is the most widely used manual for CBT (which includes a module on behavioral activation, see below). Interventions were coded for this treatment when they explicitly referred to the manual and indicated that this manual was used in the intervention.
“Coping with Depression” Course
The “Coping with Depression” course is a psychoeducational intervention for depression that has been examined in many studies. The manual was developed in the late 1970s (Lewinsohn et al., 1984) and has since then been adapted for several specific target populations. A meta-analysis of these studies was published earlier (Cuijpers, Munoz, et al., 2009). An intervention was coded for this treatment when it explicitly referred to this manual (although it could be adapted for the target population).
Guided self-help with the book “Feeling good” from David Burns
The bestseller “Feeling good” by David Burns (1980), based on Beck’s CBT was used in a considerable number of studies as bibliotherapy in which the patient read the book and received weekly feedback and support by a coach.
Behavioral activation therapy (BA)
We considered an intervention to be behavioral activation when the registration of pleasant activities and the increase of positive interactions between a person and his or her environment were the core elements of the treatment. Social skills training could be a part of the intervention. We defined subtypes of behavioral activation according to Mazzucchelli, Kane, and Rees (2009). They describe four subtypes of behavioral activation, but only two of these subtypes have sufficient trials to be reported here.
Pleasant activity scheduling
The first type of behavioral activation was developed in the 1960s by Lewinsohn and colleagues (Dimidjian et al., 2011; Lewinsohn, 1974). This type of behavioral activation mostly consisted of monitoring and pleasant activity scheduling. We categorized an intervention as pleasant activity scheduling when they referred to the work of Lewinsohn.
Contextual behavioral activation
This version builds on the behavioral activation component of cognitive behavior therapy. It includes activity scheduling, self-monitoring, graded task assignment, role-playing, functional analysis, mental rehearsal, and in newer versions mindfulness. We categorized an intervention as contextual behavioral activation when it refers to the work of Jacobson or Martell and colleagues (Jacobson, Martell, & Dimidjian, 2001; Martell, Addis, & Jacobson, 2001).
Problem-solving therapy (PST)
We defined PST as a psychological intervention in which the following elements had to be included: definition of personal problems, generation of multiple solutions to each problem, selection of the best solution, the working out of a systematic plan for this solution, and evaluation as to whether the solution has resolved the problem. We have examined PST in a recent meta-analysis (Cuijpers et al., 2017) and also worked out the subtypes of PST that are used in this study.
Extended problem-solving therapy
Extended PST, which does not only focus on the problem-solving skills themselves, but also on changing those attitudes or beliefs that may inhibit or interfere with attempts to engage in the remaining problem-solving tasks. “Social problem-solving”, developed by Nezu in the 1970s (Nezu, 1986; Nezu & D’Zurilla, 1979) is the most important type of extended problem-solving. It is typically conducted in a group format of 10 or more sessions. We (arguably) considered an intervention as extended PST when it had 10 sessions or more.
Brief problem-solving therapy
Brief PST, which was originally developed for primary care in the 1990s (PST-PC; Mynors-Wallis, Gath, Day, & Baker, 2000; Mynors-Wallis, Gath, Lloyd-Thomas, & Tomlinson, 1995), focuses on the core elements of problem-solving and can be used by trained nurses. We considered an intervention as brief PST when it had 9 sessions or less.
Self-examination therapy (SET)
SET is aimed at determining the major goals in their life, investing energy only in those problems that are related to what matters and learning to accept those situations that cannot be changed. Problem-solving skills are the core element of this approach. SET is typically used in a guided-self-help format. We considered an intervention as SET when it was based on self-examination therapy (Bowman, Ward, Bowman, & Scogin, 1996) and was conducted in guided self-help format.
Third wave cognitive behavioral therapies
Third wave therapies are a heterogeneous group of therapies that introduce several new techniques to cognitive behavior therapies. They have in common that they abandon or only cautiously use content-oriented cognitive interventions and the use of skills deficit models to delineate the core maintaining mechanisms of the addressed disorders (Kahl, Winter, & Schweiger, 2012). We found sufficient randomized trials for two types of 3rd wave therapies: Acceptance and Commitment Therapy and Mindfulness-based CBT.
Acceptance and Committment Therapy (ACT)
In brief, ACT is a form a behavioral therapy that focuses on decreasing experiential avoidance and increasing value-based behavior (Hayes, Strosahl, & Wilson, 1999). Acceptance and mindfulness strategies are used in different ways to increase psychological flexibility.
Mindfulness-based CBT (MBCT)
MBCT is a psychological treatment that combines meditation or mindfulness exercises with cognitive restructuring techniques. It was developed for the prevention of relapse in people with recurrent depression (Segal, Williams, & Teasdale, 2002), but a considerable number of studies have now examined the effects in acute depression.
Interpersonal psychotherapy (IPT)
IPT is a brief and highly structured manual-based psychotherapy that addresses interpersonal issues in depression, to the exclusion of all other foci of clinical attention. IPT has no specific theoretical origin although its theoretical basis can be seen as coming from the work of Sullivan, Meyer, and Bowlby. The current form of the treatment was developed by the late Gerald Klerman and Myrna Weissman in the 1980s (Klerman et al., 1984).
Full IPT
We distinguished two types of IPT, one in which the full manual was used or referred to without attempt to make it more brief (full IPT), and the other in which a brief version of IPT was used (brief IPT). Full IPT may have been adapted to the population or setting where it was used (including the number of sessions unless there were 10 sessions or less), but there was no mention of substantial reduction of the number of sessions.
Brief IPT
IPT has also been used in settings and populations where full IPT is not feasible. In a considerable number of studies, brief versions of IPT have been developed in which the core of IPT was retained, but the number of sessions was considerably reduced. Interpersonal counseling (Weissman et al., 2014) is one of the brief versions of IPT. We (arguably) considered an intervention as brief IPT when it had 10 sessions or less.
Psychodynamic Therapy
The primary objective in (short-term) psychodynamic therapy is to enhance the patient’s understanding, awareness and insight about repetitive conflicts (intra psychic and intrapersonal). An assumption in psychodynamic therapy is that a patient’s childhood experiences, past unresolved conflicts, and historical relationships significantly affect a person’s present life situation. In this form of treatment the therapist concentrates on the patient’s past, unresolved conflicts, historical relationships, and the impact these have on a patient’s present functioning. Furthermore, in psychodynamic therapy, the therapists explore a patient’s wishes, dreams, and fantasies. The time limitations and the focal explorations of the patient’s life and emotions distinguish psychodynamic therapy from psychoanalytic psychotherapy. We did not identify subtypes of psychodynamic therapy.
Non-directive supportive therapy
We defined non-directive therapy as any unstructured therapy without specific psychological techniques other than those common to all approaches such as helping people to ventilate their experiences and emotions and offering empathy. It is not aimed at solutions, or acquiring new skills. It is based on the assumption that relief from personal problems may be achieved through discussion with others. These non-directive therapies are commonly described in the literature as either counseling or supportive therapy. We did not identify subtypes of non-directive supportive therapy.
Life review therapy
Reminiscence is a naturally occurring process of recalling the past, that is hypothesized to resolve conflicts from the past and make up the balance of one’s life (Bohlmeijer, Smit, & Cuijpers, 2003; Butler, 1963). Since the beginning of the 1970s, reminiscence has been used by therapists as a specific treatment of depression in older adults. In these “life review” therapies the patients work through the memories of all phases in their life with the aim of re-evaluation of their life, resolving conflicts or assessing adaptive coping-responses. We defined life review therapies as all therapies that are aimed at the systematic evaluation of the lives of participants.

Table II. Effects of 15 therapies compared with control groups: Hedges’ ga.

  Ncompg95% CIbI295% CIcPrediction intervalNNTgadj95% CIdNimp
All comparisons 3850.720.67∼0.788179∼82−0.17∼1.614.040.460.40∼0.52103
One ES/study (highest) 3090.730.67∼0.798381∼84−0.19∼1.653.980.450.38∼0.5286
One ES/study (lowest) 3090.670.61∼0.738179∼83−0.20∼1.544.390.430.36∼0.5077
Outliers excluded (g > 2) 3610.610.56∼0.656864∼71−0.01∼1.234.890.440.40∼0.4989
Generic types           
 • CBT 2050.730.65∼0.808077∼82−0.16∼1.623.980.470.39∼0.5652
 • BAT 211.050.80∼1.307765∼840.02∼2.082.640.650.38∼0.929
 • 3rd wave therapies 190.850.63∼1.077559∼83−0.02∼1.723.35e 0
 • PST 300.750.53∼0.978782∼90−0.37∼1.873.860.310.06∼0.5611
 • IPT 270.600.34∼0.868782∼90−0.71∼1.914.99e 0
 • Psychodynamic 120.390.16∼0.627037∼82−0.35∼∼0.542
 • Non-directive supportive190.580.42∼0.75450∼670.07∼∼0.616
 • Life review 141.100.68∼1.518983∼92−0.52∼2.722.510.610.13∼1.085
 • Other 520.700.56∼0.847871∼82−0.15∼1.544.180.600.44∼0.764
 Specific types          
 • 3rd wave therapies− ACT80.740.61∼0.8700∼560.58∼0.903.92e 0
 − MBCT70.710.41∼1.01580∼80−0.15∼1.574.110.490.16∼0.813
 • CBT− GSH: Burns70.970.62∼1.32390∼730.09∼1.852.88e 0
 − CBT: Beck370.950.77∼1.146853∼760.01∼1.892.950.610.40∼0.8214
 − CBT: CWD260.380.27∼0.49380∼610.02∼0.748.380.280.16∼0.407
 • Behavioral activation− BAT: Pleasant events71.040.77∼1.3000∼580.70∼1.382.67e 0
 − BAT: Contextual71.060.46∼1.658774∼92−0.94∼3.062.620.49−0.14∼1.263
 • Problem-solving therapy− PST: Extended81.070.50∼1.638878∼92−0.81∼2.952.590.25−0.34∼0.854
 − PST: Brief140.810.42∼1.199086∼93−0.74∼2,363.530.26−0.18∼0.696
 − PST: SET80.420.21∼0.64630∼81−0.21∼1.057.490.280.05∼0.503
 • IPT− IPT: Full140.570.31∼0.837758∼85−0.37∼1.515.29e 0
 − IPT: Brief130.640.13∼1.159187∼94−1.41∼2.694.63e 0
 • Psychodynamic 120.390.16∼0.627037∼82−0.35∼∼0.542
 • Non-directive supportive190.580.42∼0.75450∼670.07∼∼0.616
 • Life review 141.100.68∼1.518983∼92−0.52∼2.722.510.610.13∼1.085

Table III. Effects of 15 therapies stratified across different types of control groups: Hedges’ ga.

  Waiting listCare-as-usualOther control group
  Ncompg95% CII2Ncompg95% CII2Ncompg95% CII2
All comparisons 1570.730.69∼0.7775 ***1650.630.55∼0.7080 ***630.570.43∼0.7085 ***
One ES/study (highest) 1110.960.85∼1.0780 ***1470.650.57∼0.7381 ***510.560.41∼0.7186 ***
One ES/study (lowest) 1110.840.73∼0.9477 ***1470.630.55∼0.7182 ***510.490.35∼0.6384 ***
Outliers excluded (g ≥ 2) 1400.730.66∼0.7946 ***1600.570.50∼0.6373 ***610.470.37∼0.5770 ***
Generic types             
 • CBT 900.960.83∼10876 ***860.590.50∼0.6977 ***280.540.35∼0.7386 ***
 • BAT 101.420.91∼1.9380 ***80.850.53∼1.1876 ***30.670.34∼0.9911
 • 3rd wave therapies 130.870.59∼1.1681 ***40.930.49∼1.385320.710.25∼1.1647
 • PST 160.800.52∼1.0880 ***80.990.43∼1.5493 ***60.27−0.03∼0.5772 ***
 • IPT 20.880.24∼1.5255190.630.29∼0.9790 ***60.350.09∼0.6128
 • Psychodynamic 11.090.19∼1.99080.320.08∼0.5566 **30.38−0.34∼1.0977 *
 • Non-directive supportive41.110.32∼1.9165 *130.420.30∼0.54020.970.52∼1.420
 • Life review 40.610.29∼0.934050.830.44∼1.224651.740.62∼2.8695 ***
 • Other 210.610.49∼0.720180.700.51∼0.8870 ***130.750.32∼1.1892 ***
 Specific types            
 • 3rd wave therapies− ACT60.730.59∼0.86010.66−0.01∼1.33010.920.49∼1.350
 − MBCT40.490.27∼0.71031.020.45∼1.59650   
 • CBT− GSH: Burns70.970.62∼1.32390   0   
 − CBT: Beck221.260.99∼1.5462 ***110.620.39∼0.8549 *40.540.28∼0.8033
 − CBT: CWD100.480.35∼0.610140.370.21∼0.523920.13−0.24∼0.4972
 • Behavioral activation− BAT: Pleasant events61.030.75∼1.3000   11.110.29∼1.930
 − BAT: Contextual23.14−1.86∼8.15040.550.22∼0.873410.850.18∼1.520
 • Problem-solving therapy− PST: Extended51.861.10∼2.6265 *20.330.12∼0.5501−0.19−0.54∼0.160
 − PST: Brief30.68−0.13∼1.4978 *61.240.42∼2.0594 ***50.370.06∼0.6869 *
 − PST: SET80.420.21∼0.6463 **0   0   
 • IPT− IPT: Full20.880.24∼1.525570.630.26∼1.0080 ***50.310.04∼0.5831
 − IPT: Brief0   120.630.09∼1.1792 ***10.75−0.07∼1.570
 • Psychodynamic 11.090.19∼1.99080.320.08∼0.5566 **30.38−0.34∼1.0977 *
 • Non-directive supportive41.110.32∼1.9165 *130.420.30∼0.54020.970.52∼1.420
 • Life review 40.610.29∼0.934050.830.44∼1.224651.740.62∼2.8695 ***

Table IV. Standardized regression coefficients of generic and specific treatments of depression: multivariate metaregression analyses.

  Coeff.SEpp (set)Coeff.SEpp (set)
Generic therapiesCBTRef.  0.47    
3rd wave0.130.170.43     
 Behavioral activation0.330.170.05     
 Non-directive supportive0.010.170.97     
Specific therapiesBeck’s CBT    Ref.  0.16
ACT    − 
MBCT    − 
 CWD    −0.460.16<0.01 
 Burns GSH    − 
 BAT: pleasant events    − 
 BAT: Contextual 
 PST extended    − 
 PST brief    − 
 PST SET    −0.430.230.07 
 Full IPT    −0.340.190.07 
 Brief IPT    − 
 Psychodynamic    −0.370.200.07 
 Life review 
 Non-directive supportive    − 
Western vs non-Western0.490.22<0.001 0.230.16<0.14 
Control groupWaiting listRef.  <0.01Ref.  0.06
Care as usual−0.300.090.001 − 
 Placebo−0.430.220.05 −0.500.200.01 
 Other−0.320.130.01 − 
Risk of bias (continuous)−0.110.03<0.001 − 
Intercept 1.140.10<0.001 1.310.13<0.001 
R2 analog 0.21   0.21   

Table V. Effects of 15 therapies compared with control groups from randomized trials with low risk of bias: Hedges’ ga.

 Ncompg95% CIbI295% CIcPrediction intervalNNTgadj95% CIdNimp
All studies (low risk of bias)1030.480.42∼0.556962∼74−0.05∼1.016.440.360.29∼0.4421
Generic types          
 • CBT550.500.40∼0.607364∼79−0.12∼∼0.4612
 • 3rd wave therapies60.600.43∼0.7600∼610.37∼0.834.990.540.40∼0.693
 • BAT80.780.52∼1.037331∼85−0.01∼1.573.690.620.34∼0.912
 • PST100.270.14∼0.41540∼76−0.14∼0.6812.250.210.06∼0.362
 • Psychodynamic50.520.09∼0.957931∼89−0.98∼2.025.880.23−0.24∼0.692
Specific types          
 • CBT according to Beck50.870.23∼1.528974∼93−1.56∼3.303.260.41−0.29∼1.112
 • CBT: CWD130.340.18∼0.49560∼75−0.14∼0.829.490.230.08∼0.394
 • Psychodynamic50.520.09∼0.957931∼89−0.98∼2.025.880.23−0.24∼0.692
 • PST SET60.350.14∼0.56620∼82−0.28∼0.989.190.280.06∼0.501

