Skip to main content
Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Cochrane Database Syst Rev. 2008 Oct; 2008(4): CD000448.
Published online 2008 Oct 8. doi: 10.1002/14651858.CD000448.pub3
PMCID: PMC7032678
PMID: 18843608

St John's wort for major depression

Monitoring Editor: Klaus Linde,corresponding author Michael M Berner, Levente Kriston, and Cochrane Common Mental Disorders Group
Technische Universität München / Klinikum rechts der Isar, Institut für Allgemeinmedizin / Institute of General Practice, Wolfgangstr. 8, MünchenGermany, 81667
University Medical Center Freiburg, Department of Psychiatry and Psychotherapy, Hauptstrasse 5, Freiburg im BreisgauGermany, D‐79104
Klaus Linde, ed.nehcneum-ut.zrl@edniL.sualK.

Abstract

Background

In some countries extracts of the plant Hypericum perforatum L. (popularly called St. John's wort) are widely used for treating patients with depressive symptoms.

Objectives

To investigate whether extracts of hypericum are more effective than placebo and as effective as standard antidepressants in the treatment of major depression; and whether they have fewer adverse effects than standard antidepressant drugs.

Search methods

Trials were searched in computerised databases, by checking bibliographies of relevant articles, and by contacting manufacturers and researchers.

Selection criteria

Trials were included if they: (1) were randomised and double‐blind; (2) included patients with major depression; (3) compared extracts of St. John's wort with placebo or standard antidepressants; (4) included clinical outcomes assessing depressive symptoms.

Data collection and analysis

At least two independent reviewers extracted information from study reports. The main outcome measure for assessing effectiveness was the responder rate ratio (the relative risk of having a response to treatment). The main outcome measure for adverse effects was the number of patients dropping out due to adverse effects.

Main results

A total of 29 trials (5489 patients) including 18 comparisons with placebo and 17 comparisons with synthetic standard antidepressants met the inclusion criteria. Results of placebo‐controlled trials showed marked heterogeneity. In nine larger trials the combined response rate ratio (RR) for hypericum extracts compared with placebo was 1.28 (95% confidence interval (CI), 1.10 to 1.49) and from nine smaller trials was 1.87 (95% CI, 1.22 to 2.87). Results of trials comparing hypericum extracts and standard antidepressants were statistically homogeneous. Compared with tri‐ or tetracyclic antidepressants and selective serotonin reuptake inhibitors (SSRIs), respectively, RRs were 1.02 (95% CI, 0.90 to 1.15; 5 trials) and 1.00 (95% CI, 0.90 to 1.11; 12 trials). Both in placebo‐controlled trials and in comparisons with standard antidepressants, trials from German‐speaking countries reported findings more favourable to hypericum. Patients given hypericum extracts dropped out of trials due to adverse effects less frequently than those given older antidepressants (odds ratio (OR) 0.24; 95% CI, 0.13 to 0.46) or SSRIs (OR 0.53, 95% CI, 0.34‐0.83).

Authors' conclusions

The available evidence suggests that the hypericum extracts tested in the included trials a) are superior to placebo in patients with major depression; b) are similarly effective as standard antidepressants; c) and have fewer side effects than standard antidepressants. The association of country of origin and precision with effects sizes complicates the interpretation.

Plain language summary

St. John's wort for treating depression.

Depression is characterised by depressed mood and/or loss of interest or pleasure in nearly all activities and a variety of other symptoms for periods longer than two weeks. Extracts of St. John's wort (botanical name Hypericum perforatum L.) are prescribed widely for the treatment of depression.

We have reviewed 29 studies in 5489 patients with depression that compared treatment with extracts of St. John's wort for 4 to 12 weeks with placebo treatment or standard antidepressants. The studies came from a variety of countries, tested several different St. John's wort extracts, and mostly included patients suffering from mild to moderately severe symptoms. Overall, the St. John's wort extracts tested in the trials were superior to placebo, similarly effective as standard antidepressants, and had fewer side effects than standard antidepressants. However, findings were more favourable to St. John's wort extracts in studies form German‐speaking countries where these products have a long tradition and are often prescribed by physicians, while in studies from other countries St. John's wort extracts seemed less effective. This differences could be due to the inclusion of patients with slightly different types of depression, but it cannot be ruled out that some smaller studies from German‐speaking countries were flawed and reported overoptimistic results.

Patients suffering from depressive symptoms who wish to use a St. John's wort product should consult a health professional. Using a St. John's wort extract might be justified, but important issues should be taken into account: St. John's wort products available on the market vary to a great extent. The results of this review apply only to the preparations tested in the studies included, and possibly to extracts with similar characteristics. Side effects of St. John's wort extracts are usually minor and uncommon. However, the effects of other drugs might be significantly compromised.

Background

Description of the condition

Depressive disorders are characterised by depressed mood and/or loss of interest or pleasure in nearly all activities in the presence of other symptoms such as loss of appetite, fatigue and lack of energy, sleep disturbance, restlessness or irritability, feelings of worthlessness or inappropriate guilt, difficulty in thinking, concentrating or making decisions and thoughts of death or suicide or attempts at suicide (Candy 2008). Depressive disorders are the largest source of non‐fatal disease burden in the world, accounting for 12% of years lived with disability (Ustun 2004). There are two major classification systems to diagnose depressive disorders, the Diagnostic and Statistical Manual of Mental Disorders (DSM; current version DSM‐IV‐TR) and the International Statistical Classification of Diseases and Related Health Problems (ICD; current version ICD‐10). DSM‐IV defined depressive diagnoses include recurrent or persistent major depression and minor depression. ICD‐10 diagnoses include recurrent or persistent depression with mild, moderate or severe episodes. According to the DSM‐IV diagnostic classification, either depressed mood or a loss of interest or pleasure in daily activities consistently for at least a two week period has to be present to diagnose a major depressive disorder. The ICD‐10 system uses the term depressive episode instead of major depressive disorder, but lists similar criteria.

Description of the intervention

Major depressive episodes are most commonly treated with antidepressant medication. Current first line dugs are selective serotonin reuptake inhibitors (SSRI) or tricyclic and related antidepressants (http://guidance.nice.org.uk/CG23). However, the size of effects over placebo in clinical trials has been modest (Turner 2008; Kirsch 2008), and although SSRIs are better tolerated than older antidepressants, side effects still occur in a relevant proportion of patients.

Extracts of the plant Hypericum perforatum L. (St. John's wort), a member of the Hypericaceae family, have been used in folk medicine for a long time for a range of indications including depressive disorders. Extracts of St. John's wort are licensed and widely used in Germany for the treatment of depressive, anxiety and sleep disorders. In recent years, hypericum extracts have also become increasingly popular in other countries.

How the intervention might work

The exact mechanism of action of the antidepressant effects of hypericum extracts is still unclear. Hypericum extracts contain at least seven constituents or groups of components that may contribute to its pharmacological effects (Nahrstedt 1997). These include naphthodianthrons (e.g., hypericins), flavonoids (e.g., quercetin), biflavonoids (e.g., biapigenin), xanthons, and phloroglucinol derivatives (e.g., hyperforin). Hypericum extracts have been shown to be active in a number of standard animal models that are used to indicate antidepressant effects (Wheatley 1998; Caccia 2005; Wurglics 2006). While some isolated substances, as for example hyperforin, have been shown to have antidepressant activity, the total extract seems to be more effective (Reichling 2003).

Why it is important to do this review

Hypericum extracts have been tested in a number of clinical trials since the 1980s. The first two versions of this review and other systematic reviews published between 1995 and 2000 concluded that hypericum extracts are more effective than placebo and are comparable to older antidepressants in the treatment of mild to moderate depressive disorders (Ernst 1995; Linde 1996; Linde 1998; Kim 1999; Gaster 2000; Williams 2000). Several trials included in these reviews were criticised because they included patients with few and/or mild symptoms who did not meet criteria for major depression, were conducted by primary care physicians who were not experienced in depression research, and/or used low doses of comparator drugs (Shelton 2001). In the 2005 update of our review (Linde 2005a; Linde 2005b) several new well‐designed placebo‐controlled trials were included, some of which had negative findings (Shelton 2001; HDTSG 2002) and which had spurred renewed debate about the efficacy of hypericum extracts. We systematically investigated possible reasons for the contradictory findings. We found that larger, more precise studies yielded less positive results, suggesting that small studies with a higher risk of bias might overestimate the effects of hypericum extracts over placebo. The analyses also showed that effects over placebo were less pronounced in studies restricted to patients with major depression. Finally, we had the impression that studies originating from German‐speaking countries (Germany, Austria, and Switzerland) had more positive results than studies originating from other countries independently from precision and formal diagnosis, although multiple regression analyses did not identify this as an independent predictor.

Since we completed the search for our 2005 update, again, several new well‐designed trials restricted to patients with major depression have been published. To sharpen the focus of this review, to reduce clinical heterogeneity, and to reflect the fact that almost all new high‐quality trials of hypericum extracts are restricted to patients with major depression, we decided to limit the review now to this group of patients.

Objectives

This updated review aimed to investigate whether extracts of hypericum:

  • are more effective than placebo and
  • as effective as standard antidepressant drugs, and
  • whether they have less adverse effects compared to standard antidepressant drugs

in the treatment of major depression in adults.

In addition, we investigated possible reasons for varying results across studies, with a focus on precision of the studies, baseline severity of depression, and country of origin.

Methods

Criteria for considering studies for this review

Types of studies

To be included trials had to be double‐blind and randomised.

Types of participants

Patients had to suffer from major depression (meeting DSM‐IV or ICD‐10 criteria). Trials in children (< 16 years) were not eligible. In previous versions of this review (Linde 1998; Linde 2005a) trials not restricted to patients with major depression had been included.

Types of interventions

Experimental intervention
Preparations of hypericum (St. John's wort). Trials investigating combinations of hypericum with other herbs were excluded.

Control intervention
Placebo or synthetic antidepressants (tricyclic and related antidepressants, selective serotonine reuptake inhibitors, serotonine‐noradrenaline reuptake inhibitors). Trials using clearly inadequate synthetic antidepressants (e.g., benzodiatepines) or a dosage clearly below the lower thresholds recommended in current guidelines (Härter 2003, ICSI 2007) were excluded.

Experimental and control treatments had to be given for at least four weeks.

Main comparisons
The following comparisons were performed:
1. hypericum extracts vs. placebo
2. hypericum extracts vs. standard antidepressants

Types of outcome measures

Primary outcome
To be included, trials had to measure clinical outcomes such as depression scales or symptoms. Trials that measured physiological parameters only were excluded. The primary outcomes of interest were

1. Effectiveness: treatment response

2. Safety: the proportion of patients who dropped out due to adverse effects

Secondary outcomes
1. Effectiveness: remission, depression scales such as the Hamilton Depression Scale (HAMD), the Clinical Global Impression Index (CGI), the Montgomery‐Asberg Depression Rating Scale (MADRS), patient‐rated depression scales

2. Safety: total proportion of drop‐outs, proportion of patients reporting adverse effects

Search methods for identification of studies

For the first version of the review we searched for published and unpublished eligible trials in the following ways:

1. Electronic searches
a) Clinical Trials Register of the Cochrane Collaboration Depression, Anxiety & Neurosis Group (CCDANTR)
b) database of the Cochrane Field for Complementary Medicine
c) full text searches in Medline SilverPlatter CD‐ROM from 1983 onwards and Embase 1989 onwards using the terms 'St. John's wort', 'Johanniskraut' (German for St John's wort), 'hyperic*')
d) full text searches in Psychlit and Psychindex 1987 ‐ 1997 CD‐ROM
e) searches in the private database Phytodok, Munich.

2. Searching other resources
a) Checking bibliographies of obtained articles
b) Contacting pharmaceutical companies and authors.
There were no language restrictions.

For the updated version of the review, we searched for published and unpublished eligible trials in the following ways:

1. Electronic searches
For the update, regular electronic searches were performed in CCDANTR (last search July 2007) and PubMed (screening all hits for text word "hypericum", last search July 8, 2008).

2. Searching other resources
We screened bibliographies of published articles, and repeatedly contacted experts, researchers, and manufacturers inquiring for new trials. One reviewer (KL) initially screened reference lists to identify controlled studies on hypericum preparations in humans. All possibly relevant studies or publications were then checked formally for eligibility.

Data collection and analysis

Selection of studies

Two reviewers independently decided on eligibility for the revised inclusion criteria. Disagreements were resolved through discussion. Due to reading errors, disagreements occurred for two trials in which not all patients had major depression and which were excluded after discussion (Vorbach 1994, Winkel 2000). In two trials both reviewers had problems with assessing eligibility: For one small, older trial (Lehrl 1993) the publication did not state that inclusion was limited to patients with major depression, but we had a statement of the sponsor obtained for our 1998 update that all patients met the criteria. As this information could not be verified for this update, we decided to exclude the trial. A Chinese trial (Gu 2001) referred to a Chinese classification. As this classification is not completely comparable to ICD‐10 and DSM‐IV, we decided to also exclude this trial.

Data extraction and management

Primary study characteristics and results were extracted by at least two independent reviewers using a pretested form. In particular, we extracted diagnoses and main inclusion criteria, age, gender, duration of episodes, baseline depression scores, country of origin, number and type of study centers, numbers of patients who were randomised and analysed and who completed protocols, the number and reasons for drop‐outs and withdrawals, numbers of patients reporting adverse effects, and the number and type of adverse effects that were reported.

We assessed numbers of patients who were classified as responders based on score improvements on the Hamilton Rating Scale for Depression (HAMD), the Clinical Global Impression Index (CGI; subscale global improvement rating as at least "much improved"), or any other clinical response measurement. Missing or additional information was sought from authors/sponsors.

Most trials measured clinical outcomes with the Hamilton Depression Scale (HAMD) and the Clinical Global Impression Index (CGI). The HAMD is an observer‐rated scale that focuses mainly on somatic symptoms of depression (Hamilton 1960). The original version includes 21 items, but a version with 17 items is more commonly used in clinical trials. Most studies using the HAMD report the number of 'treatment responders' (patients achieving a score less than 10 and/or less than 50% of the baseline score). When available, we extracted means and standard deviations before, during and after treatment as well as the number of 'responders'. The CGI (CGI 1970) is an observer rated instrument with three items (severity of illness, global improvement, and an efficacy index). We extracted the number of patients rated as 'much improved' or 'very much improved' for global improvement. As recently the Montgomery‐Asberg Depression Rating Scale (MADRS (Montgomery 1979)) and the remission criterion for the HAMD (usually a score of less than 8 at the end of treatment) have been gaining importance as outcome criteria, we also checked all trials for the reporting of these measures. As the DS (Depression Scale von Zerssen (von Zerssen 1996)) was the most often used patient‐rated instrument in the included trials, we extracted post‐treatment data for this scale, if available. For additional post‐hoc analyses, one reviewer (KL) also extracted data for other self‐rating instruments.

Assessment of risk of bias in included studies

The main part of the update process of this review was completed before the new risk of bias tool of the Cochrane Collaboration (Higgins 2008) was available. The methodological quality of each trial was assessed by at least two independent reviewers using scales developed by A. Jadad et al. (Jadad 1996) and by one of the reviewers (KL). The results of the quality scoring are displayed in the table of included studies.

The Jadad scale has three items adding up to a maximum score of five points. 0, 1 or 2 points can be given for randomisation (explicit statement that allocation was randomised and description of an adequate generation of the random sequence), 0, 1 or 2 points for double‐blinding (explicit statement that patients and evaluators were blinded and that treatments were indistinguishable), 0 or 1 point for description of drop‐outs and withdrawals (numbers and reasons for all compared groups separately). The display in the table of included studies is as follows (examples): 2‐2‐1 (full score in every item), 1‐0‐0 (only statement on randomisation).
The second quality scale, the "Internal Validity Scale" (IV), which has been used in other reviews on complementary medicine (Linde 1996b, Linde 1997) has six items with possible scores of 0, 0.5 or 1 point for each. Items 1 through 6 refer to statement of random allocation, adequacy of randomization concealment, baseline comparability, blinding of patients, blinding of evaluators, and likelihood of selection bias after allocation, respectively. Results are displayed by item in the table of included studies (e.g., 1‐1‐1‐0.5‐1‐1 represents a full score, with the exception of blinding of patients which was stated but treatment and placebo might have been distinguishable).

The assessments in the Jadad and IV scores are solely based on the information provided in the publication (as additional information could not be gathered for all studies). In the table 'Characteristics of included studies', however, additional information provided from authors or sponsors was included. This table also contains information on allocation concealment and attrition.

Measures of treatment effect

Our primary outcome measure, to assess the effectiveness of St John's wort versus placebo and versus other antidepressants, was the proportion of responders (according to the Hamilton Depression Scale (first preference) or other responder measures (second preference)) at the end of treatment, or in case of treatment phases longer than 6 weeks, at the time point defined for primary outcome measurement by the study investigators.

Secondary outcome measures were: proportion of responders according to HAMD, proportion of responders according to CGI, mean HAMD after treatment (or, if this was not available, difference after treatment ‐ baseline), at 2, 4, 6 to 8 weeks, and mean DS score after treatment (or, if this was not available, difference after treatment ‐ baseline).

The main outcome measure for the safety analysis was the proportion of patients who dropped out due to adverse effects. Secondary measures were the total proportion of drop‐outs and the proportion of patients reporting adverse effects.

Dichotomous outcomes
We used responder rate ratios (= relative risks = proportion of responders in the treatment group/proportion of responders in the control group) and their 95% confidence intervals for the analysis of treatment response. Responder rate ratios greater than 1 indicate better response in the hypericum group.

Due to highly variable frequency of side or adverse effects reported, odds ratios instead of rate ratios were calculated in the safety analyses. Odds ratios less than 1 indicate that fewer events occurred in the hypericum group.

Continuous outcomes
For HAMD and DS scores we calculated mean differences (also termed weighted mean differences). Negative mean differences indicate better response in the hypericum group.

Unit of analysis issues
Two trials with more than one hypericum group were included in the analyses as follows: Laakmann 1998 included an extract available on the market and an additional experimental extract with low hyperforin content which was never on the market and only used for control reasons. We did not include the data from the group receiving the experimental low‐hyperforin extract in the analyses. Kasper 2006 et al tested two dosages (600 and 1200 mg) of an available product. We pooled the data from these two groups to prevent that the control group of this trial would have been included in the analyses twice.

Dealing with missing data

Dichtomous outcomes
Responder proportions were calculated according to the intention to treat principle, counting drop‐outs as non‐responders. For the comparison hypericum extracts vs. standard antidepressants responder proportions were also calculated on a per protocol basis (as this is considered more appropriate to assess the equivalence of two treatments).

Continuous outcomes
If means and standard deviations from intent to treat analysis with missing values replaced were available, we preferably used these data. In other cases we used analysis based on available data.

Obtaining missing data
If the number of patients responding to treatment and means and/or standard deviations of HAMD scores after completion were not reported, we always tried to contact first or corresponding authors and/or sponsors to obtain these data. In general, we also tried to obtain other missing details on methods and secondary outcomes from authors or sponsors, but the extent to which we were doing this depended on the cooperation of authors/sponsors and the amount of missing information in the publications. We did not impute or recalculate missing standard deviations as these were unavailable only for a few secondary outcomes in a minority of trials.

We tried to contact authors and/or sponsors of 27 of the 29 included trials; for two trials (Kalb 2001; Laakmann 1998) this was considered unnecessary. We did not receive responses for five trials (Behnke 2002; Brenner 2000; Fava 2005; Harrer 1999; Moreno 2005). Very limited additional information was available or needed for three studies (Bjerkenstedt 2005; Volz 2000; Woelk 2000). We obtained relevant additional information to a variable extent from authors, sponsors, or both for the remaining 19 trials.

Data synthesis

The following comparisons were performed:
1. hypericum extracts vs. placebo: a) for dichotomous outcomes (response rate ratios); b) for continuous outcomes; c) for drop‐outs and adverse effects
2. hypericum extracts vs. standard antidepressants: a) for dichotomous outcomes; b) for continuous outcomes; c) for drop‐outs and adverse effects

All main analyses were performed using RevMan 5.

Due to the clinical diversity of the studied populations, the hypericum extracts and the comparison drugs used, we considered that the included studies did not estimate a common underlying effect, but rather that each individual study estimated its single and unique underlying effect. Thus, the application of random effects model in all analyses seemed to be appropriate.

The primary analysis for the comparison of response rate ratios (= relative risks) under treatment with hypericum extracts or placebo was a random‐effects intention to treat meta‐analysis stratified by study precision (above or below median of variance of treatment effect).

Assessment of heterogeneity
Heterogeneity of trials' results was tested with the Chi‐squared test, and the I‐squared statistic was calculated to give an estimate of the degree of heterogeneity. I‐squared values over 50% indicate considerable heterogeneity (Higgins 2003).

Investigation of heterogeneity and subgroup analyses
Predefined subgroup analyses were performed (a) including only trials with response operationalised with the HAMD score; (b) including only trials with response operationalized with the CGI; (c) for the type of extract investigated; and (d) comparing trials originating from German‐speaking countries and from other countries. Weighted mean differences for HAMD scores were calculated after therapy, at 2 to 3, 4, 6 to 8 weeks, and for differences compared to baseline values. For DS scores we calculated after therapy values, for MADRS score after therapy values and differences compared to baseline. As only relatively few studies used the DS, we performed an additional post‐hoc random effects analysis calculating standardised mean differences for any available patient‐rating scale (preferably end of treatment values, but if these were not available also differences from baseline) to investigate whether findings from physician‐rated instruments could be broadly reproduced.

The primary analysis for the comparison of responder rate ratios under treatment with hypericum extracts or standard antidepressants was a random effects intent to treat meta‐analysis stratified for type of synthetic antidepressant (selective serotonine inhibitors or older antidepressants). Predefined subgroup analyses were performed (a) using per protocol data; (b) stratified for country (German‐speaking Europe versus other countries); (c) including only trials with response operationalised with the HAMD score; and (d) including only trials with response operationalised with the CGI.

Additional meta‐regression analyses were performed to investigate the influence of country of origin (German‐speaking versus not German speaking), precision and HAMD baseline values on study findings (responder ratio and mean difference of HAMD scores after treatment) both in placebo and standard antidepressants comparisons. According to current recommendations of experts (Thompson 1999, Lipsey 2000), random effects meta‐regression analyses were carried out using the restricted information maximum likelihood (REML) method. A main advantage of this approach is that it accounts for residual between‐trial heterogeneity. Both univariable and multiple regression models were fitted. We calculated the proportion of explained heterogeneity variance by dividing the heterogeneity explained by the independent variable(s) through the total heterogeneity variance present in random‐effects meta‐analysis. When referring to a whole model, this coefficient was termed R2. When referring to the contribution of single covariates the coefficient was termed β2. In univariable meta‐regression analyses these coefficients are mathematically equal. In multiple meta‐regression analyses, sum of β2 values for all covariates may be slightly different from R2. For all meta‐regression analyses the Statistical Package for the Social Sciences (SPSS; Chicago, Illinois) v13.0 software using additional macros by Wilson (Wilson 2002; Lipsey 2000) was used.

Assessment of reporting biases
Visual analysis of funnel plots was performed to identify possible publication bias (Sterne 2001). Furthermore, the asymmetry coefficient was calculated for formal examination of publication bias (Egger 1997).

Results

Description of studies

Results of the search

A total of 79 possibly relevant studies were identified and checked formally for eligibility.

Included studies

Twenty nine trials including a total of 5489 (range 30 to 388) patients met inclusion criteria (see Characteristics of included studies).
Eighteen trials had a placebo‐control group (Bjerkenstedt 2005; Bracher 2001; Fava 2005; Gastpar 2006; HDTSG 2002; Hänsgen 1996; Kalb 2001; Kasper 2006; Laakmann 1998; Lecrubier 2002; Montgomery 2000; Moreno 2005; Philipp 1999; Schrader 1998; Shelton 2001; Uebelhack 2004; Volz 2000; Witte 1995), and 17 trials compared hypericum with standard antidepressants (Behnke 2002; Bjerkenstedt 2005; Brenner 2000; Fava 2005; Gastpar 2005; Gastpar 2006; Harrer 1993; Harrer 1999; HDTSG 2002; Moreno 2005; Philipp 1999; Schrader 2000; Szegedi 2005; van Gurp 2002; Vorbach 1997; Wheatley 1997; Woelk 2000). Six trials had both a placebo and a standard antidepressant control group (Bjerkenstedt 2005; Fava 2005; Gastpar 2006; HDTSG 2002; Moreno 2005; Philipp 1999). Eight trials are newly included since the last update (Bracher 2001; Fava 2005; Gastpar 2005; Gastpar 2006; Kasper 2006; Moreno 2005; Szegedi 2005; Uebelhack 2004) and one trial which had been included based on an abstract reference only is now included fully (Bjerkenstedt 2005). These eight new trials included a total of 1947 (range 72 to 388) patients. Details on patients, methods, interventions, outcomes, and results of all included studies are described in the table of included studies.

Types of participants
The severity of depression was described as mild to moderate in 19 trials, and as moderate to severe in 9 trials (one trial did not classify severity). Eighteen trials were from German‐speaking countries, four from the US, two from the UK, and one each from Brazil, Canada, Denmark, France and Sweden. Patients were recruited in private practices in all trials from German‐language countries, in the trials from Sweden (Bjerkenstedt 2005) and Canada (van Gurp 2002), and in one of the trials from the UK (Wheatley 1997). The second trial from the UK (Montgomery 2000) and the trial from France (Lecrubier 2002) were performed both in private practices and psychiatric outpatient departments. Three trials from the US (Shelton 2001; HDTSG 2002; Fava 2005) and the Brazilian trial (Moreno 2005) were performed in academic and/or community psychiatry research clinics. Two trials from the US and Denmark (Brenner 2000; Behnke 2002) did not report on the setting.

Types of intervention
A variety of hypericum preparations were studied in the trials. The range of daily extract doses varied between 240 and 1800 mg, but in most trials 500 to 1200 mg were used. The standard antidepressants used as active comparators were fluoxetine (6 trials, dosage 20 to 40 mg), sertraline (4 trials, 50 to 100 mg), imipramine (in 3 trials, dosage 100 to 150 mg), citalopram (1 trial, 20 mg), paroxetine (1 trial, 20 to 40 mg), maprotiline (1 trial, 75 mg), and amitriptyline (1 trial, 75 mg). The comparator dosage of maprotiline and amitriptyline were slightly below of those recommended in current guidelines (Härter 2003, ICSI 2007) and in most other studies at the minimum of recommended dosages. The treatment periods lasted 4 (1 trial), 6 (19), 7 (1), 8 (5) or 12 weeks (4 trials). Four trials included some long‐term follow‐up or continuation treatment after the main trial phase (Brenner 2000; Gastpar 2005; Shelton 2001; Szegedi 2005).

Types of outcome
The most frequently used instrument used for outcome measurement was the Hamilton Rating Scale for Depression (used in all trials). A variety of other ratings scales and instruments were used in addition.

Excluded studies

Fifty trials (see Characteristics of excluded studies) did not meet inclusion criteria: eight trials were not limited to patients with depression (Albertini 1986; Bendre 1980; Dittmer 1992; Hottenrott 1997; Maisenbacher 1995; Panijel 1985; Sindrup 2000; Volz 2002), four trials were on prevention or treatment of depressive symptoms in patients suffering primarily from other diseases (Häring 1996; Li 2005; Mo 2004; Werth 1989), two measured physiological parameters only (as EEG) in depressed patients (Czekalla 1997; Kugler 1990b), five did not include a placebo or standard drug comparison group (Bernhadt 1993; Lenoir 1999; Martinez 1993; Spielberger 1985; Zeller 2000), eight involved healthy volunteers (Brockmöller 1997; Herberg 1992; Johnson 1992; Johnson 1993; Schmidt 1993b; Schulz 1993; Staffeldt 1993; Wienert 1991), three tested combinations of hypericum and other herbal extracts (Ditzler 1992; Kniebel 1988; Steger 1985), and two compared hypericum extract with medications which are no longer considered adequate for depression (diazepam or bromazepam) (Kugler 1990a; Warnecke 1986); one of these trials also was not explicitly randomized. Due to the new exclusion criterion, we excluded 17 trials not restricted to patients with major depression. Fifteen had been included in the previous version of the review (Halama 1991; Harrer 1991; Hoffmann 1979; Hübner 1993; König 1993; Lehrl 1993; Osterheider 1992; Quandt 1993; Reh 1992; Schlich 1987; Schmidt 1989; Schmidt 1993; Sommer 1994; Vorbach 1994; Winkel 2000) while two were not (Agrawal 1994, for which it had not been possible to obtain a full copy and Gu 2001, which was newly identified in the update searches). Finally, we excluded one previously included trial as the standard antidepressant treatment was far below recommended dosages (30 mg Amitriptyline daily; Bergmann 1993).

Risk of bias in included studies

The majority of the trials were of high quality. The median quality scores were 5 (out of 5, range 2 to 5) for the Jadad scale and 4.5 (out of 6; range 2 to 6) for the IV scale (see quality rating of the single trials in the Characteristics of included studies).

Sequence generation/allocation concealment
The information on how the random sequence was generated was reported or provided on request for 18 trials (in all cases a computer program). Twenty two trials reported an adequate method of allocation concealment (most often consecutively numbered medication).

Blinding
All trials were described as double‐blind, but only one trial reported that blinding was tested (HDTSG 2002). In this three‐armed trial (hypericum vs. sertraline vs. placebo) about a third (as expected by chance alone) of guesses made by physicians were correct for hypericum and placebo patients, but in 66% of sertraline patients (p = 0.001).

Incomplete outcome data
In some trials attrition rates were high (for example, Fava 2005; see Characteristics of included studies). All placebo‐controlled trials included an intent to treat analysis.

Effects of interventions

Comparison 1: Hypericum extracts versus placebo

1. Effectiveness

a) Responder analyses

Sixteen of the 18 placebo‐controlled trials reported the number of patients classified as responders based on score reduction on the HAMD scale, one trial reported response according to the MADRS scale (Bracher 2001), and one trial only reported the proportion of patients rated at least as "improved" for the CGI (Volz 2002). Patients receiving hypericum extracts were significantly more likely to be responders (RR = 1.48; 95%CI 1.23 to 1.77; see comparison 1.1 and Figure 1) but study results were highly heterogeneous (I² = 75%). Effects in favour of hypericum extracts were less pronounced in more precise trials (RR = 1.28; 95%CI 1.10 to 1.49) compared to less precise trials (RR = 1.87; 95%CI 1.22 to 2.87) but heterogeneity was still strong in both subgroups (I² = 61% and 79%, respectively).

An external file that holds a picture, illustration, etc.
Object name is nCD000448-AFig-FIG01.jpg

Forest plot of comparison: 1 Hypericum mono‐preparations vs. placebo A. Dichotomous measures, outcome: 1.1 Responder ‐ grouped by precision ‐ primary analysis.

Findings were similar if response rates were based only on the trials reporting response according to the HAMD scale, or on the CGI (see comparisons 1.2 and 1.3). If trials investigating defined extracts were analysed separately (subgroups of trials testing the same extracts; see comparison 1.4), heterogeneity was strong in 3 of 4 subgroups.

Trials from German‐speaking countries reported more positive findings than trials from other countries (RR = 1.78; 95%CI 1.42 to 2.25 vs.1.07; 95% CI 0.88 to 1.31, respectively; see comparison 1.5 and Figure 2). Six trials reported remission rates. These were significantly higher in patients receiving hypericum extracts than in those receiving placebo (RR = 2.77; 95%CI 1.80 to 4.26; I² = 29%; see comparison 1.6).

An external file that holds a picture, illustration, etc.
Object name is nCD000448-AFig-FIG02.jpg

Forest plot of comparison: 1 Hypericum mono‐preparations vs. placebo A. Dichotomous measures, outcome: 1.5 Responder among studies from German‐speaking countries and other studies.

There was significant funnel plot asymmetry for the main responder analysis (coefficient = 2.19, p = 0.03; Figure 3).

An external file that holds a picture, illustration, etc.
Object name is nCD000448-AFig-FIG03.jpg

Funnel plot of comparison: 1 Hypericum mono‐preparations vs. placebo A. Dichotomous measures, outcome: 1.1 Responder ‐ grouped by precision ‐ primary analysis.

In univariable meta‐regression analyses, country of origin (studies from German‐speaking countries showing larger effect sizes; p = 0.002), precision (more precise studies showing smaller effects; p = 0.032) and baseline values (higher values associated with smaller effect sizes; p = 0.048) were significantly associated with effects sizes. In multiple analyses the association remained significant for country of origin (p = 0.035) and precision (p = 0.017) but became non‐significant for HAMD baseline values. Altogether over half of the variance (R² = 0.51) could be explained by these three variables. Findings of meta‐regression analyses are summarised in Appendix 1.

b) Analyses of depression scales

Analyses based on mean HAMD values yielded similar findings. At the completion of treatment HAMD values were 3.04 (95%CI 1.78 to 4.29) score points lower in hypericum groups compared to placebo groups, but there was strong heterogeneity (I² = 86%; see comparison 2.1). Effects over placebo were significant after 2 (comparison 2.2), 4 (comparison 2.3), 6 to 8 weeks of treatment (comparison 2.4), and for changes from baseline to end of treatment (comparison 2.5). Significant effects over placebo were also reported for the MADRS (comparisons 2.6, and 2.7). Studies from German‐language countries reported much larger effects over placebo (weighted mean difference = 4.29, 95%CI 2.97 to 5.61 score points; comparison 2.8) than studies from other countries (MD = 0.77 score points, 95%CI 0.20 to 1.74 score points).

There was no significant funnel plot asymmetry (coefficient in the analysis of HAMD values at completion of treatment = ‐2.12, p = 0.35).

In multiple meta‐regression analysis, country of origin was significantly associated with effects size (larger effects in trials from German‐speaking countries; p < 0.001) but not precision and HAMD baseline values (R² = 0.63; see Appendix 1).

The four trials reporting results for the patient‐rated von Zerssen Depression Scale (D‐S) showed a significant effect of hypericum extracts over placebo (comparison 2.9). Post‐hoc analyses using available data from 12 placebo‐controlled trials for a variety of self‐rating instruments also confirmed analyses based on physician‐rated outcomes. The pooled standardised mean difference (SMD) was ‐0.47 (95% CI ‐0.64 to ‐0.30; I² = 74%; see comparison 2.10). Trials from German‐speaking countries again reported more favourable findings than trials from other countries (SMDs of ‐0.57 and ‐0.17 respectively; see comparison 2.11).

2. Safety

Primary outcome
The number of patients dropping out for adverse effects was similar among patients receiving hypericum extracts and placebo (OR = 0.92, 95%CI 0.45 to 1.88, I2=0%; see comparison 3.1).

Secondary outcomes
The total number of patients dropping out and the number of patients dropping out for any reason were similar among patients receiving hypericum extracts and placebo (comparisons 3.2 and 3.3).

Comparison 2: Hypericum extracts versus standard antidepressants

1. Effectiveness

a) Responder analyses
All 17 trials comparing hypericum extracts to standard antidepressant treatment reported the number of responders according to the HAMD score. Based on an intention to treat approach the pooled responder rate ratio was 1.01 for all 17 trials (95%CI 0.93 to 1.09; I² = 17%; see comparison 4.1 and Figure 4). For the five trials comparing hypericum extracts with older antidepressants, the pooled estimate was 1.02 (95%CI 0.90 to 1.15; I² = 0%), and 1.00 for the 12 trials with selective serotonine reuptake inhibitors (95%CI 0.90 to 1.12; I² = 29%).

An external file that holds a picture, illustration, etc.
Object name is nCD000448-AFig-FIG04.jpg

Forest plot of comparison: 4 Hypericum mono‐preparations vs. standard antidepressants. A. Dichotomous measures, outcome: 4.1 Responder (intent to treat) ‐ primary analysis.

If analyses were based on per protocol data, the pooled responder rate ratio was 0.96 (95%CI 0.88 to 1.05; I² = 43%; see comparison 4.2). Analysis based on the CGI also found no relevant differences (RR = 1.01; 95%CI 0.94 to 1.09; I² = 24%; see comparison 4.3). In trials originating from German‐speaking countries findings were slightly more favourable to hypericum than in trials from other countries (RR 1.04 and 0.90, respectively; see comparison 4.4 and Figure 5). In the four trials reporting remission rates the response rate ratio was 1.24 (95%CI 1.02 to 1.50; I² = 0%; see comparison 4.5).

An external file that holds a picture, illustration, etc.
Object name is nCD000448-AFig-FIG05.jpg

Forest plot of comparison: 4 Hypericum mono‐preparations vs. standard antidepressants. A. Dichotomous measures, outcome: 4.4 Responder among studies from German‐speaking studies and other studies.

The asymmetry coefficient for the main responder analysis was ‐1.07 (p = 0.09; see funnel plot in Figure 6). In univariable meta‐regression analysis, there was a significant association between country of origin and response (trials from German‐speaking countries favouring hypericum; p = 0.037). In the multivariable meta‐regression analysis, none of the three tested predictors proved significant (R² = 0.24).

An external file that holds a picture, illustration, etc.
Object name is nCD000448-AFig-FIG06.jpg

Funnel plot of comparison: 4 Hypericum mono‐preparations vs. standard antidepressants. A. Dichotomous measures, outcome: 4.1 Responder (intent to treat) ‐ primary analysis.

b) Analyses of depression scales
Analyses based on HAMD scores confirmed the findings of the responder analysis (see comparisons 5.1 to 5.5, and 5.8). Analyses of MADRS‐ and D‐S‐values are difficult to interpret, as only few trials reported these outcomes (see comparisons 5.6, 5.7, 5.9). There was no funnel plot asymmetry (0.30, p = 0.73). In the multivariable meta‐regression analysis, trials with higher HAMD baseline values showed less favourable results (p = 0.010), while country of origin and precision had no significant influence (R² = 0.44).

Again, post‐hoc analyses using available data for a variety of self‐rating instruments from 10 trials comparing hypericum extracts and standard antidepressants confirmed analyses based on physician‐rated outcomes. The pooled SMD was 0.01 (95%CI ‐0.13 to 0.15; I² = 43%; see comparison 5.10). The pooled SMDs in trials from German‐speaking countries was ‐0.02 compared to 0.10 in trials from other countries (comparison 5.11).

2. Safety

Primary outcome
Patients allocated to hypericum extracts were less likely to drop out from studies due to adverse effects than patients allocated to older standard antidepressants (OR = 0.24; 95%CI 0.13 to 0.46; I² = 0%) or to SSRIs (OR = 0.53; 95%CI 0.34 to 0.83; I² = 0%; see comparison 6.1 and Figure 7).

An external file that holds a picture, illustration, etc.
Object name is nCD000448-AFig-FIG07.jpg

Forest plot of comparison: 6 Safety ‐ Hypericum mono‐preparations vs. standard antidepressants, outcome: 6.1 Number of patients discontinuing treatment/dropping out due to adverse/side effects.

Secondary outcomes
Attrition for any reason was significantly lower for hypericum extracts compared to older antidepressants (OR = 0.67; 95%CI 0.47 to 0.95; I² = 0%), but not compared to SSRIs (OR = 0.83; 95% 0.63 to 1.08; I² = 0%; see comparison 6.2).

The number of patients reporting side effects was significantly higher in patients receiving older standard antidepressants (OR = 0.39; 95% 0.30 to 0.50; I² =0%); compared to SSRIs, the difference just missed significance (OR = 0.70; 95%CI 0.49 to 1.00; I² =57%; see comparison 6.3).

Discussion

Summary of main results

Overall, the findings from newer trials seem to corrobate the evidence in favour of hypericum extracts. The available data suggest that the hypericum extracts tested in the included trials a) are superior to placebo in patients with major depression; b) are similarly effective as standard antidepressants; c) and have less side effects than standard antidepressants. There are two issues which complicate the interpretation of our findings: 1) While the influence of precision on study results in placebo‐controlled trials is less pronounced in this updated version of our review compared to the previous version (Linde 2005a), results from more precise trials still show smaller effects over placebo than less precise trials. 2) Results from German‐language countries are considerably more favourable for hypericum than trials from other countries.

Interpretation of the findings and limitations

For this update we excluded for the first time from our review all trials which were not restricted to patients with major depression. This does not mean that we believe that major depression is necessarily the only or best indication for hypericum extracts. Some authors argue that patients with signs of atypical depression might be particularly suited for treatment with hypericum extracts (Murck 2002, Murck 2005), the National Institutes of Health are currently funding a trial in patients with minor depression (www.clinicaltrials.gov, identifier NCT00048815), and the findings from older trials (some of which, however, seem methodologically questionable) not restricted to patients with major discussion were very positive (Linde 1998). As pointed out in the introduction, we now focus on major depression to make our review better comparable to overviews on standard antidepressants, to have a more comparable set of studies for analysis, and also because almost all new trials of hypericum extracts are on this indication.

In spite of the tightened inclusion criteria, the findings of the placebo‐controlled trials are still quite heterogeneous. In the trials a variety of hypericum extracts has been tested and daily doses cover a wide range. Differences in the interventions might contribute to some extent to the observed heterogeneity, but they do not seem to be a major factor. In three of four subgroup analyses of single extracts there was strong heterogeneity; for one extract the 95% confidence intervals of the two available trials (Gastpar 2006; Uebelhack 2004) did not even overlap, indicating that the results are hardly compatible. However, some of the factors leading to considerable heterogeneity between study findings could be identified. Considering country of origin, precision, and baseline depression severity of included patients explained 50 to 60 percent of the variance between trial results in comparisons with placebo and 20 to 40 percent in comparison with standard antidepressants. Nevertheless, it has to be stated that meta‐regression analyses are (even if a priori defined) entirely of observative nature. Findings on the association of baseline depression severity and effect size estimates may be biased through structural dependence and regression to the mean, and thus should be interpreted with caution (Higgins 2008). Furthermore, inferences drawn from a meta‐regression analysis on aggregate data may differentiate from inferences drawn from a meta‐regression analysis on individual data (Deeks 2006; e.g. 'ecological bias').

The finding that more precise placebo‐controlled trials yielded less positive results than less precise trials could indicate publication bias (trials with positive results are more likely to be published than trials with negative results) or bias within studies (smaller trials with less rigorous methods yielding overoptimistic results). We cannot rule out, but doubt, that selective publication of overoptimistic results in small trials strongly influences our findings. There is some evidence that "negative" trials without demonstrable differences between extracts and placebo were published less often as full articles than trials with "positive" findings. Our extensive searches identified three "negative" trials that were only published as abstracts or theses. Two that were conducted in the early 1990s (these were included in earlier versions of this review Linde 1996; Linde 1998; Linde 2005a) involved patients without documented major depression (König 1993; Osterheider 1992), and one that was conducted in the late 1990s involved patients with major depression (Montgomery 2000). One positive trial included in our last update, but now excluded, was published as an abstract and as a chapter in a not widely available book (Winkel 2000). One comparably large, positive trial (Bracher 2001) has been published only in a short report as a supplement to a German medical newspaper. This trial is an example that sponsors or manufacturers of herbal medicines sometimes have very limited interest in a major publication if their trial includes a new aspect (in that case a once daily dosage), as there is no patent protection for herbal extracts and results can be exploited by competitors, too. We suspect that there are few additional relevant unpublished trials. Few manufacturers of hypericum extracts sponsor research trials, and the five manufacturers whose products were tested in most of the trials told us they had (with the exception of one smaller negative trial) no other unpublished trials that possibly met our inclusion criteria. Through personal communication we were informed that there are at least one or two unpublished negative trials on tea preparations of Hypericum. However, tea preparations are phytochemically very different from alcoholic extracts and have to be evaluated separately.

We found that the quality of the majority of trials was adequate, and we detected no systematic differences in design aspects known to be potential sources of bias. All trials were double‐blind. Though adequacy of blinding was not formally assessed in most trials, achieving similarity between hypericum and placebo preparations is not particularly difficult. All trials were randomised, and most concealed allocation assignments by using consecutively numbered identical medication containers. Reported drop‐out rates were low in the majority of trials. Investigators involved in older trials may have had less training and/or experience with diagnostic standards and rating scales for depressive symptoms (Shelton 2001), but this issue, if true, is likely to affect generalisation of findings rather than internal validity. Finally, though we found no systematic differences in major factors generally related to trial quality, our subjective judgement was that larger trials tended to be of better quality than smaller trials. The dosages of standard antidepressants were (with two exceptions) within the range recommended in current guidelines (e.g., Härter 2003), but at the lower limit.

Our finding that studies from German‐speaking countries yielded more favourable results than trials performed elsewhere is difficult to interpret. As our analyses are partly data‐driven, they must be considered cautiously. However, the consistency and extent of the observed association suggest that there are important differences in trials performed in different countries. One possibility is that studies performed in German‐speaking countries with a long history of hypericum prescription by physicians enrolled slightly different patients in spite of similar inclusion criteria. With one exception (the extremely positive trial by Uebelhack 2004 performed in a research clinic of a contract research organisation), all German studies recruited patients in private practices, while a number of trials from other countries were performed in academic research settings or hospital outpatient units. Depression with atypical or reversed vegetative features might be present more often in primary care outpatient populations (Murck 2005).The trend that trials with higher HAMD baseline values reported slightly less favourable results also suggests that effectiveness of hypericum extracts might differ between subgroups of depressive patients. While we did not systematically investigate this issue, it seems to us that the trials from countries other than Germany might be more often investigator‐initiated. A closer link of trial planning, performance and analysis with manufacturer interests might influence study findings. This could result possibly in true bias, but also in conditions making a true positive outcome more likely. For example, for at least three trials (Kasper 2006; Schrader 1998; Uebelhack 2004) with large effects performed in German‐speaking countries authors or sponsors reported in the publication or in personal communications that contact times and interaction with patients were limited to minimise placebo response rates. Increasing placebo group response rates due to the intensive care and monitoring in antidepressant trials are considered by some researchers as a potential reason for the problem to show specific effects (Posternak 2007). One could also speculate whether unblinding might lead German physicians (who often use hypericum extracts in their usual practice) to give more positive ratings and (the possibly more sceptic) colleagues from elsewhere to more negative ratings. However, as hypericum extracts have no characteristic side effects such a problem seems only relevant in comparisons with standard antidepressants.

Potential biases in the review process

The work for the first version of this review started in 1993 and three previous versions are available (Linde 1996; Linde 1998; Linde 2005a). During that period a large number of new trials became available, diagnostic classifications used for including patients into studies have changed and the quality of trials has improved. In parallel the methods of our review were adapted. The changes over time make it difficult to report our searches and their results in a consistent and transparent manner. The way how we approached authors/sponsors for obtaining missing information and the contents of inquiries were not fully systematic and have changed over time. This could imply that additional data necessary for some secondary analyses were obtained for a selected subset of studies. However, data for the main analysis were available for all or almost all trials, therefore, major biases seem highly unlikely. A potential source of bias in the responder analyses could be slightly variable responder definitions in the primary studies. Response according to the HAMD was either defined as at least 50% reduction, a HAMD score < 10 (or 11) after treatment, at least one or the combination of both. Whether these definitions were truly made a priori in each study could not be assessed. Decisions on the inclusion of subgroup analyses (for example, regarding precision or country effects) for updates were driven by findings in previous versions of the review. Therefore, these analyses must be interpreted with caution. Publication and small study bias have been discussed in the previous section.

Authors' conclusions

Implications for practice

In older systematic reviews and meta‐analyses of hypericum extracts (Ernst 1995; Linde 1996; Linde 1998; Kim 1999; Gaster 2000; Williams 2000) the findings of the included studies were mostly positive, but reviewers drew cautious conclusions due to methodological limitations. The quality of trials on average clearly improved over recent years. However, study findings became more often contradictory, and in the last version of our review larger trials restricted to patients with major depression showed only minimal effects over placebo (Linde 2005a). With the addition of several new, partly large trials, the cumulative evidence now suggests that hypericum extract have a modest effect over placebo in a similar range as standard antidepressants (Kirsch 2008; Turner 2008). The direct comparisons with older antidepressants and selective serotonine reuptake inhibitors seem to confirm this impression. The available clinical trials now also show that hypericum extracts have fewer side effects than both older antidepressants and selective serotonine reuptake inhibitors. This would imply that an attempt of treating mild to moderate major depression with one of the hypericum preparations positively tested in clinical trials is clearly justified. However, the differences in the findings from different countries make clear‐cut recommendations difficult. The evidence for severe major depression is still insufficient to draw conclusions.

Many patients buy St John's wort products from health food stores and might not disclose this to their physicians (Smith 2004). Such uncontrolled use is problematic as serious interactions can occur with a number of frequently used drugs (Ernst 1999; Hammerness 2003; Knüppel 2004; Whitten 2006). Therefore, physicians should regularly ask their patients about hypericum intake. However, it must be kept in mind that drug interactions are not a problem unique to hypericum extracts, but also common for standard antidepressants (Nieuwstraten 2006).

It has to be emphasised that the quality of hypericum preparations can differ considerably. The composition of a product depends on the raw plant material used, the extraction process, and the solvents. In consequence, the amounts of bioactive constituents in different products can vary enormously. A recent study has shown that a number of products available on the German market contain only minor amounts of bioactive constitutents (Wurglics 2003). The hypericum extracts tested in clinical trials have to be considered high quality products. Results obtained with these extracts cannot be extrapolated directly to other products. In our meta‐analysis, the type of extract did not contribute to the explanation of heterogeneity. This does not mean, however, that all products tested in the available trials are equally effective. Standardisation of a product on a defined component (for example, hyperforin or hypericin) does not resolve the problem, as currently the exact mechanism for the antidepressant effects of hypericum extracts is still unclear, and available research indicates that several components are relevant. The findings of this review most likely apply to products (using ethanol 50 to 60% or methanol 80% for extraction from dried plant material) with daily extract dosages of 500 to 1200 mg with a ratio of raw material to extract of 3‐7:1.

Implications for research

There is a clear need to investigate the reasons for the differences in findings from trials originating from German‐speaking countries and those from other countries. Mulitnational trials would seem desirable, but it is unlikely that there will be funding for such studies in the near future. Individual patient data meta‐analysis of existing trials could be a possible tool to investigate predictors of treatment response in a more accurate manner. The authors will try to obtain such data from researchers and/or sponsors.

What's new

DateEventDescription
8 February 2009AmendedContact details updated

History

Protocol first published: Issue 3, 1997
Review first published: Issue 4, 1998

DateEventDescription
15 July 2008New citation required and conclusions have changed1) The title of the review has been changed from 'St. John's wort for depression' to 'St. John's wort for major depression'. This reflects that inclusion has been now limited to trials in patients suffering from major depression only.
2) The modification of selection criteria resulted in the exclusion of 16 of 37 studies included in the previous version. Eight new trials have been included. The review now covers 29 trials with 18 comparisons of a hypericum extract with a placebo and 17 with a standard antidepressant (six three‐armed trials).
3) Compared to the previous version our conclusions are now slightly more favourable, as modest effects over placebo have also been shown in several large trials, and as side effects appear to be less frequent compared to both older antidepressants and selective serotonin reuptake inhibitors.
4) The list of authors has been amended.
23 May 2008AmendedConverted to new review format.
12 December 2007New citation required and conclusions have changedSubstantive amendment

Acknowledgements

The authors would like to thank
‐ the researchers and sponsors who provided additional information on trials
‐ Eva Israel and Karsten Schmidt who helped with assessing eligibility and doing extractions
‐ Cindy Mulrow and Matthias Egger who were involved in previous versions of the review

Appendices

Appendix 1. Summary of meta‐regression analyses

Table 1: Univariable meta‐regression analysis of response rate ratio (relative risk of response) in comparison to placebo

Explanatory variableCoefficient (B)95% confidence intervalStatistical significance (p)Proportion of heterogeneity explained (R2)Interpretation
Country0.530.20 to 0.86.0020.29Studies form German‐speaking countries result statistically significant in 0.53 higher effect size estimates on average than studies form non‐German‐speaking countries.
Precision‐0.11‐0.21 to ‐0.01.0320.16Studies with one unit increase in precision (1/SE) result statistically significant in 0.11 lower effect size estimates on average.
Baseline HAMD‐0.12‐0.23 to 0.0480.16Studies with one point increase in mean baseline HAMD severity result statistically significant in 0.12 lower effect size estimates on average.

 

Table 2: Multiple meta‐regression analysis of response rate ratio (relative risk of response) in comparison to placebo

Explanatory variableCoefficient (B)95% confidence intervalStatistical significance (p)Proportion of heterogeneity explained (Beta2)Interpretation
Country0.320.02 to 0.61.0350.13Studies form German‐speaking countries result statistically significant in 0.32 higher effect size estimates on average than studies form non‐German‐speaking countries.
Precision‐0.09‐0.18 to ‐0.02.0170.17Studies with one unit increase in precision (1/SE) result statistically significant in 0.09 lower effect size estimates on average.
Baseline HAMD‐0.07‐0.16 to 0.03.1610.06No statistically significant association was found.

 

Table 3: Univariable meta‐regression analysis of response rate ratio (relative risk of response) in comparison to standard antidepressants

Explanatory variableCoefficient (B)95% confidence intervalStatistical significance (p)Proportion of heterogeneity explained (R2)Interpretation
Country0.180.01 to 0.36.0370.23Studies form German‐speaking countries result statistically significant in 0.18 higher effect size estimates on average than studies form non‐German‐speaking countries.
Precision0.030 to 0.06.0650.18No statistically significant association was found.
Baseline HAMD0.01‐0.02 to 0.04.5460.02No statistically significant association was found.

 

Table 4: Multiple meta‐regression analysis of response rate ratio (relative risk of response) in comparison to standard antidepressants

Explanatory variableCoefficient (B)95% confidence intervalStatistical significance (p)Proportion of heterogeneity explained (Beta2)Interpretation
Country0.14‐0.10 to 0.38.2440.17No statistically significant association was found.
Precision0.01‐0.04 to 0.05.8320.01No statistically significant association was found.
Baseline HAMD0.01‐0.02 to 0.04.7040.01No statistically significant association was found.

 

Table 5: Univariable meta‐regression analysis of mean difference in post‐treatment HAMD scores in placebo comparisons

Explanatory variableCoefficient (B)95% confidence intervalStatistical significance (p)Proportion of heterogeneity explained (R2)Interpretation
Country3.801.95 to 5.66<.0010.54Studies form German‐speaking countries result statistically significant in 3.80 HAMD points higher effect size estimates on average than studies form non‐German‐speaking countries.
Precision3.02‐2.10 to 8.15.2470.09No statistically significant association was found.
Baseline HAMD‐0.57‐1.27 to 0.14.1170.15No statistically significant association was found.

 

Table 6: Multiple meta‐regression analysis of mean difference in post‐treatment HAMD scores in placebo comparisons

Explanatory variableCoefficient (B)95% confidence intervalStatistical significance (p)Proportion of heterogeneity explained (Beta2)Interpretation
Country3.521.70 to 5.33<.0010.46Studies form German‐speaking countries result statistically significant in 3.52 HAMD points higher effect size estimates on average than studies form non‐German‐speaking countries.
Precision2.43‐1.14 to 5.99.1830.05No statistically significant association was found.
Baseline HAMD‐0.24‐0.76 to 0.29.3780.03No statistically significant association was found.

 

Table 7: Univariable meta‐regression analysis of mean difference in post‐treatment HAMD scores in standard antidepressant comparisons

Explanatory variableCoefficient (B)95% confidence intervalStatistical significance (p)Proportion of heterogeneity explained (R2)Interpretation
Country0.02‐1.64 to 1.68.9820No statistically significant association was found.
Precision0.04‐1.71 to 1.79.9660No statistically significant association was found.
Baseline HAMD‐0.24‐0.48 to 0.0450.26Studies with one point increase in mean baseline HAMD severity result statistically significant in 0.24 lower effect size estimates on average.

 

Table 8: Multiple meta‐regression analysis of mean difference in post‐treatment HAMD scores in standard antidepressant comparisons

Explanatory variableCoefficient (B)95% confidence intervalStatistical significance (p)Proportion of heterogeneity explained (Beta2)Interpretation
Country1.34‐0.76 to 3.45.2110.22No statistically significant association was found.
Precision‐2.11‐4.62 to 0.40.0990.60No statistically significant association was found.
Baseline HAMD‐0.45‐0.79 to 0.11.0100.88Studies with one point increase in mean baseline HAMD severity result statistically significant in 0.45 lower effect size estimates on average.

 

Notes

Edited (no change to conclusions)

Data and analyses

Comparison 1

Hypericum mono‐preparations vs. placebo A. Dichotomous measures
Outcome or subgroup titleNo. of studiesNo. of participantsStatistical methodEffect size
1 Responder ‐ grouped by precision ‐ primary analysis183064Risk Ratio (M‐H, Random, 95% CI)1.48 [1.23, 1.77]
1.1 Less precise trials91020Risk Ratio (M‐H, Random, 95% CI)1.87 [1.22, 2.87]
1.2 More precise trials92044Risk Ratio (M‐H, Random, 95% CI)1.28 [1.10, 1.49]
2 Responder ‐ according to HAMD162706Risk Ratio (M‐H, Random, 95% CI)1.51 [1.22, 1.87]
2.1 Less precise trials8948Risk Ratio (M‐H, Random, 95% CI)1.94 [1.19, 3.18]
2.2 More precise trials81758Risk Ratio (M‐H, Random, 95% CI)1.28 [1.06, 1.53]
3 Responder ‐ according to CGI (Clinical Global Impression Index at least "much improved")132306Risk Ratio (M‐H, Random, 95% CI)1.47 [1.24, 1.74]
3.1 Less precise trials7869Risk Ratio (M‐H, Random, 95% CI)1.74 [1.30, 2.33]
3.2 More precise trials61437Risk Ratio (M‐H, Random, 95% CI)1.26 [1.06, 1.50]
4 Responder ‐ grouped by extract18 Risk Ratio (M‐H, Random, 95% CI)Subtotals only
4.1 extract LI 1606981Risk Ratio (M‐H, Random, 95% CI)1.31 [0.92, 1.86]
4.2 extract WS 55702699Risk Ratio (M‐H, Random, 95% CI)1.57 [0.96, 2.56]
4.3 extract WS 55722170Risk Ratio (M‐H, Random, 95% CI)1.47 [1.05, 2.06]
4.4 extract STW3‐VI2401Risk Ratio (M‐H, Random, 95% CI)3.59 [0.41, 31.56]
4.5 other extracts6813Risk Ratio (M‐H, Random, 95% CI)1.45 [1.08, 1.93]
5 Responder among studies from German‐speaking countries and other studies183064Risk Ratio (M‐H, Random, 95% CI)1.48 [1.23, 1.77]
5.1 Studies from German‐speaking countries111770Risk Ratio (M‐H, Random, 95% CI)1.78 [1.42, 2.25]
5.2 Studies from other countries71294Risk Ratio (M‐H, Random, 95% CI)1.07 [0.88, 1.31]
6 Remission (HAMD score < 8 or < 7)61236Odds Ratio (M‐H, Random, 95% CI)2.77 [1.80, 4.26]
An external file that holds a picture, illustration, etc.
Object name is nCD000448-CMP-001-01.jpg
Analysis

Comparison 1 Hypericum mono‐preparations vs. placebo A. Dichotomous measures, Outcome 1 Responder ‐ grouped by precision ‐ primary analysis.

An external file that holds a picture, illustration, etc.
Object name is nCD000448-CMP-001-02.jpg
Analysis

Comparison 1 Hypericum mono‐preparations vs. placebo A. Dichotomous measures, Outcome 2 Responder ‐ according to HAMD.

An external file that holds a picture, illustration, etc.
Object name is nCD000448-CMP-001-03.jpg
Analysis

Comparison 1 Hypericum mono‐preparations vs. placebo A. Dichotomous measures, Outcome 3 Responder ‐ according to CGI (Clinical Global Impression Index at least "much improved").

An external file that holds a picture, illustration, etc.
Object name is nCD000448-CMP-001-04.jpg
Analysis

Comparison 1 Hypericum mono‐preparations vs. placebo A. Dichotomous measures, Outcome 4 Responder ‐ grouped by extract.

An external file that holds a picture, illustration, etc.
Object name is nCD000448-CMP-001-05.jpg
Analysis

Comparison 1 Hypericum mono‐preparations vs. placebo A. Dichotomous measures, Outcome 5 Responder among studies from German‐speaking countries and other studies.

An external file that holds a picture, illustration, etc.
Object name is nCD000448-CMP-001-06.jpg
Analysis

Comparison 1 Hypericum mono‐preparations vs. placebo A. Dichotomous measures, Outcome 6 Remission (HAMD score < 8 or < 7).

Comparison 2

Hypericum mono‐preparations vs. placebo. B. Continuous measures
Outcome or subgroup titleNo. of studiesNo. of participantsStatistical methodEffect size
1 Mean HAMD (Hamilton Rating Scale for Depression) scores after therapy172871Mean Difference (IV, Random, 95% CI)‐3.04 [‐4.29, ‐1.78]
2 Mean HAMD (Hamilton Rating Scale for Depression) scores after 2 to 3 weeks of treatment132299Mean Difference (IV, Random, 95% CI)‐1.22 [‐2.07, ‐0.37]
3 Mean HAMD (Hamilton Rating Scale for Depression) score after 4 weeks of treatment111634Mean Difference (IV, Random, 95% CI)‐1.65 [‐2.78, ‐0.52]
4 Mean HAMD (Hamilton Rating Scale for Depression) scores after 6 to 8 weeks of treatment152578Mean Difference (IV, Random, 95% CI)‐2.97 [‐4.31, ‐1.63]
5 Difference HAMD (Hamilton Rating Scale for Depression) baseline ‐ end of treatment172931Mean Difference (IV, Random, 95% CI)‐3.03 [‐4.67, ‐1.39]
6 MADRS after treatment3640Mean Difference (IV, Random, 95% CI)‐3.86 [‐7.30, ‐0.42]
7 Difference MADRS baseline ‐ end of treatment41015Mean Difference (IV, Random, 95% CI)‐3.01 [‐4.88, ‐1.14]
8 Mean HAMD after treatment in studies from German‐speaking countries and other studies172871Mean Difference (IV, Random, 95% CI)‐3.04 [‐4.29, ‐1.78]
8.1 Studies from German‐speaking countries111720Mean Difference (IV, Random, 95% CI)‐4.29 [‐5.61, ‐2.97]
8.2 Studies from other countries61151Mean Difference (IV, Random, 95% CI)‐0.77 [‐1.74, 0.20]
9 Mean Depression Scale von Zerssen (D‐S) after therapy/difference baseline ‐ after therapy4411Mean Difference (IV, Random, 95% CI)‐3.72 [‐5.32, ‐2.12]
10 Various self‐rating scales132330Std. Mean Difference (IV, Random, 95% CI)‐0.47 [‐0.64, ‐0.30]
10.1 von Zerssen Depression Scale (D‐S) after treatment3313Std. Mean Difference (IV, Random, 95% CI)‐0.62 [‐1.11, ‐0.14]
10.2 von Zerssen Depression Scale (D‐S) difference baseline ‐ after treatment2170Std. Mean Difference (IV, Random, 95% CI)‐0.90 [‐2.02, 0.22]
10.3 von Zerssen Adjective Mood Scale2401Std. Mean Difference (IV, Random, 95% CI)‐0.61 [‐0.84, ‐0.37]
10.4 Beck Depression Inventory1195Std. Mean Difference (IV, Random, 95% CI)‐0.28 [‐0.56, 0.00]
10.5 Beck Depression Inventory difference baseline ‐ after treatment2553Std. Mean Difference (IV, Random, 95% CI)‐0.31 [‐0.71, 0.09]
10.6 Zung Self Rating Depression Scale (SDS) difference baseline ‐ after treatment1146Std. Mean Difference (IV, Random, 95% CI)‐0.37 [‐0.73, ‐0.02]
10.7 Symptom Checklist (SCL‐58) depression score1375Std. Mean Difference (IV, Random, 95% CI)‐0.16 [‐0.37, 0.04]
10.8 von Zerssen Paranoid‐Depressivitäts‐Skala1177Std. Mean Difference (IV, Random, 95% CI)‐0.18 [‐0.48, 0.11]
11 Various self‐rating scales in studies from German‐speaking countries and other countries132330Std. Mean Difference (IV, Random, 95% CI)‐0.47 [‐0.64, ‐0.30]
11.1 Studies from German‐speaking countries101531Std. Mean Difference (IV, Random, 95% CI)‐0.57 [‐0.77, ‐0.37]
11.2 Studies from other countries3799Std. Mean Difference (IV, Random, 95% CI)‐0.17 [‐0.31, ‐0.04]
An external file that holds a picture, illustration, etc.
Object name is nCD000448-CMP-002-01.jpg
Analysis

Comparison 2 Hypericum mono‐preparations vs. placebo. B. Continuous measures, Outcome 1 Mean HAMD (Hamilton Rating Scale for Depression) scores after therapy.

An external file that holds a picture, illustration, etc.
Object name is nCD000448-CMP-002-02.jpg
Analysis

Comparison 2 Hypericum mono‐preparations vs. placebo. B. Continuous measures, Outcome 2 Mean HAMD (Hamilton Rating Scale for Depression) scores after 2 to 3 weeks of treatment.

An external file that holds a picture, illustration, etc.
Object name is nCD000448-CMP-002-03.jpg
Analysis

Comparison 2 Hypericum mono‐preparations vs. placebo. B. Continuous measures, Outcome 3 Mean HAMD (Hamilton Rating Scale for Depression) score after 4 weeks of treatment.

An external file that holds a picture, illustration, etc.
Object name is nCD000448-CMP-002-04.jpg
Analysis

Comparison 2 Hypericum mono‐preparations vs. placebo. B. Continuous measures, Outcome 4 Mean HAMD (Hamilton Rating Scale for Depression) scores after 6 to 8 weeks of treatment.

An external file that holds a picture, illustration, etc.
Object name is nCD000448-CMP-002-05.jpg
Analysis

Comparison 2 Hypericum mono‐preparations vs. placebo. B. Continuous measures, Outcome 5 Difference HAMD (Hamilton Rating Scale for Depression) baseline ‐ end of treatment.

An external file that holds a picture, illustration, etc.
Object name is nCD000448-CMP-002-06.jpg
Analysis

Comparison 2 Hypericum mono‐preparations vs. placebo. B. Continuous measures, Outcome 6 MADRS after treatment.

An external file that holds a picture, illustration, etc.
Object name is nCD000448-CMP-002-07.jpg
Analysis

Comparison 2 Hypericum mono‐preparations vs. placebo. B. Continuous measures, Outcome 7 Difference MADRS baseline ‐ end of treatment.

An external file that holds a picture, illustration, etc.
Object name is nCD000448-CMP-002-08.jpg
Analysis

Comparison 2 Hypericum mono‐preparations vs. placebo. B. Continuous measures, Outcome 8 Mean HAMD after treatment in studies from German‐speaking countries and other studies.

An external file that holds a picture, illustration, etc.
Object name is nCD000448-CMP-002-09.jpg
Analysis

Comparison 2 Hypericum mono‐preparations vs. placebo. B. Continuous measures, Outcome 9 Mean Depression Scale von Zerssen (D‐S) after therapy/difference baseline ‐ after therapy.

An external file that holds a picture, illustration, etc.
Object name is nCD000448-CMP-002-10.jpg
Analysis

Comparison 2 Hypericum mono‐preparations vs. placebo. B. Continuous measures, Outcome 10 Various self‐rating scales.

An external file that holds a picture, illustration, etc.
Object name is nCD000448-CMP-002-11.jpg
Analysis

Comparison 2 Hypericum mono‐preparations vs. placebo. B. Continuous measures, Outcome 11 Various self‐rating scales in studies from German‐speaking countries and other countries.

Comparison 3

Safety ‐ Hypericum mono‐preparations vs. placebo
Outcome or subgroup titleNo. of studiesNo. of participantsStatistical methodEffect size
1 Number of patients discontinuing treatment/dropping out for adverse effects ‐ primary analysis162784Odds Ratio (M‐H, Random, 95% CI)0.92 [0.45, 1.88]
2 Number of patients dropping out162784Odds Ratio (M‐H, Random, 95% CI)0.87 [0.67, 1.12]
3 Number of patients reporting adverse effects142496Odds Ratio (M‐H, Random, 95% CI)0.98 [0.78, 1.23]
An external file that holds a picture, illustration, etc.
Object name is nCD000448-CMP-003-01.jpg
Analysis

Comparison 3 Safety ‐ Hypericum mono‐preparations vs. placebo, Outcome 1 Number of patients discontinuing treatment/dropping out for adverse effects ‐ primary analysis.

An external file that holds a picture, illustration, etc.
Object name is nCD000448-CMP-003-02.jpg
Analysis

Comparison 3 Safety ‐ Hypericum mono‐preparations vs. placebo, Outcome 2 Number of patients dropping out.

An external file that holds a picture, illustration, etc.
Object name is nCD000448-CMP-003-03.jpg
Analysis

Comparison 3 Safety ‐ Hypericum mono‐preparations vs. placebo, Outcome 3 Number of patients reporting adverse effects.

Comparison 4

Hypericum mono‐preparations vs. standard antidepressants. A. Dichotomous measures
Outcome or subgroup titleNo. of studiesNo. of participantsStatistical methodEffect size
1 Responder (intent to treat) ‐ primary analysis172810Risk Ratio (M‐H, Random, 95% CI)1.01 [0.93, 1.09]
1.1 vs. older antidepressants51016Risk Ratio (M‐H, Random, 95% CI)1.02 [0.90, 1.15]
1.2 vs. SSRIs121794Risk Ratio (M‐H, Random, 95% CI)1.00 [0.90, 1.12]
2 Responder (per protocol)172306Risk Ratio (M‐H, Random, 95% CI)0.96 [0.88, 1.05]
2.1 vs. older antidepressants5854Risk Ratio (M‐H, Random, 95% CI)0.93 [0.78, 1.11]
2.2 vs. SSRIs121452Risk Ratio (M‐H, Random, 95% CI)0.97 [0.87, 1.08]
3 Responders according to CGI (Clinical Global Impression Index at least "much improved")122234Risk Ratio (M‐H, Random, 95% CI)1.01 [0.94, 1.09]
3.1 vs. older antidepressants4692Risk Ratio (M‐H, Random, 95% CI)0.97 [0.87, 1.09]
3.2 vs. newer antidepressants81542Risk Ratio (M‐H, Random, 95% CI)1.03 [0.92, 1.15]
4 Responder among studies from German‐speaking studies and other studies172769Risk Ratio (M‐H, Random, 95% CI)1.00 [0.93, 1.09]
4.1 Studies from German‐speaking countries91952Risk Ratio (M‐H, Random, 95% CI)1.04 [0.96, 1.13]
4.2 Studies from other countries8817Risk Ratio (M‐H, Random, 95% CI)0.90 [0.76, 1.06]
5 Remission (HAMD score < 8)4685Risk Ratio (M‐H, Random, 95% CI)1.24 [1.02, 1.50]
5.1 vs. older antidepressants00Risk Ratio (M‐H, Random, 95% CI)0.0 [0.0, 0.0]
5.2 vs. SSRIs4685Risk Ratio (M‐H, Random, 95% CI)1.24 [1.02, 1.50]
An external file that holds a picture, illustration, etc.
Object name is nCD000448-CMP-004-01.jpg
Analysis

Comparison 4 Hypericum mono‐preparations vs. standard antidepressants. A. Dichotomous measures, Outcome 1 Responder (intent to treat) ‐ primary analysis.

An external file that holds a picture, illustration, etc.
Object name is nCD000448-CMP-004-02.jpg
Analysis

Comparison 4 Hypericum mono‐preparations vs. standard antidepressants. A. Dichotomous measures, Outcome 2 Responder (per protocol).

An external file that holds a picture, illustration, etc.
Object name is nCD000448-CMP-004-03.jpg
Analysis

Comparison 4 Hypericum mono‐preparations vs. standard antidepressants. A. Dichotomous measures, Outcome 3 Responders according to CGI (Clinical Global Impression Index at least "much improved").

An external file that holds a picture, illustration, etc.
Object name is nCD000448-CMP-004-04.jpg
Analysis

Comparison 4 Hypericum mono‐preparations vs. standard antidepressants. A. Dichotomous measures, Outcome 4 Responder among studies from German‐speaking studies and other studies.

An external file that holds a picture, illustration, etc.
Object name is nCD000448-CMP-004-05.jpg
Analysis

Comparison 4 Hypericum mono‐preparations vs. standard antidepressants. A. Dichotomous measures, Outcome 5 Remission (HAMD score < 8).

Comparison 5

Hypericum mono‐preparations vs. standard antidepressants. B. Continuous measures
Outcome or subgroup titleNo. of studiesNo. of participantsStatistical methodEffect size
1 Mean HAMD (Hamilton Rating Scale for Depression) after therapy121889Mean Difference (IV, Random, 95% CI)‐0.39 [‐1.23, 0.45]
1.1 vs. older antidepressants3477Mean Difference (IV, Random, 95% CI)‐0.06 [‐1.82, 1.71]
1.2 vs. SSRIs91412Mean Difference (IV, Random, 95% CI)‐0.52 [‐1.55, 0.51]
2 Mean HAMD (Hamilton Rating Scale for Depression) scores after 2 or 3 weeks of treatment91529Mean Difference (IV, Random, 95% CI)‐0.12 [‐1.02, 0.78]
2.1 vs. older antidepressants3477Mean Difference (IV, Random, 95% CI)‐0.05 [‐1.31, 1.20]
2.2 vs. SSRIs61052Mean Difference (IV, Random, 95% CI)‐0.25 [‐1.50, 1.00]
3 Mean HAMD (Hamilton Rating Scale for Depression) scores after 4 weeks of treatment91367Mean Difference (IV, Random, 95% CI)‐0.34 [‐1.48, 0.80]
3.1 vs. older antidepressants3477Mean Difference (IV, Random, 95% CI)0.02 [‐1.11, 1.15]
3.2 vs. SSRIs6890Mean Difference (IV, Random, 95% CI)‐0.69 [‐2.44, 1.06]
4 Mean HAMD (Hamilton Rating Scale for Depression) scores after 6 to 8 weeks of treatment101659Mean Difference (IV, Random, 95% CI)‐0.34 [‐1.24, 0.57]
4.1 vs. older antidepressants2391Mean Difference (IV, Random, 95% CI)‐0.21 [‐2.56, 2.14]
4.2 vs. SSRIs81268Mean Difference (IV, Random, 95% CI)‐0.38 [‐1.46, 0.69]
5 Difference HAMD (Hamilton Rating Scale for Depression) baseline ‐ end of treatment101652Mean Difference (IV, Random, 95% CI)‐0.35 [‐1.23, 0.52]
5.1 vs. older antidepressants1210Mean Difference (IV, Random, 95% CI)‐1.20 [‐3.29, 0.89]
5.2 vs. SSRIs91442Mean Difference (IV, Random, 95% CI)‐0.25 [‐1.21, 0.71]
6 MADRS after treatment1108Mean Difference (IV, Random, 95% CI)‐0.90 [‐4.73, 2.93]
6.1 vs. older antidepressants00Mean Difference (IV, Random, 95% CI)0.0 [0.0, 0.0]
6.2 vs. SSRIs1108Mean Difference (IV, Random, 95% CI)‐0.90 [‐4.73, 2.93]
7 Difference MADRS baseline ‐ end of treatment2352Mean Difference (IV, Random, 95% CI)‐2.90 [‐5.10, ‐0.70]
7.1 vs. older antidepressants00Mean Difference (IV, Random, 95% CI)0.0 [0.0, 0.0]
7.2 vs. SSRIs2352Mean Difference (IV, Random, 95% CI)‐2.90 [‐5.10, ‐0.70]
8 Mean HAMD after treatment in studies from German‐speaking countries and other studies152423Mean Difference (IV, Random, 95% CI)‐0.39 [‐1.23, 0.45]
8.1 Studies from German‐speaking countries91888Mean Difference (IV, Random, 95% CI)‐0.43 [‐1.28, 0.41]
8.2 Studies from other countries6535Mean Difference (IV, Random, 95% CI)‐0.44 [‐2.67, 1.79]
9 Mean D‐S (Depression Scale von Zerssen) scores after therapy4360Mean Difference (IV, Random, 95% CI)2.66 [0.83, 4.50]
9.1 vs. older antidepressants2272Mean Difference (IV, Random, 95% CI)2.81 [0.77, 4.85]
9.2 vs. SSRIs288Mean Difference (IV, Random, 95% CI)2.04 [‐2.13, 6.21]
10 Various self‐rating scales101570Std. Mean Difference (IV, Random, 95% CI)0.01 [‐0.13, 0.15]
10.1 von Zerssen Depression Scale (D‐S) after treatment4360Std. Mean Difference (IV, Random, 95% CI)0.28 [0.07, 0.49]
10.2 Beck Depression Inventory183Std. Mean Difference (IV, Random, 95% CI)‐0.01 [‐0.44, 0.42]
10.3 Beck Depression Inventory difference baseline ‐ after treatment2466Std. Mean Difference (IV, Random, 95% CI)‐0.12 [‐0.53, 0.29]
10.4 Zung Self Rating Depression Scale (SDS) difference baseline ‐ after treatment1205Std. Mean Difference (IV, Random, 95% CI)‐0.11 [‐0.38, 0.17]
10.5 von Zerssen Adjective Mood Scale2456Std. Mean Difference (IV, Random, 95% CI)‐0.06 [‐0.24, 0.13]
11 Various self‐rating scales in studies from German‐speaking countries and other countries101570Std. Mean Difference (IV, Random, 95% CI)0.01 [‐0.13, 0.15]
11.1 Studies from German‐speaking countries61177Std. Mean Difference (IV, Random, 95% CI)‐0.02 [‐0.21, 0.18]
11.2 Studies from other countries4393Std. Mean Difference (IV, Random, 95% CI)0.10 [‐0.10, 0.29]
An external file that holds a picture, illustration, etc.
Object name is nCD000448-CMP-005-01.jpg
Analysis

Comparison 5 Hypericum mono‐preparations vs. standard antidepressants. B. Continuous measures, Outcome 1 Mean HAMD (Hamilton Rating Scale for Depression) after therapy.

An external file that holds a picture, illustration, etc.
Object name is nCD000448-CMP-005-02.jpg
Analysis

Comparison 5 Hypericum mono‐preparations vs. standard antidepressants. B. Continuous measures, Outcome 2 Mean HAMD (Hamilton Rating Scale for Depression) scores after 2 or 3 weeks of treatment.

An external file that holds a picture, illustration, etc.
Object name is nCD000448-CMP-005-03.jpg
Analysis

Comparison 5 Hypericum mono‐preparations vs. standard antidepressants. B. Continuous measures, Outcome 3 Mean HAMD (Hamilton Rating Scale for Depression) scores after 4 weeks of treatment.

An external file that holds a picture, illustration, etc.
Object name is nCD000448-CMP-005-04.jpg
Analysis

Comparison 5 Hypericum mono‐preparations vs. standard antidepressants. B. Continuous measures, Outcome 4 Mean HAMD (Hamilton Rating Scale for Depression) scores after 6 to 8 weeks of treatment.

An external file that holds a picture, illustration, etc.
Object name is nCD000448-CMP-005-05.jpg
Analysis

Comparison 5 Hypericum mono‐preparations vs. standard antidepressants. B. Continuous measures, Outcome 5 Difference HAMD (Hamilton Rating Scale for Depression) baseline ‐ end of treatment.

An external file that holds a picture, illustration, etc.
Object name is nCD000448-CMP-005-06.jpg
Analysis

Comparison 5 Hypericum mono‐preparations vs. standard antidepressants. B. Continuous measures, Outcome 6 MADRS after treatment.

An external file that holds a picture, illustration, etc.
Object name is nCD000448-CMP-005-07.jpg
Analysis

Comparison 5 Hypericum mono‐preparations vs. standard antidepressants. B. Continuous measures, Outcome 7 Difference MADRS baseline ‐ end of treatment.

An external file that holds a picture, illustration, etc.
Object name is nCD000448-CMP-005-08.jpg
Analysis

Comparison 5 Hypericum mono‐preparations vs. standard antidepressants. B. Continuous measures, Outcome 8 Mean HAMD after treatment in studies from German‐speaking countries and other studies.

An external file that holds a picture, illustration, etc.
Object name is nCD000448-CMP-005-09.jpg
Analysis

Comparison 5 Hypericum mono‐preparations vs. standard antidepressants. B. Continuous measures, Outcome 9 Mean D‐S (Depression Scale von Zerssen) scores after therapy.

An external file that holds a picture, illustration, etc.
Object name is nCD000448-CMP-005-10.jpg
Analysis

Comparison 5 Hypericum mono‐preparations vs. standard antidepressants. B. Continuous measures, Outcome 10 Various self‐rating scales.

An external file that holds a picture, illustration, etc.
Object name is nCD000448-CMP-005-11.jpg
Analysis

Comparison 5 Hypericum mono‐preparations vs. standard antidepressants. B. Continuous measures, Outcome 11 Various self‐rating scales in studies from German‐speaking countries and other countries.

Comparison 6

Safety ‐ Hypericum mono‐preparations vs. standard antidepressants
Outcome or subgroup titleNo. of studiesNo. of participantsStatistical methodEffect size
1 Number of patients discontinuing treatment/dropping out due to adverse/side effects162785Odds Ratio (M‐H, Random, 95% CI)0.41 [0.29, 0.60]
1.1 vs. older antidepressants51016Odds Ratio (M‐H, Random, 95% CI)0.24 [0.13, 0.46]
1.2 vs. SSRIs111769Odds Ratio (M‐H, Random, 95% CI)0.53 [0.34, 0.83]
2 Number of patients dropping out162785Odds Ratio (M‐H, Random, 95% CI)0.77 [0.62, 0.95]
2.1 vs. older antidperessants51016Odds Ratio (M‐H, Random, 95% CI)0.67 [0.47, 0.95]
2.2 vs. SSRIs111769Odds Ratio (M‐H, Random, 95% CI)0.83 [0.63, 1.08]
3 Number of patients reporting adverse effects142663Odds Ratio (M‐H, Random, 95% CI)0.56 [0.43, 0.74]
3.1 vs. older antidepressants51016Odds Ratio (M‐H, Random, 95% CI)0.39 [0.30, 0.50]
3.2 vs. SSRIs91647Odds Ratio (M‐H, Random, 95% CI)0.70 [0.49, 1.00]
An external file that holds a picture, illustration, etc.
Object name is nCD000448-CMP-006-01.jpg
Analysis

Comparison 6 Safety ‐ Hypericum mono‐preparations vs. standard antidepressants, Outcome 1 Number of patients discontinuing treatment/dropping out due to adverse/side effects.

An external file that holds a picture, illustration, etc.
Object name is nCD000448-CMP-006-02.jpg
Analysis

Comparison 6 Safety ‐ Hypericum mono‐preparations vs. standard antidepressants, Outcome 2 Number of patients dropping out.

An external file that holds a picture, illustration, etc.
Object name is nCD000448-CMP-006-03.jpg
Analysis

Comparison 6 Safety ‐ Hypericum mono‐preparations vs. standard antidepressants, Outcome 3 Number of patients reporting adverse effects.

Characteristics of studies

Characteristics of included studies [ordered by study ID]

MethodsConcealment: unclear
Blinding: double‐blind
Drop‐outs/withdrawals: 6 of 35 in hypericum group, 3 of 35 in fluoxetine group
Jadad score: 1‐1‐1
IV score: 1‐0‐1‐0.5‐0.5‐0.5
ParticipantsNumber of patients included/analyzed: 70/61
Demographics: 47 female, mean age 51 (treatment group) and 48 (control group)
Diagnosis: mild to modedrate depression (ICD 10 F32.0 and 32.1)
Setting: multicenter trial from Denmark
Baseline: HAMD score treatment group 20.0 +/‐ 3.2, control group 20.7 +/‐ 2.9
InterventionsTreatment: Hypericum extract (Calmigen) 2x1 coated tablet daily (300 mg extract) for 6 weeks
Control: Fluoxetine 2x20 mg for 6 weeks
OutcomesObservation period: 6 weeks
Physician‐rated: Hamilton Depression Scale (HAMD, 17 items; response = at least 50% reduction), Clinical Global Impression Index (CGI)
Patient‐rated: von Zerssen Depression Scale, Clinical Global Impression Index (CGI), global assessment
NotesContract research organization which performed the study contacted for additional information but no answer was received
Risk of bias
BiasAuthors' judgementSupport for judgement
Allocation concealment?Unclear riskB ‐ Unclear

MethodsConcealment: consecutively numbered pharmacy
Blinding: double‐blind (double‐dummy)
Drop‐outs/withdrawals: ITT analysis on 163 of 174 patients randomized
Jadad score: 1‐2‐0
IV score: 1‐1‐1‐0.5‐0.5‐0.5
ParticipantsNumber of patients included/analyzed: 177 /163
Demographics: 129 female, mean age 50 years
Diagnosis: mild to moderate major depression (DSM‐IV 296.31 or 32)
Setting: 15 practices (psychiatry, neurology, GP) in Sweden
Baseline: HAMD score hypericum group 24.9, fluoxetine group 23.8, placebo group 25.2
InterventionsTreatment: Hypericum extract LI 160 (Jarsin 300) 3x1 coated table (900 mg) for 6 weeks Control 1: Fluoxetine 1x1 capsule (20 mg) for 6 weeks
Control 2: Placebo for 4 weeks, then randomized to hypericum or fluoxetine
OutcomesObservation period: 6 weeks
Physician‐rated: Hamilton Depression Scale (HAMD), Montgomery Asberg Depression Rating Scale (MADRS), Clinical Global Impression Index (CGI)
NotesDue to the request of the ethical review board patients allocated to placebo had to be re‐randomized to hypericum or fluoxetine after 4 weeks. Prioir to publication of the main reference authors provided some additional information; after publication of the full report no further unpublished information was considered necessary
Risk of bias
BiasAuthors' judgementSupport for judgement
Allocation concealment?Low riskA ‐ Adequate

MethodsConcealment: consecutively numbered pharmacy
Blinding: double‐blind
Drop‐outs/withdrawals: ITT analysis on 207 of 218 patients randomized
Jadad score: not performed (see notes)
IV score: not performed (see notes)
ParticipantsNumber of patients included/analyzed: 218/207
Demographics: 167 female, mean age 44 years
Diagnosis: mild to moderate major depression (DSM‐IV 296.31 or 32)
Setting: 17 practices (GP, various other) in Germany
Baseline: HAMD score hypericum group 19.7, placebo group 19.7
InterventionsTreatment: Hypericum extract HYP611 (Felis 650) 1x1 coated table (650 mg) for 6 weeks
Control: Placebo
OutcomesObservation period: 6 weeks
Physician‐rated: Hamilton Depression Scale (HAMD), Montgomery Asberg Depression Rating Scale (MADRS), Clinical Global Impression Index (CGI)
Patient‐rated: von Zerssen Paranoid Depression Scale (PDS)
NotesThis trial was identified during the revision process. It is only published as a short report in a sponsored supplement to a nonscientific journal. The sponsor (Hexal AG, Holzkirchen, Germany) allowed a reviewer (KL) to extract detailed information from the full unpublished study report
Risk of bias
BiasAuthors' judgementSupport for judgement
Allocation concealment?Low riskA ‐ adequate

MethodsConcealment: unclear
Blinding: double‐blind (double‐dummy)
Drop‐outs/withdrawals: 7 of 15 hypericum patients, 3 of 15 sertraline patients
Jadad score: 1‐2‐0
IV score: 1‐0‐1‐1‐1‐0
ParticipantsNumber of patients included/analyzed: 30/28
Demographics: 19 female, mean age 45 years
Diagnosis: mild to moderate depressive disorder
Setting: community hospital in USA
Baseline: HAMD score hypericum group 21.3 +/‐ 3.2, sertraline group 21.7 +/‐ 2.7
InterventionsTreatment: Hypericum extract LI 160 3x1 tablet (900 mg extract) for 7 weeks
Control: Sertraline 3x1 capsule (75 mg) for 7 weeks
OutcomesObservation period: 7 weeks
Physician‐rated: Hamilton Depression Scale (HAMD, 17 items; response = at least 50% reduction), Clinical Global Impression Index (CGI)
Patient‐rated: von Zerssen Depression Scale (D‐S)
NotesSmall study. High drop‐out rate with more losses in the hypericum group. In spite of intention to treat analysis bias cannot be ruled out
Risk of bias
BiasAuthors' judgementSupport for judgement
Allocation concealment?Unclear riskB ‐ Unclear

MethodsConcealment: unclear
Blinding: double‐blind
Drop‐outs/withdrawals: 18/45 in the hypericum group, 23 of 47 in the fluoxetine group and 22 of 43 in the placebo group
Jadad score: 1‐1‐0
IV score: 1‐0‐1‐0.5‐0.5‐0
ParticipantsNumber of patients included/analyzed: 135/135
Demographics: 57% female, mean age 37 years
Diagnosis: mild to moderate major depression
Setting: 2 psychiatric outpatient departments in the US
Baseline: HAMD score (17 items) 19.6 +/‐ 3.5 in the hypericum group, 19.9 +/‐ 2.9 in the fluoxetine and 19.6 +/‐ 3.1 in the placebo group
InterventionsTreatment: Hypericum extract LI 160 3x1 tablets (900 mg) for 12 weeks (+ 1 capsule placebo)
Control 1: Fluoxetine 1x20 mg (capsule) for 12 weeks (+ 3x1 placebo tablets)
Control 2: Placebo (1x1 capsule, 3x1 tablets)
OutcomesObservation period: 12 weeks (+ 1 week placebo run‐in)
Physician‐rated: Hamilton Depression Scale (HAMD, 17 items; response = at least 50% reduction), Clinical Global Impression Index (CGI)
Patient‐rated: Beck Depression Inventory (BDI)
NotesHigh drop‐out rate (intent to treat analysis); recruitment stopped before planned sample size was reached due to decision of the sponsor (lichtwer Pharma, Berlin). Only remission rates reported (and use instead of responder data for analyses)
Risk of bias
BiasAuthors' judgementSupport for judgement
Allocation concealment?Unclear riskB ‐ Unclear

MethodsConcealment: numbered pharmacy
Blinding: double‐blind (double‐dummy technique)
Drop‐out/withdrawals17 of 123 (hypericum), 19 of 118 (sertaline)
Jadad score: 2‐2‐1
IV score: 1‐1‐1‐0.5‐0.5‐0.5
ParticipantsPatients included/analyzed: 241/118 (in per protocol analysis)
Demographics: 74% female, mean age 19 years
Diagnosis: moderate depression (F32.1 or F33.1)
Setting: 18 primary care physicians in Germany
Baseline: HAMD values 22.0 +/‐ 1.1 vs. 22.1 +/‐ 1.1
InterventionsInterventions: Hypericum extract STW3 1x1 tablet (612 mg extract) + 1 capsule placebo daily for 12 weeks
Control: 1x1 tablet sertraline (50 mg) + 1 tablet placebo daily for 12 weeks
After the 12 week‐treatment phase there was an optional continuation phase
OutcomesObservation period: 12 weeks for the main comparison + 12 weeks continuation phase
Physician‐rated: Hamitlon Depression Scale (HAMD, 17 items; response = HAMD score < 10 or at least 50% reduction), Clinical Global Impression Index (CGI)
Patient‐rated: Adjective Mood Scale (BfS)
NotesAdditional information provided by first author and sponsor.
The number of adverse effects in the hypericum group is much higher than in the sertraline group (189 vs. 112). For the number of patients reporting adverse effects the difference is less pronounced (74 vs. 60; 1 patient in the hypericum gorup had reported 26 adverse effects). In the hypericum group there was a higher number of infections (48 vs. 26; not observed in any other trial before). The number of adverse effects which were possibly causally related with the treatment was higher in the sertraline group (12 vs. 16). Also the intensity of side effects was more pronounced in the sertraline group.
Risk of bias
BiasAuthors' judgementSupport for judgement
Allocation concealment?Low riskA ‐ Adequate

MethodsConcealment: numbered pharmacy
Blinding: double‐blind (double‐dummy technique)
Drop‐outs/withdrawals: 6 of 131 (hypericum), 6 of 127 (citalopram), 8 of 130 (placebo)
Jadad score: 2‐2‐1
IV score: 1‐1‐1‐0.5‐0.5‐1
ParticipantsPatients included/analyzed: 388/388
Demographics: 67% female, mean age 50 years
Diagnosis: moderate depression (ICD‐10 F32.1 or F33.1)
Setting: 21 general practitioners and internists in Germany
Baseline: HAMD score hypericum group 21.9 +/‐ 1.2, citalopram group 21.8 +/‐1.2, placebo group 22.0 +/‐ 1.2
InterventionsTreatment: Hypericum extract STW3‐VI 1x1 tablet (900 mg extract) for 6 weeks
Control 1: Placebo for 6 weeks
Control 2: Citalopram 1x1 tablet 20 mg for 6 weeks
OutcomesObservation period: 6 weeks
Physician‐rated: Hamitlon Depression Scale (HAMD, 17 items; response = HAMD score < 10 or at least 50% reduction), Clinical Global Impression Index (CGI)
Patient‐rated: Adjective Mood Scale (BfS)
NotesAdditional information provided by first author and sponsor
Risk of bias
BiasAuthors' judgementSupport for judgement
Allocation concealment?Low riskA ‐ Adequate

MethodsConcealment: consecutively numbered pharmacy
Blinding: Double‐blind
Drop‐outs/withdrawals: 7 of 51 in hypericum group, 9 of 51 in maprotiline group
Jadad score: 2‐2‐1
IV score: 1‐1‐1‐0.5‐0.5‐0.5
ParticipantsNumber of patients included/analyzed: 102 /86
Demographics: 73 female, mean age 44 years in the Hypericum group and 48 years in the maprotiline group
Diagnosis: single, moderately severe depressive episode (ICD 10 F32.1) (according to information from the sponsor patients met DSM‐III‐R criteria for major depression); Setting: 6 practices in Germany (neurology/psychiatry)
Baseline: HAMD scores 20.5 +/‐ 3.7 in hypericum group and 21.5 +/‐ 3.9 in maprotiline group
InterventionsTreatment: Hypericum extract LI 160 (Jarsin 300) 3x1 coated tablet (900 mg extract) daily for 4 weeks
Control: Maprotiline 3x1 coated tablet (75 mg) daily for 4 weeks
OutcomesObservation period 4 weeks
Physician‐rated: Hamilton Depression Scale (HAMD, 17 items; response = at least 50% reduction or score < 10), Clinical Global Impression Index (CGI), global assessment
Patient rated: Depression Scale von Zerssen D‐S, global assessment
NotesAdditional information provided from sponsor (Lichtwer, Berlin, Germany)
Risk of bias
BiasAuthors' judgementSupport for judgement
Allocation concealment?Low riskA ‐ Adequate

MethodsConcealment: unclear
Blinding: double‐blinding
Drop‐outs/withsdrawals: 8 of 77 hypericum patients and 16 of 84 fluoxetine patients
Jadad score: 1‐1‐1
IV score: 1‐0‐1‐0.5‐0.5‐0.5
ParticipantsNumber of patients included/analyzed: 161/149
Demographics: 129 female, mean age 69 years
Diagnosis: mild to moderate depression (ICD 10 F32.0 or F32.1)
Setting: 17 practices in Germany
Baseline: HAMD score hypericum group 16.6, fluoxetine group 17.2
InterventionsTreatment: Hypericum extract LoHyp‐57 2x2 coated tablets (800 mg) for 6 weeks
Control: Fluoxetine 2x2 coated tablets (20 mg) for 6 weeks
OutcomesObservation period: 6 weeks
Physcian‐rated: Hamilton Depression Scale (HAMD, 17 items; response = at least 50% reduction or score < 11), Clinical Global Impression Index (CGI)
Patient‐rated: Self‐Rating Scale for Depression (SDS), Fragebogen Alltagsleben (German quality of life questionnaire)
NotesReporting of results partly insufficient
Risk of bias
BiasAuthors' judgementSupport for judgement
Allocation concealment?Unclear riskB ‐ Unclear

MethodsConcealment: central telephone randomization
Blinding: double‐blind (double‐dummy)
Drop‐outs/withdrawals: 31 of 113 in hypericum group, 32 of 116 in placebo group, and 32 of 111 in sertraline group
Jadad score: 2‐2‐1
IV score: 1‐1‐1‐0.5‐0.5‐0.5
ParticipantsNumber of patients included/analyzed: 340/340
Demographics: 66% female, mean age 43 years
Diagnosis: major depression (DSM‐IV)
Setting: 12 academic and community psychiatric research clinics in the US
Baseline: HAMD scores 23.1 +/‐ 2.7 (hypericum), 22.7 +/‐ 2.7 (placebo), 22.5 +/‐ 2.5 (sertraline)
InterventionsTreatment: Hypericum LI 160 extract 900 to 1500 mg for 8 weeks
Control 1: Placebo for 8 weeks
Control 2: Sertraline 50 to 100 mg for 8 weeks
OutcomesObservation period: 1 weeks run‐in, 8 weeks treatment, 18 weeks follow‐up for responders
Physician‐rated: Hamilton Depression Scale (HAMD, response = at least 50% reduction or score < 10), Clinical Global Impression Index (CGI), Global Assessment of Functioning (GAF)
Patient‐rated: Beck Depression Inventory (BDI), Sheehan Disability Scale (SDS)
NotesTrial funded by public institution (NIH). Large proportion of patients with chronic depression. Some unblinding detected (sole trial which reported a check of blinding). Authors provided additional data
Risk of bias
BiasAuthors' judgementSupport for judgement
Allocation concealment?Low riskA ‐ Adequate

MethodsConcealment: numbered pharmacy
Blinding: double‐blind
Drop‐outs/withdrawals: 3 of 54 in hypericum group, 4 of 54 in placebo group
Jadad score: 2‐2‐1
IV score: 1‐1‐1‐1‐1‐0.5
ParticipantsNumber of patients included/analyzed: 108/101
Demographics: 66 female, mean age 52 years
Diagnosis: major depression (DSM‐III‐R, HAMD score > 15)
Setting: 17 practices in Germany (neurologists/psychiatrist, general practitioners)
Baseline: HAMD score 21.8 +/‐ 2.8 (hypericum), 20.4 +/‐ 3.4 (placebo)
InterventionsTreatment: Hypericum extract LI160 (Jarsin 300) 3x1 coated tablet daily (900 mg extract) for 4 weeks
Control: Placebo
For further 2 weeks both groups received Hypericum
OutcomesObservation period: 4 weeks
Physician‐rated: Hamilton Depression Scale (HAMD, 21 items; response = at least 50% reduction or score < 10), Clinical Global Impression Index (CGI) after 2 and 4 weeks
Patient‐rated: Depression Scale von Zerssen (D‐S), complaints check list (BEB) after 2 and 4 weeks
NotesAdditional information provided by author and sponsor (Lichtwer, Berlin, Germany). This trial was first published in 1993 (in German, 1994 in English) with 72 patients and re‐published with 108 patients in 1996 (without refering to the earlier publications)
Risk of bias
BiasAuthors' judgementSupport for judgement
Allocation concealment?Low riskA ‐ Adequate

MethodsConcealment: numbered pharmacy
Blinding: double‐blind
Drop‐outs/withdrawals: none
Jadad score: 2‐2‐1
IV score: 1‐1‐1‐1‐1‐1
ParticipantsPatients included/analyzed: 72/72
Demographics: 48 female, mean age 48 years
Diagnosis: mild to moderate depression (DSM‐IV 296.21/31/22/32)
Setting: 11 practices (psychiatry, internal medicine, GP) in Germany
Baseline: HAMD score 19.7 +/‐ 3.4 hpyericum group, 20.1 +/‐ 2.6 placebo group
InterventionsTreatment: Hypericum extract WS 5572 (Neuroplant) 3x1 coated tablet (900 mg) for 6 weeks
Control: Placebo for 6 weeks
OutcomesObservation period: 3‐7 days run‐in, 6 weeks treatment
Physician‐rated: Hamilton Depression Scale (HAMD, 17 items; response = at least 50% reduction), Clinical Global Impression Index (CGI)
Patient‐rated: von Zerssen Depression Scale (D‐S), Patient's Global Assessment Scale (GPA)
NotesTrial with adaptive design stopped early due to significant superiority at preplanned interim analysis
Risk of bias
BiasAuthors' judgementSupport for judgement
Allocation concealment?Low riskA ‐ Adequate

MethodsConcealment: numbered pharmacy
Blinding: double‐blind
Drop‐out/withdrawals: 12 of 123 (hypericum 600 mg), 19 of 127 (hypericum 1200 mg), 8 of 82 (placebo)
Jadad score: 2‐2‐1
IV score: 1‐1‐0.5‐1‐1‐1
ParticipantsPatientns included/analyzed: 332/324
Demographics: 63% female, mean age 46 years
Diagnosis: mild or moderate, single or recurrent, major depressive episode (DSM IV 296.21/22, 296.31/32)
Setting: 11 psychiatric and 5 GP practices in Germany
Baseline: HAMD score hypericum 600 mg group 22.8 +/‐ 3.3, hypericum 1200 mg group 22.6 +/‐3.8, placebo group 23.6 +/‐ 4.2
InterventionsTreatment 1: 1 tablet hypericum extract (600 mg) daily + 1 placebo tablet for 6 weeks
Treatment 2: 2x1 tablet hypericum extract (total daily extract dosage 1200 mg) daily for 6 weeks
Control: 2x1 tablet placebo daily for 6 weeks
OutcomesObservation period: 6 weeks
Physician‐rated: Hamilton depression Scale (HAMD, 17 items; response = atleast 50% score reduction), Clinical Global Impression Index (CGI), Montgomery‐Asberg Depression Scale (MADRS)
Patient‐rated: Beck Depression Inventory (BDI), qualtiy of life (SF‐36)
NotesRandomization in 3:3:2 ratio
Additional information provided by sponsor
Risk of bias
BiasAuthors' judgementSupport for judgement
Allocation concealment?Low riskA ‐ Adequate

MethodsConcealment: unclear
Blinding: double‐blind
Drop‐outs/withdrawals: 3 in group hypericum 1, 1 in group hypericum 2, 4 in placebo group
Jadad score: 2‐2‐1
IV score: 1‐0‐1‐1‐1‐1
ParticipantsPatients included/analyzed: 147/147
Demographics: 117 female, mean age 49 years
Diagnosis: mild to moderate depression (DSM‐IV)
Setting: 11 practices in Germany
Baseline: HAMD score 20.9 +/‐ 3.1 (hypericum 1), 20.3 +/‐ 2.7 (hypericum 2), 21.2 +/‐ 3.3 (placebo)
InterventionsTreatment 1: Hypericum extract WS 5572 (5% hyperforin) 3x1 coated tablet (900 mg) for 6 weeks
Treatment 2: Hypericum extract WS 5573 (0.5% hyperforin) 3x1 coated tablet (900 mg) for 6 weeks
Control: Placebo for 6 weeks
OutcomesObservation period: 6 weeks
Physician‐rated: Hamilton Depression Scale (HAMD, 17 items; response = at least 50% reduction), Clinical Global Impression Index (CGI)
Patient‐rated: von Zerssen Depression Scale (D‐S), global assessment
NotesIncludes a group receiving a second hypericum extract with very low hyperforin content (not included in meta‐analysis as this tests an extract which was not marketed)
Risk of bias
BiasAuthors' judgementSupport for judgement
Allocation concealment?Low riskA ‐ Adequate

MethodsConcealment: numbered pharmacy
Blinding: double‐blind
Drop‐outs/withdrawals: 18 of 186 (hypericum), 25 of 189 placebo)
Jadad score: 1‐2‐0
IV score: 1‐0‐1‐1‐1‐0.5
ParticipantsPatients included/analyzed: 375/375
Demographics: 297 female, mean age 41 years
Diagnosis: mild to moderate depression (DSM‐IV 296.21/22/31/32)
Setting: 26 psychiatric centers and practices in France
Baseline: HAMD score hypericum group 21.9 +/‐ 1.7, placebo group 21.9 +/‐ 1.7
InterventionsTreatment: Hypericum extract WS 5570 3x1 tablet (900 mg) for 6 weeks
Control: placebo for 6 weeks
OutcomesObservation period: 3‐7 days run‐in, 6 weeks treatment
Physician‐rated: Hamilton Depression Scale (HAMD, 17 items; response = at least 50% reduction), Montgomery‐Asberg Depression Rating Scale (MADRS), Clinical Global Impression Index (CGI)
Patient‐rated: Symptom Check List (SCL‐58)
NotesTrial with preplanned interim analysis with 169 patients (no significant difference)
Additional information provided by sponsor (Schwabe, Karlsruhe)
Risk of bias
BiasAuthors' judgementSupport for judgement
Allocation concealment?Low riskA ‐ Adequate

MethodsConcealment: unclear
Blinding: double‐blind
Drop‐outs/withdrawals: unclear
Jadad score: 1‐2‐0
IV score: 1‐0‐0‐1‐1‐0.5
ParticipantsPatients included/analyzed: 247/?
Demographics: 183 female, mean age 43 years
Diagnosis: mild to moderate depression (DSM‐IV 296.2x/3x)
Setting: 18 GPs and psychiatric outpatient clinics in the UK
Baseline: HAMD about 22 in both groups (data extrapolated from figure in poster)
InterventionsTreatment: Hypericum extract LI 160 3x1 coated tablet (900 mg) for 12 weeks
Control: Placebo for 12 weeks
OutcomesObservation period: 3 to 7 days run‐in, 12 weeks treatment
Physician‐rated: Hamilton depression Scale (HAMD, 17 items; response = at least 50% reduction or score < 10), Montgomery‐Asberg Depression Rating Scale (MADRS), Clinical Global Impression Index (CGI)
Patient‐rated: None mentioned
NotesAvailable as congress abstract and poster handout only. Additional information from sponsor (Lichtwer, Berlin)
Risk of bias
BiasAuthors' judgementSupport for judgement
Allocation concealment?Unclear riskB ‐ Unclear

MethodsConcealment: unclear
Blinding: double‐blind (double‐dummy technique)
Drop‐outs/withdrawals: 19 of 72 (no details for single groups reported)
Jadad score: 1‐1‐0
IV score: 1‐0‐0‐0.5‐0.5‐0
ParticipantsNumber of patients included/analyzed: 72/66
Demographics: 83% female, mean age 40 years
Diagnosis: mild to moderate major depression (DSM‐IV criteria)
Setting: Institute of Psychiatry, University Sao Paulo, Brasil
Baseline: extrapolated from figure HAMD score hypericum group 15, placebo 17, fluoxetine, 15
InterventionsTreatment: Hypericum extract Iperisan 3x1 (900 mg) daily for 8 weeks
Control 1: Placebo
Control 2: Fluoxetine 20 mg/day for 8 weeks (1 capsule fluoxetine and 2 capsules placebo per day)
OutcomesObservation period: 1‐week prerandomzationw ash‐out, 8 weeks post randomization
Physician‐rated: Hamilton Depression Scale (HAMD, 21 items, response = at least 50% redution, remission < 8 on the 17 item scale), Montgomery‐Asberg Depression rating Scale (MADRS), Clinical Global Impression Index (CGI)
Patient‐rated: none
NotesPublication misses important details. Authors inquired for additional information but no response received
Risk of bias
BiasAuthors' judgementSupport for judgement
Allocation concealment?Unclear riskB ‐ Unclear

MethodsConcealment: numbered pharmacy
Blinding: Double‐blind
Drop‐outs/withdrawals: 13 of 106 (hypericum), 11 of 110 (imipramine), 9 of 47 (placebo)
Jadad score: 2‐2‐1
IV score: 1‐1‐1‐0.5‐0.5‐0.5
ParticipantsPatients included/analyzed: 263/251
Demographics: 197 female, mean age 47 years
Diagnosis: moderate depression (ICD‐10 F32.1/F32.2)
Setting: 18 GPs in Germany
Baseline: HAMD score 22.7 +/‐ 4.2 (hypericum), 22.2 +/‐ 4.2 (imipramine), 22.7 +/‐ 4.0 (placebo)
InterventionsTreatment: Hypericum extract STEI 300 3x1 capsule (1050 mg) for 8 weeks
Control 1: Imipramine 100 mg for 8 weeks
Control 2: Placebo for 8 weeks
OutcomesObservation period: 1 week screening, 8 weeks treatment
Physician‐rated: Hamilton Depression Scale (HAMD, 17 items; response = at least 50% reduction), Hamilton Anxiety Scale (HAMA), Clinical Global Impression Scale (CGI)
Patient‐rated: Zung Depression Scale, Quality of life (SF‐36)
NotesAdditional information from sponsor (Steiner, Berlin). Primary outcome comparison with placebo after 6 weeks, with imipramine after 8 weeks
Risk of bias
BiasAuthors' judgementSupport for judgement
Allocation concealment?Low riskA ‐ Adequate

MethodsConcealment: numbered pharmacy
Blinding: double‐blind
Drop‐outs/withdrawals: 1 of 81 (hypericum), 2 of 81 (placebo)
Jadad score: 2‐2‐1
IV score: 1‐0‐0‐1‐1‐1
ParticipantsPatients included/analyzed: 162/159
Demographics: 108 female, mean age 47 (hypericum) vs. 39 (placebo)
Diagnosis: mild to moderate depression (F32.0/32.1)
Setting: 16 private practices in Germany
Baseline: HAMD values 20.1 +/‐ 2.8 (hypericum) vs. 18.7 +/‐ 3.5
InterventionsTreatment: 2x1 coated tablet (500 mg extract) Hypericum extract ZE117 daily for 6 weeks
Control: Placebo
OutcomesObservation period: 6 weeks
Physician‐rated: Hamilton Depression Scale (HAMD, 21 items; response = at least 50% reduction or score < 10), Clinical Global Impression Index (CGI)
Patient‐rated: visual analogue scale
NotesAdditional information provided by author and sponsor (Zeller AG, Romanshorn, Switzerland). Baseline HAMD score lower in placebo group. Low response rate in placebo group.
In meta‐analyses HAMD values were calculated for the 17‐item version (data provided by author)
Risk of bias
BiasAuthors' judgementSupport for judgement
Allocation concealment?Low riskA ‐ Adequate

MethodsConcealment: numbered pharmacy
Blinding: double‐blind (double‐dummy)
Drop‐outs/withdrawals: 1 of 126 (hypericum), 1 of 113 (fluoxetine)
Jadad score: 1‐2‐1
IV score: 1‐0‐1‐0.5‐0.5‐1
ParticipantsPatients included/analyzed: 230/228
Demographics: 157 female, mean age 46 years
Diagnosis: mild to moderate depression (ICD‐10 F32.0/32.1)
Setting: 7 practices (internal medicine) in Germany
Baseline: HAMD values 19.6 +/‐ 3.1 (hypericum), 19.5 +/‐ 2.4 (fluoxetine)
InterventionsTreatment: Hypericum extract Ze 117 2x1 coated tablet (500 mg) for 6 weeks
Control: Fluoxetine 1x1 capsule (20 mg) for 6 weeks
OutcomesObservation period: 6 weeks
Physician‐rated: Hamilton Depression Scale (HAMD, 21 items; response = at least 50% reduction or score < 10), Clinical Global Impression (CGI)
Physician‐rated: visual analogue scale
NotesAdditional information (particulary responder rates and 17‐item HAMD scores) provided by author and sponsor (Zeller AG, Romanshorn, Switzerland)
Risk of bias
BiasAuthors' judgementSupport for judgement
Allocation concealment?Low riskA ‐ Adequate

MethodsConcealment: numbered pharmacy
Blinding: double‐blind
Drop‐outs/withdrawals:15 of 98 (hypericum), 13 of 102 (placebo)
Jadad score: 2‐2‐1
IV score: 1‐1‐1‐1‐1‐1
ParticipantsPatients included/analyzed: 200/195
Demographics: 64% female, mean age 42 years
Diagnosis: major depression (DSM‐IV)
Setting: 11 academic medical centers in the USA
Baseline: mean HAMD values above 22 in both groups
InterventionsTreatment: Hypericum extract LI 160 3x1 to 4x1 tablet (900 to 1200 mg) for 8 weeks
Control: placebo for 8 weeks
OutcomesObservation period: 1 week run‐in, 8 weeks treatment
Phsician‐rated: Hamilton Depression Scale (HAMD, 17 items, at least 50% reduction), Clinical Global Impression Index (CGI), Hamilton Anxiety Scale (HAMA)
Patient‐rated: Beck Depression Inventory (BDI)
NotesAverage duration of current depressive episode more than 2 years. Additional information provided from authors
Risk of bias
BiasAuthors' judgementSupport for judgement
Allocation concealment?Low riskA ‐ Adequate

MethodsAllocation concealment: numbered pharmacy
Blinding: double‐blind (double dummy technique)
Drop‐out/wothdrawals: 17 of 125 in hypericum group, 29 of 126 in the paroxetine group
Jadad score: 2‐2‐1
IV score: 1‐1‐1‐0.5‐0.5‐0.5
ParticipantsNumber of patients included/analyzed: 251/244
Demographics: 69% female, mean age 47 years
Diagnosis: moderate or severe unupolar major depression (DSM IV 296.22/23, 296.32/33)
Setting: 21 Psychiatric primary care centers in Germany
Baseline: HAMD score 25.5.+/‐ 2.7 (hypericum), 25.5 +/‐ 2.9 (paroxetine)
InterventionsTreatment: 3x1 tablet (900 mg daily) hypericum extract WS 5570 daily for 6 weeks (in patients whose depression score was not improved by at least 20% 3x 600 mg were used after 2 weeks)
Control: 1x1 tablet (20 mg) paroxetine daily for 6 weeks (in patients whose depression score was not improved by at least 20% 40 mg were used after 2 weeks) packed in capsules
For each drug an indentically matched placebo was available
OutcomesObservation period: 6 weeks (+ 16 weeks continuation phase)
Physician‐reated: Hamilton Depression Scale (HAMD, 17 items, at least 50% reduction), Clinical Global Impression (CGI), Montgomery‐Asberg Depression Rating Scale (MADRS)
Patient‐rated: Beck Depression Inventory (BDI)
NotesAdditional information provided by sponsor
Risk of bias
BiasAuthors' judgementSupport for judgement
Allocation concealment?Low riskA ‐ Adequate

MethodsConcelment: consecutively numbered pharmacy
Blinding: double‐blind
Drop‐out/withdrawals: none
Jadad score: 2‐2‐1
IV score: 1‐0‐1‐1‐1‐1
ParticipantsNumber of patients included/analyzed: 140/140
Demographics: 67% female, mean age 45 years
Diagnosis: moderate depressive disorder (ICD‐10 F32.1 or F33.1)
Setting: clinical trial center in Germany
Baseline: 22‐.8 +/‐ 1.1 in hypericum group, 22.8 +/‐ 1.2 in placebo group
InterventionsTreatment: 1x1 tablet (900 mg) STW3‐VI daily for 6 weeks
Control: Placebo
OutcomesObservation period: 6 weeks
Physician‐rated: Hamitlon Depression Scale (HAMD, 17 items; response = HAMD score < 10 or at least 50% reduction), Clinical Global Impression Index (CGI)
Patient‐rated: Adjective Mood Scale (BfS)
NotesTrial performed in a single clinical trial unit with short recruitment period and low placebo response. Additional information from sponsor
Risk of bias
BiasAuthors' judgementSupport for judgement
Allocation concealment?Low riskA ‐ Adequate

MethodsConcealment: independent pharmacist
Blinding: double‐blind
Drop‐outs/withdrawals: 16 of 45 (hypericum), 17 of 45 (sertraline)
Jadad score: 2‐2‐1
IV score: 1‐1‐1‐0.5‐0.5‐0
ParticipantsPatients included/analyzed: 90/87
Demographics: 61% female, mean age 40 years)
Diagnosis: major depression (DSM IV)
Setting: 12 community based offices for family medicine in Canada
Baseline: mean HAMD values 18.8 +/‐ 3.6 (hypericum group), 19.7 +/‐ 3.5 (sertraline group)
InterventionsTreatment: 3x1 to 3x2 (in case of insufficient response at 4 weeks) capsules (900‐1800 mg) Hypericum extract daily for 12 weeks
Control: 3x1 to 3x2 (in case of insufficient response at 4 weeks) capsules Sertraline (50 to 100 mg) daily for 12 weeks
In 9 patients of each group the dosage was doubled during the trial
OutcomesObservation period: 6 weeks
Physician‐rated: Hamilton Depression Scale (HAMD, 17 items; response = at least 50% reduction or score < 10)
Patient‐rated: Beck Depression Inventory (BDI)
NotesAdditional information provided from author. High drop‐out rate
Risk of bias
BiasAuthors' judgementSupport for judgement
Allocation concealment?Low riskA ‐ Adequate

MethodsConcealment: numbered pharmacy
Blinding: double‐blind
Drop‐outs/withdrawals:2/71 (hypericum), 5/71 (placebo)
Jadad score: 1‐2‐0
IV score: 1‐1‐1‐1‐1‐0.5
ParticipantsPatients included/analyzed: 142/140
Demographics: 81% female, mean age 47 years
Diagnosis: mild or moderate episode of a major depression (DSM‐IV)
Setting: 17 practices for psychiatry, neurology, internal medicine and GP in Germany
Baseline: HAMD 21.0 +/‐ 2.0 (hypericum group), 20.7 +/‐ 1.9 (placebo group)
InterventionsTreatment: 2x1 capsule (500 mg) Hypericum extract D‐0496 daily for 6 weeks
Control: Placebo
OutcomesObservation period: 1 week placebo run in, 6 weeks treatment
Phyiscian‐rated: Hamilton Depression Scale (HAMD, 21 items; response = at least 50% reduction), Clinical Global Impression Index (CGI)
Patient‐rated: von Zerssen Depression Scale (D‐S)
NotesManufacturer/sponsor no longer existing. Author could provide only minimal additional information
Risk of bias
BiasAuthors' judgementSupport for judgement
Allocation concealment?Low riskA ‐ Adequate

MethodsConcealment: numbered pharmacy
Blinding: double‐blind
Drop‐outs/withdrawals: 9 of 107 in hypericum group, 14 of 102 in imipraimine group
Jadad score: 2‐2‐1, IV score: 1‐1‐0.5‐0.5‐0.5‐0.5
ParticipantsNumber of patients included/analyzed: 209/209 (186 per protocol)
Demographics: 154 female, mean age 49 years
Diagnosis: severe episode of a major depression
Setting: 20 psychiatric practices
Baseline: HAMD score 25.3 +/‐ 4.7 (hypericum group), 26.1 +/‐ 4.8 (imipramine group)
InterventionsTreatment: 3x2 coated tablets (1800 mg extract) Hypericum extract LI 160 daily for 6 weeks
Control: 3x2 coated tablets imipramine (150 mg) daily for 6 weeks
OutcomesObservation period: 6 weeks
Physician‐rated: Hamilton Depression Scale (HAMD, 17 items; response = at least 50% reduction), Clinical Global Impression Index (CGI)
Patient‐rated: Depression‐Scale (D‐S von Zerssen)
NotesAdditional information provided from sponsor
Risk of bias
BiasAuthors' judgementSupport for judgement
Allocation concealment?Low riskA ‐ Adequate

MethodsConcealment: numbered pharmacy
Blinding: double‐blind
Drop‐outs/withdrawals: 20 of 87 in hypericum group, 24 of 78 amitriptyline group
Jadad score: 2‐2‐0
IV score: 1‐1‐0.5‐0.5‐0.5‐0.5
ParticipantsNumber of patients included/analyzed: 165/156
Demographics: 126 female, mean age 40 years
Diagnosis: major depression (DSM‐IV)
Setting: 18 general practices and one hospital outpatient clinic in the UK
Baseline: HAMD score 20.6 +/‐ 2.1 (hypericum group), 20.8 +/‐ 2.3 (amitriptyline group)
InterventionsTreatment: 3x1 coated tablet (900 mg extract) Hypericum extract LI 160 daily for 6 weeks
Control: 3x1 coated tablet amitriptyline (75 mg) daily for 6 weeks
OutcomesObservation period: 6 weeks
Physician‐rated: Hamilton Depression Scale (HAMD, 17 items; response = at least 50% reduction or score < 10), Montgomery‐Asberg Reating Scale for Depression (MADRS), Clinical Global Impression Index (CGI)
NotesSome additional information provided from sponsor. HAMD mean values extrapolated from figure
Risk of bias
BiasAuthors' judgementSupport for judgement
Allocation concealment?Low riskA ‐ Adequate

MethodsConcealment: numbered pharmacy
Blidning: double‐blind
Drop‐outs/withdrawals: 1 of 48 in hypericum group, 1 of 49 in placebo group
Jadad score: 1‐2‐1
IV score: 1‐0‐0.5‐1‐1‐0.5
ParticipantsNumber of patients included/analyzed: 97/95
Demographics: 64 female, mean age 43 years
Diagnosis: depression (ICD‐10 F32.1)
Setting: 5 general practitioners in Germany
Baseline: HAMD score 24.6 +/5.4 (hypericum group), 22.7 +/‐ 4.4 (placebo group)
InterventionsTreatment: 2x1 capsules (200 to 240 mg) Hypericum extract (Psychotonin forte) daily for 6 weeks
Control: placebo
OutcomesObservation period: 6 weeks
Physician‐rated: Hamilton Depression Scale (HAMD, 21 items; response = at least 50% reduction or score < 10), Clinical Global Impression Index (CGI)
Patient‐rated: Depression Scale (D‐S von Zerssen), State Trait Anxiety Inventory (STAI)
NotesAdditional information provided by author
Risk of bias
BiasAuthors' judgementSupport for judgement
Allocation concealment?Low riskA ‐ Adequate

MethodsConcealment: unclear
Blinding: Double‐blind (double dummy)
Drop‐outs/withdrawals: 15/157 (hypericum), 32 of 167 (imipramine)
Jadad score: 2‐2‐0
IV score: 1‐0‐1‐0.5‐0.5‐0
ParticipantsPatients included/analyzed: 324/324
Demographics: 71% female, mean age 46 years
Diagnosis: mild to moderate depression (ICD10 F32.0 or 1, F33.0 or 1)
Setting: 40 practices for psychiatry, internal medicine, GP in Germany
Baseline: HAMD 22.4 +/‐3.4 (hypericum), 22.1 +/‐ 2.9 (imipramine)
InterventionsTreatment: 2x1 coated tablet (500 mg extract) Hypericum extract ZE 117 daily for 6 weeks
Control: Imipramine 150 mg daily for 6 weeks
OutcomesObservation period: 6 weeks
Physician‐rated: Hamilton Depression Scale (HAMD, 17 items; at least 50% reduction), Clinical Global Impression Index (CGI)
Patient‐rated: global assessment
NotesSome additional information by manufacturer of the extract
Risk of bias
BiasAuthors' judgementSupport for judgement
Allocation concealment?Unclear riskB ‐ Unclear

Characteristics of excluded studies [ordered by study ID]

StudyReason for exclusion
Agrawal 1994Not major depression (Study in patients with fatigue and minor depression)
Albertini 1986Not depression (RCT of hypericum in homeopathic preparation for dental neuralgia)
Bendre 1980Not depression (CCT of homeopathic preparations of hypericum and arnica in dental practice)
Bergmann 1993Inadequate dosage of amitriptyline (30 mg daily) ‐ RCT included in previous versions of this review
Bernhadt 1993RCT comparing two doses of hypericum for depression
Brockmöller 1997RCT on pharmacokinetics and photosensitivity in healthy volunteers
Czekalla 1997Report of results on ECG analyses undertaken in the Vorbach 1997 study
Dittmer 1992Only a minority of patients suffered from depression (RCT of Hypericum for "psychovegetative complaints")
Ditzler 1992Placebo‐controlled randomized controlled trial in depressed patients using a fixed combination of several herbs including hypericum
Gu 2001RCT comparing 900 mg of a hypericum extract with fluoxetine in patients with depression according to the Chinese classification CCMD‐2. As it is not clear whether all included patients meet criteria for major depression the study was excluded. Response rates were similar in both groups
Halama 1991Not major depression (placebo‐controlled trial included in previous versions of the review)
Harrer 1991Not major depression (placebo‐controlled trial included in previous versions of the review)
Herberg 1992RCT comparing hypericum and a combination of Hypericum and Valerian on concentration, reaction etc. in healthy volunteers
Hoffmann 1979Not major depression (placebo‐controlled trial included in previous versions of the review)
Hottenrott 1997Not depression (RCT of a combination of hypericum and vitamine E to enhance performance of athlets)
Häring 1996Placebo‐controlled RCT of hypericum to prevent reactive depression in 28 patients with solid tumors undergoing chemotherapy
Hübner 1993Not major depression (placebo‐controlled trial included in previous versions of the review)
Johnson 1992RCT on neurophysiological effects of hypericum in healthy volunteers
Johnson 1993RCT comparing neurophysiological effects of hypericum and maprotiline in healthy volunteers
Kniebel 1988RCT comparing a combination of hypericum and valerian vs. amitriptyline in depressive patients
Kugler 1990aRCT comparing hypericum and bromazepam in depressed patients (included in earlier versions of this review; inclusion has now been limited to commonly recommended standard antidepressants)
Kugler 1990bRCT on pharmacodynamics in depressed patients (reason for exclusion: no symptomatic outcomes related to depressive symptoms)
König 1993Not major depression (placebo‐controlled trial included in previous versions of the review)
Lehrl 1993Diagnosis not decscribed as major depression in publication. An information collected from the sponsor for a previous version of the review that patients met criteria for major depression could not be verified (placebo‐controlled trial included in previous versions of the review)
Lenoir 1999RCT comparing three dosages of a hypericum extract in depressive patients
Li 2005Not major depression (placebo‐controlled RCT in depressive patients undergoing coronary artery bypass grafting)
Maisenbacher 1995Not all patients suffered from depression (RCT of hypericum for anxiety)
Martinez 1993RCT comparing light therapy and a combination of light therapy and hypericum in patients with seasonal affective disorder
Mo 2004Not major depression (RCT comparing an hypericum extract and fluoxetine in post‐stroke depression)
Osterheider 1992Not major depression (placebo‐controlled trial included in previous versions of the review)
Panijel 1985Not depression (RCT of a combination of hypericum and valerian for anxiety)
Quandt 1993Not major depression (placebo‐controlled trial included in previous versions of the review)
Reh 1992Not major depression (placebo‐controlled trial included in previous versions of the review)
Schlich 1987Not major depression (placebo‐controlled trial included in previous versions of the review)
Schmidt 1989Not major depression (placebo‐controlled trial included in previous versions of the review)
Schmidt 1993Not major depression (placebo‐controlled trial included in previous versions of the review)
Schmidt 1993bRCT investigating possible interactions of hypericum and alcohol in healthy volunteers
Schulz 1993RCT investigating the effects of hypericum on the sleep‐EEG in elderly volunteers
Sindrup 2000RCT in patients with polyneuropathy (depression not mentioned)
Sommer 1994Not major depression (placebo‐controlled trial included in previous versions of the review)
Spielberger 1985RCT comparing two hypericum preparations for depression
Staffeldt 1993RCT on pharmacokinetics in healthy volunteers
Steger 1985RCT of a comination of of hypericum and valerian vs. desipramine for depression
Volz 2002RCT hypreicum vs. placebo in patients with somatoform disorders
Vorbach 1994Not all patients met criteria for major depression (comparison with maprotiline; included in previous versions of the review)
Warnecke 1986Open controlled trial of hypericum vs. diazepam in women with climacteric depression; method of allocation unclear (included in earlier version of this review; now inclusion limited to explicitely randomized, double‐blind trials)
Werth 1989RCT of hypericum vs. amitriptyline for about two weeks in patients undergoing amputation to prevent reactive depression (included in earlier version of this review)
Wienert 1991RCT on photosensitivtity after application of a combination of hypericum and valerian
Winkel 2000Not major depression (placebo‐controlled trial included in previous versions of the review)
Zeller 2000(Nonrandomized?) comparison of different administration schedules of a hypericum extract in patients with mild to moderate depression (not restricted to major depression)

Contributions of authors

Study concept and design: Linde, Berner, Kriston
Acquisition of data: Linde, Berner, Kriston
Analysis and interpretation of data: Linde, Berner, Kriston
Drafting and revision of the manuscript: Linde, Berner, Kriston
Statistical expertise: Kriston
Study coordination: Linde

Sources of support

Internal sources

  • Centre for Complementary Medicine Research, Department of Internal Medicine II, Technische Universität München, Germany.
  • Department of Psychiatry and Psychotherapy, University of Freiburg, Germany.

External sources

  • No sources of support supplied

Declarations of interest

Michael Berner has received a grant for research on hypericum in the past and has received fees for speaking on conferences from a manufacturer (Schwabe). Klaus Linde once received reimbursement for travel expenses for speaking on a meeting organised by a manufacturer (Schwabe). Levente Kriston once received reimbursement for travel expenses (Schwabe).

References

References to studies included in this review

Behnke 2002 {published data only}

  • Behnke K, Jensen GS, Graubaum HJ, Gruenwald J. Hypericum perforatum versus fluoxetine in the treatment of mild to moderate depression. Advances in Therapy 2002;19:43‐52. [PubMed] [Google Scholar]

Bjerkenstedt 2005 {published data only}

  • Bjerkenstedt L, Edman GV, Alken RG, Mannel M. Hypericum extract LI 160 and fluoxetine in mild to moderate depression. A randomized, placebo‐controlled multi‐center study in outpatients. European Archives of Psychiatry and Clinical Neuroscience 2005;255:40‐7. [PubMed] [Google Scholar]

Bracher 2001 {published and unpublished data}

  • Bracher A. Trial on HYP611 confirms effectiveness [HYP611‐Studie bestätigt Wirksamkeit]. Ärztliche Praxis 2001;51 (suppl.):3‐4. [Google Scholar]

Brenner 2000 {published data only}

  • Brenner R, Azbel V, Madhusoodanan S, Pawlowska M. Comparison of an extract of hypericum (LI 160) and sertraline in the treatment of depression: a double‐blind, randomized pilot study. Clinical Therapeutics 2000;22:411‐9. [PubMed] [Google Scholar]
  • Brenner R, Madhusoodanan S, Pawlowska M. Efficacy of continuation treatment with hypericum perforatum in depression. Journal of Clinical Psychiatry 2002;63:455. [PubMed] [Google Scholar]

Fava 2005 {published data only}

  • Fava M, Alpert J, Nierenberg AA, Mischoulon D, Otto MW, Zajecka J, Murck H, Rosenbaum JF. A double‐blind, randomized trial of St John's wort, fluoxetine, and placebo in major depressive disorder. Journal of Clinical Psychopharmacology 2005;25:441‐7. [PubMed] [Google Scholar]

Gastpar 2005 {published and unpublished data}

  • Gastpar M, Singer A, Zeller K. Efficacy and tolerability of hypericum extract STW3 in long‐term treatment with a once‐daily dosage in comparison with sertraline. Pharmacopsychiatry 2005;38:78‐87. [PubMed] [Google Scholar]

Gastpar 2006 {published and unpublished data}

  • Gastpar M, Singer A, Zeller K. Comparative efficacy and safety of a once‐daily dosage of hypericum extract STW3‐VI and citalopram in patients with moderate depression: a double‐blind, randomised, multicentre, placebo‐controlled study. Pharmacopsychiatry 2006;39:66‐75. [PubMed] [Google Scholar]

Harrer 1993 {published and unpublished data}

  • Harrer G, Hübner WD, Podzuweit H. Effectiveness and tolerance of the hypericum extract LI 160 compared to maprotiline: a multi‐center double‐blind study. Journal of Geriatric Psychiatry and Neurology 1994;7(suppl 1):S24‐8. [PubMed] [Google Scholar]
  • Harrer G, Hübner WD, Podzuweit H. Wirksamkeit und Verträglichkeit des Hypericum‐Präparates LI 160 im Vergleich mit Maprotilin. Nervenheilkunde 1993;12:297‐301. [Google Scholar]

Harrer 1999 {published data only}

  • Harrer G, Schmidt U, Kuhn U, Biller A. Comparison of equivalence between the St. John's wort extract LoHyp‐57 and fluoxetine. Arzneimittel‐Forschung/Drug Research 1999;49:289‐96. [PubMed] [Google Scholar]

HDTSG 2002 {published and unpublished data}

  • Hypericum Depression Trial Study Group. [Effect of Hypericum perforatum (St John's wort) in major depressive disorder. A randomized controlled trial]. JAMA 2002;287:1807‐14. [PubMed] [Google Scholar]
  • Vitiello B, Shader RI, Parker CB, Ritz L, Harlan W, Greenblatt DJ, Gadde KM, Krishnan KRR, Davidson JRT. Hyperforim plasma level as a marker of treatment adherence in the National Institutes of Health Hypericum Depression Trial. Journal of Clin Psychopharmacology 2005;25:243‐9. [PubMed] [Google Scholar]

Hänsgen 1996 {published and unpublished data}

  • Hängsen KD, Vesper J. Antidepressive Wirksamkeit eines hochdosierten Hypericum‐Extraktes. Muenchener medizinische Wochenschrift 1996;138(3):29‐33. [Google Scholar]
  • Hänsgen KD, Vesper J, Ploch M. Multizentrische Doppelblindstudie zur antidepressiven Wirksamkeit des Hypericumextraktes LI 160. Nervenheilkunde 1993;12:285‐9. [Google Scholar]
  • Vesper J, Ploch M. Multi‐center double‐blind study examining the antidepressant effectiveness of the hypericum extract LI 160. Journal of Geriatric Psychiatry and Neurology 1994;7(suppl 1):S15‐8. [PubMed] [Google Scholar]

Kalb 2001 {published data only}

  • Kalb R, Trautmann‐Sponsel RD, Kieser M. Efficacy and tolerability of hypericum extract WS 5572 versus placebo in mildly depressed patients. Pharmacopsychiatry 2001;34:96‐103. [PubMed] [Google Scholar]

Kasper 2006 {published and unpublished data}

  • Kasper S, Anghelescu IG, Szegedi A, Dienel A, Kieser M. Superior efficacy of St John's wort extract WS 5570 compared to placebo in patients with major depression: a randomized, double‐blind, placeb‐controlled, multi‐center trial [ISRCTN77277298]. BMC Medicine 2006;4:14. [PMC free article] [PubMed] [Google Scholar]

Laakmann 1998 {published data only}

  • Laakmann G, Dienel A, Kieser M. Clinical significance of hyperforin for the efficacy of hypericum extracts on depressive disorders of different severities. Phytomedicine 1998;5:435‐42. [PubMed] [Google Scholar]
  • Laakmann G, Schüle C, Baghai T, Kieser M. St John's wort in mild and moderate depression: the relevance of hyperforin for the clinical efficacy. Pharmacopsychiatry 1998;31 (suppl):54‐9. [PubMed] [Google Scholar]

Lecrubier 2002 {published and unpublished data}

  • Lecrubier Y, Bech P. The Ham D6 is more homogeneous and as sensitive as the Ham D17. European Psychiatry 2007;22:252‐5. [PubMed] [Google Scholar]
  • Lecrubier Y, Clerc G, Didi R, Kieser M. Efficacy of St. John's wort extract WS 5570 in major depression: a double‐blind, placebo‐controlled trial. American Journal of Psychiatry 2002;159:1361‐6. [PubMed] [Google Scholar]

Montgomery 2000 {published and unpublished data}

  • Montgomery SA, Hübner WD, Grigoleit HG. Efficacy and tolerability of St. John's wort extract compared with placebo in patients with a mild to moderate depressive disorder. Phytomedicine 2000;7 (suppl II):107. [Google Scholar]

Moreno 2005 {published data only}

  • Moreno RA, Teng CT, Almeida KM, Tavares H jr. Hypericum perforatum versus fluoxetine in the treatment of mild to moderate depression: a randomized double‐blind trial in a Brazilian sample. Revista Brasileira de Psiquiatria 2005;28:29‐32. [PubMed] [Google Scholar]

Philipp 1999 {published and unpublished data}

  • Philipp M, Kohnen R, Hiller KO. Hypericum extract versus imipramine or placebo in patients with moderate depression: randomized multicentre study of treatment for 8 weeks. BMJ 1999;319:1534‐9. [PMC free article] [PubMed] [Google Scholar]

Schrader 1998 {published and unpublished data}

  • Schrader E, Meier B, Brattström A. Hypericum treatment of mild‐moderate depression in a placebo‐controlled study. A prospective, double‐blind, randomized, placebo‐controlled, mulitcentre study. Human Psychopharmacology 1998;13:163‐9. [Google Scholar]

Schrader 2000 {published and unpublished data}

  • Schrader E. Equivalence of St John's wort (Ze 117) and fluoxetine: a randomized, controlled study in mild‐moderate depression. International Clinical Psychopharmacology 2001;15:61‐8. [PubMed] [Google Scholar]

Shelton 2001 {published and unpublished data}

  • Gelenberg AJ, Shelton RC, Crits‐Christoph P, Keller MB, Dunner DL, Hirschfeld RMA, Thase MAE, Russel JM, Lydiard RB, Gallop RJ, Todd L, Hellersteim DJ, Goodnick PJ, Keitner GI, Stahl SM, Halbreich U, Hopkins KS. The effectiveness of St. John's wort in major depressive disorder: a naturalistic phase 2 follow‐up in which non‐responders were provided alternate medication. Journal of Clinical Psychiatry 2004;65:1114‐9. [PubMed] [Google Scholar]
  • Shelton RC, Keller MB, Gelenberg A, Dunner DL, Hirschfeld R, Thase ME, et al. Effectiveness of St. John's wort in major depression. A randomized controlled trial. JAMA 2001;285:1978‐86. [PubMed] [Google Scholar]

Szegedi 2005 {published and unpublished data}

  • Anghelescu IG, Kohnen R, Szegedi A, Klement S, Kieser M. Comparison of hypericum extract WS 5570 and paroxetine in ongoing treatment after recovery from an episode of moderate to severe depression: results from a randomized multicenter study. Pharmacopsychiatry 2006;39:213‐9. [PubMed] [Google Scholar]
  • Szegedi A, Kohnen R, Dienel A, Kieser M. Acute treatment of moderate to severe depression with hypericum extract WS 5570 (St John's wort): randomised controlled double blind non‐inferiority trial versus paroxetine. BMJ 2005;330:503‐6. [PMC free article] [PubMed] [Google Scholar]

Uebelhack 2004 {published and unpublished data}

  • Uebelhack R, Gruenwald J, Graubaum HJ, Busch R. Efficacy and tolerability of hypericum extract STW 3‐VI in patients with moderate depression: a double‐blind, randomized, placebo‐controlled clinical trial. Advances in Therapy 2004;21:265‐75. [PubMed] [Google Scholar]

van Gurp 2002 {published and unpublished data}

  • Gurp G, Meterissian GB, Haiek LN, McCusker J, Bellavance F. St John's wort or sertraline? Randomized controlled trial in primary care. Canadian Family Physician 2002;48:905‐12. [PMC free article] [PubMed] [Google Scholar]

Volz 2000 {published data only}

  • Volz HP, Eberhardt R, Grilll G. Efficacy and tolerance of the St John's wort extract D‐0496 in mild to moderate depression ‐ a placebo‐controlled, double‐blind 6‐week trial [Wirksamkeit und Verträglichkeit des Johanniskrautextraktes D‐0496 bei leichten bis mittelschweren depressiven Episoden. Plazebokontrollierte Doppelblindstudie über 6 Wochen]. Nervenheilkunde 2000;19:401‐5. [Google Scholar]

Vorbach 1997 {published and unpublished data}

  • Vorbach EU, Arnoldt KH, Hübner WD. Efficacy and tolerability of St. John's wort extract LI 160 versus imipramine in patients with severe depressive episodes according to ICD‐10. Pharmacopsychiatry 1997;30(suppl.2):81‐5. [PubMed] [Google Scholar]

Wheatley 1997 {published and unpublished data}

  • Wheatley D. LI 160, an extract of St. John's wort, versus amitriptyline in mildly to moderately depressed outpatients ‐ a controlled 6‐week clinical trial. Pharmacopsychiatry 1997;30(suppl.2):77‐80. [PubMed] [Google Scholar]

Witte 1995 {published and unpublished data}

  • Witte B, Harrer G, Kaptan T, Podzuweit H, Schmidt U. Behandlung depressiver Verstimmungen mit einem hochkonzentrierten Hypericumpräparat ‐ eine multizentrische plazebokontrollierte Doppelblindstudie. Fortschritte der Medizin 1995;113:404‐8. [PubMed] [Google Scholar]

Woelk 2000 {published data only}

  • Woelk H for the Remotiv/Imipramine Study Group. Comparison of St John's wort and imipramine for treating depression: randomised controlled trial. BMJ 2000;321:536‐9. [PMC free article] [PubMed] [Google Scholar]

References to studies excluded from this review

Agrawal 1994 {published data only}

  • Agrawal A, Dixit SP, Dubey GP, Gupta BS. Clinical evaluation of anti‐depressant properties of basant (hypericum perforatum). Pharmacopsychoecologia 1994;7:253‐6. [Google Scholar]

Albertini 1986 {published data only}

  • Albertini H. Evaluation d'un traitement homeopathique de la nevralgie dentaire. In: Boiron J, Belon P, Hariveau E editor(s). Recherche en homeopathie. Lyon: Fondation francaise pour la recheche en homeopathie, 1986:75‐7. [Google Scholar]

Bendre 1980 {published data only}

  • Bendre VV, Dharmadhikari SD. Arnica montana and hypericum in dental practice. Hahnemann Gleanings 1980;47:70‐2. [Google Scholar]

Bergmann 1993 {published and unpublished data}

  • Bergmann R, Nübner J, Demling J. Behandlung leichter bis mittelschwerer Depressionen. Therapiewoche Neurol Psychiatr 1993;7:235‐40. [Google Scholar]

Bernhadt 1993 {published data only}

  • Bernhardt M, Liske E, Ebeling L. Hypericum perforatum in der Therapie leichter bis mittelschwerer Depressionen: Vergleich der antidepressiven Wirksamkeit von zwei unterschiedlichen Dosierungsschemata. Bonn, 5. Phytotherapie‐Kongreß 1993.

Brockmöller 1997 {published data only}

  • Brockmöller J, Reum T, Bauer S, Kerb R, Hübner WD, Roots I. Hypericin and pseudohypericin: pharmacokinetics and effects on photosensitivity in humans. Pharmacopsychiatry 1997;30(suppl.2):94‐101. [PubMed] [Google Scholar]

Czekalla 1997 {published data only}

  • Czekalla J, Gastpar M, Hübner WD, Jäger D. The effect of hypericum extract on cardiac conduction as seen in the electrocardiogram compared to that of imipramine. Pharmacopschychiatry 1997;30(supp.2):86‐8. [PubMed] [Google Scholar]

Dittmer 1992 {published data only}

  • Dittmer TLJ. Die Behandlung von psychovegetativen Störungen mit Johanniskraut‐Öl (Jukunda‐Rot‐Öl‐Kapseln). Naturheilpraxis mit Naturmedizin 1992;45:118‐22. [Google Scholar]

Ditzler 1992 {published and unpublished data}

  • Ditzler K, Gessner B, Schatton WFH, Willems M. Clinical trial on Neuropas versus placebo in patients with mild to moderate depressive symptoms: a placebo‐controlled, randomised double‐blind study. Complementary Therapies in Medicine 1994;2:5‐13. [Google Scholar]
  • Ditzler K, Schatton W. Johanniskraut bei leichten bis mittelschweren Depressionen ‐ Ergebnisse einer plazebo‐kontrollierten Doppelblindstudie. Heilkunst 1992;104:263‐70. [Google Scholar]

Gu 2001 {published data only}

  • Gu N, Li H, Shu L. Zhang H, Weng Z, Ma C, Wang M. Multicenter study of St John's wort extract in treatment of mild to moderate depression. Chinese Journal of Clinical Pharmacy 2001;10:271‐4. [Google Scholar]

Halama 1991 {published and unpublished data}

  • Halama P. Wirksamkeit des Hypericum‐Extraktes LI 160 bei 50 Patienten einer psychiatrischen Fachpraxis. Nervenheilkunde 1991;10:305‐7. [Google Scholar]
  • Halama P. Wirksamkeit des Johanniskrautextraktes LI 160 bei depressiver Verstimmung. Nervenheilkunde 1991;10:250‐3. [Google Scholar]

Harrer 1991 {published and unpublished data}

  • Harrer G, Schmidt U, Kuhn U. "Alternative" Depressionsbehandlung mit einem Hypericum‐Extrakt. Therapiewoche Neurologie 1991;5:710‐6. [Google Scholar]

Herberg 1992 {published data only}

  • Herberg KW. Psychotrope Phytopharmaka im Test ‐ Alternative zu synthetischen Psychopharmaka. Therapiewoche 1992;44:704‐13. [Google Scholar]

Hoffmann 1979 {published data only}

  • Hoffmann J, Kühl ED. Therapie von depressiven Zuständen mit Hypericin. Zeitschrift für Allgemeinmedizin 1979;55:776‐82. [PubMed] [Google Scholar]

Hottenrott 1997 {published data only}

  • Hottenrott K, Sommer HM, Lehrl S, Hauer H. Der Einfluss von Vitamin E und Johanniskraut‐Trockenextrakt auf die Ausdauerleistungsfähigkeit von Wettkampfsportlern. Eine placebo‐kontrollierte Doppelblindstudie mit Langstreckenläufern und Triathleten. Deutsche Zeitschrift für Sportmedizin 1997;48:22‐7. [Google Scholar]

Häring 1996 {published and unpublished data}

  • Häring B, Hauns B, Hermann C, Hübner WD, Maier‐Lenz H, Marschner N. A double‐blind, placebo‐controlled pilot study of LI 160 in combination with chemotherapy in patients with solid tumors. Abstracts of the 2nd International Congress on Phytomedicine. Munich, 1996:SL‐88.

Hübner 1993 {published and unpublished data}

  • Hübner WD, Lande S, Podzuweit H. Behandlung larvierter Depressionen mit Johanniskraut. Nervenheilkunde 1993;12:278‐280. [Google Scholar]
  • Hübner WD, Lande S, Podzuweit H. Hypericum treatment of mild depression with somatic symptoms. Journal of Geriatric Psychiatry and Neurology 1994;7(suppl 1):S12‐5. [PubMed] [Google Scholar]

Johnson 1992 {published data only}

  • Johnson D. Neurophysiologische Wirkungen von Hypericum im Doppelblindversuch mit Probanden. Nervenheilkunde 1991;10:316‐7. [Google Scholar]
  • Johnson D, Siebenhüner G, Hofer E, Sauerwein‐Giese E, Frauendorf A. Einfluß von Johanniskraut auf die ZNS‐Aktivität. Neurologie und Psychiatrie 1992;6:436‐44. [Google Scholar]

Johnson 1993 {published data only}

  • Johnson D, Ksciuk H, Woelk H, Sauerwein‐Giese E, Frauendorf A. Effetcs of hypericum extract LI 160 compared with maprotiline on resting EEG and evoked potentials in 24 volunteers. Journal of Geriatric Psychiatry and Neurology 1994;7(suppl 1):S44‐6. [PubMed] [Google Scholar]
  • Johnson D, Ksciuk H, Woelk H, Sauerwein‐Giese E, Frauendorf A. Wirkungen mit Johanniskraut‐Extrakt LI 160 im Vergleich mit Maprotilin auf Ruhe‐EEG und evozierte Potentiale bei 24 Probanden. Nervenheilkunde 1993;12:328‐30. [Google Scholar]

Kniebel 1988 {published and unpublished data}

  • Hiller KO, Rahlfs V. Therapeutische Äquivalenz eines hochdosierten Phytopharmakons mit Amitriptylin bei ängstlich‐depressiven Verstimmungen ‐ Reanalyse einer randomisierten Studie unter besonderer Beachtung biometrischer und klinischer Aspekte. Forschende Komplementärmedizin 1995;2:123‐32. [Google Scholar]
  • Kniebel R, Burchard JM. Zur Therapie depressiver Verstimmungen in der Praxis. Zeitschrift für Allgemeinmedizin 1988;64:689‐96. [Google Scholar]

Kugler 1990a {published data only}

  • Kugler J, Weidenhammer W, Schmidt A, Groll S. [Therapie depressiver Zustände. Hypericum‐Extrakt Steigerwald als Alternative zur Benzodiazepin‐Behandlung]. Zeitschrift für Allgemeinmedizin 1990;66 (suppl 1):21‐9. [Google Scholar]

Kugler 1990b {published data only}

  • Kugler J, Schmidt A, Groll S, Weidenhammer W. Zur Pharmakodynamik eines Hypericum‐Extraktes. Untersuchungen bei Patienten mit depressiven Zuständen im Vergleich zu Bromazepam und Placebo. Zeitschrift für Allgemeinmedizin 1990;66(suppl.):13‐20. [Google Scholar]

König 1993 {published data only}

  • König CD. Hypericum perforatum L. (gemeines Johanniskraut) als Therapeutikum bei depressiven Verstimmungszuständen ‐ eine Alternative zu synthetischen Arzneimitteln. University of Basel: Thesis, 1993. [Google Scholar]

Lehrl 1993 {published and unpublished data}

  • Lehrl S, Woelk H. Ergebnisse von Messungen der kognitiven Leistungsfähigkeit bei Patienten unter der Therapie mit Johanniskraut. Nervenheilkunde 1993;12:281‐4. [Google Scholar]
  • Lehrl S, Woelk H. Psychometrische Messung der Leistungkapazität unter antidepressiver Therapie mit Johanniskraut. Nervenheilkunde 1991;10:313‐5. [Google Scholar]

Lenoir 1999 {published data only}

  • Lenoir S, Degenring FH, Saller R. A double‐blind randomised trial to investigate three different concentrations of a standardised fresh plant extract obtained from the shoot tips of hypericum perforatum L. Phytomedicine 1999;6:141‐6. [PubMed] [Google Scholar]

Li 2005 {published data only}

  • Li YM, Li GY, Wang XL, & Yu LF. Rehabilitation effect of Saint John's Wort extract on depression and myocardial function after coronary artery bypass grafting: A randomized grouping, placebo‐control and blind evaluation. Chinese Journal of Clinical Rehabilitation 2005;9(28):38‐9. [Google Scholar]

Maisenbacher 1995 {published data only}

  • Maisenbacher J, Schmidt U, Schenk N. Therapie mit Hypericum bei Angstzuständen. Therapiewoche Neurologie Psychiatrie 1995;9:65‐60. [Google Scholar]

Martinez 1993 {published data only}

  • Kasper S. Treatment of seasonal affective disorder (SAD) with hypericum extract. Pharmacopsychiatry 1997;30(suppl.2):89‐93. [PubMed] [Google Scholar]
  • Martinez B, Kasper S, Ruhrmann B, Möller HJ. Hypericum in der Behandlung von saisonal abhängigen Depressionen. Nervenheilkunde 1993;12:302‐7. [Google Scholar]
  • Martinez B, Kasper S, Ruhrmann B, Möller HJ. Hypericum in the treatment of seasonal affective disorders. Journal of Geriatric Psychiatry and Neurology 1994;7(suppl 1):S29‐33. [PubMed] [Google Scholar]

Mo 2004 {published data only}

  • Mo WY, Yu HY, Yang Y. Comparison of Saint John' Wort extract with fluoxetine in the treatment of post‐stroke depression. Chinese Journal of New Drugs 2004;13:942‐4. [Google Scholar]

Osterheider 1992 {published and unpublished data}

  • Osterheider M, Schmidtke A, Beckmann H. Behandlung depressiver Syndrome mit Hypericum (Johanniskraut ‐ eine placebokontrollierte Doppelblindstudie. Fortschritte Neurologie Psychiatrie 1992;60(suppl.2):210‐1. [Google Scholar]

Panijel 1985 {published data only}

  • Panijel J. Die Behandlung mittelschwerer Angstzustände. Therapiewoche 1985;41:4659‐68. [Google Scholar]

Quandt 1993 {published and unpublished data}

  • Quandt J, Schmidt U, Schenk N. Ambulante Behandlung leichter und mittelschwerer depressiver Verstimmungen. Der Allgemeinarzt 1993;15(2):97‐102. [Google Scholar]

Reh 1992 {published and unpublished data}

  • Reh C, Laux P, Schenk N. Hypericum‐Extrakt bei Depressionen ‐ eine wirksame Alternative. Therapiewoche 1992;42:1576‐81. [Google Scholar]

Schlich 1987 {published and unpublished data}

  • Schlich D, Braukmann F, Schenk N. Behandlung depressiver Zustände mit Hypericinium. Psycho 1987;13:440‐7. [Google Scholar]

Schmidt 1989 {published and unpublished data}

  • Schmidt U, Schenk N, Schwarz I, Vorberg G. Zur Therapie depressiver Verstimmungen. Psycho 1989;15:665‐71. [Google Scholar]

Schmidt 1993 {published and unpublished data}

  • Schmidt U. Konstantes Reaktionsvermögen unter antidepressiver Theerapie mit dem Hypericum‐Präparat LI 160. Nervenheilkunde 1991;10:311‐2. [Google Scholar]
  • Schmidt U, Sommer H. Johanniskraut‐Extrakt zur ambulanten Therapie der Depression. Fortschritte der Medizin 1993;111:339‐42. [PubMed] [Google Scholar]

Schmidt 1993b {published data only}

  • Schmidt U, Harrer G, Kuhn U, Berger‐Deinert W, Luther D. Wechselwirkungen von Hypericin‐Extrakt mit Alkohol. Nervenheilkunde 1993;12:314‐9. [Google Scholar]

Schulz 1993 {published data only}

  • Schulz H, Jobert M. Der Einfluß von Johanniskraut auf das Schlaf‐EEG bei älteren Probandinnen. Nervenheilkunde 1993;12:323‐7. [Google Scholar]
  • Schulz H, Jobert M. Effetcs of hypericum extract on the sleep EEG in older volunteers. Journal of Geriatric Psychiatry and Neurology 1994;7(suppl 1):S39‐43. [PubMed] [Google Scholar]

Sindrup 2000 {published data only}

  • Sindrup SS, Madsen C, Bach FW, Gram LF, Jensen TS. [St. John's wort has no effect on pain in polyneuropathy]. Pain 2000;91:361‐365. [PubMed] [Google Scholar]

Sommer 1994 {published and unpublished data}

  • Harrer G, Sommer H. Therapie leichter/mittelschwerer Depressionen mit Hypericum. Münchener medizinische Wochenschrift 1993;135:305‐9. [Google Scholar]
  • Harrer G, Sommer H. Treatment of mild/moderate depressions with Hypericum. Phytomedicine 1994;1:3‐8. [PubMed] [Google Scholar]
  • Sommer H. Besserung psychovegetativer Beschwerden durch Hypericum im Rahmen einer multizentrischen Dopppelblindstudie. Nervenheilkunde 1991;10:308‐10. [Google Scholar]
  • Sommer H, Harrer G. Placebo‐controlled double‐blind study examining the effectiveness of an hypericum preparation in 105 mildly depressed patients. Journal of Geriatric Psychiatry and Neurology 1994;7(suppl 1):S9‐11. [PubMed] [Google Scholar]
  • Sommer H, Harrer G. Placebo‐kontrollierte Studie zur Wirksamkeit eine Hypericumpräparates bei 105 Patienten mit Depressionen. Nervenheilkunde 1993;12:274‐7. [Google Scholar]

Spielberger 1985 {published data only}

  • Spielberger E. Johanniskraut‐Präparat lindert selbst mittelschwere Depressionen. Ärztliche Praxis 1985;37:2546‐7. [Google Scholar]

Staffeldt 1993 {published data only}

  • Staffeldt B, Kerb R, Brockmöller J, Ploch M, Roots I. Pharmacokinetics of hypericin and pseudohypericin after oral intake of the hypericum perforatum extract LI 160 in healthy volunteers. Journal of Geriatric Psychiatry and Neurology 1994;7(suppl 1):S47‐53. [PubMed] [Google Scholar]
  • Staffeldt B, Kerb R, Brockmöller J, Ploch M, Roots I. Pharmakokinetik von Hypericin und Pseudohypericin nach oraler Einnahme des Johanniskraut‐Extrakts LI 160 bei gesunden Probanden. Nervenheilkunde 1993;12:331‐8. [Google Scholar]

Steger 1985 {published and unpublished data}

  • Steger W. Depressive Verstimmungen. Zeitschrift für Allgemeinmedizin 1985;61:914‐8. [Google Scholar]

Volz 2002 {published data only}

  • Volz HP, Murck H, Kasper S, Möller HJ. St John's wort extract (LI 160) in somatoform disorders: results of a placebo‐controlled trial. Psychopharmacology 2002;164:294‐300. [PubMed] [Google Scholar]

Vorbach 1994 {published and unpublished data}

  • Vorbach EU, Hübner WD, Arnoldt KH. Effectiveness and tolerance of the hypericum extract LI 160 in comparison with imipramine: randomized double‐blind study in 135 outpatients. Journal of Geriatric Psychiatry and Neurology 1994;7(suppl 1):S19‐23. [PubMed] [Google Scholar]
  • Vorbach EU, Hübner WD, Arnoldt KH. Wirksamkeit und Verträglichkeit des Hypericum‐Extraktes LI 160 im Vergleich mit Imipramin. Nervenheilkunde 1993;12:290‐6. [Google Scholar]

Warnecke 1986 {published data only}

  • Warnecke G. Beeinflussung klimakterischer Depressionen. Zeitschrift für Allgemeinmedizin 1986;62:1111‐3. [Google Scholar]

Werth 1989 {published data only}

  • Werth W. Psychotonin M versus Imipramin in der Chirurgie. Der Kassenarzt 1989;15:64‐8. [Google Scholar]

Wienert 1991 {published data only}

  • Wienert V, Classen R, Hiller KO. Zur Frage der Photosensibilisierung von Hypericin in einer Baldrian‐Johanniskraut‐Kombination ‐ klinisch‐experimentelle, plazebokontrollierte Vergleichsstudie. Lübeck‐Travemünde, 3rd Phytotherapy Congress 1991.

Winkel 2000 {published data only}

  • Winkel R, Koritsch HD, Piayda H, Trenckmann. [Phytopharmakotherapie der komorbiden depressiven Störungen bei Alkoholabhängigkeit ‐ Wirksamkeit des Johanniskrautextraktes LI 160 bei komorbid‐depressiv Alkoholerkrankten]. In: Trenckmann U editor(s). Phytopharmaka und seelische Störungen. Neu‐Isenburg: LinguaMed, 2000:43‐51. [Google Scholar]
  • Winkel R, Koritsch HD, Piayda H, Trenckmann U. St. John's wort extract LI 160 in depressive, alcohol addicted patients. Phytomedicine 2000;7 (suppl II):19. [Google Scholar]

Zeller 2000 {published data only}

  • Zeller K. [Once daily administration of hypericum extract (STW 3): a convenient treatment]. In: Chrubasik S, Roufogalis BD editor(s). Herbal medicinal products for the treatment of pain. Lismore: Southern Cross University Press, 2000:164‐8. [Google Scholar]

Additional references

Caccia 2005

  • Caccia S. Anti‐depressant‐like components of hypericum perforatum extracts: an overview of their pharmacokinetics and metabolism. Current Drug Metabolism 2005;6:531‐43. [PubMed] [Google Scholar]

Candy 2008

  • M Candy B, Jones L, Williams R, Tookman A, King. Psychostimulants for depression. Cochrane Database of Systematic Reviews 2008, Issue 2. [DOI: 10.1002/14651858.CD006722.pub2] [PubMed] [CrossRef] [Google Scholar]

CGI 1970

  • National Institutes of Mental Health. CGI ‐ Clinical Global Impressions. In: Guy W editor(s). ECDEU Assessment Manual for Psychopharmacology. Rockville, MD: US Department of Heath, Education, and Welfare Public Health Service Alcohol, Drug Abuse, and Mental Health Administration, 1970:217‐222. [Google Scholar]

Deeks 2006

  • Deeks JJ, Higgins JPT, Altman DG on behalf of the Cochrane Statistical Methods Group. Analysing and presenting results. In: Higgins JPT, Green S editor(s). Cochrane Handbook for Systematic Reviews of Interventions 4.2.6 [updated September 2006]. In: The Cochrane Library, Issue 4, 2006. Chichester: John Wiley & Sons, Ltd, 2006. [Google Scholar]

DSM‐IV‐TR

  • American Psychiatric Association. DSM‐IV‐TR®. http://www.dsmivtr.org/index.cfm.

Egger 1997

Ernst 1995

  • Ernst E. St. John’s wort, an anti‐depressant? A systematic, criteria‐based review. Phytomedicine 1995;2:67‐71. [PubMed] [Google Scholar]

Ernst 1999

  • Ernst E. Second thoughts about safety of St John's wort. Lancet 1999;354:2014‐5. [PubMed] [Google Scholar]

Gaster 2000

  • Gaster B. St John's wort for depression. A systematic review. Archives of Internal Medicine 2000;160:152‐6. [PubMed] [Google Scholar]

Hamilton 1960

Hammerness 2003

  • Hammerness P, Basch E, Ulbricht C. St. John's wort: a systematic review of adverse effects and drug interactions for the consultation psychiatrist. Psychosomatics 2003;44:271‐82. [PubMed] [Google Scholar]

Higgins 2003

Higgins 2008

  • Higgins JPT, Altman DG on behalf of the Cochrane Statistical Methods Group and the Cochrane Bias Methods Group. Chapter 8:  Assessing risk of bias in included studies. In: Higgins JPT, Green S editor(s). Cochrane Handbook for Systematic Reviews of Interventions Version 5.0.0 [updated February 2008]. The Cochrane Collaboration, 2008. [Google Scholar]

Härter 2003

  • Härter M, Bermejo I, Schneider F, Katz S, Gaebel W, Hegerl U, Niebling W, Berger M. Clinical practice guideline for the diagnosis and treatment of depressive disorders in primary care [Versorgungsleitlinien zur Diagnostik und Therapie depressiver Störungen in der hausärztlichen Praxis]. Zeitschrift für ärztliche Fortbildung und Qualitätssicherung 2003;7 (suppl. IV):16‐35. [PubMed] [Google Scholar]

ICD‐10

  • World Health Organization. International Statistical Classification of Diseases and Related Health Problems, 10th Revision, Version for 2007. http://www.who.int/classifications/apps/icd/icd10online/.

ICSI 2007

  • Institute for Clinical Systems Improvement (ICSI). Health care guideline: major depression in adults in primary care. www.icsi.org 2007.

Jadad 1996

  • Jadad AR, Moore RA, Carrol D, Jenkinson C, Reynolds DJM, Gavaghan DJ, McQuay HJ. Assessing the quality of reports of randomized clinical trials: is blinding necessary?. Controlled Clinical Trials 1996;17:1‐12. [PubMed] [Google Scholar]

Khan 2000

  • Khan A, Warner HA, Brown WA. Symptom reduction and suicide risk in patients treated with placebo in antidepressant clinical trials. Archives of General Psychiatry 2000;57:311‐7. [PubMed] [Google Scholar]

Kim 1999

  • Kim HL, Streltzer J, Goebert D. St John's wort for depression: a meta‐analysis of well‐defined clinical trials. Journal of Nerval and Mental Diseases 1999;187:532‐9. [PubMed] [Google Scholar]

Kirsch 2002

  • Kirsch I, Moore TJ, Scoboria A, Nicholls SS. The emporor's new drugs: an analysis of antidepressant medication submitted to the U.S. Food and Drug Administration. Prevention & Treatment 2002;5:article 23. [Google Scholar]

Kirsch 2008

  • Kirsch I, Deacon BJ, Huedo‐Medina TB, Scoboria A, Moore TJ, Johnson BT. Initial severity and antidepressant benefits: a meta‐analysis of data submitted to the Food and Drug Administration. PLoS Medicine 2008;5:260‐8. [PMC free article] [PubMed] [Google Scholar]

Knüppel 2004

  • Knüppel L, Linde K. Adverse effects of St. John's wort ‐ a systematic review. Journal of Clinical Psychiatry 2004;65:1470‐9. [PubMed] [Google Scholar]

Linde 1996

  • Linde K, Ramirez G, Mulrow CD, Pauls A, Weidenhammer W, Melchart D. St John's wort for depression ‐ an overview and meta‐analysis of randomized clinical trials. BMJ 1996;313:253‐8. [PMC free article] [PubMed] [Google Scholar]

Linde 1996b

  • Linde K, Worku F, Melchart D, Wiesner‐Zechmeister M, Pothmann R, Weinschütz T, Melchart D. Randomized clinical trials of acupuncture for asthma ‐ a systematic review. Forschende Komplementärmedizin 1996;3:148‐55. [Google Scholar]

Linde 1997

  • Linde K, Clausisus N, Ramirez G, Melchart D, Eitel F, Hedges LV, Jonas WB. Are the clinical effects of homoeopathy placebo effects? A meta‐analysis of placebo‐controlled trials. Lancet 1997;350:834‐43. [PubMed] [Google Scholar]

Lipsey 2000

  • Lipsey M, Wilson DB. Practical meta‐analysis. Applied social research methods series, Vol. 49. London: Sage Publications, 2000. [Google Scholar]

Montgomery 1979

  • Montgomery SA, Asberg M. A new depression scale designed to be sensitive to change. British Journal of Psychiatry 1979;134:382‐9. [PubMed] [Google Scholar]

Murck 2002

  • Murck H. Atypical depression and related disorders ‐ neurobiological basis for their treatment with St. John's wort [Die atypische Depression und verwandte Erkrankungen ‐ neurobiologische Grundlagen für ihre Behandlung mit Johanniskraut‐Extrakt]. Wiener medizinische Wochenschrift 2002;152:398‐403. [PubMed] [Google Scholar]

Murck 2005

  • Murck H, Fava M, Alpert J, Nierenberg AA, Mischoulon D, Otto MW, Zajecka J, Mannel M, Rosenbaum JF. Hypericum extract in patients with MDD and reversed vegetative signs: re‐analysis from data of a double‐blind, randomized trial of hypericum extract, fluoxetine, and placebo. International Journal of Neuropsychopharmacology 2005;8:215‐21. [PubMed] [Google Scholar]

Nahrstedt 1997

  • Nahrstedt A, Butterweck V. Biologically active and other chemical constitutents of the herb hypericum perforatum L. Pharmacopsychiatry 1997;30(suppl 2):129‐34. [PubMed] [Google Scholar]

Nieuwstraten 2006

  • Nieuwstraten C, Labiris NR, Holbrock A. Systematic overview of drug interactions with antidepressant medications. Canadian Journal of Psychiatry 2006;51:300‐16. [PubMed] [Google Scholar]

Posternak 2007

  • Posternak MA, Zimmerman M. Therapeutic effect of follow‐up assessments on antidepressant and placebo response rates in antidepressant effiacy trials. British Journal of Psychiatry 2007;190:287‐92. [PubMed] [Google Scholar]

Reichling 2003

  • Reichling J, Hostanska K, Saller R. St. John's wort (Hypericum perforatum L.) ‐ multicompound preparations versus single substances [Johanniskraut (Hypericum perforatum L.) ‐ Vielstoffgemische kontra phytogene Einzelstoffe]. Forschende Komplementärmedizin und Klassische Naturheilkunde 2003;10 (suppl 1):28‐32. [PubMed] [Google Scholar]

Smith 2004

  • Smith L, Ernst E, Ewings P, Myers P, Smith C. Co‐ingestion of herbal medicines and warfarin. British Journal of General Practice 2004;54:439‐41. [PMC free article] [PubMed] [Google Scholar]

Sterne 2001

  • Sterne JA, Egger M, Davey Smith G. Systematic reviews in health care: investigating and dealing with publication and other biases in meta‐analysis. BMJ 2001;323:101‐5. [PMC free article] [PubMed] [Google Scholar]

Thompson 1999

  • Thompson SD, Sharp SJ. Explaining heterogeneity in meta‐analyses: a comparison of methods. Statitics in Medicine 1999;18:2693‐708. [PubMed] [Google Scholar]

Turner 2008

  • Turner EH, Matthews MM, Linardatos E, Tell RA, Rosenthal R. Selective publication of antidepressant trials and its influence on apparent efficacy. New England Journal of Medicine 2008;358:252‐60. [PubMed] [Google Scholar]

Ustun 2004

  • Ustun TB, Ayuso‐Mateos JL, Chatterji S, Mathers C, Murrary CJ. Global burden of depressive disorders in the year 2000. British Journal of Psychiatry 2004;184:386‐92. [PubMed] [Google Scholar]

Volz 1997

  • Volz HP. Controlled clinical trials of hypericum extract in depressed patients ‐ an overview. Pharmacopsychiatry 1997;30(suppl 2):72‐6. [PubMed] [Google Scholar]

von Zerssen 1996

  • Zerssen D. D‐S. In: AMDP, CIPS editor(s). Rating scales for psychiatry. Weinheim: Beltz Test, 1990:section 2.8. [Google Scholar]

Wheatley 1998

  • Wheatley D. Hypericum extract ‐ potential in the treatment of depression. CNS Drugs 1998;9:431‐440. [Google Scholar]

Whitten 2006

  • Whitten DL, Myers SP, Hawrelak JA, Wohlmuth H. The effect of St John's wort extracts on CYP3A: a systematic review of prospective clinical trials. British Journal of Clinical Pharmacology 2006;62:512‐26. [PMC free article] [PubMed] [Google Scholar]

Williams 2000

  • Williams JW jr, Mulrow CD, Chiquette E, Hitchcock NP, Aguilar C, Cornell J. A systmatic review of newer pharmacotherapies for depression in adults: evidence report summary. Annals of Internal Medicine 2000;132:743‐56. [PubMed] [Google Scholar]

Wilson 2002

  • Wilson DB. SPSS, Stata, and SAS macros for performing meta‐analytic analyses. http://mason.gmu.edu/˜dwilsonb/ma.html. last uptated: 15‐1‐2002. accessed: 10‐11‐2003.

Wurglics 2003

  • Wurglics M, Schulte‐Löbbert S, Dingermann T, Schubert‐Zsilavecs M. Rational and traditional hypericum preparations [Rationale und traditionelle Johanniskraut‐Präparate]. Deutsche Apotheker Zeitung 2003;143:1454‐8. [Google Scholar]

Wurglics 2006

  • Wurglics M, Schubert‐Zsilavecz M. Hypericum perforatum: a "modern" herbal antidepresant: pharmacokinetics of active ingredients. Clinical Pharmacokinetics 2006;45:449‐68. [PubMed] [Google Scholar]

References to other published versions of this review

Linde 1998

  • Linde K, Mulrow CD. St John's wort for depression. Cochrane Database of Systematic Reviews 1998, Issue 4. [PubMed] [Google Scholar]

Linde 2005a

  • Linde K, Mulrow CD, Berner M, Egger M. St John's wort for depression. Cochrane Database of Systematic Reviews 2005, Issue 2. [PubMed] [Google Scholar]

Linde 2005b

  • Linde K, Berner M, Egger M, Culrow C. St John's wort for depression ‐ meta‐analysis of randomised controlled trials. British Journal of Psychiatry 2005;186:99‐107. [PubMed] [Google Scholar]

Articles from The Cochrane Database of Systematic Reviews are provided here courtesy of Wiley

-