Top Banner
PMR Classification Study Protocol Version 1 15/05/2007 Page 1 A SPECIFIC AIMS Polymyalgia rheumatica (PMR) is the most common inflammatory rheumatic disease of the elderly and represents the most common indication for long-term glucocorticoid therapy in the community. PMR is the rheumatic disease subject to the widest variations of clinical practice since it is managed both in primary and secondary care setting by general practitioners, rheumatologists and non-rheumatologists. There is considerable uncertainty related to diagnosis and outcomes in patients presenting with the polymyalgic syndrome. Furthermore, the course of the disease is heterogeneous and unpredictable. Glucocorticoid treatment leads to a rapid and dramatic improvement in disease symptoms in the majority of patients but treatment typically lasts for several years and results in serious adverse effects, of which osteoporosis, fractures, diabetes and infections are among the worst. As a result, treatment of PMR remains largely empiric and unsatisfactory. There is an unmet clinical need for the treatment of PMR due to the long-term adverse effects of prolonged use of glucocorticoids. The foremost factor that has hampered development of rational therapeutic approaches to management of PMR is the lack of standardized diagnostic criteria and reliable, valid and sensitive outcome measures. These deficiencies have led to an inability to accurately distinguish this clinical disease entity from other conditions presenting with the polymyalgic syndrome, and difficulty with evaluation and comparison of the efficacy of different therapeutic approaches, including novel drug therapies. In order to tackle these challenges, an international PMR Consortium has convened and initiated a multinational effort. The long-term goal of the Consortium is to standardize classification criteria and outcome measures in patients with PMR and facilitate the conduct of well-designed clinical studies that incorporate these standardized and clinically relevant outcomes. The objective of this particular application is to develop and quantitatively evaluate the reliability, validity and sensitivity to change of various measures for disease activity, response and remission in patients with PMR in a multinational prospective observational study. To this end, our specific aims are as follows: Specific Aim 1. To develop a reliable and valid composite disease activity score and remission criteria in PMR Aim 1a. To evaluate the validity, redundancy and reproducibility of core disease activity domains based on patient-reported (pain, stiffness, function, global assessment, fatigue, QOL), physician- reported (global assessment, tenderness, pain on motion and limitation, therapy), laboratory (ESR, CRP) and ultrasound -based measures Aim 1b. To derive a preliminary composite disease activity score and remission criteria incorporating these core domains Aim 1c. To assess the discriminant validity of core domains and composite score in differentiating PMR from conditions mimicking PMR Specific Aim 2. To develop a preliminary definition of improvement for the evaluation of response to therapy in PMR Aim 2a. To evaluate the within group sensitivity to change (responsiveness) and discriminant validity of core disease activity domains as measured by patient-based, physician-based, laboratory and ultrasound-based measures Aim 2b. To derive a preliminary composite definition of improvement based on disease activity (Aim 1b) and change in core disease activity domains Specific Aim 3. To create an infrastructure for storage of biospecimens in order to facilitate future research on novel disease-specific biomarkers and their therapeutic potential This will be the first comprehensive effort to develop classification and outcome criteria in PMR. The findings of this project will be used to develop preliminary evidence-based, consensus guidelines for
30

A SPECIFIC AIMS - EULAR - PMR...A SPECIFIC AIMS Polymyalgia rheumatica (PMR) is the most common inflammatory rheumatic disease of the elderly and represents the most common indication

May 22, 2018

Download

Documents

lynguyet
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: A SPECIFIC AIMS - EULAR - PMR...A SPECIFIC AIMS Polymyalgia rheumatica (PMR) is the most common inflammatory rheumatic disease of the elderly and represents the most common indication

PMR Classification Study Protocol

Version 1 15/05/2007 Page 1

A SPECIFIC AIMS

Polymyalgia rheumatica (PMR) is the most common inflammatory rheumatic disease of the elderly and represents the most common indication for long-term glucocorticoid therapy in the community. PMR is the rheumatic disease subject to the widest variations of clinical practice since it is managed both in primary and secondary care setting by general practitioners, rheumatologists and non-rheumatologists. There is considerable uncertainty related to diagnosis and outcomes in patients presenting with the polymyalgic syndrome. Furthermore, the course of the disease is heterogeneous and unpredictable . Glucocorticoid treatment leads to a rapid and dramatic improvement in disease symptoms in the majority of patients but treatment typically lasts for several years and results in serious adverse effects , of which osteoporosis, fractures, diabetes and infections are among the worst. As a result, treatment of PMR remains largely empiric and unsatisfactory. There is an unmet clinical need for the treatment of PMR due to the long-term adverse effects of prolonged use of glucocorticoids.

The foremost factor that has hampered development of rational therapeutic approaches to management of PMR is the lack of standardized diagnostic criteria and reliable, valid and sensitive outcome measures. These deficiencies have led to an inability to accurately distinguish this clinical disease entity from other conditions presenting with the polymyalgic syndrome, and difficulty with evaluation and comparison of the efficacy of different therapeutic approaches, including novel drug therapies. In order to tackle these challenges, an international PMR Consortium has convened and initiated a multinational effort. The long-term goal of the Consortium is to standardize classification criteria and outcome measures in patients with PMR and facilitate the conduct of well-designed clinical studies that incorporate these standardized and clinically relevant outcomes. The objective of this particular application is to develop and quantitatively evaluate the reliability, validity and sensitivity to change of various measures for disease activity, response and remission in patients with PMR in a multinational prospective observational study. To this end, our specific aims are as follows:

Specific Aim 1. To develop a reliable and valid com posite disease activity score and remission criteria in PMR

Aim 1a. To evaluate the validity, redundancy and reproducibility of core disease activity domains based on patient-reported (pain, stiffness, function, global assessment, fatigue, QOL), physician-reported (global assessment, tenderness, pain on motion and limitation, therapy), laboratory (ESR, CRP) and ultrasound -based measures

Aim 1b. To derive a preliminary composite disease activity score and remission criteria incorporating these core domains

Aim 1c. To assess the discriminant validity of core domains and composite score in differentiating PMR from conditions mimicking PMR

Specific Aim 2. To develop a preliminary definition of improvement for the evaluation of response to therapy in PMR

Aim 2a. To evaluate the within group sensitivity to change (responsiveness) and discriminant validity of core disease activity domains as measured by patient-based, physician-based, laboratory and ultrasound-based measures

Aim 2b. To derive a preliminary composite definition of improvement based on disease activity (Aim 1b) and change in core disease activity domains

Specific Aim 3. To create an infrastructure for sto rage of biospecimens in order to facilitate future research on novel disease-specific biomarker s and their therapeutic potential

This will be the first comprehensive effort to develop classification and outcome criteria in PMR. The findings of this project will be used to develop preliminary evidence-based, consensus guidelines for

Page 2: A SPECIFIC AIMS - EULAR - PMR...A SPECIFIC AIMS Polymyalgia rheumatica (PMR) is the most common inflammatory rheumatic disease of the elderly and represents the most common indication

PMR Classification Study Protocol

Version 1 15/05/2007 Page 2

evaluating outcomes in PMR and form the basis for international guidelines. The conduct of the proposed project will be optimized by several key e lements . First, this project will be implemented as a multinational effort. A PMR Consortium consisting of highly committed rheumatologists and non-rheumatologists with synergistic experience on a number of fronts (epidemiology, rheumatology, general medicine, health services research, biostatistics) is actively working together to develop the methodology for the prospective validation study. Recent and ongoing work by the PMR Consortium provides important insights into the objectives and the methodologies proposed herein. This international, collaborative and broadly based effort has a high likelihood of resulting in wider acceptability and utilization of the newly developed criteria. Given the uncertainties associated with PMR diagnosis and fast evolution of clinical findings and symptoms, it is of paramount importance to develop classification and outcome measures simultaneously and this is ensured in this proposal. The infrastructure for the proposed study has already b een established with 22 centers in 10 countries having participated in preliminary studies and agreeing to participate in the current effort. In this context, we demonstrated the feasibility of the major methodological components central to this proposal through our ongoing collaborative work. This effort is also endorsed by the American College of Rheumatology (ACR) and the European League Against Rheumatism (EULAR).

B BACKGROUND AND SIGNIFICANCE

B.1 Problems with PMR diagnosis and management

PMR is an inflammatory condition of unknown etiology characterized by aching and morning stiffness in the proximal regions of the extremities and torso, and elevated markers of inflammation including the erythrocyte sedimentation rate (ESR). Few population-based studies have evaluated the epidemiological aspects of PMR because of the lack of a diagnostic hallmark and universally accepted classification criteria. PMR is extremely rare in persons under 50 years of age. The prevalence among those over 50 is estimated to be around 0.74%1. The overall age- and sex-adjusted annual incidence per 100,000 population aged ≥50 years in Olmsted County is estimated at 58.7 (95% confidence interval: 52.8, 64.7)2. The incidence of PMR increases with age, with a peak in those 70–80 years of age. PMR is more frequent in females than in males in all age groups. PMR is probably a polygenic disease in which multiple environmental and genetic factors influence susceptibility and severity. The increased incidence at higher latitudes (i.e. in Scandinavian countries and in USA communities with a strong Scandinavian ethnic background) and occasional familial cases support the etiological role of both environmental and genetic causes3.

In the past, PMR was considered a manifestation of giant-cell arteritis (GCA) or a variant of elderly-onset rheumatoid arthritis (EORA). This fact can in part explain the lack of efforts to standardize its diagnosis. Therefore, the criteria most frequently used for diagnostic purposes are empirical. They have been defined by clinical experts who had studied the disease extensively. The criteria defined by Hunder at the Mayo Clinic4 and by Healey at Mason Clinic5 are similar, the only difference being the inclusion in the Healey criteria of the response to steroids. So far, the only effort to develop diagnostic criteria for PMR was a collaborative study performed by 11 British rheumatology units (Bird criteria)6. There are several similarities and dissimilarities between these criteria, for example, different cut-offs for age and ESR values and the need to exclude other conditions.

There are 3 diagnostic challenges in PMR. First, PMR is a heterogeneous disease and many features of PMR predispose the unwary clinician to diagnostic error7. The proximal pain and stiffness syndrome - the main symptoms of PMR - can occur in many other illnesses. A third of the patients have systemic symptoms such as fever, anorexia and weight loss. Half the patients may have distal musculoskeletal manifestations such as peripheral arthritis, distal swelling with pitting edema and

Page 3: A SPECIFIC AIMS - EULAR - PMR...A SPECIFIC AIMS Polymyalgia rheumatica (PMR) is the most common inflammatory rheumatic disease of the elderly and represents the most common indication

PMR Classification Study Protocol

Version 1 15/05/2007 Page 3

carpal tunnel syndrome. PMR is also associated with GCA in 10% to 20% of cases. Although ESR of at least 40 mm/hour is considered an important finding for the diagnosis of PMR, an acute phase response and elevated ESR values can occur in other settings such as neoplasia, rheumatoid arthritis and infection. Furthermore, some studies have reported that a normal ESR at diagnosis accounted for 7–20% of the patients with PMR8-10 PMR and GCA are regarded as urgent indications for the start of steroid therapy. Many clinicians and 2 of the published diagnostic criteria use a response to steroids as a main defining feature of these conditions11, 12. This may encourage diagnostic error since steroids are potent anti-inflammatory agents that can mask symptoms from a host of serious conditions ranging from osteoarthritis, rotator cuff problems, RA, cancer, infection, migraine and intra-cranial tumor, especially if used in traditional high doses and for protracted lengths of time.

Second, there is uncertainty about the origin of symptoms that deter mines duration of treatment. Reports from the Mayo Clinic in the USA originally described a short duration of disease both for PMR (11 months median duration of treatment with 75% of patients off steroids at 2 years) and giant cell arteritis4. However, reports from Europe describe a more prolonged illness requiring several years of steroid therapy marked by frequent relapses13-16. The fact that the majority of patients with PMR remain on long-term steroids suggests persistent symptoms. Kyle and Hazleman found that only 24% of patients were off steroids at 2 years13. A recent cohort study reported a high incidence of relapses, this was largely irrespective of original steroid responsiveness and up to a third complained of proximal pain at 12 months in the absence of raised inflammatory indices. Results of our Delphi survey of generalists corroborate these observations17.

A third difficulty is the wide variations in clinical practice. In comparison to other rheumatic diseases, PMR is managed both in the primary and secondary care setting by general practitioners, rheumatologists and non-rheumatologists18-20. Rheumatologist referral is mostly for management of steroid complications21. There is a lack of recognition of PMR, particularly in primary care. Bahlas et al reported low accuracy of PMR diagnosis and an over-utilization of diagnostic investigations by family practitioners18. Among the various rheumatologic diseases, the lowest agreement between primary care physicians and rheumatologists is on the subject of diagnosis of PMR22. In another study in the north-eastern USA, 128 physicians were presented with videos of patients presenting with symptoms of PMR and depression23. The study was designed to assess whether patient or physician attributes influenced clinical decision-making. Only 9 of the 128 physicians identified PMR as the most likely diagnosis whereas 84 considered depression most likely23.

In conclusion, PMR is a heterogeneous disease. The re is considerable uncertainty and variations related to diagnosis and outcomes in pat ients presenting with the polymyalgic syndrome. The heterogeneity of the disease along w ith wide variations in practice in the management of PMR hinders easy diagnosis and recogn ition of changes in disease activity status. These difficulties present a challenge whe n evaluating patients in the clinic and when conducting clinical studies. There is an urgent ne ed for guidelines for safe diagnosis, ongoing monitoring of the disease, vigilance regarding an a lternative diagnosis and early referral of patients with atypical features and poor steroid re sponse. Our proposed study will provide information on several of these issues.

B.2 Unmet clinical need in PMR

The mainstay of therapy in PMR is oral steroids24. Yet, the true outcome of standard steroid therapy, which may only partially control disease and symptoms, is not known. Prednisone, 15 to 25 mg, usually suppresses inflammation dramatically25. However, up to 60% of patients experience disease relapse during steroid tapering and several studies indicate that steroid treatment can only rarely be discontinued before 2 years12, 24, 26, 27. A recent study identified complete, partial and non-responders

Page 4: A SPECIFIC AIMS - EULAR - PMR...A SPECIFIC AIMS Polymyalgia rheumatica (PMR) is the most common inflammatory rheumatic disease of the elderly and represents the most common indication

PMR Classification Study Protocol

Version 1 15/05/2007 Page 4

to initial standard steroid therapy with partial and non-responders having earlier relapses leading to greater cumulative dose28. Observational studies indicate that higher doses of steroids are not more effective in reducing the incidence of relapses but this has never been tested in clinical trials.

In a large series of patients taking steroids, the median reported starting dose of prednisone was 20 mg/d4; Other studies reported similar figures. Recent studies indicate that oral steroids should be administered for 18 to 36 months29, 30. The obvious risk for side effects of this long-term treatment, particularly osteoporosis, hypertension, hyperglycemia, and cataracts31, support the need for improved

therapeutic options, particularly in patients at high risk for steroid-related toxicity31, 32.

Randomized controlled clinical trials of different therapeutic agents are rare in PMR. In one small clinical trial, intramuscular methylprednisolone acetate on a monthly basis was shown to have benefit in reducing cumulative steroid dose and toxicity while maintaining control of disease16. A number of studies examined the efficacy of methotrexate with conflicting findings33-37. There is some evidence to suggest that methotrexate use can reduce the duration of steroid therapy and disease flares. Other than these small studies, there are only 1-2 trials with newer biologics. Infliximab was recently found to be ineffective in a small clinical trial38.

In conclusion, it is imperative to evaluate the eff icacy of disease modifying anti-rheumatic drugs in PMR. Unlike the extensive experience in r heumatoid arthritis, previous therapeutic studies in PMR are based on small patient numbers, have examined only short-term effects and have not evaluated patient-based outcomes such as f unctional ability and quality of life. Classification criteria for PMR will ensure recruit ment of individuals with similar clinical entity, and response criteria will ensure standardized defi nitions of therapeutic response and outcomes and qualitative comparisons across trials using the same metrics. In addition, this project will provide an infrastructure for storage of biospecimens. Combined with the phenotypic information collected as part of the pro posed study, the biospecimen infrastructure will facilitate future research on novel disease-sp ecific biomarkers (e.g. IL-1, IL-6, proteomic and genomics applications) and their therapeutic po tential.

B.3 Need for outcomes and classification criteria f or PMR There are few standard definitions for diagnosis, clinical response and remission in PMR39. Several different outcome measures have been used in the past, and there is no consensus about the optimal endpoints for evaluating treatment (see Table 1). Moreover, none of these remission definitions have been validated, especially with patient-based outcome measures. These remission definitions are arbitrary in a number of fronts, in particular importance of various criteria components at different stages of the disease, time/duration of the disease activity and remission states, correlation with prognostic markers and failure to account for the continuity of disease activity process. For example, validity and sensitivity to change of different measures can be different early in the disease course versus later (e.g. pain resolves early but laboratory values remain high for longer). There are other unanswered questions, such as, how many patients with proximal pain and stiffness respond to steroids even though they are not thought to have PMR? Can a patient have PMR even if he/she doesn't respond to steroids at a specified dose?

Page 5: A SPECIFIC AIMS - EULAR - PMR...A SPECIFIC AIMS Polymyalgia rheumatica (PMR) is the most common inflammatory rheumatic disease of the elderly and represents the most common indication

PMR Classification Study Protocol

Version 1 15/05/2007 Page 5

ESR, erythrocyte sedimentation rate; CRP, C-reactive protein

There is also no consensus on the role of imaging for the evaluation of suspected PMR. Magnetic resonance imaging (MRI) in PMR patients with active disease showed that bilateral subacromial and subdeltoid bursitis was present in 100% of PMR patients. Gleno-humeral synovitis and long head biceps tenosynovitis were detected in 77% and 54% of the cases47. Ultrasound evaluations revealed evidence of subacromial/subdeltoid bursitis in almost all patients with PMR and in only 22% of controls. The lesions were bilateral in 96% of the PMR patients and in only 4% of controls. A recent systematic review of ultrasound in GCA clearly finds it a useful investigation48.

Needless to say, there is an urgent need to develop consensus criteria for PMR. The one advantage of the investigators of this proposal is the body of experience with criteria development in oth er rheumatic diseases . International, gold standard guidelines for identifying a core set of outcomes to be measured to define improvement in rheumatic diseases, OMERACT, have now been established in adult RA49-51, juvenile arthritis52, ankylosing spondylitis53 and osteoarthritis and osteoporosis54. The

Table 1. Remis sion criteria applied in the literature

History criteria

Clinical criteria Laboratory criteria

Therapy

ESR

[ mm/1 st hour ]

CRP [ mg/dl ]

Blood count

Plasma fibrinogen

[ µmol/l ]

Martinez- Taboada40

Absence of clinical symptoms

Normal laboratory

values

Meliconi41 Absence of clinical symptoms

< 40 < 0,5

Cantini9 Absence of clinical symptoms

< 40 < 0,5

Dasgupta16

Reduction of Pain (VAS ≥

50%) Morning stiffness <30

min

< 20 Hb>12 g/dl n.a.

Feinberg42 Absence of pain

Normal values

Ferraccioli35 Absence of myalgia

Normal values

Normal values

Mertens43 Absence of clinical symptoms

Krogsgaard44

Absence of muscular pain

morning stiffness

muscular tenderness

Normal values

Normal values

Wagner45

morning stiffness for just some minutes

Absence of clinical symptoms < 20

≤ 5mg Corticosteroids and

no methotrexate

Delecoeuillerie27 Absence of clinical symptoms

Ayoub46 Absence of clinical symptoms

Behn15 Absence of clinical symptoms

< 30

Chuang4 Absence of clinical

symptoms Normal laboratory values

Discontinuation of steroids or NSAID other than Aspirin

If treated with Aspirin: dose less

than 1,8g/ day

Page 6: A SPECIFIC AIMS - EULAR - PMR...A SPECIFIC AIMS Polymyalgia rheumatica (PMR) is the most common inflammatory rheumatic disease of the elderly and represents the most common indication

PMR Classification Study Protocol

Version 1 15/05/2007 Page 6

OMERACT guidelines were developed through a series of evidence-based, consensus conferences, the aim of which was to identify the most scientifically rigorous measures to evaluate clinical and patient-based outcomes in clinical trials. There is a clear need for similar guidelines to define a core set of outcomes for use in future research on PMR. The first step in this process is to obtain validated classification criteria, rigorously measured outcome data (clinical and patient-based), a standard validated definition of disease activity and remission and improvement in a large cohort of patients in a prospective study of outcomes in PMR. The goal of this proposal is that each outcome measure in PMR meets the standards of the "OMERACT Filter"55-57 of truth (face, content, construct, and criterion validity), discrimination (reliability and sensitivity to change), and feasibility. We will adopt a data-driven process to achieve this goal.

B.4 Significance This project is significant in a number of aspects. First, this will be the first comprehensive assessment of outcome measures in patients with PMR . The results emanating from this study will provide reliable and quantifiable criteria for early distinction of PMR from other similar conditions, initiation of timely therapeutic interventions and consideration of clinically relevant outcomes that matter to the patients and also define the course of the disease. Second, it brings together the expertise and commitment of an international group of investigators. Third, validated and standardized classification and outcome criteria will open the way to well-designed clinical studies and new therapeutic targets with the aim of improving patient care in PMR. Fourth, the proposed biospecimen storage will facilitate identification of therapeutic targets. Lastly, the benefits are not just limited to the clinical research setting. These standardized criteria will be used to monitor health status and response to treatment in the clinical setting, case management and adjudication in litigation environments, for benchmarking activities, systematic reviews, cost-effectiveness analyses, clinical decision making and estimation of the burden of PMR to patients and the society.

C PRELIMINARY STUDIES

This project will specifically build on a strong infrastructure of international network of rheumatologists with interest in PMR and who are committed to development of standardized PMR outcomes criteria. Specifically, the PMR work group was convened in 2005 and already demonstrated their commitment to this effort and efficiency in working together with the series of activities over the last year. These are briefly summarized below and more extensively in a manuscript58.

The PMR Consortium of international experts in PMR, biostatistics and epidemiology was formed as a work group in 2005 in order to develop classification criteria for PMR. This effort was supported by the ACR Classification and Response Criteria Subcommittee. The Consortium has evolved with over 50 investigators in 10 countries. It meets 2 to 3 times a year. Completed projects include a submitted manuscript on methodology for study of PMR, work of a consensus conference for classification criteria and an extended Delphi study of these criteria. The current study has emerged out of these efforts.

Literature review : An extensive literature review was performed by the PMR work group in collaboration with the British Society for Rheumatology. Briefly, this literature review indicated that there was a dearth of clinical studies of PMR conducted within the primary care and rheumatology setting and very few randomized controlled treatment trials. Methodological problems encountered in the published literature included: small sample sizes that were not powerful enough to measure differences in clinically important outcomes with high precision, diversity of measurement instruments

Page 7: A SPECIFIC AIMS - EULAR - PMR...A SPECIFIC AIMS Polymyalgia rheumatica (PMR) is the most common inflammatory rheumatic disease of the elderly and represents the most common indication

PMR Classification Study Protocol

Version 1 15/05/2007 Page 7

that were mostly unvalidated59, 60, problems with internal validity, absence of strict randomization and blind assessment of observer–rated outcomes, inadequate information on content and quality of interventions and protocol adherence, lack of patient data prior to steroid interventions and use of a variety of diagnostic entry criteria with blurred distinction between isolated PMR and GCA with polymyalgia61. There were only a few observational studies of long-term outcomes15, 30, 31, 46, 62-65.

A Delphi survey of imaging including ultrasound in PMR was conducted in 2005 and indicated that physicians did not believe that currently, there was a role for routine use of MRI or FDGPET scans in the investigation of PMR. However musculoskeletal ultrasound had promise due to widespread availability (65% of experts considered musculoskeletal ultrasound to be routinely available), feasibility and good research evidence, and the utility of ultrasound in PMR diagnosis. The PMR Consortium imaging work group met in the Netherlands in June 2006 to standardize the examination of shoulders and hips by ultrasound for the purposes of the current study.

Development of consensus-based initial candidate cr iteria by the PMR Consortium The PMR work group first convened in 2005 and discussed several issues on the nature of the diagnostic and evaluation process in PMR. The results of the work group’s deliberations were presented to the attendees of the 3rd International Conference on GCA and PMR held at St. Johns College, Cambridge and feedback was received. At the end of the conference, the work group members agreed that the diagnosis of PMR was a stepped process using core and supportive inclusion/exclusion criteria and a standardized initial steroid response. Over 50 items were rated. Of these, 43 receiving at least 50% agreement, including 10 core items that were agreed upon by expert consensus. These 43 items were evaluated in a subsequent extended Delphi survey.

Testing for wider acceptability of consensus based initial candidate criteria in an extended Delphi survey 17, 58, 66. A wider Delphi survey was used to determine if there is widespread acceptance of the views of the PMR work group and to receive input from other rheumatologists, primary care physicians and non-rheumatologists. Responses were received from 111 practicing rheumatologists from a mix of academic and private practice settings (49 US, 62 from 15 countries in Northern/Western Europe and Canada) and 53 generalists/internists (29 US, 24 UK). Over 70% of Delphi survey respondents agreed with the experts on 7 of 10 core candidate criteria. These were age >50 years, duration >2 weeks, bilateral shoulder and/or pelvic girdle aching, morning stiffness duration of >45 minutes, erythrocyte sedimentation rate (ESR), C-reactive protein (CRP) and rapid steroid response (> 75% global response within one week to prednisolone/prednisone 15-20 mg). Little agreement was observed on the remaining 3 core items: abrupt onset in <1 week (25%), systemic signs/symptoms (38%) and neck aching (35%). Among physical signs, >70% of respondents agreed with the experts on the importance of assessing shoulder (84%) and/or hip (76%) pain on motion, but agreement was low for peripheral signs like carpal tunnel, tenosynovitis and peripheral arthritis. Seventy five % or more of both groups agreed that a diagnosis of rheumatoid arthritis, lupus, vasculitis, inflammatory myopathy, septic arthritis, active neoplasia, active thyroid disease or drug-related myalgia would exclude the PMR diagnosis in a classification criteria study of the polymyalgic syndrome. Therefore, this Delphi survey clearly demonstrated that there is agreement between experts and practicing rheumatologists and generalists/internists on the i mportance of the majority of potential PMR criteria examined.

In conclusion, these preliminary studies demonstrat e the previous research experience, competence and commitment of the research team who have already laid the framework for the proposed research to develop outcome measures in PM R. A comprehensive and sensible list of potential criteria had been developed through an extensive literature review, consensus meetings and 3 separate Delphi surveys. The select ed criteria to be evaluated as part of this

Page 8: A SPECIFIC AIMS - EULAR - PMR...A SPECIFIC AIMS Polymyalgia rheumatica (PMR) is the most common inflammatory rheumatic disease of the elderly and represents the most common indication

PMR Classification Study Protocol

Version 1 15/05/2007 Page 8

proposal are the most relevant, crucial clinical ch aracteristics of PMR and likely to facilitate classification and comprehensive assessment of pati ents. Therefore, the initial face and content validity of the selected list of outcome cr iteria have already been evaluated according to the OMERACT filter and more recent recommendatio ns by the Classification and Response Criteria Subcommittee of the ACR Quality Measures C ommittee. The next steps are to establish the reproducibility, redundancy and valid ity of these criteria for defining disease activity, remission and response in a prospective o bservational study. Still, we do recognize that criteria developed as part of this effort will be considered preliminary until tested in the context of a clinical trial.

D RESEARCH DESIGN AND METHODS

The core component of this project is a prospective observational study. General research methods including descriptions of the study design, recruitment process and study measurements are outlined in section D.1. Measurements and our rationale for inclusion of each of the measurements are described in section D.2. The statistical methods and power considerations are then described, according to each specific aim, in sections D.3 - D.5.

D.1 Overview of study design

The specific aims of this proposal will be fulfilled using a prospective observational study. Potential PMR cases (total 120 subjects) will be identified among consecutive patients referred to rheumatology clinics with suspected polymyalgia, i.e., new-onset proximal pain and morning stiffness and followed up over a 6-month period (Figure 1). Patients will be recruited prior to initiation of steroid treatment (i.e. patients should not have been treated with steroids within the last 12 weeks prior to study entry). Steroid treatment will be initiated according to a pre-defined protocol. Various clinical, laboratory and patient-based outcome measurements will be collected using standard questionnaires translated to different languages. Evaluations will be performed at baseline, 1 week, 4 weeks, 12 weeks, 26 weeks (6 months). Control patients will be evaluated at baseline and at 26 weeks. A main purpose of these follow-up evaluations until 26 weeks is to determine whether the initial PMR diagnosis is sustained.

Every effort will be made to choose PMR cases across the whole spectrum of PMR disease severity. Polymyalgia with clinical suspicion of GCA will be studied as part of the inflammatory control group since these patients require different steroid doses.

At each follow-up visit, clinical evaluation will also include response

Potential incidentPMR cases (n=120)

Potential controlsubjects (n=240)

Baseline 1 week 4 weeks 12 weeks 26 weeks

Measurements• Demographics• Vital signs• Weight, height• Family history• Clinical history• Labs• Steroid therapy• Physician&patient

-based measures• Ultrasound

PMR cases(n=126)

Non-PMR controls (n=234)

~15-17%

~10%

Measurements• Labs• Steroid therapy• Physician&patient

-based measures

Measurements• Labs• Steroid therapy• Physician&patient

-based measures

Measurements• Labs• Steroid therapy• Physician&patient

-based measures

Measurements• Labs• Steroid therapy• Physician&patient

-based measures • Ultrasound

= =

FIGURE 1. Study Plan

Page 9: A SPECIFIC AIMS - EULAR - PMR...A SPECIFIC AIMS Polymyalgia rheumatica (PMR) is the most common inflammatory rheumatic disease of the elderly and represents the most common indication

PMR Classification Study Protocol

Version 1 15/05/2007 Page 9

to steroid therapy (see also measurements section) and opinion on emergence of alternative diagnoses. Patients not considered as having PMR at any time during the study would be evaluated and treated according to accepted clinical practice and they will be excluded from the pool of PMR cases and will continue to be evaluated as non-PMR controls. It is expected that approximately 15% of potential PMR subjects (and potentially up to 17%) will be reclassified as controls, and approximately 10% of controls are expected to qualify as PMR cases by the end of the 26 weeks of follow-up.

Selection of controls (total 240 subjects) is important for the development of classification criteria because it influences both the sensitivity and specificity. In this study, we propose to include control patients from a variety of “suspected” conditions, in order to evaluate the discriminating validity of the

PMR criteria in different patient populations. The control patients will have conditions likely to be representative the types of patients that need to be distinguished from PMR patients (Figure 2). This is essential in order to produce criteria that are useful, both for clinical studies and in clinical practice. We plan to include consecutive elderly (>50 years) patients with new-onset proximal musculoskeletal pain and with a suspected diagnosis of either inflammatory (120 control subjects) or non-inflammatory (120 control subjects) conditions. The controls will be included in the study on the basis of clinician diagnosis. Formal criteria for control conditions will not be developed as part of this study.

Inclusion-exclusion criteria . All patients who present with new onset (≤ 3 months) proximal bilateral shoulder/thigh pain will be eligible for inclusion into the study. Patients with active cancer or those diagnosed with selected conditions prior to the enrollment date will be excluded (see Table 2). A diagnosis other than PMR should be suspected in all controls. No guidelines will be provided for treatment of conditions in the control group.

Inflammatory conditions (should be previously undia gnosed or condition present for ≤3 months)

� New onset rheumatoid arthritis (60 subjects, 3 per center)

� New onset other sero-negative arthritides including spondyloarthritis, psoriatic arthritis etc. (30 subjects, 1-2 per center)

� New onset connective tissue diseases & vasculitides, e.g. ANCA positive vasculitis, GCA with polymyalgia, SLE, inflammatory myopathies (30 subjects, 1-2 per center)

Non-inflammatory conditions (should be previously u ndiagnosed or condition present for ≤3 months)

Potential incident PMR cases (n=120)

Potential control subjects (n=240)

New onset RA (n=60)

Sero ( - ) arthrides (n=30)

CTD, SLE, vasculides , etc. (n=30)

Shoulder conditions (n=60)

Fibromyalgia , OA, etc. (n=30)

Others* (n=30)

* Others include endocrinopathies (thyroid disorders) and neurological disorder (parkinsons disease), malignancies (myeloma , solid cancers esp. metastatic disease presenting as musculoskeletal symptoms, infections (ideally systemic) e.g. chest infections, UTIs, possibly bacterial endocarditis, septic arthritis

20 centers

FIGURE 2. Recruitment Plan

Page 10: A SPECIFIC AIMS - EULAR - PMR...A SPECIFIC AIMS Polymyalgia rheumatica (PMR) is the most common inflammatory rheumatic disease of the elderly and represents the most common indication

PMR Classification Study Protocol

Version 1 15/05/2007 Page 10

� Shoulder conditions: bilateral rotator cuff syndrome and/or adhesive capsulitis, rotator cuff tear, OA (60 subjects, 3 per center)

� Chronic pain e.g. fibromyalgia, back and hip pain (30 subjects, 1-2 per center)

� Either one of the below conditions (30 subjects, 1-2 per center)

o Endocrinopathies, neurological or drug-related disorders, e.g. Thyroid disorders, Parkinson’s disease, statin-associated myopathy

o Malignancies e.g. myeloma, solid cancers esp. metastatic disease presenting as musculoskeletal ‘red flags’

o Infections (ideally systemic) e.g. chest infections, urinary tract infections, possibly bacterial endocarditis, septic arthritis

Recruitment: Patients will be recruited initially from 20 rheumatology referral clinics in eight countries: USA, UK,

Italy, Spain, Sweden, France, Austria, Germany; up to 22 centers in 10 countries may ultimately be used (including Hungary, Israel). Each center will be recruiting 6 PMR cases and 12 non-PMR controls. Recruitment is expected to last 12 months with an anticipated recruitment rate of two patients per center per month. Full written consent will be obtained from all patients. Participating general practices which have agreed to refer the suspected PMR cases and non-PMR control subjects will send letters of support agreeing to refer untreated patients to the study centers. Recruitment will be monitored closely in the first few months to determine whether additional practices need to be identified in order to maintain the target recruitment level.

Steroid treatment: All eligible PMR patients in the study will receive a standard course of steroid therapy with oral prednisolone taken according to the schedule shown in Table 3. Tapering should be performed according this schedule. All patients will receive an initial dose of 15 mg of prednisone equivalent per day. Subsequent treatment will be individualized on the basis of treatment response, comorbidities, and other relevant clinical considerations. In accordance with recognized treatment methods, deviations from the treatment schedule can be made at the discretion of the clinician to allow steroid doses to be tailored to individual patient requirements. All steroid treatment will be recorded.

Re-evaluation of PMR diagnosis and response: The development of classification and outcomes criteria is inherently exposed to the risk of circularity of reasoning, since the same

rheumatologists who judge whether the patient has the disease or not also develop the list of candidate criteria items. Outcome prediction at follow-up can obviate, at least to some extent, the problem of circularity. Therefore, in order to ensure the quality of the data, and create a standard for the diagnosis and outcome assessment of PMR, we will perform a diagnostic re-evaluation at the end of the study. Each investigator will be randomly assigned to a set of data from PMR cases and non-PMR controls enrolled in the study by different investigators. Each reviewer will be asked to re-evaluate the diagnosis on a 5-point scale. Note that only patients who were originally diagnosed and

Table 3. Steroid schedule

Time Oral Prednisone (dose, frequency)

Week 1-2 15mg daily

Week 3-5 12.5mg daily

Week 6-11 10mg daily

Week 12-15

10mg every other day alternating with 7.5 mg every other

day

Week 16-25 7.5mg daily

Week 26 and thereafter

According to treatment response

Table 2. Exclusion criteria

Active cancer Known Infection-septic arthritis/other Previous steroid usage for this episode of polymyalgia Known PMR Known RA Known other inflammatory arthropathies Known connective diseases Known vasculitides excluding GCA Known active thyroid disease Known Parkinson’s disease Known capsulitis shoulder * Known indicates condition present for >3 months)

Page 11: A SPECIFIC AIMS - EULAR - PMR...A SPECIFIC AIMS Polymyalgia rheumatica (PMR) is the most common inflammatory rheumatic disease of the elderly and represents the most common indication

PMR Classification Study Protocol

Version 1 15/05/2007 Page 11

treated as PMR will have complete information. Furthermore, blinding is not feasible because steroid treated patients are the ones who originally received PMR diagnosis. Nonetheless, during the re-evaluation, physicians will have information on the course of the disease for these patients, not originally available at the time of initial diagnosis. Therefore, the gold standard for the diagnosis of PMR will be the “clinical diagnosis” on this 5-point scale based upon the patients´ entire course by at least 2 participating physicians with an expertise in PMR. This evaluation will ensure the homogeneity of decision to diagnose PMR and also constitute a reproducibility assessment of the newly developed criteria (i.e. to estimate the reliability of classification based on only the first visit).

D.2 Study Measurements

Several measurements, corresponding to 9 domains of disease activity (see table 4), required to accomplish the 3 specific aims, are described below. Our justification for inclusion of each of these measurements is described in this section. Briefly, each of these domains are deemed important based on either (a) inclusion in existing criteria (pain, stiffness, acute phase reactants, steroid response), (b) expert consensus and Delphi survey (clinical examination and ultrasound imaging), (c) evidence from the literature (functional status, QOL). Values and change in values of each of these measurements over time will be assessed as disease activity and remission criteria (Aim 1), response criteria (Aim 2) and classification criteria (aim 3). All measurements, except ultrasound, will be collected at each visit. Ultrasound evaluations will be performed at baseline and at 26 weeks.

Pain domain : Pain is the dominant feature of PMR and a mandatory criteria in previously proposed EULAR PMR response criteria67. Almost all patients report bilateral shoulder or pelvic girdle pain at initial presentation and pain resolves quickly (50% of patients within weeks) with initiation of steroid therapy28. In our study, we will assess pain using a horizontal 100 mm visual analogue scale in 4 separate locations (shoulder, pelvic, neck and overall, 0=no pain, 100=worst pain).

Morning Stiffness: This is another prominent domain in PMR and had been included in previous diagnostic and response criteria. We will assess the duration of morning stiffness (during the past 24 hours) at each study visit. Stiffness will be assessed by direct questioning of the patient, e.g. “How long did you feel stiff this morning?” with 0 indicating no stiffness.

Functional status & quality of life (QOL): QOL is increasingly recognized as a major contributor of the burden of chronic diseases, especially rheumatic diseases. QOL takes account of the effect of impairments and disability on the patient in addition to other influences including personality, social and physical environment, economic resources and culture. Several generic and disease-specific measures of QOL and functional disability have been validated and are being used in rheumatic diseases68-71. However, evidence is scanty in PMR, as described below. In this study, we propose to use 2 QOL instruments, as described below.

SF-36 (Medical Outcome Study Short Form 36 item health status survey questionnaire) is the most widely used generic health status measure to assess impairment and disability72. The SF-36 is a 36-item questionnaire which measures physical functioning, social functioning, role limitation due to physical health problems, role limitation due to emotional problems, mental health, vitality/energy, bodily pain and general health perceptions. It is the current gold standard measure of QOL and has been shown to be a reliable, valid and responsive measure of outcome in a number of diseases73-76 including rheumatoid arthritis69, 77. The majority of studies that utilized SF-36 demonstrate that patients with rheumatic diseases have significantly lower scores on all SF-36 dimensions than those without, especially for physical functioning and pain. Furthermore, SF-36 scores significantly correlate with disease activity and severity measures69. Another important advantage of SF-36 in PMR is that it can provide additional information about comorbid conditions or treatment side effects that cannot be easily

Page 12: A SPECIFIC AIMS - EULAR - PMR...A SPECIFIC AIMS Polymyalgia rheumatica (PMR) is the most common inflammatory rheumatic disease of the elderly and represents the most common indication

PMR Classification Study Protocol

Version 1 15/05/2007 Page 12

captured by other measures such as pain, stiffness or clinical examination. This is particularly important in the elderly with several comorbid conditions. Thus, this generic instrument will be complementary to other disease specific measures in our study. We will evaluate SF-36 according to established scoring systems72, 78, 79. The SF-36 is scored to produce eight dimension scores and two physical and mental component summary scores.

We are only aware of 2 unpublished studies that have used SF-36 in PMR p atients . In an unpublished study by one of the co-investigators28, the QOL of PMR patients at presentation, as measured by mean SF-36 physical and mental component summary scores (PCS=31.5, MCS=38.9), was substantially lower than general population norms (PCS=44.7, 53.2 MCS) for 65-74 year olds. By the end of 12 months of follow-up, physical and mental scores had, on average, improved by 12.6 and 11.2 points, respectively. Higher levels of disease activity in all markers were associated with lower physical and mental scores. SF-36 scores correlated significantly with pain, stiffness and inflammatory acute phase reactants. In another unpublished survey conducted by the PI of this application (Maradit Kremers, unpublished data), 105 PMR patients filled the SF-36 questionnaire several years after the resolution of their PMR. Findings indicated that the PMR subjects had worse physical functioning but better mental functioning compared with their peers of the same age and sex. The age- and sex-adjusted physical composite score was 48.45 (SD: 10.69) and mental composite score was 52.85 (SD: 7.27). There was no correlation between the two scores. These findings suggest that physical functioning declines with advancing age but mental health remains stable. Age, sex, ESR at baseline, marital status and time since diagnosis (time from incidence date to survey date) were not associated with physical and mental scores. The lack of any trend with time since diagnosis points to the

Table 4. Domains and measurements

Domains Assessor Scale/measures

Pain Self-reported (VAS 100 mm) Shoulder girdle pain Pelvic girdle pain Neck pain Overall pain

Morning Stiffness Self-reported Duration of morning stiffness (minutes)

Functional status/disability & QOL Self-reported MHAQ SF-36

Global well-being measures Self-reported Physician

Patient global assessment (VAS 100 mm) Physician global assessment (VAS 100 mm)

Fatigue Self-reported VAS-Fatigue 100 mm

Clinical examination Physician Shoulder tenderness, pain on motion & limitation (dichotomous) Hip tenderness, pain on motion & limitation (dichotomous)

Steroid therapy Physician Steroid response Steroid dose Steroid dose change Steroid discontinuation (dichotomous)

Acute phase reactants Laboratory ESR CRP

Imaging Physician Shoulder ultrasound (dichotomous) Pelvic ultrasound (dichotomous)

VAS, visual analog scale; QOL, quality of life; mHAQ, modified Health Assessment Questionnaire; SF-36, Medical Outcomes Study Short Form; ESR, erythrocyte sedimentation rate; CRP, C-reactive protein

Page 13: A SPECIFIC AIMS - EULAR - PMR...A SPECIFIC AIMS Polymyalgia rheumatica (PMR) is the most common inflammatory rheumatic disease of the elderly and represents the most common indication

PMR Classification Study Protocol

Version 1 15/05/2007 Page 13

conclusion that PMR, although devastating during the course of the disease, has no long-term impact on physical and mental functioning of patien ts shortly after full remission of symptoms.

MHAQ (Stanford Health Assessment Questionnaire (HAQ)) is the most common and widely used disease-specific instrument in rheumatology. HAQ focuses on two dimensions of health: physical disability and pain. In this study, we propose to use the modified HAQ (MHAQ) questionnaire which is a shortened version of the HAQ, with reduction of the 20 items (8 categories) in the original questionnaire to 8 items80. The MHAQ is an 8-item, disease-specific questionnaire which measures dressing/grooming, rising, eating, walking, hygiene, reach, grip and activities on 2- and 3-pt scales to yield an overall score. The MHAQ score is calculated as the mean of the scores for each activity. MHAQ was used in 2 separate PMR studies by one of the co-investigators28, 81. HAQ was responsive to change and correlated well with conventional indices of disease activity in PMR. However, fixed disabilities like osteoarthritis, shoulder capsulitis and systemic diseases affected its interpretation. The sections of the HAQ measuring disability related to inflammatory stiffness/proximal involvement showed greater responsiveness to change than other sections, and hence may have a greater role in evaluating disease activity in PMR. In another larger study, by the end of 12 months of follow-up of PMR patients, the mean change in HAQ scores in PMR patients was -0.83 (-0.95 to -0.72) and the percentage of patients with a score of less than 1 increased from 30% at baseline to 88% at month 1228. Higher levels of ESR, CRP and morning stiffness were all independently associated with lower HAQ scores. Therefore, these 2 studies justify the feasibility of using MHAQ in PMR patients and demonstrate the correlation of MHAQ with disease activity measures and sensitivity to change during the active period of the disease28, 81.

Global well-being measures: Global measures are designed to measure QOL in a comprehensive manner, usually in a single question. Indeed, physician and patient assessment of global status are included in the ACR core dataset for rheumatoid arthritis82. Both the patients and the physicians assessment of global status and overall disease activity will be evaluated on a 100 mm VAS scale. Although the patients’ perception of their health status is poorly assessed by their physicians83, we believe it is important to collect both the patient and physician assessments in this validation study, in order to better capture the clinical practice and treatment decisions of the physicians with respect to their assessment of disease activity at each visit.

Fatigue: In addition to disability, physical parameters such as fatigue are also measurable features of rheumatic diseases and increasingly recognized as a significant contributor to decreased QOL. Fatigue was also demonstrated to improve with treatment indicating its sensitivity to change over time84-87. We chose to assess fatigue as a potential criterion because pain, the most prominent symptom in PMR, has the strongest association with fatigue86. Disease-related fatigue can be evaluated by using a variety of instruments84. We will use a single item 100 mm VAS scale since it performs as well as or better than longer scales with respect to sensitivity to change, and is at least as well correlated with clinical variables as longer scales88.

Clinical examination : The only clinical examination measures to be assessed in our study (by physician) are tenderness, pain on motion and limitation (dichotomous) in shoulders and hip. According to expert consensus, there was >80% agreement on importance of these criteria but Delphi respondents rating was only 50%.

Steroid therapy: The mainstay of therapy in PMR is oral steroids24. Prednisone, 15 to 25 mg, usually suppresses inflammation dramatically. However, up to 60% of patients experience disease relapse during steroid tapering, and several studies indicate that steroid treatment can only rarely be discontinued before 2 years. Most clinicians consider a rapid resolution of symptoms with steroid therapy a diagnostic hallmark of PMR, though some studies report slower response rates in many patients. Lack of response may also indicate an incorrect diagnosis. Therefore, we will assess the

Page 14: A SPECIFIC AIMS - EULAR - PMR...A SPECIFIC AIMS Polymyalgia rheumatica (PMR) is the most common inflammatory rheumatic disease of the elderly and represents the most common indication

PMR Classification Study Protocol

Version 1 15/05/2007 Page 14

validity of steroid therapy characteristics as potential criteria for classification, disease activity and response.

In our Delphi survey, 99% of respondents agreed that steroid response is an important criterion in PMR. There was variation in respondents’ opinion of what the initial dose should be, what constitutes a rapid response and time to response. In our study, steroid therapy will be examined in a number of ways as potential response and remission criteria. We will specifically consider important components of steroid therapy: rapid, complete and sustained response to low dose steroid therapy (i.e. 15 mg daily). Evaluation at 1 week will define whether the clinical response is rapid and complete. Evaluation at 4 weeks will define whether the laboratory response is complete. Even if there is clinical response at 1 week the laboratory parameters may take another 2-3 weeks to normalize. The global clinical response along with normalization of laboratory abnormalities of acute phase response will distinguish the non-specific steroid response from the ‘PMR’ response. Evaluation at 26 weeks will define whether the response is sustained. We will also examine daily dose (continuous), dose changes (increase, decrease, unchanged) and steroid discontinuation (e.g. time to discontinuation) as potential disease activity and response criteria (e.g. remission while on or off steroid therapy). Steroid therapy will be assessed at each visit and tapering details between visits will be collected through record review and questionnaires.

Acute phase reactants: Erythrocyte sedimentation rate (ESR) by the Westergren method and C-reactive protein (CRP) are part of current practice to evaluate diagnosis and response in PMR. Although they are non-specific, they are the most widely used laboratory tests in the absence of more specific biomarkers. These tests can be performed at all participating centers. The test results will be standardized based on the normal values provided by the centers.

Imaging-ultrasound: Ultrasound has been used for the investigation and management of patients with musculoskeletal diseases for at least 30 years. Technology improvements in the last decade greatly improved and more recently, its use has been directed towards the assessment of patients with inflammatory arthritis89, 90. Ultrasound has recently been used in PMR patients as well. It depicts characteristic pathologic findings of shoulders and hips that can aid in distinguishing PMR from other diseases that may mimic PMR91-97. Typical findings on ultrasound include subdeltoid bursitis and biceps tendon tenosynovitis at the shoulders92, 93, and less frequently synovitis of the glenohumoral joint. In the hips, ultrasound often reveals synovitis and trochanteric bursitis94, 95. Inflammatory shoulder lesions were observed even in patients with normal ESR values and it has been suggested that ultrasound may facilitate the proper diagnosis in patients with the typical proximal symptoms of PMR who also have normal ESR98.

In our study, we propose to employ bilateral ultrasound examinations of hip and shoulders at baseline visit and at 26 weeks of follow-up. Evaluations will be made according to EULAR guidelines99. In particular, we will assess inflammatory changes, such as subdeltoid bursitis, tenosynovitis of the biceps tendons, hip joint synovitis and trochanteric bursitis.

A rheumatologist or a radiologist who is experienced with musculoskeletal ultrasound of shoulders and hips will perform the ultrasound examination. For the shoulders linear probes are necessary that provide a frequency in the range between 6 and 10 MHz. For the hips linear or curved array probes are necessary that provide a frequency in the range between 5 and 8 MHz. Table 5 describes the core criteria to be evaluated. Table 6 outlines the scans that are to be performed.

Page 15: A SPECIFIC AIMS - EULAR - PMR...A SPECIFIC AIMS Polymyalgia rheumatica (PMR) is the most common inflammatory rheumatic disease of the elderly and represents the most common indication

PMR Classification Study Protocol

Version 1 15/05/2007 Page 15

Table 5. Core ultrasound criteria to be evaluated

Joint Pathology Definition* Parameters to be evaluated

Shoulder Subdeltoid bursitis Anechoic or hypoechoic thickened tissue between rotator cuff and deltoid muscle that may be compressible and displaceable and that may exhibit color signals. Minimum sagittal diameter at any region in any position from internal to external rotation: > 2.0 mm

Present: yes / no

Maximum sagittal diameter

Shoulder Tenosynovitis at long head of biceps tendon

Anechoic or hypoechoic thickened tissue within tendon sheath that may exhibit color signals

Present: yes / no

Maximum diameter

Shoulder Effusion / synovitis Anechoic or hypoechoic thickened tissue that may be compressible or displaceable and that may exhibit color signals.

Axillary region: Sagittal diameter: >3.9 mm and / or joint capsule not parallel to the bone

Posterior region: Sagittal diameter: > 2.1 mm in any position from internal to external rotation. Synovitis is present if detected in axillary and / or posterior region

Present: yes / no

Maximum sagittal diameter at axillary recess and at posterior region

Hip Trochanteric bursitis Anechoic or hypoechoic thickened tissue lateral to the greater trochanter

Present: yes / no

Volume (longitudinal, transverse, and sagittal diameters)

Hip Synovitis / effusion Anechoic or hypoechoic thickened tissue that may be compressible or displaceable and that may exhibit color signals.

Present: yes / no

Sagittal diameter between bone and joint capsule at the middle of the concavity of femoral head / neck

* The text refers to the definitions of the OMERACT special interest group on musculoskeletal ultrasonography90 and to standard reference values for musculoskeletal US100

Table 6. Scans to be performed (according to EULAR guidelines 99)

Joint Scan Pathologies to evaluate Shoulder Biceps tendon transverse Biceps tendon tenosynovitis Biceps tendon longitudinal Biceps tendon tenosynovitis Shoulder anterior transverse Rotator cuff, bursitis, impingement, erosion Shoulder anterior longitudinal Rotator cuff, bursitis, impingement, erosion Shoulder lateral transverse Rotator cuff, bursitis, impingement, erosion Shoulder lateral longitudinal Rotator cuff, bursitis, impingement, erosion Shoulder posterior transverse Rotator cuff, bursitis, impingement, erosion,

synovitis Shoulder posterior longitudinal Rotator cuff, bursitis, impingement, erosion,

synovitis Acromioclavicular joint,

longitudinal to joint Osteoarthritis, synovitis

Axillary Synovitis, erosion, osteoarthritis, axillary arteritis Hip Anterior longitudinal Synovitis, iliopsoas bursitis, osteoarthritis Anterior transverse Synovitis, iliopsoas bursitis, osteoarthritis Greater trochanter longitudinal Trochanteric bursitis Greater trochanter transverse Trochanteric bursitis

Page 16: A SPECIFIC AIMS - EULAR - PMR...A SPECIFIC AIMS Polymyalgia rheumatica (PMR) is the most common inflammatory rheumatic disease of the elderly and represents the most common indication

PMR Classification Study Protocol

Version 1 15/05/2007 Page 16

Additional measurements include demographics (age, sex, race, ethnicity), vital signs and symptoms, weight, height, family history of rheumatic/autoimmune diseases, comorbidities, comedications and several laboratory measurements (hemoglobin, complete blood count, lipids, serum protein electrophoresis, rheumatoid factor (RF) and anti-cyclic citrullinated peptide (anti-CCP) antibodies (for differential diagnosis).

D.3 Methods for Specific Aim 1. To develop a reliab le and valid composite disease activity score and remission criteria in PMR

Overview : Under aim 1(a), we will evaluate the validity, reliability and reproducibility of each of the core disease activity domains (Table 4) based on patient-reported (pain, stiffness, function, global assessment, fatigue), physician-reported (global assessment, tenderness, pain on motion and limitation of hip and shoulders, steroid therapy), laboratory (ESR, CRP) and ultrasound-based measures. We will consider all of these domains and measures for potential inclusion in the definition of disease activity. We will then derive (aim 1(b)) a composite disease activity score and remission criteria incorporating these core domains. These analyses will be restricted to the estimated 126 subjects who are confirmed to have PMR (see figure 1 in section D.1.). Aim 1(c) will involve assessment of discriminant validity of the core domains and the composite disease activity score in distinguishing PMR patients from other similar conditions mimicking PMR (see figure in section D.1.).

D.3.A Methods for Specific Aim 1(a)

This aim involves evaluation of the validity, redundancy and reproducibility of each of the domains and measures included in Table 4. Validity is the extent to which an instrument specifically measures the phenomenon of interest. The initial face and content validity of our proposed set of domains have already been tested through consensus meetings and Delphi surveys (see Preliminary Studies section). Furthermore, most of the domains are part of previously proposed diagnostic and response criteria for PMR (see Background section). Therefore, aim 1(a) will mainly focus on establishing criterion and construct validity for these domains and associated measures, consistent with several guidelines including the OMERACT filter55, 56, 101.

Criterion validity of each of the domains in our study is the correlation of that domain or a measure of the domain with another domain or measure of PMR. Concurrent criterion validity is the correlation of the measures assessed at the same point in time. Concurrent criterion validity is essentially a correlational measure of convergent construct validity. In our study, these correlation assessments will be performed for all measures performed during a study visit. We will use standard techniques applied in rheumatology and psychometric criteria development102, 103. Convergent construct validity will be assessed by the Spearman’s rank correlations. Values greater than 0.7 will be considered high correlation, 0.4-0.7 moderate and less than 0.4 low correlation. Construct validity of ultrasound measures will be similarly assessed by examining Spearman correlations with patient’s and physician’s global assessment of disease activity as the surrogate measures. Redundancy , or colinearity of measures will be investigated by the Pearson’s correlation coefficient values, where a coefficient >0.7 will be considered to represent evidence of colinearity or redundancy. Steroid treatment characteristics are an essential component of construct validity, as any definition of disease activity is related to the treatment decision-making. Under aim 1(a), we will examine the correlation of steroid dose and dose changes (increased, decreased, unchanged, discontinued) with other disease activity domains.

It is quite likely that the form of relationship or correlation between domains (measures) is different at different levels. For example, pain scores and ESR may be perfectly correlated at high levels but the correlation and the functional form of the relationship may change at low pain levels (i.e. patient has

Page 17: A SPECIFIC AIMS - EULAR - PMR...A SPECIFIC AIMS Polymyalgia rheumatica (PMR) is the most common inflammatory rheumatic disease of the elderly and represents the most common indication

PMR Classification Study Protocol

Version 1 15/05/2007 Page 17

little pain but the ESR values remain high). We will use scatter plots to investigate the functional form of relationships at different time points and identify change points, if there are any. Furthermore, subgroup analyses will be performed based on baseline patient characteristics, i.e. patients with low ESR, as the correlations between measures may differ in these patients.

Internal consistency is a measure of the degree to which various measures hold together, or the degree to which they measure the same underlying construct. This will be determined by Cronbach’s alpha104 with cut-off values of >0.9 as indication of perfect consistency and <0.6 as poor consistency. Feasibility and acceptability of measures will be determined through an examination of ease of scoring, simplicity and item non-response rate and endorsement frequencies across response categories.

Reproducibility (reliability, consistency) is the extent to which a measurement yields the same result on repeated measurements by the same observer (intra-observer) or by different observers (inter-observer), as compared to the measurement differences between patients. Since we anticipate that the measurements over time are rarely exactly the same in PMR (especially early in the disease course), our analyses will focus on intra- and inter-observer reproducibility of physician-based measures (global assessment, clinical examinations and ultrasound). In a small number of stable patients, it may also be feasible to assess the test-retest reproducibility of patient-based measures by administering questionnaires with a 2-5 days interval. When developing a composite disease activity criteria (as in aim 1b), each measure in the criteria must demonstrate reliability, since an unreliable measure only adds noise to the scale. Therefore, the reliability of each of the individual measures needs to be assessed prior to development of a composite score.

Intra- and inter-observer reproducibility of clinical examinations will be assessed at 2-3 participating sites and will involve at least 2 rheumatologists and 2 generalists at each site. A total of 20 patients with PMR will be evaluated independently by the 4 physicians twice and 1-2 days apart. A similar method will be used to assess the reproducibility of ultrasound assessments. A total of 20 patients with PMR will be evaluated independently by 4 observers at 2 time points (baseline and at 6 months). Ultrasound reproducibility assessment will be performed at 2-3 participating centers. Effort will be made to ensure that the observers have different levels of experience. In both reproducibility assessments, observers will meet ahead and review the study protocol and less experienced observers will receive basic training. The observers will also document the time required for each assessment (feasibility). Statistical analysis of intra-observer and inter-observer reproducibilities will be evaluated using a two-way mixed effect model, leading to an overall components of variance decomposition (between patients, within a patient at different times, between observers, between items, etc). Individual aspects of the decomposition can also be summarized using intra-class correlation coefficients (ICC). Cronbach's alpha is related to the variance of the per-subject latent trait versus the total variance in this model.

D.3.B Methods for Specific Aim 1(b). Derivation of a composite disease activity score

A composite disease activity score will be derived using data from Aim 1(a). A composite disease activity score is more useful than an individual measure of disease activity because it can incorporate several aspects of disease manifestation into a single descriptive indicator. This composite score can provide a uniform outcome measure of disease activity with greater face validity, can avoid multiplicity and can enhance precision.

Since this is a longitudinal study with repeated measures, it is feasible to develop a continuous disease activity score . This continuous score can then be dissected to an ordered categorical score, if deemed necessary, depending on prediction of future outcomes, such as the prognostic value of disease activity scores to predict physician’s decision to initiate or change steroid dose or, likelihood of

Page 18: A SPECIFIC AIMS - EULAR - PMR...A SPECIFIC AIMS Polymyalgia rheumatica (PMR) is the most common inflammatory rheumatic disease of the elderly and represents the most common indication

PMR Classification Study Protocol

Version 1 15/05/2007 Page 18

reversal of disease activity back to a higher level. Undoubtedly, PMR subjects typically experience periods of inactive disease followed by disease flares. Thus, any PMR disease activity score must allow patients to shift from one disease activity state to a higher disease activity state. Given our extensive analyses under Aim 1(a), disease activity scores will likely perform well in distinguishing patients with high, medium, low and no disease activity states. However, misclassification bias is always possible, especially for those patients on the boundary.

The ultimate test of validity is the ability of an instrument or score to predict phenomena of clinical interest105, 106. The distinction between true PMR cases and the other diseases chosen for study controls is an endpoint of critical importance. We will create candidate models for the composite score using multiple regression, with the 0-5 disease rating scores as the dependent variable. Variables will be grouped into 3 broad classes: imaging measurements, other information known at the initial visit, and follow-up information such as early response. We will assess how important each item is within each group, in terms of both statistical significance and decrease in overall classification error, as well as the overall utility that both imaging and later evaluations add to the initial core measurements. Note, the objective is not to simply derive a model based upon stepwise regression techniques but to consider various models based upon these different predictor sets and to evaluate these models in terms of their association with both physician and patient assessments, their robustness to changes in a single measure, content validity scores of the individual core measures as found in aim 1a and the content validity scores of the various composite measures as obtained from consensus (i.e. experts will be asked to rank the easiness and credibility of the various candidate composite disease activity scores). For a multiple linear regression model which already includes 5 covariates (e.g. one for each domain) with a squared multiple correlation R-square of 0.60, a sample size of 120 will have 90% power to detect at α = 0.050 an increase in R-square of 0.033 due to including 1 additional covariate.

An alternative approach, more in keeping with psychometric tradition, is to first normalize each variable to the same standard deviation, and then form a simple sum of the items. For questionnaire data whose responses are all concordant, e.g. a 5 point Likert, the scaling is unnecessary, but it is mandatory with the different item ranges found in this study. If all the items used have been shown to be reliable and valid, this will decrease the variance of the score, at the possible cost of some precision that could be gained by using "optimal" weights. We will also explore a hierarchical random effects model on the scaled data, where the individual coefficients βi are assumed to come from a common N(µ ,σ) distribution. This strikes a balance between unrestricted regression (µ=∞) and the simple summation (σ =0) model; the optimal value for the coefficient shrinkage will be based on Akiake's Information Criteria (AIC).

In additional analyses, we will assess the construct validity of the composite disease activity scores by examining the correlation between the composite measure and its components. The primary summary will be a guided factor analysis of the individual items, where the first factor is constrained to be the composite score. The spearman correlation matrix will provide augmental data.

One important issue in derivation of the composite disease activity score in PMR is the weighting of individual domains or measures, because the validity of each domain in defining disease activity in PMR is likely different depending on the time since initial diagnosis. For example, early in the disease course, patient-derived measures such as pain and QOL may be important and the main drivers of the disease activity. In contrast, later in the disease course, therapeutic changes typically target high acute phase reactants and therefore, physician-derived measures may be more important. Therefore, the relative weight of each of the measures in the composite disease activity score will differ depending on time since diagnosis. Technically these differences may be thought of as time by modifier interactions. The dependency of repeated measures form the multiple time points for individual patients will be accounted for in the analyses by either longitudinal or resampling methods.

Page 19: A SPECIFIC AIMS - EULAR - PMR...A SPECIFIC AIMS Polymyalgia rheumatica (PMR) is the most common inflammatory rheumatic disease of the elderly and represents the most common indication

PMR Classification Study Protocol

Version 1 15/05/2007 Page 19

We anticipate that there will be some discordance between patient’s and physician’s asses sments of disease activity. Patients and physicians typically think differently about how to define disease activity. In judging the level of disease activity, patients may rate their complaints higher than, for example, abnormal laboratory results, whereas physicians will tend to give weight to the latter observations, irrespective of patient complaints. Therefore, models examining these outcomes will have different results, depending on the extent of discordance. Examination of the weights will identify in a sense the characteristics yielding the discordance, but because of colinearities these will be difficult to interpret. We will thus also consider regression techniques to identify the most significant predictors for any discordances.

The ideal validation of the composite score is, of course, data from a new study. We plan to assess the reproducibility of the composite disease activi ty score in 2 separate longitudinal studies 28,

30. One of these is a prospective observational study by one of the co-investigators and includes all (except fatigue) of the measures required for validation. The other dataset is retrospective observational data from the Mayo Clinic PMR cohort. Although this retrospective dataset does not include all measures, patients included in this cohort had been followed up for a period of up to 5 years following diagnosis and available data include visit level details for all patients, especially steroid therapy details abstracted at every visit during the 5 years of follow-up. In the current grant, we will get first estimates of predictive validity in two ways, a statistical approach using the main endpoint of aim 1b, and investigation of other related endpoints that were not directly used in the creation of the regression model. For the first, we will use bootstrap calibration105. This repeatedly sets aside a portion of the sample, rebuilds the model of aim 1b on the subset (with the same y and x), and assesses it on the left out portion. The resultant estimates can be used to shrink or recalibrate the coefficients of the regression model so as to provide an unbiased estimate of the prediction error. The second, more direct approach is to consider prediction of other outcome variants, e.g. how does the composite score, developed using only PMR yes/no as the target, work for predicting patient assessment of disease or physician assessment of severity.

D.3.C Methods for Specific Aim 1(c). Development of PMR classification criteria

Under this aim, we will develop classification criteria for PMR. It will involve assessment of discriminant validity of each of the domains and the composite score (developed in aim 1(b)) in distinguishing PMR patients from other similar conditions mimicking PMR. The study population for this aim is both the PMR cases and controls (see figure in section D.1.)

In statistical analyses, sets of classification criteria will be developed. One set will best distinguish PMR cases from the pooled set of controls (i.e. all controls combined) and other sets will best distinguish PMR cases from each control group separately. This analytic approach will provide insight as to what measures are important in a situation of general diagnosis for PMR as well as what measures are important for distinguishing between the PMR diagnosis versus another disease with similar presentation. This is especially relevant in this study since the diagnosis of PMR in different clinical settings (e.g. primary care v rheumatology clinic v other non-rheumatology clinics) may involve different sets of differential diagnoses. Three specific tools will be considered to develop the classification criteria: random forests107, logistic regression and a sum of the individual scores. Random forests employ recursive partitioning (rpart)108 to identify which measures are important. A random forests approach is an improvement over rpart alone in that it produces classifiers with higher accuracy. It also provides a measure of the importance of each measure in terms of being able to distinguish between cases and controls.

Our rationale for using random forests as the primary tool for developing the classification criteria is as follows: The proposed random forest classification technique is based upon rpart methodology. rpart is

Page 20: A SPECIFIC AIMS - EULAR - PMR...A SPECIFIC AIMS Polymyalgia rheumatica (PMR) is the most common inflammatory rheumatic disease of the elderly and represents the most common indication

PMR Classification Study Protocol

Version 1 15/05/2007 Page 20

a classification procedure that can handle both continuous and categorical variables, simultaneously. It also objectively determines cut-off values for continuous variables, which are needed when performing classification using continuous variables. rpart also performs well when there is missing data because it has a “built-in” method for imputing the missing values. Finally, it also automatically allows for synergistic effects among the variables. All these characteristics make it an extremely useful and flexible tool for developing classification criteria.

Ideally, in developing classification criteria, one would have a training data set, upon which to develop the criteria, and an independent test data set from which to estimate the performance of the classification criteria (i.e. estimate sensitivity and specificity). Currently such an independent data set is not available (i.e. PMR cases and non-PMR controls). Naively using the data that was used to develop a classification criteria to also estimate the sensitivity and specificity of the performance of the classification criteria results in biased estimates of accuracy. Specifically, it results in estimates of sensitivity and specificity that are too high. The random forest technique allows us to obtain estimates of sensitivity and specificity that are unbiased or closer to the true sensitivity and specificity without having an independent test data set. Furthermore, random forests will also indicate the relative importance of each variable with respect to its contribution to the classification criteria. In other words, it provides a measure of the usefulness of the variable in terms of distinguishing PMR cases from subjects with other conditions.

The measures identified by random forests will be used in a logistic regression model. Logistic regression will provide another measure of the importance of each measure in the presence of the other measures in the model. It will also help refine the classifiers in that there may be measures that can be omitted without substantially reducing the classifier’s accuracy. Finally, the logistic regression model will produce a score for each subject based upon the measures in the model (note that the measures will not have equal weight). A subject’s logistic regression score can be used to classify her/him as a PMR case or not. A receiver operator characteristic (ROC) curve will be generated for the scores to determine a cut-off that provides satisfactory sensitivity and specificity.

Since our aim is to develop classification criteria for PMR rather than to test a specific hypothesis, a sample size calculation based on power is not appropriate. On the other hand, the metrics by which classification criteria are judged typically are sensitivity and specificity. Sample size determines how precise we are able to estimate these quantities. Hence, the sample size justification arises from the

level of desired precision for estimating sensitivity and specificity. Another factor that influences the level of precision for our estimates of sensitivity and specificity are the true values of sensitivity and specificity for our classification criteria. Table 8 indicates the level of precision for our estimates of sensitivity and specificity for different sample sizes.

We anticipate that the classification criteria will achieve a minimum sensitivity and specificity of 0.70 (70%). If this cannot be achieved, the criteria will not be very useful. The sample size for the sensitivity analysis will be the number of cases, which could range from N = 100 to 120. For a classification criterion that yielded a true sensitivity of 0.70, we would be able to estimate it to within ± 0.04 (confidence interval size = 0.08). For larger levels of sensitivity, the precision of the sensitivity estimate increases (i.e. the size of the confidence interval decreases). Similarly, the worst-case scenario for specificity would be a sample size of 240 and specificity of 0.70. Our estimate of specificity would be within ± 0.03 of the true specificity. As the true specificity and sample size

Table 8. Size of confidence interval* for sensitivi ty and specificity

Proportion (Sensitivity or Specificity)

Sample size 0.7 0.8 0.9

120 0.082 0.072 0.054

240 0.058 0.051 0.038

* two-sided 95% confidence intervals assumed

Page 21: A SPECIFIC AIMS - EULAR - PMR...A SPECIFIC AIMS Polymyalgia rheumatica (PMR) is the most common inflammatory rheumatic disease of the elderly and represents the most common indication

PMR Classification Study Protocol

Version 1 15/05/2007 Page 21

increases, the precision of our estimate will improve (i.e. the size of the confidence interval for the estimate will decrease). The proposed sample sizes yield levels of precision such that our estimates will be within at least ±0.04 of the true sensitivity and ±0.03 of the true specificity values.

D.4 Specific Aim 2. To develop a preliminary defini tion of improvement for the evaluation of response to therapy in PMR

Overview : Under aim 2a, we will evaluate within individual sensitivity to change (responsiveness) and discriminant validity of core disease activity domains (as measured by patient-based, physician-based, laboratory and ultrasound-based measures). We will then derive a preliminary composite definition of improvement based on disease activity (Aim 1b) and change in core disease activity domains. We will use a combination of statistical criteria and consensus-formation techniques to derive a validated definition of improvement. These analyses will be restricted to the estimated 126 subjects who are confirmed of having PMR (see figure 1 in section D.1.).

D.4.A Methods for Specific Aim 2(a). Assessment of sensitivity to change and discriminant validity

Sensitivity to change (responsiveness) refers to the ability of each measure to detect clinically important changes between baseline and at different follow-up time points. Clinically important change is a change that either the patients or the treating physicians think is discernible and important. These evaluations will be performed with individual measures as well as the disease activity score derived in aim 1(b). Anchor-based methods will be used to examine sensitivity to change or responsiveness109,

110. Anchor-based methods compare changes in measure values on an anchor, which we will define as a patients’ global rating of change. A patient will be considered to have improved (or deteriorated) only when he/she meets the anchor-based criteria and distribution-based criteria for establishing clinically meaningful change.

Discriminant validity of each measure will be assessed by evaluating the capacity of the measure to distinguish patients who experienced improvement from those who did not. At each follow-up visit, patients will be asked to judge whether the disease had improved, remained stable or had worsened as compared to the previous visit. Treating physician’s decision to change steroid therapy (increase dose, keep same dose, reduce dose, discontinue) can also be considered as a surrogate measure of physician assessment of improvement. Patients who were judged as improving will be compared with those who were judged as not improving (i.e., disease remained stable or worsened) by t-tests or the Mann-Whitney U test, as appropriate. Moreover, the level of agreement between physicians and patients in the evaluation of response to therapy will be assessed with the kappa statistic111.

Convergent construct validity (of the changes over time) will be examined by assessing whether improvement (which is the construct in question) is related to other measures (change in pain scores, ESR) in a manner consistent with prior expectations. Pearson's correlation coefficients will be calculated for the absolute change in measure values over time (e.g. baseline to 6 months). Values >0.7 will be considered as evidence of colinearity or redundancy whereas, values in the mid range 0.4-0.7 will be considered as good convergent construct validity. The internal consistency of the various measures will be determined by Cronbach's alpha104.

Based on these analyses, the various domains of response to therapy will be finalized. Once this is complete, we will use a combination of consensus methods and statistical criteria to define improvement, as outlined in Aim 2(b).

Page 22: A SPECIFIC AIMS - EULAR - PMR...A SPECIFIC AIMS Polymyalgia rheumatica (PMR) is the most common inflammatory rheumatic disease of the elderly and represents the most common indication

PMR Classification Study Protocol

Version 1 15/05/2007 Page 22

D.4.B Methods for Specific Aim 2(b). Derivation of a composite improvement criteria

This aim involves a combination of consensus-driven and statistical methods to derive the composite definition of improvement in PMR.

First, we will aim to derive consensus-based gold standard definitions of improvement using expert ratings of patient profiles. Each patient’s profile over time will be presented to the experts and they will be instructed to rate each profile as clinically improved or not improved. In order to achieve a gold-standard consensus definition, we will require 75%- 80% consensus among the experts. Since the physicians assessments are frequently in disagreement, we can also adopt probability of agreement approach, using performance characteristic curves based on computer simulations (as previously applied in SLE)112.

Second, we will derive various combinations of improvement criteria (based on consensus-derived definitions) and evaluate their ability to classify patients as improved or not improved. For each consensus-derived improvement criteria evaluated, we will calculate the chi-square test, the sensitivity, specificity and false-positive/negative rates against the gold standard of expert consensus. In these analyses, sensitivity will demonstrate the ability of an improvement definition to identify a patient as improved who had been classified as improved by the gold standard (i.e. consensus definition). Specificity refers to the ability of the definition to identify a patient as not improved who had been classified as not improved by the gold standard. We will also calculate kappa statistics to measure the strength of agreement between the definitions and consensus of the physicians. Only definitions with substantial agreement (e.g. kappa statistics >0.7) and high sensitivity and specificity values and low false-positive and false-negative rates will be evaluated further.

Next we will consider multiple composite models for improvement using multivariable logistic regression. The approach will be similar to that of Aim 1b deriving candidate weights for the core measures and then evaluating the models in terms of their association with improvement, content validity, easiness and credibility. Further, along with the results of Aim 1(a) we will attempt to identify a model with good performance both as a disease activity score and sensitive to improvement.

D.5 Specific Aim 3. To create an infrastructure for storage of biospecimens in order to facilitate future research on novel disease-specifi c biomarkers and their therapeutic potential

All PMR subjects will be asked to give 10 cc of whole blood for future studies at each of the 5 visits, and controls will be asked to give 10 cc of whole blood for these studies at baseline only. The samples will be stored and processed at the Mayo Clinic (Rochester, MN). Each center will collect one EDTA tube (whole blood for DNA extraction) and two "dry" tubes for serum. The EDTA tube will be frozen on site. The dry tubes will be centrifuged, and serum will be aliquoted on site. Aliquotes will be labeled and frozen at -80°C. Specimens will be transported to Mayo Clinic twice a year from each center. DNA extraction and storage of DNA aliquots and the serum aliquots (without thawing), will be managed at the Mayo Clinic.

D.6 Approach to missing data

We expect that the number of missing values will be small in our study, due to short duration of follow-up. Nevertheless, for each analysis, we will first examine the distribution of missing values, and will consider multiple imputation and propensity score methods to deal with those. Multiple imputation corrects the variance deflation aspect of simple imputation by using a distribution of fill-in values113. The propensity score methods for the Cox model were developed by Pugh et al., and further developed by Robins114-116. If the probability of missingness differs between subgroups, but is random within a group (as might occur if PMR subjects on steroids were more likely to receive a certain test

Page 23: A SPECIFIC AIMS - EULAR - PMR...A SPECIFIC AIMS Polymyalgia rheumatica (PMR) is the most common inflammatory rheumatic disease of the elderly and represents the most common indication

PMR Classification Study Protocol

Version 1 15/05/2007 Page 23

than those who were not on steroids, but the probability of missingness was random within each group of subjects) then this is a very effective analytic strategy. We will compare the results from both the complete case analyses and the multiple imputation analyses and if a difference exists, we will report results from the multiple imputation analyses.

D.7 Strengths and Limitations

The strengths of our proposed study are several-fold. First, the acceptability of the study findings should be high owing to the multinational collaboration. Second, the infrastructure already exists as the participating centers are willing and able to recruit patients within a relatively short frame. Third, the study builds upon strong validation methods developed in other rheumatic diseases. Fourth, biospecimen storage will promote future studies of the proteomics and genomics of PMR in a well-characterized patient cohort.

This proposal does, however, have potential limitations . First, our ability to perform external validation of the newly developed criteria is limited as there are not many clinical trials performed in PMR patients. Second, given the uncertainties associated with the etiology of PMR, these criteria represent clinical remission and not biological remission. Third, the study subjects in this study will be recruited from rheumatology referral centers, rather than a community based patient population. Although this may raise concerns regarding representativeness, we believe this is unlikely because GPs within the coverage area of the participating rheumatology centers will be asked to refer their suspected patients during the duration of the study. Furthermore, there are wide practice variations in individual countries and therefore, it is challenging to ensure compliance with the study protocol in a GP based recruitment plan. Finally, the duration of follow-up (6 months) may be too short to identify sufficient number of patients in complete remission, especially while off steroid therapy. However, future clinical trials in PMR are unlikely to be longer than 6 months and therefore, the criteria developed should be satisfactory.

In conclusion, this will be the first comprehensive effort to develop outcome definitions for PMR. The study infrastructure and methodology are robust and achievable. The proposed approach will take advantage of a unique set of cir cumstances and expertise. Findings emanating from this study will lay the foundation f or international guidelines for evaluating outcomes in PMR.

D.8 Timelines

The 2-year study plan includes recruitment, follow-up and closeout. Project milestones are as follows:

Time (months) Activities

1-2 Staff training, IRB, preparation of study materials;

2-14 Patient recruitment/enrolment;

8-20 Patient follow-up, data entry;

21-22 Initial data analysis;

23-24 Data analysis and preparation of reports.

Page 24: A SPECIFIC AIMS - EULAR - PMR...A SPECIFIC AIMS Polymyalgia rheumatica (PMR) is the most common inflammatory rheumatic disease of the elderly and represents the most common indication

PMR Classification Study Protocol

Version 1 15/05/2007 Page 24

E HUMAN SUBJECTS RESEARCH

E.1 Protection of Human Subjects

Human subjects involvement and characteristics: This study involves prospective recruitment of 120 subjects with polymyalgic symptoms and 240 control subjects, who are ≥ 50 years and are not pregnant or lactating. Based on our preliminary data, we anticipate that the majority of subjects will be white (>90%) with a mean age of 70 years. We will include in our study every person at the selected recruitment sites who qualifies for inclusion during the duration of the study regardless of race, ethnicity, or socioeconomic status (age ≥ 50 years is an inclusion criteria).

Source of research materials: The research material will involve a physical examination, a venous blood sample, an ultrasound examination and questionnaire data collected at each study visit.

Potential risks: This project does not involve any experimentation upon human subjects. The potential risks of this study are minor and unlikely to occur. Subjects will undergo a venipuncture for blood collection and an ultrasound examination. These procedures will expose subjects to minimal physical risks only. The potential physical risks associated with venipuncture include pain, bruising, hematoma formation, superficial phlebitis, and extremely rare instances of infection or cellulitis. There is no known risk of musculoskeletal ultrasound. There is a potential for incidental ultrasound findings which will require management discussion with the patients.

Recruitment and informed consent: Potential PMR subjects and controls will be identified among consecutive patients referred to secondary care rheumatology clinics with suspected polymyalgia at 20 rheumatology referral centers in eight countries: USA, UK, Italy, Spain, Sweden, France, Austria, Germany. Each center will be recruiting 6 PMR cases and 12 non-PMR controls. Recruitment is expected to last 12 months with an anticipated recruitment rate of two patients per center per month. Written informed consent will be obtained by the study coordinator or by one of the investigators and this will be documented as part of the study documentation. Use of stored blood samples for future analyses will be explained to subjects while informed consent is obtained.

Procedures to minimize risk: The Institutional Review Board of each participating center (some of which are foreign centers in Europe) will review all aspects of this study and the contact procedures will be approved through institutional channels prior to use. The confidentiality will be maintained by using a study patient number for each participant. Only the center staff will know the identity of the patient. Questionnaires will be sent to the data management center (Mayo Clinic) using an identification number. At the data management center, all information on study subjects will be stored in locked file cabinets. Electronic information will be identified only through the non-identifiable study number. Only persons with appropriate authorization will have access to computer files. All members of the staff at the data coordinating center (Mayo Clinic) are trained about the importance of confidentiality and procedures by which this can be maintained. All data will be managed by study number and analyzed anonymously. All reports will be of a summary nature and no individual will be identified.

Potential benefits of the proposed research to the subjects and others: If abnormalities are noted on the ultrasound, it will be recommended that further evaluation be undertaken by physician. No other significant direct benefits to the subjects involved are anticipated, but neither is there any appreciable health risk other than those outlined above. Benefits to future patients with PMR could be substantial through early detection and treatment of this condition.

Importance of the knowledge to be gained: This application will be the first large-scale international study that examines a comprehensive list of outcomes in PMR patients. The knowledge to be gained

Page 25: A SPECIFIC AIMS - EULAR - PMR...A SPECIFIC AIMS Polymyalgia rheumatica (PMR) is the most common inflammatory rheumatic disease of the elderly and represents the most common indication

PMR Classification Study Protocol

Version 1 15/05/2007 Page 25

will clearly offset the minimal risks to subjects. The importance of the knowledge to be gained is extensively detailed throughout the application.

E.2 Inclusion of Women and Minorities – Inclusion o f Children

In our study, we will include every person greater than 50 years old and who fulfills the inclusion criteria, regardless of race, ethnicity, or socioeconomic status. Based on the epidemiology of PMR, we expect to recruit a slightly higher number of women (60-70%). Since one of the inclusion criteria of the study mandates >50 years of age, there are no major concerns regarding women of childbearing potential, or pregnant or lactating women. Similarly, children are not included.

E.3 Data and safety monitoring plan

Not applicable (observational study)

F LITERATURE CITED

1. Lawrence RC, Hochberg MC, al. e. Estimates of the prevalence of selected arthritic and musculoskeletal diseases in the United States. in press. 2006.

2. Doran MF, Crowson CS, O'Fallon WM, Hunder GG, Gabriel SE. Trends in the incidence of polymyalgia rheumatica over a 30 year period in Olmsted County, Minnesota, USA. Journal of Rheumatology. 2002;29:1694-1697.

3. Maradit Kremers H, Gabriel SE. Epidemiology of the Rheumatic Diseases. In: Harris ED, Ruddy S, Sledge CB, eds. Kelley's Textbook of Rheumatology. Vol 1. 7th ed. Philadelphia: W B Saunders Company; 2004.

4. Chuang TY, Hunder GG, Ilstrup DM, Kurland LT. Polymyalgia rheumatica. A 10-year epidemiologic and clinical study. Annals of Internal Medicine. 1982;97(5):672-680.

5. Healey LA. Long-term follow-up of polymyalgia rheumatica: evidence for synovitis. Semin Arthritis Rheum. 1984;13(4):322-328.

6. Bird HA, Esselinckx W, Dixon AS, Mowat AG, Wood PH. An evaluation of criteria for polymyalgia rheumatica. Ann Rheum Dis. 1979;38(5):434-439.

7. Jones JG, Hazleman BL. Prognosis and management of polymyalgia rheumatica. Annals of Rheumatic Diseases. 1981;40(1):1-5.

8. Gonzalez-Gay MA, Rodriguez-Valverde V, Blanco R, et al. Polymyalgia rheumatica without significantly increased erythrocyte sedimentation rate. Archives of Internal Medicine. 1997;157:317-320.

9. Cantini F, Salvarani C, Olivieri I, et al. Erythrocyte sedimentation rate and C-reactive protein in the evaluation of disease activity and severity in polymyalgia rheumatica: a prospective follow-up study. Seminars in Arthritis & Rheumatism. 2000;30(1):17-24.

10. Proven A, Gabriel SE, O'Fallon WM, Hunder GG. Polymyalgia rheumatica with low erythrocyte sedimentation rate at diagnosis. Journal of Rheumatology. 1999;26(6):1333-1337.

11. Dasgupta B, Kalke S. Polymyalgia Rheumatica. In: Isenberg D, Maddison P, Woo P, Glass D, Breedveld FC, eds. Oxford Textbook of Rheumatology. 3rd ed: Oxford University Press; 2004:977-983.

12. Gonzalez-Gay MA, Garcia-Porrua C, Vazquez-Caruncho M, Dababneh A, Hajeer A, Ollier WE. The spectrum of polymyalgia rheumatica in northwestern Spain: incidence and analysis of variables associated with relapse in a 10 year study. Journal of Rheumatology. 1999;26(6):1326-1332.

13. Kyle V, Hazleman BL. The clinical and laboratory course of polymyalgia rheumatica/giant cell arteritis after the first two months of treatment. Ann Rheum Dis. Dec 1993;52(12):847-850.

14. Soelberg-Sorensen P, Lorenzen I. Giant cell arteritis, temporal arteritis and polymyalgia rheumatica. Acta Medica Scandinavica. 1977;201:207-213.

15. Behn AR, Perera T, Myles AB. Polymyalgia rheumatica and corticosteroids: how much for how long? Annals of the Rheumatic Diseases. 1983;42(4):374-378.

16. Dasgupta B, Dolan AL, Panayi GS, Fernandes L. An initially double-blind controlled 96 week trial of depot methylprednisolone against oral prednisolone in the treatment of polymyalgia rheumatica. Br J Rheumatol. Feb 1998;37(2):189-195.

Page 26: A SPECIFIC AIMS - EULAR - PMR...A SPECIFIC AIMS Polymyalgia rheumatica (PMR) is the most common inflammatory rheumatic disease of the elderly and represents the most common indication

PMR Classification Study Protocol

Version 1 15/05/2007 Page 26

17. Matteson EL, Salvarani C, Schirmer M, et al. Differences Among Rheumatologists and Generalists in Selection of Classification Criteria for Polymyalgia Rheumatica (PMR): Results of a Delphi survey[abstract]. submitted for the 2006 ACR meeting. 2006.

18. Bahlas S, Ramos-Remus C, Davis P. Utilisation and costs of investigations, and accuracy of diagnosis of polymyalgia rheumatica by family physicians. Clinical Rheumatology. 2000;19(4):278-280.

19. Chakravarty K, Elgabani SH, Scott DG, Merry P. A district audit on the management of polymyalgia rheumatica and giant cell arteritis.[comment]. British Journal of Rheumatology. 1994;33(2):152-156.

20. Frearson R, Cassidy T, Newton J. Polymyalgia rheumatica and temporal arteritis: evidence and guidelines for diagnosis and management in older people. Age Ageing. Jul 2003;32(4):370-374.

21. Maradit Kremers H, Reinalda MS, Crowson CS, Zinsmeister AR, Hunder GG, Gabriel SE. Use of physician services in a population-based cohort of patients with polymyalgia rheumatica over the course of their disease. Arthritis Care and Research. 2005;53(3):395-403.

22. Gamez-Nava JI, Gonzalez-Lopez L, Davis P, Suarez-Almazor ME. Referral and diagnosis of common rheumatic diseases by primary care physicians. British Journal of Rheumatology. 1998;37(11):1215-1219.

23. McKinlay JB, Lin T, Freund K, Moskowitz M. The unexpected influence of physician attributes on clinical decisions: results of an experiment. J Health Soc Behav. Mar 2002;43(1):92-106.

24. Salvarani C, Cantini F, Boiardi L, Hunder GG. Polymyalgia rheumatica and giant-cell arteritis. New England Journal of Medicine. 2002;347(4):261-271.

25. Gran JT. Current therapy of polymyalgia rheumatica. Scand J Rheumatol. 1999;28(5):269-272. 26. Kyle V, Hazleman BL. The clinical and laboratory course of polymyalgia rheumatica/giant cell arteritis

after the first two months of treatment. Annals of the Rheumatic Diseases. 1993;52(12):847-850. 27. Delecoeuillerie G, Joly P, Cohen de Lara A, Paolaggi JB. Polymyalgia rheumatica and temporal arteritis:

A retrospective analysis of prognostic features and different corticosteroid regimens (11 year survey of 210 patients). Annals of Rheumatic Diseases. 1988;47:733-739.

28. Dasgupta B, Hutchings A, Hollywood J, al e. Clinical outcomes, quality of life and diagnostic uncertainty in the first year in polymyalgia rheumatica: a prospective cohort study. Rheumatology. 2006;45(suppl 1):i170.

29. Cimmino MA, Moggiana G, Montecucco C, Caporali R, Accardo S. Long term treatment of polymyalgia rheumatica with deflazacort. Ann Rheum Dis. May 1994;53(5):331-333.

30. Maradit Kremers HM, Reinalda MS, Crowson CS, Zinsmeister AR, Hunder GG, Gabriel SE. Relapse in a population based cohort of patients with polymyalgia rheumatica. J Rheumatol. Jan 2005;32(1):65-73.

31. Gabriel SE, Sunku J, Salvarani C, O'Fallon WM, Hunder GG. Adverse outcomes of anti-inflammatory therapy among patients with polymyalgia rheumatica. Arthritis & Rheumatism. 1997;40(10):1873-1878.

32. Ferraccioli GF, Di Poi E, Damato R. Steroid sparing therapeutic approaches to polymyalgia rheumatica-giant cell arteritis. State of the art and perspectives. Clin Exp Rheumatol. Jul-Aug 2000;18(4 Suppl 20):S58-60.

33. Caporali R, Cimmino MA, Ferraccioli G, et al. Prednisone plus methotrexate for polymyalgia rheumatica: a randomized, double-blind, placebo-controlled trial. Ann Intern Med. Oct 5 2004;141(7):493-500.

34. Krall PL, Mazanec DJ, Wilke WS. Methotrexate for corticosteroid-resistant polymyalgia rheumatica and giant cell arteritis. Cleve Clin J Med. May 1989;56(3):253-257.

35. Ferraccioli G, Salaffi F, De Vita S, Casatta L, Bartoli E. Methotrexate in polymyalgia rheumatica: preliminary results of an open, randomized study. J Rheumatol. Apr 1996;23(4):624-628.

36. Wagener P. [Methotrexate therapy of polymyalgia rheumatica]. Z Rheumatol. Nov-Dec 1995;54(6):413-416.

37. van der Veen MJ, Dinant HJ, van Booma-Frankfort C, van Albada-Kuipers GA, Bijlsma JW. Can methotrexate be used as a steroid sparing agent in the treatment of polymyalgia rheumatica and giant cell arteritis? Ann Rheum Dis. Apr 1996;55(4):218-223.

38. Salvarani C, Manzini C, al. e. Infliximab in the Treatment of Polymyalgia Rheumatica: A Double Blind, Randomized, Placebo Controlled Study[abstract]. Arthritis & Rheumatism. 2005;52(9):Sxx.

39. Leeb BF, Bird HA. A disease activity score for polymyalgia rheumatica. Ann Rheum Dis. Oct 2004;63(10):1279-1283.

Page 27: A SPECIFIC AIMS - EULAR - PMR...A SPECIFIC AIMS Polymyalgia rheumatica (PMR) is the most common inflammatory rheumatic disease of the elderly and represents the most common indication

PMR Classification Study Protocol

Version 1 15/05/2007 Page 27

40. Martinez-Taboada VM, Blanco R, Fito C, Pacheco MJ, Delgado-Rodriguez M, Rodriguez-Valverde V. Circulating CD8+ T cells in polymyalgia rheumatica and giant cell arteritis: a review. Semin Arthritis Rheum. Feb 2001;30(4):257-271.

41. Meliconi R, Pulsatelli L, Dolzani P, et al. Vascular endothelial growth factor production in polymyalgia rheumatica. Arthritis & Rheumatism. 2000;43(11):2472-2480.

42. Feinberg HL, Sherman JD, Schrepferman CG, Dietzen CJ, Feinberg GD. The use of methotrexate in polymyalgia rheumatica. J Rheumatol. Sep 1996;23(9):1550-1552.

43. Mertens JC, Willemsen G, Van Saase JL, Bolk JH, Dijkmans BA. Polymyalgia rheumatica and temporal arteritis: a retrospective study of 111 patients. Clin Rheumatol. Nov 1995;14(6):650-655.

44. Krogsgaard MR, Lund B, Johnsson B. A longterm prospective study of the equipotency between deflazacort and prednisolone in the treatment of patients with polymyalgia rheumatica. J Rheumatol. Sep 1995;22(9):1660-1662.

45. Wagner P. Methotrexattherapie der Polymyalgia rheumatica. Z Rheumatol. 1995;54:413-416. 46. Ayoub WT, Franklin CM, Torretti D. Polymyalgia rheumatica. Duration of therapy and long-term outcome.

American Journal of Medicine. 1985;79(3):309-315. 47. Cantini F, Salvarani C, Niccoli L, et al. Fat suppression magnetic resonance imaging in shoulders of

patients with polymyalgia rheumatica. J Rheumatol. Jan 2004;31(1):120-124. 48. Salvarani C, Silingardi M, Ghirarduzzi A, et al. Is duplex ultrasonography useful for the diagnosis of

giant-cell arteritis? Annals of Internal Medicine. 2002;137(4):232-238. 49. Fried BJ, Boers M, Baker PRA. A method for achieving consensus on rheumatoid arthritis outcome

measures: the OMERACT conference process. J Rheumatol. 1993;20(3):548-551. 50. Felson DT, Anderson JJ, Boers M, et al. The American College of Rheumatology preliminary core set of

disease activity measures for rheumatoid arthritis clinical trials. The Committee on Outcome Measures in Rheumatoid Arthritis Clinical Trials. Arthritis Rheum. Jun 1993;36(6):729-740.

51. Felson DT, Anderson JJ, Boers M, et al. American College of Rheumatology. Preliminary definition of improvement in rheumatoid arthritis. Arthritis & Rheumatism. 1995;38(6):727-735.

52. Giannini E, Ruperto N, Ravelli A, Lovell D, Felson D, Martini A. Preliminary definition of improvement in juvenile arthritis. Arthritis & Rheumatism. 1997;40:1202-1209.

53. van der Heijde D, Bellamy N, Calin A, Dougados M, Khan MA, van der Linden S. Preliminary core sets for endpoints in ankylosing spondylitis. Assessments in Ankylosing Spondylitis Working Group. J Rheumatol. Nov 1997;24(11):2225-2229.

54. Brooks P, Tugwell P, Boers M. OMERACT II: Conference on Outcome Measures in Rheumatoid Arthritis Clinical Trials. J Rheumatol. 1997;24:763-802; 979-1011;1206-1037.

55. Tugwell P, Bombardier C. A methodologic framework for developing and selecting endpoints in clinical trials. Journal of Rheumatology. 1982;9(5):758-762.

56. Boers M, Brooks P, Strand CV, Tugwell P. The OMERACT filter for outcome measures in rheumatology [editorial]. Journal of Rheumatology. 1998;25(2):198-199.

57. Bellamy N. Clinimetric concepts in outcome assessment: the OMERACT filter. J Rheumatol. Apr 1999;26(4):948-950.

58. Dasgupta B, Salvarani C, Schirmer M, et al. Developing Classification Criteria for Polymyalgia Rheumatica (PMR): comparison of views from an expert panel and wider survey. Annals of Rheumatic Diseases. 2007;under review.

59. Dasgupta B, Hutchings A, Matteson E. Polymyalgia rheumatica: the mess we are now in and what we need to do about it. Arthritis & Rheumatism. 2006;in press.

60. Brooks RC, McGee SR. Diagnostic dilemmas in polymyalgia rheumatica. Arch Intern Med. Jan 27 1997;157(2):162-168.

61. Gonzalez-Gay MA, Garcia-Porrua C, Salvarani C, Olivieri I, Hunder GG. The spectrum of conditions mimicking polymyalgia rheumatica in Northwestern Spain. Journal of Rheumatology. 2000;27(9):2179-2184.

62. Myklebust G, Gran JT. Prednisolone maintenance dose in relation to starting dose in the treatment of polymyalgia rheumatica and temporal arteritis. A prospective two-year study in 273 patients. Scandinavian Journal of Rheumatology. 2001;30(5):260-267.

Page 28: A SPECIFIC AIMS - EULAR - PMR...A SPECIFIC AIMS Polymyalgia rheumatica (PMR) is the most common inflammatory rheumatic disease of the elderly and represents the most common indication

PMR Classification Study Protocol

Version 1 15/05/2007 Page 28

63. Myklebust G, Gran JT. A prospective study of 287 patients with polymyalgia rheumatica and temporal arteritis: clinical and laboratory manifestations at onset of disease and at the time of diagnosis. British Journal of Rheumatology. 1996;35(11):1161-1168.

64. Weyand CM, Fulbright JW, Evans JM, Hunder GG, Goronzy JJ. Corticosteroid requirements in polymyalgia rheumatica. Archives of Internal Medicine. 1999;159(6):577-584.

65. Kyle V, Hazleman BL. Treatment of polymyalgia rheumatica and giant cell arteritis. II. Relation between steroid dose and steroid associated side effects. Ann Rheum Dis. Aug 1989;48(8):662-666.

66. Dasgupta B, Salvarani C, Schirmer M, et al. Development of Classification Criteria for Polymyalgia Rheumatica (PMR): Results From a Consensus Conference and Extended Delphi Survey[abstract]. Arthritis & Rheumatism. 2006;54(9 suppl):S345.

67. Leeb BF, Bird HA, Nesher G, et al. EULAR response criteria for polymyalgia rheumatica: results of an initiative of the European Collaborating Polymyalgia Rheumatica Group (subcommittee of ESCISIT). Ann Rheum Dis. Dec 2003;62(12):1189-1194.

68. Pincus T, Sokka T. Quantitative measures and indices to assess rheumatoid arthritis in clinical trials and clinical care. Rheum Dis Clin North Am. Nov 2004;30(4):725-751, vi.

69. Ruta DA, Hurst NP, Kind P, Hunter M, Stubbings A. Measuring health status in British patients with rheumatoid arthritis: reliability, validity and responsiveness of the short form 36-item health survey (SF-36). British Journal of Rheumatology. 1998;37(4):425-436.

70. Fioravanti A, Cantarini L, Fabbroni M, Righeschi K, Marcolongo R. Methods used to assess clinical outcome and quality of life in osteoarthritis. Semin Arthritis Rheum. Jun 2005;34(6 Suppl 2):70-72.

71. Haywood KL, Garratt AM, Dziedzic K, Dawes PT. Generic measures of health-related quality of life in ankylosing spondylitis: reliability, validity and responsiveness. Rheumatology (Oxford). Dec 2002;41(12):1380-1387.

72. Ware JE, Jr., Sherbourne CD. The MOS 36-item short-form health survey (SF-36). I. Conceptual framework and item selection. Medical Care. 1992;30(6):473-483.

73. Ware JE, Snow KK, Kosinski M, Gandek B. SF-36 health survey: manual and interpretation guide. Boston, MA: Nimrod Press; 1993.

74. Ware JE, Jr., Snow KK, Kosinski M, Grandek B. SF-36 physical and mental component summary measures: A user's manual. Boston: The Health Institute, New England Medical Center. 1994.

75. Garratt AM, Ruta DA, Abdalla MI, Buckingham JK, Russell IT. The SF36 health survey questionnaire: an outcome measure suitable for routine use within the NHS?[see comment]. BMJ. 1993;306(6890):1440-1444.

76. Jenkinson C, Coulter A, Wright L. Short form 36 (SF36) health survey questionnaire: normative data for adults of working age.[see comment]. BMJ. 1993;306(6890):1437-1440.

77. Talamo J, Frater A, Gallivan S, Young A. Use of the short form 36 (SF36) for health status measurement in rheumatoid arthritis. British Journal of Rheumatology. 1997;36(4):463-469.

78. McHorney CA, Ware JE, Jr., Lu JF, Sherbourne CD. The MOS 36-item Short-Form Health Survey (SF-36): III. Tests of data quality, scaling assumptions, and reliability across diverse patient groups. Med Care. Jan 1994;32(1):40-66.

79. Wre JE, Kosinski M. SF-36 Physical and Mental Health Summary Scales: a manual for users of version 1, 2nd edition. Lincoln, RI: QualityMetric Incorporated; 2001.

80. Pincus T, Summey JA, Soraci SA, Jr., Wallston KA, Hummon NP. Assessment of patient satisfaction in activities of daily living using a modified Stanford health assessment questionnaire. Arthritis & Rheumatism. 1983;26(11):1346-1353.

81. Kalke S, Mukerjee D, Dasgupta B. A study of the health assessment questionnaire to evaluate functional status in polymyalgia rheumatica. Rheumatology (Oxford). Aug 2000;39(8):883-885.

82. Felson DT, Anderson JJ, Boers M, et al. American College of Rheumatology. Preliminary definition of improvement in rheumatoid arthritis. Arthritis & Rheumatism. 1995;38(6):727-735.

83. Calkins DR, Rubenstein LV, Cleary PD, et al. Failure of physicians to recognize functional disability in ambulatory patients. Ann Intern Med. Mar 15 1991;114(6):451-454.

84. Mayoux-Benhamou MA. Fatigue and rheumatoid arthritis. Ann Readapt Med Phys. Jul 2006;49(6):385-388.

85. Moreland LW, Genovese MC, Sato R, Singh A. Effect of etanercept on fatigue in patients with recent or established rheumatoid arthritis. Arthritis Rheum. Apr 15 2006;55(2):287-293.

Page 29: A SPECIFIC AIMS - EULAR - PMR...A SPECIFIC AIMS Polymyalgia rheumatica (PMR) is the most common inflammatory rheumatic disease of the elderly and represents the most common indication

PMR Classification Study Protocol

Version 1 15/05/2007 Page 29

86. Pollard LC, Choy EH, Gonzalez J, Khoshaba B, Scott DL. Fatigue in rheumatoid arthritis reflects pain, not disease activity. Rheumatology (Oxford). Jul 2006;45(7):885-889.

87. Rupp I, Boshuizen HC, Jacobi CE, Dinant HJ, van den Bos GA. Impact of fatigue on health-related quality of life in rheumatoid arthritis. Arthritis Rheum. Aug 15 2004;51(4):578-585.

88. Wolfe F. Fatigue assessments in rheumatoid arthritis: comparative performance of visual analog scales and longer fatigue questionnaires in 7760 patients. J Rheumatol. Oct 2004;31(10):1896-1902.

89. Ostergaard M, Szkudlarek M. Imaging in rheumatoid arthritis--why MRI and ultrasonography can no longer be ignored. Scand J Rheumatol. 2003;32(2):63-73.

90. Wakefield RJ, Balint PV, Szkudlarek M, et al. Musculoskeletal ultrasound including definitions for ultrasonographic pathology. J Rheumatol. Dec 2005;32(12):2485-2487.

91. Koski JM. Ultrasonographic evidence of synovitis in axial joints in patients with polymyalgia rheumatica. Br J Rheumatol. Mar 1992;31(3):201-203.

92. Frediani B, Falsetti P, Storri L, et al. Evidence for synovitis in active polymyalgia rheumatica: sonographic study in a large series of patients. J Rheumatol. Jan 2002;29(1):123-130.

93. McGonagle D, Pease C, Marzo-Ortega H, O'Connor P, Gibbon W, Emery P. Comparison of extracapsular changes by magnetic resonance imaging in patients with rheumatoid arthritis and polymyalgia rheumatica. J Rheumatol. Aug 2001;28(8):1837-1841.

94. Salvarani C, Cantini F, Olivieri I, et al. Proximal bursitis in active polymyalgia rheumatica. Annals of Internal Medicine. 1997;127(1):27-31.

95. Cantini F, Niccoli L, Nannini C, et al. Inflammatory changes of hip synovial structures in polymyalgia rheumatica. Clin Exp Rheumatol. Jul-Aug 2005;23(4):462-468.

96. Lange U, Piegsa M, Teichmann J, Neeck G. Ultrasonography of the glenohumeral joints--a helpful instrument in differentiation in elderly onset rheumatoid arthritis and polymyalgia rheumatica. Rheumatol Int. 2000;19(5):185-189.

97. Cantini F, Salvarani C, Olivieri I, et al. Shoulder ultrasonography in the diagnosis of polymyalgia rheumatica: a case-control study. J Rheumatol. May 2001;28(5):1049-1055.

98. Cantini F, Salvarani C, Olivieri I, et al. Inflamed shoulder structures in polymyalgia rheumatica with normal erythrocyte sedimentation rate. Arthritis Rheum. May 2001;44(5):1155-1159.

99. Backhaus M, Burmester GR, Gerber T, et al. Guidelines for musculoskeletal ultrasound in rheumatology. Ann Rheum Dis. Jul 2001;60(7):641-649.

100. Schmidt WA, Schmidt H, Schicke B, Gromnica-Ihle E. Standard reference values for musculoskeletal ultrasonography. Ann Rheum Dis. Aug 2004;63(8):988-994.

101. Singh JA, Solomon DH, Dougados M, et al. Development of classification and response criteria for rheumatic diseases. Arthritis Rheum. Jun 15 2006;55(3):348-352.

102. Esdaile JM, Suissa S, Shiroky JB, Lamping DL, Group THS. A randomized trial of hydroxychloroquine in early rheumatoid arthritis: The HERA study. American Journal of Medicine. 1995;98:156-168.

103. Streiner DL, Norman GR. Heatlh measurement scales: A practical guide to their development and use (2nd ed.). Oxford: Oxford Unviversity Press; 1995.

104. Cronbach LJ. Coefficient alfa and the internal structure of tests. Psychometrika. 1951;16:297-334. 105. Harrell FE, Jr. Regression Modeling Strategies: Springer-Verlag; 2001. 106. Steiner DL, Norman GR. Health Measurement Scales, second edition: Oxford University Press; 1995. 107. Breiman L. Random Forests. Machine Learning. 2001;45(1):5-32. 108. Therneau TM, Atkinson EJ. An Introduction to Recursive Partitioning using the RPART Routines: Mayo

Clinic, Section of Biostatistics Technical Report Series, No. 61; November 1997. 109. Crosby RD, Kolotkin RL, Williams GR. An integrated method to determine meaningful changes in health-

related quality of life. J Clin Epidemiol. Nov 2004;57(11):1153-1160. 110. Crosby RD, Kolotkin RL, Williams GR. Defining clinically meaningful change in health-related quality of

life. J Clin Epidemiol. May 2003;56(5):395-407. 111. Cohen J. A coefficient of agreement for nominal scales. Educational and Psychological Measurement.

1960;20:37-46. 112. The American College of Rheumatology response criteria for systemic lupus erythematosus clinical

trials: measures of overall disease activity. Arthritis Rheum. Nov 2004;50(11):3418-3426. 113. Rubin D, Schenker N. Multiple imputation in health-care databases: An overview and some applications.

Statistics in Medicine. 1991;10:585-598.

Page 30: A SPECIFIC AIMS - EULAR - PMR...A SPECIFIC AIMS Polymyalgia rheumatica (PMR) is the most common inflammatory rheumatic disease of the elderly and represents the most common indication

PMR Classification Study Protocol

Version 1 15/05/2007 Page 30

114. Pugh M, Robins J, Lipsitz S, Harrington D. Inference in the Cox proprtional hazards model with missing covariates. Boston: Harvard School of Public Health, Department of Biostatistics; 1992. Technical Report #758Z.

115. Robins J. An analytic method for randomized trials with informative censoring: Part I. Lifetime Data Analysis. 1995;1:241-254.

116. Robins J. An analytic method for randomized trials with informative censoring: Part II. Lifetime Data Analysis. 1995;1:417-434.