amostragem 4

8/14/2019 amostragem 4

1/9

R E S U M O

Os estudos clnicos, para serem vlidos,devem ser metodologicamente rigorosos. O

conceito de validade interna de um estudo

de crucial importncia: um estudo vlidose os respectivos resultados representam

uma estimativa no enviesada do resultadoverdadeiro. Neste caso, a validade define-se

como interna porque se refere ao grupo dedoentes que esto a ser estudados e no

necessariamente a outros (neste ltimo casoseria a validade externa ou aplicabilidade).

A validade interna da investigao clnica determinada atravs do rigor do desenho do

estudo, da recolha dos dados e dasrespectivas anlises, podendo ser posta em

causa por vises (erros sistemticos) ou peloacaso (variao aleatria dos fenmenossob estudo).

Independentemente do tipo de estudo(analtico, descritivo), as caractersticas da

respectiva amostra de doentes fundamentalpara a validade dos resultados. Os mtodos

de amostragem so fundamentais para arepresentatividade dos doentes do estudo em

relao populao para a qual se desejaextrapolar os resultados conseguidos. Dentre

as caractersticas fundamentais de uma

amostra rigorosa encontra-se a sua correctadimenso. Mesmo o estudo mais bemexecutado pode no conseguir responder

capazmente questo colocada, se adimenso da respectiva amostra for muito

pequena. Por outro lado, estudos comamostras de grandes dimenses so mais

difceis de concluir e muito maisdispendiosos. O objectivo do planeamento da

dimenso correcta da amostra de um estudo

CARDIOLOGIA BASEADA NA EVIDNCIA

Clculo da Dimenso da Amostraem Estudos Clnicos:

Princpios Metodolgicos Bsicos [119]

ANTNIO VAZ CARNEIROCentro de Estudos de Medicina Baseada na Evidncia

Faculdade de Medicina de Lisboa, Lisboa, Portugal

15

A B S T R A C T

Estimating Sample Size in ClinicalStudies: Basic MethodologicalPrinciples

In order to be valid, clinical studies must bemethodologically rigorous. The internalvalidity of a study is of crucial importance: astudy is valid if its results are an unbiasedestimation of the true result. In this case, thevalidity is internal because it refers to thegroup of patients under study and notnecessarily different ones (external validityor applicability). Internal validity in clinicalresearch is achieved through rigorous design,data collection and appropriate analysis, andis threatened by bias (systematic errors) or

chance (random variation of the phenomenaunder study).Regardless of the type of study (analytic,descriptive, etc.), the characteristics of itssample are fundamental for the validity of theresults. The sampling methods are crucial ifthe study patients are to be representative ofthe population to which one desires toextrapolate the results. One of the mostfundamental characteristics of a sample is itssize. Even the best executed study may failto answer the research question if the sample

size is too small. On the other hand, a studywith too large a sample is harder to conductand more costly. The goal of planning thesample size is to estimate the appropriatenumber of research subjects for the study.In this paper we will present and discuss themethodological principles underlyingcalculation of sample size: outcomes, type Iand II error, alpha and beta, study power andvariability.

Recebido para publicao: Novembro de 2003 Aceite para publicao: Dezembro de 2003

Received for publi cation: November 2003 Accepted for publication: December 2003

Rev Port Cardiol 2003;22 (12): 1513-1521


2/9

INTRODUO

Os estudos clnicos, para serem vlidos, de-vem ser metodologicamente rigorosos (1, 2).O conceito de validade interna de um estudo de crucial importncia e varia ligeiramenteconforme o tipo de questo clnica que se querresponder: no caso de estudos diagnsticos outeraputicos, por exemplo, eles so vlidos seos respectivos resultados representam uma es-timativa no enviesada do resultado verda-deiro.

Neste caso, a validade define-se como in-terna porque se refere ao grupo de doentes amostra que esto a ser estudados e no ne-cessariamente a outros (neste ltimo caso seria

a validade externa ou aplicabilidade)(3)

. A vali-dade interna da investigao clnica determi-nada atravs do rigor do desenho do estudo, darecolha dos dados e das respectivas anlises,podendo ser posta em causa por vises (errossistemticos) ou pelo acaso (variao aleatriados fenmenos sob estudo) (4).

Independentemente do tipo de estudo (ana-ltico, descritivo), as caractersticas da respec-tiva amostra de doentes fundamental para avalidade dos resultados. Os mtodos de amos-tragem so fundamentais para a representativi-

dade dos doentes do estudo em relao popu-lao para a qual se deseja extrapolar osresultados conseguidos. Dentre as caractersti-cas fundamentais de uma amostra rigorosa en-contra-se a sua correcta dimenso: mesmo oestudo mais bem executado pode no conse-guir responder capazmente questo colocada,se a dimenso da respectiva amostra for muitopequena. Por outro lado, estudos com amostrasde grandes dimenses so mais difceis deconcluir e muito mais dispendiosos. O objec-1514

INTRODUCTION

In order to be valid, clinical studies must bemethodologically rigorous (1, 2). The internalvalidity of a study is of crucial importance, andvaries slightly depending on the type of clini-cal question being posed. Diagnostic or thera-peutic trials, for example, are valid if their re-sults are an unbiased estimation of the trueresult.

In this case, the validity is internal becauseit refers to the group of patients under study the sample and not necessarily different ones(which would be external validity or applicabi-lity) (3). Internal validity in clinical research isachieved through rigorous design, data col-

lection and appropriate analysis, and is threa-tened by bias (systematic errors) or chance(random variation of the phenomena understudy) (4).

Regardless of the type of study (analytic,descriptive, etc.), the characteristics of itssample are fundamental for the validity of theresults. The sampling methods are crucial ifthe study patients are to be representative ofthe population to which one desires to extrapo-late the results. One of the most fundamentalcharacteristics of a sample is its size: even thebest-executed study may fail to answer the re-search question if the sample size is too small.On the other hand, a study with too large asample is harder to conduct and more costly.The goal of planning the sample size is to esti-mate the appropriate number of research sub-jects for the study (5).

In this paper we will present and discussthe methodological principles underlying esti-mation of sample size. The discussion will not

Palavras-ChaveHipteses de investigao; Dimenso do efeito;

Poder do estudo; Nvel de significncia; Erros tipo I e II;Alfa e beta; Dimenso da amostra; Validade interna;

Cardiologia baseada na evidncia

consiste na estimativa do nmero apropriadode sujeitos para o estudo especfico.

Neste artigo iremos apresentar e discutir osprincpios de base metodolgica que servem

para o clculo da dimenso da amostra deum estudo: resultados (outcomes), erros tipo I

e II, alfa e beta, poder do estudo e

variabilidade.

Key wordsResearch hypothesis; Effect size; Study power;Significance level; Type I and II error; Alpha and beta;Sample size; Internal validity; Evidence-based cardiology


3/9

tivo do planeamento da dimenso correcta daamostra de um estudo consiste na estimativado nmero apropriado de sujeitos para o es-tudo especfico (5).

Neste artigo iremos apresentar e discutir osprincpios de base metodolgica que servempara o clculo da dimenso da amostra de umestudo, mas no discutiremos nem as equaes

especficas, nem as instrues matemticasutilizadas para clculo do nmero final dedoentes de uma amostra. Os leitores interessa-dos podem consultar textos clssicos para esseefeito (6-12).

CONSIDERAES GERAIS

A definio evolutiva da questo cientficainicial que se pretende ver respondida por umdeterminado estudo constitui o que se designacomo hiptese de investigao, que uma

verso final mais elaborada daquela e que in-clui os elementos do estudo (desenho geral,amostragem, marcadores de doena, anlisesestatsticas, etc.). A colocao de uma hiptese necessria na maior parte dos estudos obser-vacionais e em todos os experimentais, quecomparam, de algum modo, dois ou mais gru-pos de doentes (13, 14).

A hiptese de investigao muito impor-tante num estudo, j que nela que assenta aseleco dos testes de significncia estatstica (6).Uma hiptese pode ser simples, se contm

apenas um factor preditivo e uma varivel deresultado (outcome variable): por ex., o taba-gismo associa-se a uma maior incidncia dedoena crebrovascular. Uma hiptese com-plexa se contm mais do que um factor predi-tivo (o tabagismo associado a hipertenso asso-cia-se a uma maior incidncia de doenacrebrovascular), ou do que uma varivel de re-sultado: por ex., o tabagismo associa-se a umamaior incidncia de doena crebrovascular ede cancro do pulmo. Uma hiptese espec-fica se no deixa dvidas ou ambiguidadeacerca dos sujeitos, variveis e testes estatsti-cos a utilizar.

Em termos de testar significncias estatsti-cas, a hiptese de investigao deve ser ex-pressa em termos de diferenas esperadas en-tre os grupos de estudo. Classicamente, ahiptese nula afirma que no existe associa-o entre o factor preditivo e a varivel na po-pulao ou que, por exemplo, no existe dife-rena entre os resultados de duas intervenesteraputicas. A hiptese nula constitui a base 15

cover specific equations or the mathematicalmethods used to estimate the final number ofpatients in a sample. The interested reader canconsult the standard texts in this area (6-12).

GENERAL CONSIDERATIONS

The process of defining the initial clinical

question for which an answer is sought in aparticular trial leads to what is called the re-search hypothesis. This is a more detailed finalversion of the initial question, and covers thestudy elements (overall design, sampling, end-points, statistical analysis, etc.). It is necessaryto put forward a hypothesis in most observa-tional and all experimental studies that in oneway or another compare two or more groups ofpatients (13, 14).

The research hypothesis is a crucial part ofa study, since it is the basis for selecting tests

of statistical significance (6). A hypothesis canbe simple, if it contains only one predictivefactor and one outcome variable (such asSmoking is associated with a higher incidenceof cerebrovascular disease), or complex, if ithas more than one predictive factor (Smokingtogether with hypertension is associated with ahigher incidence of cerebrovascular disease)or more than one outcome variable (Smokingis associated with a higher incidence of cere-brovascular disease and lung cancer). Ahypothesis is termed specific if there is no un-

certainty or ambiguity concerning its subjects,variables and statistical tests.

With regard to testing statistical signifi-cance, the research hypothesis must be expres-sed in terms of the expected differences bet-ween the study groups. Classically, the nullhypothesis states that there is no associationbetween the predictive factor and the variablein the population; for example, that there is nodifference between the results of two therapeu-tic interventions. The null hypothesis is theformal basis for testing statistical significance.By contrast, the alternative hypothesis is thatthere is an association between the predictivefactor and the outcome variable. It is importantto remember that the alternative hypothesiscannot be tested directly; it is accepted if thetest of statistical significance rejects the nullhypothesis (15).

Finally, a one-sided hypothesis specifiesthe direction of the association between thepredictive factor and the outcome variable,


4/9

formal para testar a significncia estatstica.Pelo contrrio, a hiptese alternativa afirmaa existncia da associao entre o factor predi-tivo e a varivel do resultado. importantelembrar que a hiptese alternativa no podeser testada directamente; ela aceite se o testeestatstico de significncia rejeita a hiptesesnula (15).

Finalmente, uma hiptese unilateral (one-sided) especifica a direco da associao en-tre o factor preditivo e a varivel do resultado,por ex., a hipertenso mais frequente emdoentes com doena crebrovascular. Uma hip-tese bilateral (two-sided) afirma apenas queexiste uma relao entre o factor preditivo e avarivel do resultado, por ex., em doentes comdoena crebrovascular, a tenso arterial

muito varivel. mais fcil testar uma hip-tese alternativa com um teste estatstico unila-teral (one-sided) que com um bilateral (two-si-ded) o erro tipo I metade (ver adiante).

PRINCPIOS ESTATSTICOS

A importncia da amostra em estudos clni-cos seja de que tipo for j foi sublinhadaanteriormente. Ela constitui uma das bases dequalquer estudo que se quer vlido.

Para que um investigador possa determinaruma relao de causalidade entre um factor(por exemplo, uma interveno teraputica baixa do colesterol) e um resultado (diminuioda taxa de doena coronria), ele dever co-mear por assumir a hiptese nula de ne-nhuma associao. Depois, baseado nos dadosfinais coligidos na amostra do seu estudo, se-leccionar os testes estatsticos ideais que lhepermitam determinar a existncia de provassuficientes para rejeitar a hiptese nula. Aofaz-lo, ele implicitamente decide a favor dahiptese alternativa (de que h uma associa-o entre a baixa do colesterol e a diminui-o da taxa de doena coronria). O standardpara estes testes o chamado nvel de signifi-cncia (16).

As bases sobre as quais assenta o clculoda dimenso da amostra em estudos analticosou experimentais so vrias, todas necessriaspara um correcto resultado final, e incluem: oserros tipo I e II e as probabilidades de os co-meter (designadas como alfa e beta), a dimen-so do efeito, a variabilidade dos resultados eo poder do estudo. Na restante parte deste ar-tigo iremos definir cada um destes factores.1516

such as hypertension is more common in pa-tients with cerebrovascular disease. A two-sided hypothesis states only that there is a rela-tion between the predictive factor and the out-come variable, such as in patients with cere-brovascular disease, blood pressure is veryvariable. It is easier to test an alternativehypothesis with a one-sided statistical test than

with a two-sided one, as this halves the chan-ces of a type I error (see below).

STATISTICAL PRINCIPLES

The importance of the sample size in anytype of clinical trial has already been emphasi-zed. It is one of the bases of the validity of anystudy.

An investigator who wishes to show a cau-sal relation between a factor (such as a thera-peutic intervention to lower cholesterol) and anoutcome (a reduction in the rate of coronaryartery disease) should begin by assuming thenull hypothesis of no association. Then, basedon the final data collected in the study sample,the most suitable statistical tests should be se-lected that will determine whether there is suf-ficient evidence to reject the null hypothesis.Such rejection implicitly decides in favor ofthe alternative hypothesis, that there is an as-sociation between lowering cholesterol and areduction in the rate of coronary artery dis-ease. The standard for these tests is the levelof significance (16).

There are several aspects relevant to esti-mating the correct sample size in analyticor experimental trials, all of which must betaken into consideration. They include type Iand II error and the probability of committingthem (termed alpha and beta), effect size, va-riability in outcomes, and study power. Therest of this article will discuss each of thesefactors.

Type I and II error, alpha ()and beta ()

As already stated, at the end of the trial theinvestigator will use statistical tests to attemptto reject the null hypothesis in favor of the al-ternative hypothesis. Assuming that there wasno bias that altered the results, and dependingon whether the null hypothesis was true orfalse in the general population, there are fourpossibilities (Table I):


5/9

Erros tipo I e II, alfa () e beta ()

Como j foi dito, no final do estudo o inves-tigador vai lanar mo de testes estatsticos natentativa de rejeitar a hiptese nula e favorecera alternativa. Partindo do princpio que noexistiram vises que alterem sistematicamenteos resultados e dependendo da hiptese nula

ser verdadeira ou falsa na populao em geral,so possveis quatro hipteses (Quadro I).

Quadro I

Verdade na populao vs. resultadosna amostra do estudo

Verdade na populao

Resultados Existe associao No existe associaona amostra entre o factor entre o factordo estudo e o resultado e o resultado

Rejeita a hiptese nula Correcto Erro tipo INo rejeita a hiptese

nula Erro tipo II Correcto

O erro tipo I (falso-positivo) ocorre se uminvestigador rejeita a hiptese nula,quando ela verdadeira na populao;

O erro tipo II (falso-negativo) ocorre seum investigador no rejeita a hiptesenula, quando ela falsa na populao.

Os erros tipo I e II no podem ser evitadospor completo, mas pode diminuir-se a sua pro-babilidade aumentando a dimenso da amostrado estudo, por exemplo (porque se aproxima a

dimenso da amostra da dimenso da popula-o da qual aquela emanou, reduzindo as dife-renas eventualmente existentes entre elas) (6).

A probabilidade (previamente definida) deum investigador cometer um erro tipo I (rejei-tar a hiptese nula quando ela verdadeira)designa-se por alfa () ou, como tambm co-nhecido, por nvel de significncia estats-tica. Por exemplo, se num estudo relacionandoos efeitos do exerccio com os nveis de HDL oalfa de 0,05, ento o investigador define 5 %como a chance mxima de incorrectamente re-

jeitar a hiptese nula (inferindo que o exerc-cio e os nveis de HDL esto associados na po-pulao, quando de facto no esto). O alfatem relao com o valor de P, determinado pe-los testes estatsticos, j que a hiptese nula rejeitada se o valor deP menor que o alfa (17, 18).

Por outro lado, a probabilidade de um in-vestigador cometer um erro tipo II (no rejeitara hiptese nula quando ela de facto falsa)designa-se por beta (). Idealmente, o alfa e obeta deveriam ser iguais a zero, mas isto im- 15

Table I

Truth in the population vs. resultsin the study sample

Truth in the population

Results There is an association There is no associationin the study between the factor between the factorsample and the outcome and the outcome

Reject the null hypothesis Correct Type I error Do not reject the null

hypothesis Type II error Correct

A type I error (false positive) occurs if theinvestigator rejects the null hypothesiswhen it is in fact true in the population;

A type II error (false negative) occurs ifthe investigator does not reject the nullhypothesis when it is in fact false in thepopulation.

Type I and II errors cannot be avoidedcompletely, but they can be made less likelyby such strategies as increasing the studyssample size, as this will bring the sample sizenearer to that of the population from which thesample was taken and thus reduce any diffe-rences between them (6).

The predefined likelihood of an investigatorcommitting a type I error (rejecting the nullhypothesis when it is true) is termed alpha (),and is also known as the level of statistical sig-nificance. For instance, in a study relating theeffects of exercise to HDL levels, if alpha is0.05, then the investigator is setting 5 % as the

maximum chance of wrongly rejecting the nullhypothesis (i.e. inferring that exercise andHDL levels are associated in the population,when in fact they are not). Alpha is related tothe p value, as determined by statistical tests,since the null hypothesis is rejected if p is lessthan alpha (17, 18).

Conversely, the probability of an investiga-tor committing a type II error (failing to rejectthe null hypothesis when it is in fact false) istermed beta (). Ideally, alpha and beta shouldboth be equal to zero, but in practice this is

impossible. Errors can be made less likely byincreasing sample size, using continuous varia-bles or more precise or paired measures, or byusing different-sized groups or more commonoutcomes (7, 19).

If it is particularly important to avoid type Ierrors, such as when testing a high-risk treat-ment, alpha should be set lower. On the otherhand, if it is more important to avoid type IIerrors (false positives), it is beta that should beset to a lower level.


6/9

possvel. Podemos diminuir a possibilidade dasua ocorrncia aumentando a dimenso daamostra, utilizando variveis contnuas, medi-es mais precisas ou emparelhadas, utilizandogrupos de dimenses desiguais ou utilizandoresultados (outcomes) mais comuns (7, 19).

Se para o investigador particularmenteimportante evitar um erro tipo I (por exemplo

quando quer testar a eficcia de uma interven-o teraputica com riscos elevados), ento eledeve colocar o alfa a nveis mais baixos. Pelocontrrio, se quiser evitar o erro tipo II (falso--negativo), ento o beta que dever ser colo-cado a nveis baixos.

Dimenso do efeito

A possibilidade de um estudo detectar umaassociao entre um factor preditivo e um re-sultado depende da dimenso do efeito espe-rado: se este for de grandes dimenses, de

mais fcil deteco. O problema surge quandoo investigador no tem uma estimativa prviada magnitude da associao. Neste caso, nohaver outra hiptese seno ele decidir a prioria dimenso que gostaria de detectar ou, alter-nativamente, que considere clinicamente signi-ficativa: por exemplo, uma diminuio mdiade 30 mmHg da presso sistlica num estudosobre a eficcia de um medicamento anti-hi-pertensivo. Esta medida designa-se por di-menso do efeito (11).

Este passo de enorme importncia no de-

senho do estudo e pode, se for mal calculado,comprometer ou anular os resultados. Existemtrs hipteses de o investigador minimizar estaincerteza: a primeira basear-se em dados pu-blicados que, de qualquer maneira, possa for-necer estimativas realistas; a segunda, acei-tar a menor dimenso que seja clinicamentesignificativa (no exemplo anterior, 20 mmHg);a terceira pode obrigar a um estudo piloto, es-pecialmente quando os dados so pura e sim-plesmente inexistentes.

Uma ltima precauo: para efeitos de cl-culo da dimenso amostral, a medida de asso-ciao deve basear-se no resultado (outcome)principal. As dimenses dos efeitos para os re-sultados secundrios podem ser calculadas de-pois a partir dessa amostra.

Poder do estudo

J vimos anteriormente a definio dos er-ros tipo I e II. Importa agora definir o conceitode poder do estudo. Este define-se como o in-verso de , isto , 1-.1518

Effect sizeThe ability of a trial to detect an associa-

tion between a predictive factor and an out-come depends on the size of the expected ef-fect. The larger the effect, the easier it is todetect. Problems arise when the investigatorhas no previous estimate of the magnitude ofthe association. In this case, there is no alter-

native to deciding a priori what size effect onewould like to detect or would consider clini-cally significant, such as a mean fall of 30mmHg in systolic blood pressure in a trial onthe efficacy of an antihypertensive drug. Thisis known as the effect size (11).

Calculating the effect size is a very impor-tant step in designing the study, as a miscalcu-lation can compromise or even invalidate theresults. There are three ways in which the in-vestigator can minimize the uncertainty. Thefirst is to base the calculation on published

data which can help arrive at a realistic esti-mate; secondly, to accept the smallest clini-cally significant effect size (which could be 20mmHg in the example given); thirdly to carryout a pilot study if necessary, especially whenthere is simply no data available.

As a final precaution, when calculatingsample size, the measure of association shouldbe based on the primary outcome. Effect sizesfor secondary outcomes can then be calculatedon the basis of this sample.

Study powerType I and II errors have now been descri-

bed; we now need to define the concept of thestudys power. This is calculated as the inverseof , that is, 1-.

The power of a study is the probability ofrejecting the null hypothesis in that sample ifthe real effect in the population is equal to theeffect size. If beta is predefined as 0.20, thismeans that the investigator is willing to accepta 20 % possibility of failure to detect a possi-ble association. The study power is 0.80, mean-ing that the study has an 80 % chance of de-tecting an association with the specified effect.Most studies set alpha at 0.05 (ranging be-tween 0.01 and 0.10) and beta at 0.20 (rangingbetween 0.05 and 0.20).

In order to summarize all these factors andtheir interaction, Table II shows the variablesused to calculate the power of a study using apaired Students t test and their effect on sam-ple size (9).


7/9

O poder de um estudo a probabilidade derejeitar a hiptese nula daquela amostra, se oefeito real na populao igual dimenso doefeito. Se o pr-definido como 0,20, istoquer dizer que o investigador est disposto aaceitar 20% de hiptese de falhar a detecode uma eventual associao. O poder do es-tudo de 0,80, isto , o estudo tem umachance de 80 % de detectar uma associaocom o efeito determinado. A maior parte dosestudos coloca o alfa a 0,05 (podendo variarentre 0,01 e 0,10) e o beta a 0,20 (podendo va-riar entre 0,05 e 0,20).

Como resumo de todos estes factores esuas interaces, esto descritas no QuadroII as variveis includas no clculo do poderde um estudo utilizando um teste de t empa-relhado e o seu efeito na dimenso da amos-tra (9).

Quadro II

Variveis includas no clculo do poder de umestudo utilizando um teste de t emparelhado e oseu efeito na dimenso da amostra*

Varivel 1- n

Teste bilateral** 5 20 0,05 0,8 127Teste unilateral 5 20 0,05 0,8 100

Aumento de 10 20 0,05 0,8 25Aumento de 5 25 0,05 0,8 155Diminuio de 5 20 0,01 0,8 160Diminuio de 5 20 0,05 0,9 138

** Os valores a negrito foram variados a partir da primeira linha, para de-monstrar como as alteraes em cada varivel afectam a dimenso daamostra.

** Os testes bilaterais so mais conservadores que os unilaterais, porque,para se obter significncia estatstica so necessrias amostras de maio-res dimenses.

= (delta) dimenso da diferena ou alterao a ser detectada ideal-mente, a diferena mais pequena julgada clinicamente significativapelo investigador.

= (sigma) o desvio padro; o aumento da variabilidade da dife-rena entre as observaes emparelhadas.

= (alfa) o limiar abaixo do qual se define a significncia estatstica(definido pelo investigador).

1- = o poder estatstico do estudo (definido pelo investigador). n = dimenso da amostra.

Adaptado de (9)

Para finalizar: as frmulas utilizadas noclculo da dimenso amostral so baseadasem conceitos matemticos diferentes, con-forme os testes estatsticos seleccionados.Estes devem ser escolhidos antes do inciodo estudo e dependem do tipo de factor pre-ditivo e da varivel includa. No Quadro IIIesto includos alguns testes estatsticos usa-dos neste contexto (11). 15

Table II

Variables used to calculate the power of a studyusing a paired Students t test and their effect onsample size*

Variable 1- n

Two-sided test** 5 20 0.05 0.8 127One-sided test 5 20 0.05 0.8 100

Increased 10 20 0.05 0.8 25

Increased 5 25 0.05 0.8 155Decreased 5 20 0.01 0.8 160Decreased 5 20 0.05 0.9 138

** The values in bold have been varied from those in the first line to de-monstrate how changes in each variable affect the sample size.

** Two-sided tests are more conservative than one-sided tests, since lar-ger sample sizes are required to reach statistical significance.

= (delta) = size of the difference or change to be detected ideallythe smallest difference considered clinically significant by the investi-gator.

= (sigma) standard deviation; the increase in variability of the diffe-rence between paired observations.

= (alpha) = the limit below which statistical significance is deemedby the investigator to have been reached.

1- = the statistical power of the study (as defined by the investigator).

n = sample size. Adapted from (9)

A final note: the formulas used to calculatethe sample size are based on different mathe-matical concepts, depending on the statisticaltests selected. These tests should be chosenbefore the beginning of the trial and depend onthe type of predictive factor and the variableincluded. Table III shows some statistical testsused for this purpose (11).

Table III

Statistical tests used to determinesample size

Variable (outcome)

Predictive factor Dichotomous ContinuousDichotomous Chi-square Students t testContinuous Students t test Correlation coefficient

CONCLUSIONS

For both scientific and ethical reasons,sample size in clinical trials must be carefullyplanned if the results are to be credible (14).Ideally, the trial should be large enough tohave a high probability of detecting a clinicallyimportant difference of a certain magnitude, ifsuch a difference really exists, and showing itto be statistically significant (20).

Studies, whether analytic or experimental(purely descriptive studies do not need hypo-theses), require a hypothesis that specifies theexpected association between the predictivefactors and the outcomes, in order for tests of


8/9

Quadro III

Testes estatsticos para determinaoda dimenso amostral

Varivel (outcome)

Factor preditivo Dicotmica ContnuaDicotmica Chi-quadrado Teste tContnua Teste t Coeficiente de correlao

CONCLUSES

A dimenso da amostra em estudos clnicosnecessita, por razes cientficas e ticas, de serbem planeada, se se quiser que os resultadossejam credveis (14). Idealmente, um estudo de-ver ter uma dimenso suficientemente impor-tante para poder ter uma alta probabilidade dedetectar, como estatisticamente significativa,uma diferena clinicamente importante comuma determinada dimenso, se essa diferenarealmente existir(20).

Os estudos quer sejam analticos, querexperimentais (os puramente descritivos nonecessitam de hipteses) necessitam da colo-cao de uma hiptese que especifique (paraaplicao ulterior de testes de significncia) aassociao esperada entre os factores prediti-vos e os resultados. A hiptese nula que abase formal para testar a significncia estats-tica afirma que no existe associao entre ofactor preditivo e a varivel do resultado napopulao ou que, por exemplo, no existe di-ferena entre os resultados de duas interven-

es teraputicas. A hiptese alternativa quepoder ser uni ou bilateral afirma a existn-cia da associao entre o factor preditivo e avarivel do resultado medido.

Os elementos para o clculo correcto da di-menso da amostra baseada no resultadoprincipal que se quer medir so: 1) os resul-tados (outcomes) em cada grupo, na base dosquais se define o que so as diferenas clini-camente significativas; 2) o erro tipo I (alfa); 3)o poder do estudo (o nvel de erro tipo II oubeta) e 4) (no caso de variveis contnuas) odesvio padro das medidas a variabilidade.

1520

statistical significance to be applied subse-quently. The null hypothesis, the formal basisfor testing statistical significance, states thatthere is no association between the predictivefactor and the outcome variable in the popula-tion; that, for example, there is no differencebetween the outcomes of two therapeutic inter-ventions. The alternative hypothesis (which

can be one- or two-sided) states that there is infact an association between the predictive fac-tor and the measured outcome variable.

The elements required for correct calcula-tion of the sample size based on the primaryoutcome that is to be measured are: 1) theoutcomes in each group, on the basis of whichclinically significant differences are determi-ned; 2) type I error (alpha); 3) the study power(the level of type II error, or beta); and 4) inthe case of continuous variables, the standarddeviation of the measures, or variability.

Pedido de separatas para:Address for reprints:

ANTNIO VAZ CARNEIROCentro de Estudos de MedicinaBaseada na EvidnciaFaculdade de Medicina de LisboaAv. Prof. Egas Moniz1649-035 LISBOA Portugal


9/9

1. Carneiro AV. A Medicina Baseada na Evidncia. Umnovo paradigma para a prtica mdica. Acta Md Port1998;11:717-37.

2. Carneiro AV. Um instrumento para a mudana da prticamdica: a Medicina Baseada na Evidncia. Medicina Int1998;5:133-8.

3. Guyatt GH, Rennie D, editors. Users Guides to the Medi-cal Literature. A Manual for Evidence-Based Clinical Prac-tice. Chicago: JAMA & Archives Journals AMA, 2001.

4. Clinical epidemiology. Baltimore: Williams & Wilkins,1996.

5. Campbell MJ, Julious SA, Altman DG. Estimating samplesizes for binary, ordered categorical, and continuous out-comes in two group comparisons. BMJ 1995;311:1145-8.

6. Designing clinical research. Philadelphia: Lippincott Wil-liams and Wilkins, 2001.

7. An introduction to medical statistics. Oxford: Oxford Uni-versity Press, 2000.

8. Clinical epidemiology & evidence-based medicine. Thou-sand Oaks: Sage Publications, 2001.

9. How to report statistics in medicine. Philadelphia: ACP,1997.

10. Statistical methods in diagnostic medicine. New York:

Wiley, 2002.

152

11. Medical statistics. Malden: Blackwell Science, 2003.

12. Basic & clinical biostatistics. New York: Lange MedicalBooks/McGraw-Hill, 2001.

13. Katz MH. Multivariable analysis: a primer for readers ofmedical research. Ann Int Med 2003;138:644-50.

14. Chan KBY, Man-Son-Ing M, Molnar FJ, Laupacis A.How well is the clinical importance of study results repor-ted? An assessment of randomized controlled trials. CMAJ

2001;165:1197-202.15. Kerry SM, Bland JM. Sample size in cluster randomisa-tion. BMJ 1998;316:549.

16. Fundamentals of clinical trials. New York: Springer,1998.

17. Goodman SN. Toward evidence-based medical statistics.1. The P value fallacy. Ann Int Med 1999;130:995-1004.

18. Goodman SN. Toward evidence-based medical statistics.2. The Bayes factor. Ann Int Med 1999;130:1005-13.

19. Schulz KF, Grimes DA. Sample size slippages in rando-mised trials: exclusions and the lost and wayward. Lancet2002;359:781-5.

20. Sackett DL. Why randomized controlled trials fail butneednt: 2. Failure to employ physiological statistics, or theonly formula a clinician-trialist is ever likely to need (or un-

derstand!). CMAJ 2001;165:1226-37.

BIBLIOGRAFIA / REFERENCES

Hospital Fernando FonsecaServio de Cardiologia

IV INTERNATIONAL SYMPOSIUM

Thrombosisand Thromboembolism

Penha-Longa Golf Resort

SINTRA PORTUGAL

October, 1 and 2, 2004

amostragem 4

Documents

Transcript of amostragem 4