[논문]치의학 연구에서 귀무가설 유의성 검정의 문제점과 대안에 관한 고찰

이광희

doi:10.5933/jkapd.2013.40.3.223

치의학 연구에서 귀무가설 유의성 검정의 문제점과 대안에 관한 고찰
Review on Problems with Null Hypothesis Significance Testing in Dental Research and Its Alternatives 원문보기

大韓小兒齒科學會誌 = Journal of the Korean academy of pediatric dentistry, v.40 no.3, 2013년, pp.223 - 232

초록
AI-Helper

치의학 연구에서 사용되는 귀무가설 유의성 검정에서 p값을 기준으로 연구의 결과를 평가하는 것은 많은 문제점을 내포하고 있다. 귀무가설이 기각되지 않은 경우에 귀무가설이 옳다는 결론을 내리는 것은 논리적 오류이다. p값에 대한 중대한 오해가 많이 있으며 연구자는 논문을 작성할 때 p값의 해석에 신중해야 한다. 귀무가설검정을 보완하거나 대체할 수 있는 대안으로서, 효과 크기, 신뢰구간, 베이지안 통계 등이 있다.

Abstract ▼ AI-Helper

There are many problems in evaluating study results by p value in null hypothesis testing for dental research. It is a logical fallacy to conclude that the null hypothesis is true when the it is not rejected. There are much serious misunderstanding about p value, and researchers should be cautious about interpreting p value in writing papers. As alternatives to complement or replace the null hypothesis significance testing, effect size, confidence interval, and Bayesian statistics are introduced.

주제어

질의응답

핵심어	질문	논문에서 추출한 답변
	효과 크기란 무엇인가?	효과 크기는 독립변수와 종속변수 간 연관성의 강도를 나타내는 지표이다. 실험군의 평균과 대조군의 평균 사이의 차이를 효과 크기라고 할 수 있으나, 임의적 척도를 사용한 연구에서처럼 변수의 측정치 자체가 내재적 의미를 가지고 있지 않거나 메타분석 연구에서처럼 상이한 척도를 사용한 여러 연구들의 결과를 종합하여야 할 때에는 표준화된 효과 크기를 사용한다.
	베이지안 통계은 어떤 편리성이 있는가?	연구가 반복되어 결과가 축적될수록 베이지안 통계는 정확한 진실에 가까워진다. 지금까지 얻은 정보를 하나 하나 다시 계산하는 것이 아니라 최신 정보만 개정하면 결과적으로 같은 수치를 얻을 수 있다는 편리성이 있다. 최근의 미국대통령 선거에서 한 설문조사 사이트는 누적되는 설문조사 결과를 이어지는 설문조사 결과와 통합 분석하는 베이지안 통계 방법을 사용하여 정확한 예측에 성공할 수 있었다38).
	귀무가설이란 무엇인가?	귀무가설은 연구가설(대립가설, alternative hypothesis)의 반대가 되는 가설로서, 실제로 연구에서 알고자 하는 효과가 없다고 가정하는 가설이다. 유의성 검정에서 연구가설을 검정하지 않고 연구가설의 반대인 귀무가설을 검정을 하는 것은 후건 긍정의 오류를 피하기 위함이다.

참고문헌 (48)

Seaman JE, Allen IE : Not significant, but Important? Know the pitfalls of p-values and formal hypothesis tests. Quality Progress, 2011 August. Available from URL : http://asq.org/quality-progress/2011/08/statistics-roundtable/not-significant-butimportant.html (Accessed on July 8, 2013)
Matrixx Initiatives, Inc. v. Siracusano. Available from URL: http://en.wikipedia.org/wiki/Matrixx_Initiatives,_Inc._v._Siracusano (Accessed on July 8, 2013)
Meehl PE : Theory-testing in psychology and physics: a methodological paradox. Philosophy Sci, 34:103-115, 1967.

상세보기
Meehl PE : Theoretical risks and tabular asterisks: sir Karl, sir Ronald, and the slow progress of soft psychology. J Consult Clin Psychol, 46:806-834, 1978.

상세보기
Cohen J : The earth is round (p<.05). Am Psychol, 49:997-1003, 1994.

상세보기
Schmidt FL, Hunter JE : Eight common but false objections to the discontinuation of significance testing in the analysis of research data. In Harlow LA, Mulaik SA, Steiger JH (eds.) : What if there were no significance tests? Mahwah, NJ, Lawrence Erlbaum Associates, 37-64, 1997.
NHST problems. Available from URL: http://www.faculty.biol.ttu.edu/strauss/stats/LectureNotes/20_NHSTProblems.pdf (Accessed on July 8, 2013)
Fallacy of affirming the consequent. Available from URL: http://terms.naver.com/entry.nhn?cid1137&docId275047&mobile&categoryId1137 (Accessed on July 8, 2013)
Pollard P, Richardson JTE : On the probability of making type I errors. Psychol Bull, 102:159-163, 1987.

상세보기
Reese HW : Problems of statistical inference. Mex J Behav Anal, 25:39-68, 1999.
Goodman S : A dirty dozen: twelve p-value misconceptions. Semin Hematol, 45:135-140, 2008.

상세보기
Hubbard R, Lindsay RM : Why p values are not a useful measure of evidence in statistical significance sesting. Theory Psychol, 18:69-88, 2008.

상세보기
Sterne JAC, Smith GD : Sifting the evidence - what's wrong with significance tests? BMJ(Clin res), 322:226-231, 2001.
Johnson, DH : The insignificance of statistical significance testing. J Wildlife Manag, 63:763-772, 1999.

상세보기
Nurminen M : Statistical significance - a misconstrued notion in medical research. Scand J Work Environ Health, 23:232-235, 1997.

상세보기
Schervish MJ : P values: what they are and what they are not. Am Stat, 50:203-206, 1996.

상세보기
Carver RP : The case against statistical significance testing. Harvard Educat Review, 48:378-399, 1978.

상세보기
Nickerson RS : Null hypothesis statistical testing: a review of an old and continuing controversy. Psychol Methods, 5:241-301, 2000.

상세보기
Berger JO, Sellke T : Testing a point null hypothesis: the irreconcilability of p values and evidence (with comments). J Am Stat Assoc, 82:112-139, 1987.
Berger JO, Delampady M : Testing precise hypotheses (with comments). Stat Science, 2:317-352, 1987.

상세보기
Nester MR : An applied statistician’s creed. Statistician, 45:401-410, 1996.
Berger JO, Berry DA : Statistical analysis and the illusion of objectivity. Am Scientist, 76:159-165, 1988.
Hubbard, R : Alphabet soup: blurring the distinctions between p's and ${\alpha}$ 's in psychological research. Theory Psychol, 14:295-327, 2004.

상세보기
Sellke T, Bayarri MJ, Berger JO : Calibration of p values for testing precise null hypotheses. Am Statistician, 55:62-71, 2001.

상세보기
Gelman A, Stern H : The difference between 'significant' and 'not significant' is not itself statistically significant. Am Statistician, 60:328-331, 2006.

상세보기
International committee of medical journal editors : Uniform requirements for manuscripts submitted to biomedical journals. Available from URL: http://www.icmje.org/manuscript_1prepare.html (Assessed on June 27, 2013)
Royall RM : The effect of sample size on the meaning of significance tests. Am Statistician, 40:313-315, 1986.
Hand DJ : Data mining: statistics and more? Am Statistician, 52:112.118, 1998.
Fisher RA : The design of experiments (8th ed.). Edinburgh, Oliver & Boyd, 1966.
Fisher BJ : R.A. Fisher: The life of a scientist. New York, Wiley, 1978.
Denis DJ : Alternatives to null hypothesis significance testing. Theory & Science, 4(1), 2003. Available from URL: http://theoryandscience.icaap.org/content/vol4.1/02_denis.html (Accessed on July 8, 2013)
Rosenthal R : Effect size estimation, significance testing, and the file-drawer problem. J Parapsychol, 56:57-58, 1992.
Vaughan GM, Corballis MC : Beyond tests of significance: Estimating strength of effects in selected ANOVA designs. Psychol Bulletin, 72:204-213, 1969.

상세보기
Silva-Aycaguer LC, Suarez-Gil P, Fernandez-Somoano A : Null hypothesis significance test in health sciences research (1995-2006): statistical analysis and interpretation. BMC Med Res Methodol, 10:44, 2010.

상세보기
Schmidt FL : Statistical significance testing and cumulative knowledge in psychology: implications for training of researchers. Psychol Methods, 1:115-129, 1996.

상세보기
Cumming G, Finch S : Inference by eye: confidence intervals and how to read pictures of data. Am Psychol, 60:170-180, 2005.

상세보기
Schenker N, Gentleman JF : On judging the significance of differences by examining the overlap between confidence intervals. Am Statistician, 55:182-186, 2001.

상세보기
Wang S, Campbell B : Mr. Bayes goes to Washington. Science, 339:758-759, 2013.

상세보기
Efron B : Bayes’Theorem in the twenty-first century. Science, 340:1177-1178, 2013.

상세보기
FDA : Guidance for the use of Bayesian statistics in medical device clinical trials. Available from URL : http://www.fda.gov/medicaldevices/deviceregulationandguidance/guidancedocuments/ucm071072.htm (Accessed on July 8, 2013)
Lilford RJ, Braunholtz D : The statistical basis of public policy: a paradigm shift is overdue. Br Med J, 313:603-607, 1996.

상세보기
Efron B : Why isn’t everyone a Bayesian (with discussion)? Am Statist, 40:1-11, 1986.
Nurminen M, Mutanen P : Exact Bayesian analysis of two proportions. Scand J Stat, 14:67-77, 1987.
Diaconis P, Freedman D : On the consistency of Bayes estimate (with discussion). Ann Math Stat, 14:1-67, 1986.

상세보기
Zhang Y, Todem D, Kim K, Lesaffre E : Bayesian latent variable models for spatially correlated toothlevel binary data in caries research. Stat Modelling, 11:25-47, 2011.

상세보기
Tu YK, Needleman I, Chambrone L, et al. : A Bayesian network meta-analysis on comparisons of enamel matrix derivatives, guided tissue regeneration and their combination therapies. J Clin Periodontol, 39:303-314, 2012.

상세보기
Frosio I, Olivieri C, Lucchese M, et al. : Bayesian denoising in digital radiography: a comparison in the dental field. Comput Med Imaging Graph, 37:28-39, 2013.

상세보기
Freedman L : Bayesian statistical methods. A natural way to assess clinical evidence (editorial). Br Med J, 313:569-570, 1996.

상세보기

내보내기 구분	파일저장 인쇄 메일전송
구성항목	기본정보 상세정보 관리번호, 논문명, 저널/프로시딩명, 저자 , 발행년, 권, 호, 시작페이지, 끝페이지, 발행기관 관리번호, 논문명, 대등논문명, 저자 , 저널/프로시딩명, 발행기관, 발행년, 발행언어, 권, 호, 시작페이지, 끝페이지, ISBN, ISSN, 주제분야, 키워드, 초록(한글), 초록(영문), 저자(소속기관)
저장형식	Text(ASCII format) Excel format RefWorks Direct Export RIS format (for Reference Manager, ProCite, EndNote), Scholar's Aids, Mendeley
메일정보	받는사람 (필수) @ 보내는사람 (선택) @ 제목 내용 KISTI 검색결과 이메일 서비스
안내	총 건의 자료가 검색되었습니다. 다운받으실 자료의 인덱스를 입력하세요. (1-10,000) 검색결과의 순서대로 최대 10,000건 까지 다운로드가 가능합니다. 데이타가 많을 경우 속도가 느려질 수 있습니다.(최대 2~3분 소요) 다운로드 파일은 UTF-8 형태로 저장됩니다. 파일의 내용이 제대로 보이지 않을실 때는 웹브라우저 상단의 보기 -> 인코딩 -> 자동선택 여부를 확인하십시오. ~ Text(ASCII format) Excel format

연합인증

치의학 연구에서 귀무가설 유의성 검정의 문제점과 대안에 관한 고찰
Review on Problems with Null Hypothesis Significance Testing in Dental Research and Its Alternatives 원문보기

초록
AI-Helper

Abstract ▼ AI-Helper

주제어

질의응답

참고문헌 (48)

이 논문을 인용한 문헌

저자의 다른 논문 :

관련 콘텐츠

원문 보기

원문 URL 링크

오픈액세스(OA) 유형

이 논문과 함께 이용한 콘텐츠

AI-Helper ※ AI-Helper는 오픈소스 모델을 사용합니다.

선택된 텍스트

연합인증

치의학 연구에서 귀무가설 유의성 검정의 문제점과 대안에 관한 고찰 Review on Problems with Null Hypothesis Significance Testing in Dental Research and Its Alternatives 원문보기

초록 용어보기논문에서 용어와 풀이말을 자동 추출한 결과로, 시범 서비스 중입니다. AI-Helper

Abstract ▼ AI-Helper

주제어

질의응답

참고문헌 (48)

이 논문을 인용한 문헌

저자의 다른 논문 :

이광희 (115)

관련 콘텐츠

원문 보기

원문 URL 링크

오픈액세스(OA) 유형

이 논문과 함께 이용한 콘텐츠

AI-Helper ※ AI-Helper는 오픈소스 모델을 사용합니다.

선택된 텍스트

치의학 연구에서 귀무가설 유의성 검정의 문제점과 대안에 관한 고찰
Review on Problems with Null Hypothesis Significance Testing in Dental Research and Its Alternatives 원문보기

초록
AI-Helper