[논문]유전 통합 데이터 구축과 전장유전체연관분석

추소담

유전 통합 데이터 구축과 전장유전체연관분석
Construction of combined genetic data and GWAS among Korean population 원문보기

추소담 (연세대학교 보건대학원 역학 국내석사)

초록 ▼
AI-Helper

유전 통합 데이터 구축과
전장유전체연관분석

연구 배경
전장유전체연관분석(Genome-Wide Association Studies, GWAS)이란 인간의 질별 발생에 기여하는 유전자를 찾기 위해 수백만 개의 단일염기다형성(SNP)의 유전자형(Genotype)과 표현형(Phenotype)간의 통계적인 연관성을 분석함으로써 질병과 관련된 유전 변이를 찾아내는 방법을 의미한다. 충분한 통계적 파워를 가지고 있는 샘플 사이즈는 GWAS 연구에 있어서 매우 중요한 역할을 하는데, 최근에는 여러 데이터를 통합하는 방법을 이용하여 새로운 유전변이 발굴을 위해 노력하고 있다.

연구 목적
첫째, 한국인을 대상으로 한 6개의 각기 다른 플랫폼을 가진 유전 자료를 이용하여 하나의 통합데이터를 구축하고자 하였다. 또한, 다양한 칩을 여러 형태의 통합 데이터로 구축하여 칩마다의 특성을 확인하고자 하였다. 둘째, 유전 통합 데이터를 이용하여 빌리루빈과 요산의 전장유전체연관분석을 실시하고 통계적 검증력을 확인하고자 하였다. 또한, 여러 통합데이터들의 전장유전체연관분석 결과와 이전 선행연구와의 결과를 함께 비교해보고자 하였다.

연구자료 및 방법
이 연구는 한국인을 대상으로 한 서로 다른 Genotyping platform을 가지고 있는 총 6개의 유전자료를 이용하였다. 모든 자료는 Annotation build 36 버전을 이용하여 모든 정보를 동일하게 맞춰준 후 공통 SNP만을 추출하여 병합 과정을 진행하였다. 모든 유전분석과 유전 데이터 병합은 PLINK 1.9 버전을 사용하였다.

연구결과
연구대상자 수가 늘어날수록 관련성 분석의 p-value와 effect size, 통계적 유의성이 증가하는 모습을 볼 수 있었으며, 연구대상자의 수가 적은 데이터에서는 통계적 유의성을 보이지 않았던 snp이 연구대상자 수가 늘어난 데이터에서는 통계적 유의성을 보이기도 하였다.

결론
이 연구에서는 다양한 칩을 이용하여 여러 통합데이터를 구축하였고, 이에 따른 칩마다의 특성을 확인할 수 있었다. 이를 통하여, 개별 유전자료의 샘플 사이즈의 문제로 allele frequency, missing rate 등의 이유로 분석에 포함되지 못하거나 통계적으로 유의한 결과를 도출하지 못한다는 점을 통합데이터로 보완할 수 있었다. 또한, 이전 선행연구 결과를 재현해냈다는 점과 한국인에 특이적인 유전변이 후보군을 제안할 수 있었다는 점에서 중요한 의미를 가진다고 할 수 있겠다.

Abstract ▼ AI-Helper

Construction of combined genetic data and GWAS among Korean population

Sodam Chu
Graduate School of Public Health
Yonsei University, Seoul, Korea

(Directed by Professor Sun Ha Jee, Ph.D.)

Background and Objectives: Genome-Wide Association Studies (GWAS) analyzes the statistical relationships between the genotype and phenotype of millions of single nucleotide polymorphisms (SNPs). Sample size with sufficient statistical power plays an important role in GWAS research. Recently, efforts have been made to discover new genetic variants using a variety of data integration methods.
Therefore, in this study, a combiend data using genetic data of six different platforms for Koreans has been constructed. In addition, the characteristics of various chips were identified by constructing various types of integrated data. Second, genetic association data of bilirubin and uric acid were analyzed using genetic integration data, and statistical validity was confirmed. In addition, the results of the linkage genome association analysis of several integrated data were compared with those of the previous study.

Methods: This study used a total of six genetic data with different genotyping platforms for Koreans. All data were merged using the Annotation Build 36. All genetic analysis and genetic data merging were performed using PLINK version 1.9.

Results: As the number of study subjects increased, the p-value and effect size of statistical significance increased SNP, which showed no statistical significance in the data with a small number of subjects, showed a statistical significance in a combined data.

Conclusion: In this study, the characteristics of chips within the integrated data. Through this, we could complement the fact that the sample size of individual genetic data were not included in the analysis due to the allele frequency, missing rate, or failure to yield statistically significant results. In addition, this study replicates the results of previous studies and suggests a candidate genetic mutation specific for Koreans.

주제어

학위논문 정보

저자	추소담
학위수여기관	연세대학교 보건대학원
학위구분	국내석사
학과	역학
지도교수	지선하
발행연도	2019
총페이지	ix, 77p.
키워드	유전 통합 데이터 전장유전체연관분석 빌리루빈 요산
언어	kor
원문 URL	http://www.riss.kr/link?id=T15010546&outLink=K
정보원	한국교육학술정보원

내보내기 구분	파일저장 인쇄 메일전송
구성항목	기본정보 상세정보 관리번호, 논문명(한글), 저자명(한글), 학위수여기관, 학위연도, 학위구분, 학과, 총페이지, 키워드, 초록(한글), 초록(영문) 관리번호, 논문명(한글), 논문명(영문), 저자명(한글), 저자명(영문), 학위수여기관, 학위연도, 학위구분, 학과, 총페이지, 키워드, 초록(한글), 초록(영문)
저장형식	Text(ASCII format) Excel format
메일정보	받는사람 (필수) @ 보내는사람 (선택) @ 제목 내용 KISTI 검색결과 이메일 서비스
안내	총 건의 자료가 검색되었습니다. 다운받으실 자료의 인덱스를 입력하세요. (1-10,000) 검색결과의 순서대로 최대 10,000건 까지 다운로드가 가능합니다. 데이타가 많을 경우 속도가 느려질 수 있습니다.(최대 2~3분 소요) 다운로드 파일은 UTF-8 형태로 저장됩니다. 파일의 내용이 제대로 보이지 않을실 때는 웹브라우저 상단의 보기 -> 인코딩 -> 자동선택 여부를 확인하십시오. ~ Text(ASCII format) Excel format

연합인증

유전 통합 데이터 구축과 전장유전체연관분석
Construction of combined genetic data and GWAS among Korean population 원문보기

초록 ▼
AI-Helper

Abstract ▼ AI-Helper

주제어

학위논문 정보

이 논문을 인용한 문헌

관련 콘텐츠

원문 보기

이 논문과 함께 이용한 콘텐츠

AI-Helper ※ AI-Helper는 오픈소스 모델을 사용합니다.

선택된 텍스트

연합인증

유전 통합 데이터 구축과 전장유전체연관분석 Construction of combined genetic data and GWAS among Korean population 원문보기

초록 ▼ 용어보기논문에서 용어와 풀이말을 자동 추출한 결과로, 시범 서비스 중입니다. AI-Helper

Abstract ▼ AI-Helper

주제어

학위논문 정보

이 논문을 인용한 문헌

관련 콘텐츠

원문 보기

이 논문과 함께 이용한 콘텐츠

AI-Helper ※ AI-Helper는 오픈소스 모델을 사용합니다.

선택된 텍스트

유전 통합 데이터 구축과 전장유전체연관분석
Construction of combined genetic data and GWAS among Korean population 원문보기

초록 ▼
AI-Helper