[특허]Methods and systems for customizable clustering of sub-networks for bioinformatics and health care applications

Methods and systems for customizable clustering of sub-networks for bioinformatics and health care applications 원문보기

IPC분류정보
국가/구분	United States(US) Patent 등록
국제특허분류(IPC7판)	G06F-017/30
출원번호	US-0604433 (2015-01-23)
등록번호	US-9690844 (2017-06-27)
우선권정보	IN-310/CHE/2014 (2014-01-24); KR-10-2015-0006118 (2015-01-13)
발명자 / 주소	Mukherjee, Subhankar Ahn, Taejin Bopardikar, Ajit S. Bhaduri, Anirban Mallavarapu, Srikanth Rama
출원인 / 주소	SAMSUNG ELECTRONICS CO., LTD.
대리인 / 주소	Leydig, Voit & Mayer, Ltd.
인용정보	피인용 횟수 : 0 인용 특허 : 4

초록 ▼

Methods and devices for clustering a plurality of sub-networks of a larger interaction network using an enhanced hierarchical clustering algorithm are disclosed. The methods provide expression based sub-network generation using differentially expressed markers. The enhanced hierarchical clustering algorithm clusters the generated sub-networks based on a user defined customizable similarity coefficient. The methods use non-Boolean links to cluster similar sub-networks. This provides consideration of indirect relationships among sub-networks. The customizable similarity coefficient enables the methods to be used for diverse applications such as biomarker detection, patient stratification, personalized therapy, drug efficacy prediction, genetic similarity analysis in genetic diseases. The methods enable patient grouping based on the enhanced hierarchical clustering algorithm.

대표청구항 ▼

1. A method for clustering a plurality of sub-networks comprising: receiving as input in a computing device, one or more expression data sets of one or more samples;preprocessing the input expression data for obtaining a plurality of seed markers; wherein the seed markers are biomarkers or input marker genes, and obtained by methods such but not limited to thresholding normalized expression data based on a predefined threshold value;extracting a set of sub-networks for the input samples using expression values, set of seed markers obtained as above and the interaction network;selecting sub networks among the plurality of the extracted or input sub-networks;building a plurality of local heaps for each cluster among a plurality of clusters by computing a first link between each cluster and remaining clusters of the plurality of clusters, wherein each of the plurality of clusters correspond to the selected sub-networks;building a global heap by computing a second link between each cluster among the plurality of clusters and a highest ranked cluster of each of the local heap among the plurality of local heaps;merging the highest ranked cluster of each local heap and a highest ranked cluster of the global heap to form a plurality of intermediate clusters;calculating a similarity coefficient between each intermediate cluster among the plurality of intermediate clusters and each cluster in the global heap and each cluster corresponding to one of the local heap; andreturning each intermediate cluster as a final cluster, if each the calculated similarity coefficients are below a predefined link cutoff value. 2. The method as in claim 1, wherein a value of the link is based on a user defined customizable similarity coefficient used for computing a functional relationship quantifier. 3. The method as in claim 1, wherein the method further comprises pushing the each intermediate cluster into the global heap if the calculated link is above a predefined link cutoff value. 4. The method as in claim 1, wherein the method comprises building the local heap for the each cluster by adding each cluster from the remaining clusters to each local heap if the computed link for the cluster is above a predefined link cutoff value. 5. The method as in claim 1, wherein the method comprises ranking at least one cluster in each local heap and at least one cluster in the global heap to determine the highest ranked cluster in each local heap and the highest ranked cluster in the global heap based on a value of the computed link for the at least one cluster in the each local heap and the at least one cluster in the global heap. 6. The method as in claim 1, wherein the method further comprises performing grouping based on the enhanced hierarchical clustering algorithm by: generating the plurality of sub-networks for each sample among a plurality of samples;clustering sub-networks within the plurality of sub-networks of each sample using the enhanced hierarchical clustering algorithm based on the customizable similarity coefficient; andgenerating a data set of clusters by pooling clusters across the plurality of samples. 7. The method as in claim 5, wherein the method further comprises: initializing a plurality of clusters-of-interest from the data set of clusters;growing the clusters-of-interest using the data set of clusters;determining membership of each sample in each the plurality of cluster-of-interest based on the clustered sub-networks for the each sample; andgrouping the plurality of samples into a group among a plurality of groups based on the determined membership of the sample, wherein samples in the group exhibit identical cluster memberships. 8. The method as in claim 5, wherein the method further comprises generating the plurality of sub-networks by: generating a set of first level sub-networks around a plurality of seed markers based on differential marker expression;growing the set of generated first level sub-networks based on a predefined scoring function; wherein the predefined scoring function is defined as but not limited to the first derivative of a log likelihood function, andmerging the set of grown first level sub-networks based on one of: the enhanced clustering algorithm and a predefined similarity coefficient to generate the plurality of sub-networks; andmerging of the sub-networks using a highest differentially expressed marker in a neighborhood. 9. The method as in claim 6, wherein the similarity coefficient can be based on similarity measures such as but not limited to Jaccard coefficient, Edge interaction coefficient (EIC) and Common neighborhood interaction coefficient (CNIC). 10. A device for clustering a plurality of sub-networks derived from a larger network using an enhanced hierarchical clustering algorithm, wherein the device comprises: an integrated circuit further comprising at least one processor;at least one memory having a computer program code within the circuit;the at least one memory and the computer program code with the at least one processor cause the device, when the computer program code is executed by the processor, to:receive a data set representing a plurality of sub-networks derived from a network;select sub networks among the plurality of sub-networks;build a plurality of local heaps for each cluster among a plurality of clusters by computing a first link between each cluster and remaining clusters of the plurality of clusters, wherein the plurality of clusters correspond to a plurality of selected sub-networks among the plurality of sub-networks;build a global heap by computing a second link between each cluster among the plurality of clusters and a highest ranked cluster of each the local heap among the plurality of local heaps;merge the highest ranked cluster of each local heap and a highest ranked cluster of the global heap to form a plurality of intermediate clusters;calculate a similarity coefficient between each intermediate cluster among the plurality of intermediate clusters and each cluster in the global heap, each cluster corresponding to one of the local heap; andreturn each intermediate cluster as a final cluster, if each the calculated link is below a predefined link cutoff value. 11. The device as in claim 10, wherein a value of the link is based on a user defined customizable similarity coefficient used for computing a functional relationship quantifier. 12. The device as in claim 10, wherein the device is further configured to push each intermediate cluster into the global heap if each the calculated link is above the predefined link cutoff value. 13. The device as in claim 10, wherein the device is configured to build the local heap for the each cluster by adding each cluster from the remaining clusters to each local heap if the computed link for the cluster is above the predefined link cutoff value. 14. The device as in claim 10, wherein the device is configured to rank at least one cluster in each local heap and at least one cluster in the global heap to determine the highest ranked cluster in each local heap and the highest ranked cluster in the global heap based on a value of the computed link for the at least one cluster in each local heap and the at least one cluster in the global heap. 15. The device as in claim 10, wherein the device is further configured to perform patient grouping based on the enhanced hierarchical clustering algorithm by: generating the plurality of sub-networks for each patient among a plurality of patients;clustering sub-networks within the plurality of sub-networks of each patient using the enhanced hierarchical clustering algorithm based on the customizable similarity coefficient; andgenerating a data set of clusters by pooling clusters across the plurality of patients. 16. The device as in claim 15, wherein the device is further configured to: initialize a plurality of clusters-of-interest from the data set of clusters;grow the clusters-of-interest using the data set of clusters;determine membership of the each patient in each the plurality of cluster-of-interest based on the clustered sub-networks for the each patient; andgroup the plurality of patients into a group among a plurality of groups based on the determined membership of the each patient, wherein patients in the group exhibit identical cluster membership. 17. The device as in claim 15, wherein the device is further configured to generate the plurality of sub-networks by: generating a set of first level sub-network around a plurality of seed markers based on differential marker expression;growing the set of generated first level sub-networks based on a predefined scoring function; andmerging the set of grown first level sub-networks based on one of: the enhanced hierarchical clustering algorithm and the predefined scoring function to generate the plurality of sub-networks, wherein the predefined scoring function is defined as the first derivative of a log likelihood function. 18. The device as in claim 10, wherein the device is further configured to refine at least one biomarker by clustering sub-networks generated from an incomplete set of disease specific input marker genes, wherein the clustering is based on a customizable similarity coefficient. 19. A computer program that is implemented by hardware and is stored in a medium to execute the method of claim 1.

이 특허에 인용된 특허 (4)

Singh, Ambuj Kumar; He, Huahai, Graph querying, graph motif mining and the discovery of clusters.
상세보기
Jin,Yaochu; Sendhoff,Bernhard, Reduction of fitness evaluations using clustering techniques and neural network ensembles.
상세보기
Asgekar, Amogh; Tawari, Sandesh, Social network node clustering system and method.
상세보기
Nucci, Antonio; Keralapura, Ram, System and method for content-aware co-clustering algorithm based on hourglass model.
상세보기

내보내기 메뉴

내보내기 구분

파일저장
인쇄
메일전송

구성항목

기본정보
상세정보

관리번호, 국가코드, 자료구분, 상태, 출원번호, 출원일자, 공개번호, 공개일자, 등록번호, 등록일자, 발명명칭(한글), 발명명칭(영문), 출원인(한글), 출원인(영문), 출원인코드, 대표IPC

저장형식

Text(ASCII format)
Excel format
PIAS분석(.xls)

메일정보

받는사람 (필수): @
보내는사람 (선택): @
제목
내용: KISTI 검색결과 이메일 서비스

안내

총 건의 자료가 검색되었습니다.

다운받으실 자료의 인덱스를 입력하세요. (1-10,000)

검색결과의 순서대로 최대 10,000건 까지 다운로드가 가능합니다.

데이타가 많을 경우 속도가 느려질 수 있습니다.(최대 2~3분 소요)

다운로드 파일은 UTF-8 형태로 저장됩니다.
파일의 내용이 제대로 보이지 않을실 때는 웹브라우저 상단의 보기 -> 인코딩 -> 자동선택 여부를 확인하십시오.

Text(ASCII format)
Excel format

AI-Helper ※ AI-Helper는 을 사용합니다.

AI-Helper

안녕하세요, AI-Helper입니다. 좌측 "선택된 텍스트"에서 텍스트를 선택하여 요약, 번역, 용어설명을 실행하세요.
※ AI-Helper는 부적절한 답변을 할 수 있습니다.

IPC	Description
A	생활필수품
A62	인명구조; 소방(사다리 E06C)
A62B	인명구조용의 기구, 장치 또는 방법(특히 의료용에 사용되는 밸브 A61M 39/00; 특히 물에서 쓰이는 인명구조 장치 또는 방법 B63C 9/00; 잠수장비 B63C 11/00; 특히 항공기에 쓰는 것, 예. 낙하산, 투출좌석 B64D; 특히 광산에서 쓰이는 구조장치 E21F 11/00)
A62B-1/08	.. 윈치 또는 풀리에 제동기구가 있는 것

연합인증

Methods and systems for customizable clustering of sub-networks for bioinformatics and health care applications 원문보기

초록 ▼

대표청구항 ▼

연구과제 타임라인

이 특허에 인용된 특허 (4)

관련 콘텐츠

특허 원문 보기

IPC 상위 출원인

이 특허와 함께 이용한 콘텐츠

AI-Helper ※ AI-Helper는 오픈소스 모델을 사용합니다.

선택된 텍스트

내보내기 구분	파일저장 인쇄 메일전송
구성항목	기본정보 상세정보 관리번호, 국가코드, 자료구분, 상태, 출원번호, 출원일자, 공개번호, 공개일자, 등록번호, 등록일자, 발명명칭(한글), 발명명칭(영문), 출원인(한글), 출원인(영문), 출원인코드, 대표IPC 관리번호, 국가코드, 자료구분, 상태, 출원번호, 출원일자, 공개번호, 공개일자, 공고번호, 공고일자, 등록번호, 등록일자, 발명명칭(한글), 발명명칭(영문), 출원인(한글), 출원인(영문), 출원인코드, 대표출원인, 출원인국적, 출원인주소, 발명자, 발명자E, 발명자코드, 발명자주소, 발명자 우편번호, 발명자국적, 대표IPC, IPC코드, 요약, 미국특허분류, 대리인주소, 대리인코드, 대리인(한글), 대리인(영문), 국제공개일자, 국제공개번호, 국제출원일자, 국제출원번호, 우선권, 우선권주장일, 우선권국가, 우선권출원번호, 원출원일자, 원출원번호, 지정국, Citing Patents, Cited Patents
저장형식	Text(ASCII format) Excel format PIAS분석(.xls)
메일정보	받는사람 (필수) @ 보내는사람 (선택) @ 제목 내용 KISTI 검색결과 이메일 서비스
안내	총 건의 자료가 검색되었습니다. 다운받으실 자료의 인덱스를 입력하세요. (1-10,000) 검색결과의 순서대로 최대 10,000건 까지 다운로드가 가능합니다. 데이타가 많을 경우 속도가 느려질 수 있습니다.(최대 2~3분 소요) 다운로드 파일은 UTF-8 형태로 저장됩니다. 파일의 내용이 제대로 보이지 않을실 때는 웹브라우저 상단의 보기 -> 인코딩 -> 자동선택 여부를 확인하십시오. ~ Text(ASCII format) Excel format

연합인증

Methods and systems for customizable clustering of sub-networks for bioinformatics and health care applications 원문보기

초록 ▼

대표청구항 ▼

연구과제 타임라인

전체(0) 논문(0) 특허(0) 보고서(0)

전체(0) 논문(0) 특허(0) 보고서(0)

이 특허에 인용된 특허 (4)

관련 콘텐츠

특허 원문 보기

IPC 상위 출원인

이 특허와 함께 이용한 콘텐츠

AI-Helper ※ AI-Helper는 오픈소스 모델을 사용합니다.

선택된 텍스트