[특허]System and method for scientific information knowledge management

System and method for scientific information knowledge management 원문보기

IPC분류정보
국가/구분	United States(US) Patent 등록
국제특허분류(IPC7판)	G06F-017/00 G06N-005/02
출원번호	US-0641539 (2006-12-18)
등록번호	US-8275737 (2012-09-25)
발명자 / 주소	Kupershmidt, Ilya Su, Qiaojuan Jane Andry, Francois
출원인 / 주소	NextBio
대리인 / 주소	Weaver Austin Villeneuve & Sampson LLP
인용정보	피인용 횟수 : 4 인용 특허 : 12

초록 ▼

The present invention relates to methods, systems and apparatus for capturing, integrating, organizing, navigating and querying large-scale data from high-throughput biological and chemical assay platforms. It provides a highly efficient meta-analysis infrastructure for performing research queries across a large number of studies and experiments from different biological and chemical assays, data types and organisms, as well as systems to build and add to such an infrastructure.

대표청구항 ▼

1. A computer-implemented method of providing data to a knowledge base of scientific information, the knowledge base including a plurality of pre-existing feature sets and pre-existing feature groups, the pre-existing feature sets each including a list of features and associated statistics indicating one or more of: differential expression of said features, abundance of said features, responses of said features to a treatment or stimulus, and effects of said features on biological systems, and the pre-existing feature groups each including a list of features related by structure or function, wherein the features are biological or chemical entities or units of biological or chemical information, the method comprising: (a) correlating by one or more processors of a computer system an input feature set against a plurality or all of the pre-existing feature sets in the knowledge base, the input feature set including a list of features and associated statistics indicating one or more of: differential expression of said features, abundance of said features, responses of said features to a treatment or stimulus, and effects of said features on biological systems;(b) correlating by one or more processors of the computer system the input feature set against one or more pre-existing feature groups in the knowledge base;(c) storing on one or more storage devices correlation information generated in (a) and (b) for use in responding to queries involving feature groups or feature sets; andprior to (a), mapping by one or more processors of the computer system each feature in the input feature set to one or more mapping identifiers in the knowledge base, wherein each mapping identifier represents a globally unique feature in the knowledge base. 2. The method of claim 1, wherein the features comprise genes of an organism. 3. The method of claim 1, wherein the features comprise chemical compounds and indications of responses are effect(s) of chemical compounds on biological systems. 4. The method of claim 1, wherein the correlating in (b) comprises performing a rank-based statistical algorithm. 5. The method of claim 1, wherein the correlating in (c) comprises performing a rank-based statistical algorithm. 6. The method of claim 1, wherein the features comprise SNPs. 7. The method of claim 1, wherein mapping each feature in the input feature set comprises mapping one or more feature identifiers associated with each feature in the input feature set to one or more mapping identifiers. 8. The method of claim 7, wherein at least some features are mapped based on established relationships between features and feature identifiers. 9. The method of claim 1, wherein at least some features are mapped based on genomic coordinates of the features. 10. The method of claim 1, wherein at least some features are mapped based on an indirect association of a feature to one or more pre-existing features in the knowledge base. 11. The method of claim 1, further comprising tagging the input feature set with terms in the knowledge base that are related to the input feature set. 12. The method of claim 11, wherein tagging comprises automatically associating terms in the knowledge base to the input feature set by one or more processors of the computer system. 13. The method of claim 1, further comprising, prior to (a), ranking the features in the input feature set by one or more processors of the computer system. 14. A computer program product comprising a machine readable non-transitory medium on which is provided program instructions for providing data to a knowledge base of scientific information, the knowledge base including a plurality of pre-existing feature sets and pre-existing feature groups, the pre-existing feature sets each including a list of features and associated statistics indicating one or more of: differential expression of said features, abundance of said features, responses of said features to a treatment or stimulus, and effects of said features on biological systems, wherein the features are biological or chemical entities or units of biological or chemical information, the program instructions comprising: (a) code for receiving an input feature set, the input feature set including a list of features and associated statistics indicating one or more of: differential expression, abundance of said features, responses of said features to a treatment or stimulus, and effects of said features on biological systems;(b) code for mapping each feature in the input feature set to one or more mapping identifiers in the knowledge base, wherein each mapping identifier represents a globally unique feature in the knowledge base;(c) code for correlating the input feature set against a plurality or all of the pre-existing feature sets in the knowledge base;(d) code for correlating the input feature set against one or more pre-existing feature groups in the knowledge base, wherein the feature groups provide collections of features having structural and/or functional characteristics in common; and(e) code for storing correlation information generated in (c) and (d) for use in responding to queries involving feature groups or feature sets. 15. The computer program product of claim 14, wherein the code correlating in (c) or (d) comprises code for a rank-based statistical algorithm. 16. The computer program product of claim 14, further comprising code for tagging the input feature set with terms in the knowledge base that are related to the feature set. 17. The computer program product of claim 14, further comprising code for ranking the features in the input feature set. 18. The method of claim 1 wherein the feature groups include at least one group of genes and/or proteins that all belong to the same signaling pathway. 19. The method of claim 1 wherein the feature groups provide collections of features having structural and/or functional characteristics in common without associated statistics. 20. A computer-implemented method of providing data to a knowledge base of scientific information, the knowledge base including a plurality of pre-existing feature sets and pre-existing feature groups, the pre-existing feature sets each including a list of features and information about one or more of: differential expression of said features, abundance of said features, responses of said features to a treatment or stimulus, and effects of said features on biological systems, and each feature group providing a list of related features without associated statistics, wherein the features are biological or chemical entities or units of biological or chemical information, the method comprising: (a) correlating by one or more processors of a computer system an input feature set against a plurality pre-existing feature sets in the knowledge base, the input feature set including a list of features and information about one or more of: differential expression of said features, abundance of said features, responses of said features to a treatment or stimulus, and effects of said features on biological systems;(b) correlating by one or more processors of a computer system the input feature set against a plurality of pre-existing feature groups in the knowledge base, and(c) storing on one or more storage devices correlation information generated in (a) and (b) for use in responding to queries involving feature groups or feature sets. 21. The method of claim 20, wherein the input feature set includes gene expression profile information of a patient. 22. The method of claim 20, wherein the features in the input feature set are SNPs. 23. The method of claim 20, wherein the features in the input feature set are chemical compounds. 24. The method of claim 20, wherein the features in the input feature set are proteins. 25. The method of claim 20, wherein the features in the input feature set are genes. 26. The method of claim 20, wherein (b) comprises correlating the input feature set to a feature group having gene features. 27. The method of claim 20, wherein at least one of (b) and (c) comprises performing a rank-based statistical algorithm. 28. A computer program product comprising a machine readable non-transitory medium on which is provided program instructions for providing data to a knowledge base of scientific information, the knowledge base including a plurality of pre-existing feature sets and pre-existing feature groups, the pre-existing feature sets each including a list of features and information about one or more of: differential expression of said features, abundance of said features, responses of said features to a treatment or stimulus, and effects of said features on biological systems, and each feature group providing a list of related features without associated statistics, wherein the features are biological or chemical entities or units of biological or chemical information, the program instructions comprising: (a) code for correlating an input feature set against a plurality pre-existing feature sets in the knowledge base, the input feature set including a list of features and information about one or more of: differential expression of said features, abundance of said features, responses of said features to a treatment or stimulus, and effects of said features on biological systems;(b) code for correlating the input feature set against a plurality of pre-existing feature groups in the knowledge base, and(c) code for storing correlation information generated in (a) and (b) for use in responding to queries involving feature groups or feature sets. 29. The computer program product of claim 28, wherein the input feature set includes gene expression profile information of a patient. 30. The computer program product of claim 28, wherein the features in the input feature set are SNPs. 31. The computer program product of claim 28, wherein the features in the input feature set are chemical compounds. 32. The computer program product of claim 28, wherein the features in the input feature set are proteins. 33. The computer program product of claim 28, wherein the features in the input feature set are genes. 34. The computer program product of claim 28, wherein (b) comprises code for correlating the input feature set to a feature group having gene features. 35. The computer program product of claim 28, wherein at least one of (b) and (c) comprises code for performing a rank-based statistical algorithm. 36. The method of claim of claim 1, wherein the features of the input feature set and one or more of the pre-existing feature sets are units of genetic or phenotypic information. 37. The method of claim of claim 20, wherein the features of the input feature set and one or more of the pre-existing feature sets are units of genetic or phenotypic information. 38. The computer program product of claim of claim 14, wherein the features of the input feature set and one or more of the pre-existing feature sets are units of genetic or phenotypic information. 39. The computer program product of claim of claim 29, wherein the features of the input feature set and one or more of the pre-existing feature sets are units of genetic or phenotypic information.

이 특허에 인용된 특허 (12)

Kincaid,Robert, Biotechnology information naming system.
상세보기
Maroko Peter R. (1765 Garwood Dr. Cherry Hill NJ 08003), Compositions and method of treatment for improving circulatory performance.
상세보기
Papierniak Karen A. ; Thaisz James E. ; Diwekar Anjali M. ; Chiang Luo-Jen, Computer architecture and method for collecting, analyzing and/or transforming internet and/or electronic commerce data for storage into a data storage area.
상세보기
Gong, Yihong; Liu, Xin, Creating audio-centric, image-centric, and integrated audio-visual summaries.
상세보기
Paul K. Wolber, Multidentate arrays.
상세보기
Qu,Kunbin; Lin,Nan; Lu,Yanmei; Payan,Donald G., Multidimensional biodata integration and relationship inference.
상세보기
Quake, Stephen R.; van Dam, R. Michael; Brody, James P.; Shafee, Rebecca, Non-metric tool for predicting gene relationships from expression data.
상세보기
Gardner,Steve, Ontology-based information management system and method.
상세보기
Blumberg,Brad W.; Blumberg,Eric M., Position-based information access device and method of searching.
상세보기
Timothy J. Maslyn ; Srikar D. Rao ; Benjamin G. Cocks ; Rachel J. Cheng ; Timothy M. Engler ; James R. Kerr ; Steven M. Lassagne ; Jeffrey J. Seilhamer, System and method for generating, analyzing and storing normalized expression datasets from raw expression datasets derived from microarray includes nucleic acid probe sequences.
상세보기
Axaopoulos Jack ; Carpenter ; Jr. James F. ; Peckover Douglas L., System and method for storing and searching buy and sell information of a marketplace.
상세보기
Singarajan,Kumar; Dahl,Michael A; Aldrich,Gary L; George,Robert A, Virtual manufacturing system.
상세보기

이 특허를 인용한 특허 (4)

Kupershmidt, Ilya; Su, Qiaojuan Jane; Liu, Qingdi; Alag, Satnam; Sundaresh, Suman, Categorization and filtering of scientific data.
상세보기
Kupershmidt, Ilya; Su, Qiaojuan Jane, Method and systems for querying sequence-centric scientific information.
상세보기
Kupershmidt, Ilya; Su, Qiaojuan Jane, Sequence-centric scientific information management.
상세보기
Kupershmidt, Ilya; Su, Qiaojuan Jane, Sequence-centric scientific information management.
상세보기

IPC	Description
A	생활필수품
A62	인명구조; 소방(사다리 E06C)
A62B	인명구조용의 기구, 장치 또는 방법(특히 의료용에 사용되는 밸브 A61M 39/00; 특히 물에서 쓰이는 인명구조 장치 또는 방법 B63C 9/00; 잠수장비 B63C 11/00; 특히 항공기에 쓰는 것, 예. 낙하산, 투출좌석 B64D; 특히 광산에서 쓰이는 구조장치 E21F 11/00)
A62B-1/08	.. 윈치 또는 풀리에 제동기구가 있는 것

내보내기 구분	파일저장 인쇄 메일전송
구성항목	기본정보 상세정보 관리번호, 국가코드, 자료구분, 상태, 출원번호, 출원일자, 공개번호, 공개일자, 등록번호, 등록일자, 발명명칭(한글), 발명명칭(영문), 출원인(한글), 출원인(영문), 출원인코드, 대표IPC 관리번호, 국가코드, 자료구분, 상태, 출원번호, 출원일자, 공개번호, 공개일자, 공고번호, 공고일자, 등록번호, 등록일자, 발명명칭(한글), 발명명칭(영문), 출원인(한글), 출원인(영문), 출원인코드, 대표출원인, 출원인국적, 출원인주소, 발명자, 발명자E, 발명자코드, 발명자주소, 발명자 우편번호, 발명자국적, 대표IPC, IPC코드, 요약, 미국특허분류, 대리인주소, 대리인코드, 대리인(한글), 대리인(영문), 국제공개일자, 국제공개번호, 국제출원일자, 국제출원번호, 우선권, 우선권주장일, 우선권국가, 우선권출원번호, 원출원일자, 원출원번호, 지정국, Citing Patents, Cited Patents
저장형식	Text(ASCII format) Excel format PIAS분석(.xls)
메일정보	받는사람 (필수) @ 보내는사람 (선택) @ 제목 내용 KISTI 검색결과 이메일 서비스
안내	총 건의 자료가 검색되었습니다. 다운받으실 자료의 인덱스를 입력하세요. (1-10,000) 검색결과의 순서대로 최대 10,000건 까지 다운로드가 가능합니다. 데이타가 많을 경우 속도가 느려질 수 있습니다.(최대 2~3분 소요) 다운로드 파일은 UTF-8 형태로 저장됩니다. 파일의 내용이 제대로 보이지 않을실 때는 웹브라우저 상단의 보기 -> 인코딩 -> 자동선택 여부를 확인하십시오. ~ Text(ASCII format) Excel format

연합인증

System and method for scientific information knowledge management 원문보기

초록 ▼

대표청구항 ▼

연구과제 타임라인

이 특허에 인용된 특허 (12)

이 특허를 인용한 특허 (4)

관련 콘텐츠

특허 원문 보기

IPC 상위 출원인

이 특허와 함께 이용한 콘텐츠

AI-Helper ※ AI-Helper는 오픈소스 모델을 사용합니다.

선택된 텍스트

연합인증

System and method for scientific information knowledge management 원문보기

초록 ▼

대표청구항 ▼

연구과제 타임라인

전체(0) 논문(0) 특허(0) 보고서(0)

전체(0) 논문(0) 특허(0) 보고서(0)

이 특허에 인용된 특허 (12)

이 특허를 인용한 특허 (4)

관련 콘텐츠

특허 원문 보기

IPC 상위 출원인

이 특허와 함께 이용한 콘텐츠

AI-Helper ※ AI-Helper는 오픈소스 모델을 사용합니다.

선택된 텍스트