[특허]Combinatorial approach for supervised neural network learning

Combinatorial approach for supervised neural network learning 원문보기

IPC분류정보
국가/구분	United States(US) Patent 등록
국제특허분류(IPC7판)	G06F-015/18 G06F-003/00 G06E-001/00
출원번호	US-0942155 (2001-08-29)
발명자 / 주소	Shetty, Ravindra K. Thyagarajan, Venkatesan
출원인 / 주소	Honeywell International, Inc.
대리인 / 주소	Schwegman, Lundberg, Woessner &
인용정보	피인용 횟수 : 1 인용 특허 : 8

초록 ▼

A technique for machine learning, such as supervised artificial neural network learning includes receiving data and checking the dimensionality of the read data and reducing the dimensionality to enhance machine learning performance using Principal Component Analysis methodology. The technique further includes specifying the neural network architecture and initializing weights to establish a connection between read data including the reduced dimensionality and the predicted values. The technique also includes performing supervised machine learning using the specified neural network architecture, initialized weights, and the read data including the reduced dimensionality to predict values. Predicted values are then compared to a normalized system error threshold value and the initialized weights are revised based on the outcome of the comparison to generate a learnt neural network having a reduced error in weight space. The learnt neural network is validated using known values and is then used for predicting values.

대표청구항 ▼

1. A computer-implemented method for supervised artificial neural network machine learning, comprising:reducing the dimensionality of the received data to enhance machine learning performance based on the dimensionality; specifying the supervised neural network architecture; initializing weights to establish connection strengths between the received data and predicted values; performing supervised machine learning using the specified architecture, initialized weights, and the received data including the reduced dimensionality to predict values; and revising the initialized weights of the network based on a normalized system error threshold value to generate a learnt neural network having a reduced error in weight space. 2. The method of claim 1, wherein the data comprises:data selected from the group consisting of static data and real-time data. 3. The method of claim 2, wherein reducing the dimensionality of the received data to enhance machine learning performance based on the dimensionality, further comprises:receiving data; checking dimensionality of the received data; and reducing the dimensionality of the received data to enhance machine learning performance based on the outcome of the checking. 4. The method of claim 3, wherein checking the dimensionality of the received data to improve performance during the machine learning further comprises:comparing the dimensionality of the received data to a threshold value; and reducing the dimensionality of the received data to increase machine learning performance based on the outcome of the comparison. 5. The method of claim 4, wherein reducing the dimensionality of the received data comprises;reducing the number of attributes in the received data using Principal Component Analysis. 6. The method of claim 4, wherein comparing the dimensionality of the received data to the threshold value comprises:comparing the number of attributes in the received data to the threshold value. 7. The method of claim 6, wherein the threshold value comprises:greater than or equal to 25 attributes. 8. The method of claim 2, further comprising:if the received data is static data, then reading a sample of the received static data using a predetermined window length; and if the received data is real-time data, then reading a sample of the received real-time data using a dynamically varying window of predetermined window length. 9. The method of claim 8, further comprising:if the received data is real-time data, then repeating the receiving of the real-time data using a dynamically varying window of predetermined window length. 10. The method of claim 2, wherein specifying the supervised neural network architecture comprises:specifying the learning parameters for the neural network selected from the group consisting of number of input nodes, number of hidden layers, number of nodes in each of the layers, number of nodes at the output layer, and learning rate; and allocating dynamic storage for updating the initialized weights and to store the trend between input and output nodes during each iteration using the specified neural network architecture. 11. The method of claim 2, wherein initializing the weights further comprises:initializing the weights using random weights. 12. The method of claim 2, wherein revising the initialized weights of the network based on a normalized system error threshold value, further comprises:comparing the predicted values to a normalized system error threshold value; and revising the initialized weights of the neural network based on the outcome of the comparison to generate a learnt neural network having a reduced error in weight space. 13. The method of claim 12, wherein comparing the predicted values to the normalized system error threshold value further comprises:computing the normalized system error by using desired values and the predicted values to reduce error in the weight space using a gradient descent technique; and comparing the computed normalized system error with the normalized system error threshold value. 14. The method of claim 13, further comprising:repeating the performing and comparing steps until the computed normalized difference error is less than or equal to the normalized system error threshold value. 15. The method of claim 1, further comprising:using a Hessian matrix to enhance the learning rate of the neural network; and using a function approximation neighborhood technique to perturb the learning parameters of the neural network to further enhance the learning rate of the neural network. 16. The method of claim 1, further comprising:validating the learnt neural network to verify the reliability of the learnt neural network. 17. The method of claim 16, wherein validating the neural network further comprises:performing supervised learning using the learnt neural network to predict the values; computing accuracy of the predicted values by comparing the predicted values with the known values; comparing the computed accuracy with an accuracy value; and repeating the varying learning rate, performing, comparing, and validating steps based on the outcome of the comparison to further enhance the reliability of the learnt neural network. 18. The method of claim 16, further comprising:inputting unknown values into the validated neural network; and predicting the values by performing supervised learning on the validated neural network. 19. A computer readable medium having computer-executable instructions for supervised artificial neural network learning, comprising:receiving data; checking dimensionality of the received data; reducing the dimensionality of the received data to enhance machine learning performance based on the outcome of the checking; specifying the supervised neural network architecture; initializing weights to establish connection strengths between the received data and predicted values; performing supervised machine learning using the specified architecture, initialized weights, and the received data including the reduced dimensionality to predict values; comparing the predicted values to a normalized system error threshold value; and revising the initialized weights of the neural network based on the outcome of the comparison to generate a learnt neural network having reduced error in weight space. 20. The computer readable medium of claim 19, wherein the data comprises:data selected from the group consisting of static data and real-time data. 21. The computer readable medium of claim 20, wherein checking the dimensionality of the received data to improve performance during the machine learning further comprises:comparing the dimensionality of the received data to a threshold value; and reducing the dimensionality of the received data to increase machine learning performance based on the outcome of the comparison. 22. The computer readable medium of claim 21, wherein reducing the dimensionality of the received data comprises;reducing the number of attributes in the received data using Principal Component Analysis. 23. The computer readable medium of claim 21, wherein comparing the dimensionality of the received data to the threshold value comprises:comparing the number of attributes in the received data to the threshold value. 24. The computer readable medium of claim 23, wherein the threshold value comprises:greater than or equal to 25 attributes. 25. The computer readable medium of claim 20, further comprising:if the received data is static data, then reading a sample of the received static data using a predetermined window length; and if the received data is real-time data, then reading a sample of the received real-time data using a dynamically varying window of predetermined window length. 26. The computer readable medium of claim 25, further comprising:if the received data is real-time data, then repeating the receiving of the real-time data using a dynamically varying window of predetermined window length. 27. The computer readable medium of claim 20, wherein specifying the supervised neural network architecture comprises:specifying the learning parameters for the neural network selected from the group consisting of number of input nodes, number of hidden layers, number of nodes in each of the layers, number of nodes at the output layer, and learning rate; and allocating dynamic storage for updating the initialized weights and to store the trend between input and output nodes during each iteration using the specified neural network architecture. 28. The computer readable medium of claim 20, wherein initializing the weights further comprises:initializing the weights using random weights. 29. The computer readable medium of claim 20, wherein comparing the predicted values to the normalized system error threshold value further comprises:computing the normalized system error by using desired values and the predicted values to reduce error in the weight space using gradient descent technique; and comparing the computed normalized system error with the normalized system error threshold value. 30. The computer readable medium of claim 29, further comprising:repeating the performing and comparing steps until the computed normalized difference error is less than or equal to the normalized system error threshold value. 31. The computer readable medium of claim 19, further comprising:using a Hessian matrix to enhance learning rate of the neural network; and using a function approximation neighborhood technique to perturb the learning parameters of the neural network to further enhance the learning rate of the neural network. 32. The computer readable medium of claim 19, further comprising:validating the learnt neural network to verify the reliability of the learnt neural network. 33. The computer readable medium of claim 32, wherein validating the neural network further comprises:performing supervised learning using the learnt neural network to predict the values; computing accuracy of the predicted values by comparing the predicted values with the known values; comparing the computed accuracy with an accuracy value; and repeating the varying learning rate, performing, and comparing steps based on the outcome of the comparison. 34. The computer readable medium of claim 32, further comprising:using the validated neural network for predicting the values. 35. A computer system for machine learning in a sparse data environment, comprising:a storage device; an output device; and a processor programmed to repeatedly perform a method, comprising: receiving the data; checking dimensionality of the received data; reducing the dimensionality of the received data to enhance machine learning performance based on the outcome of the checking; specifying the supervised neural network architecture; initializing weights to establish connection strengths between the received data and predicted values; performing supervised machine learning using the specified architecture, initialized weights, and the received data including the reduced dimensionality to predict the values; comparing the predicted values to a normalized system error threshold value; and revising the initialized weights of the neural network based on the outcome of the comparison to generate a learnt neural network having reduced error in weight space. 36. The system of claim 35, wherein the data comprises:data selected from the group consisting of static data and real-time data. 37. The system of claim 36, wherein checking the dimensionality of the received data to improve performance during the machine learning further comprises:comparing the dimensionality of the received data to a threshold value; and reducing the dimensionality of the received data to increase machine learning performance based on the outcome of the comparison. 38. The system of claim 37, wherein reducing the dimensionality of the received data comprises:reducing number of attributes in the received data using Principal Component Analysis. 39. The system of claim 37, wherein comparing the dimensionality of the received data to the threshold value comprises:comparing the number of attributes in the received data to the threshold value. 40. The system of claim 39, wherein the threshold value comprises:greater than or equal to 25 attributes. 41. The system of claim 36, further comprising:if the received data is static data, then reading a sample of the received static data using a predetermined window length; and if the received data is real-time data, then reading a sample of the received real-time data using a dynamically varying window of predetermined window length. 42. The system of claim 41, further comprising:if the received data is real-time data, then repeating the receiving of the real-time data using a dynamically varying window of predetermined window length. 43. The system of claim 36, wherein specifying the supervised neural network architecture comprises:specifying the learning parameters for the neural network selected from the group consisting of number of input nodes, number of hidden layers, number of nodes in each of the layers, number of nodes at the output layer, and learning rate; and allocating dynamic storage for updating the initialized weights and to store the trend between input and output nodes during each iteration using the specified neural network architecture. 44. The system of claim 36, wherein initializing the weights further comprises:initializing the weights using random weights. 45. The system of claim 36, wherein comparing the predicted values to the normalized system error threshold value further comprises:computing the normalized system error by using desired values and the predicted values to reduce error in the weight space using gradient descent technique; and comparing the computed normalized system error with the normalized system error threshold value. 46. The system of claim 45, further comprising:repeating the performing and comparing steps until the computed normalized difference error is less than or equal to the normalized system error threshold value. 47. The system of claim 35, further comprising:using a Hessian matrix to enhance learning rate of the neural network; and using a function approximation neighborhood technique to perturb the learning parameters of the neural network to further enhance the learning rate of the neural network. 48. The system of claim 35, further comprising:validating the learnt neural network to verify the reliability of the learnt neural network. 49. The system of claim 48, wherein validating the neural network further comprises:performing supervised learning using the learnt neural network to predict values; computing accuracy of the predicted values by comparing the predicted values with the known values; comparing the computed accuracy with an accuracy value; and repeating the performing and comparing steps based on the outcome of the comparison. 50. The system of claim 48, further comprising:using the validated neural network for predicting the values. 51. A computer-implemented system for supervised artificial neural network learning, comprising:a receive module to receive data; a reading module to read the received data; an analyzer to check dimensionality of the read data and reduce the dimensionality of the read data to enhance machine learning performance based on the outcome of the checking; wherein the analyzer specifies neural network architecture and initializes weights to establish connection strengths between the read data and predicted values obtained using the neural network, wherein the analyzer performs supervised learning using the specified architecture, initialized weights, and the read data including the reduced dimensionality to predict the values; and a comparator coupled to the analyzer, compares the predicted values to a normalized system error threshold value, wherein the analyzer revises the initialized weights of the neural network based on the outcome of the comparison to generate a learnt neural network having a reduced error in weight space. 52. The system of claim 51, further comprising:a database coupled to the receive module to receive and store data. 53. The system of claim 52, wherein the data comprises:data selected from the group consisting of static data and real-time data. 54. The system of claim 53, wherein the comparator compares the dimensionality of the read data to a threshold value, and the analyzer reduces the dimensionality of the read data to increase neural network learning performance based on the outcome of the comparison by the comparator.55. The system of claim 54, wherein the threshold value is greater than or equal to 25.56. The system of claim 55, wherein the comparator compares number of attributes in the read data to the threshold value.57. The system of claim 54, wherein the analyzer reduces the dimensionality of the read data by reducing the number of attributes in the read data using Principal Component Analysis.58. The system of claim 57, wherein the analyzer reduces the dimensionality of the read data by forming a Covariance Matrix using the equation:Cn×n=Xt*X wherein the received data is inputted in a matrix form (say Xm×n); and eigen values and eigen vectors are computed from the formed Covariance Matrix using the equation: (C?λI)Ui=0→??(1) wherein λ=(λ1, λ2, . . . , λn) are the roots of the equation, solving the equation (1) gives eigen values, and Ui=(ui1, ui2, . . . , uin) gives the corresponding eigen vectors; and principal components are selected using the equation: where β is the cutoff percentage (?85%); and features are further selected using the equation: to reduce the dimensionality of the received data. 59. The system of claim 54, wherein the reading module reads a sample of the received data using a predetermined window length.60. The system of claim 59, wherein the reading module reads the sample of the received data using a predetermined window length when the read data is static data, and the reading module reads a sample of the received data using a dynamically varying window of predetermined length when the read data is real-time data.61. The system of claim 60, wherein the reading module repeats the reading of the sample of the received data using a dynamically varying window of predetermined window length when the received data is real-time data.62. The system of claim 51, wherein the analyzer specifies the learning parameters for the neural network using the learning parameters selected from the group consisting of number of input nodes, number of hidden layers, number of nodes in each of the layers, number of nodes at the output layer, learning rate, and dynamic storage for updating the initialized weights.63. The system of claim 51, wherein the analyzer initializes weights using random values approximately in the range of about ?0.5 to 0.5.64. The system of claim 51, wherein the analyzer computes the normalized system error by using desired values and the predicted values, and the comparator compares the computed normalized system error with the normalized system error threshold value to reduce the error in the weight space using a gradient descent technique based on the outcome of the comparison.65. The system of claim 64, wherein the gradient descent technique uses the equation:Wij(n+1)=Wji(n)+η(δjoi) Wherein Wij are the weights in a space of i rows and j columns, oj is the actual output, δj is the desired output, and η is the learning rate.66. The system of claim 65, wherein the analyzer enhances the learning rate of the neural network during updating of weights using a Hessian Matrix: wherein H[i, k] are diagonal elements of second order derivatives, wherein i,j, and k are an architecture dependent number of nodes and hidden layers, and wherein wherein G[k] is a gradient of the previous iteration error with respect to the weight space and i,j, and k are an architecture dependent number of nodes and hidden layers. 67. The system of claim 66, wherein the analyzer further enhances the learning rate of the neural network using a function approximation neighborhood technique to update weights using the equation:W(t+1)=?(n, m, δr(t)) wherein n is a number of nodes in the input layer, m is a number of nodes in the next layer, and δr(t) is a parameter based on a function of time. 68. The system of claim 67, further comprising:repeating the varying learning rate, performing, and comparing to reduce the normalized system error. 69. The system of claim 51, wherein the analyzer validates the neural network to verify reliability of the learnt neural network by performing a supervised learning using learnt neural network to predict values, wherein the analyzer computes the accuracy of the predicted values by comparing the predicted values with known values, wherein the comparator compares the computed accuracy with an accuracy value, and the analyzer repeats the supervised learning based on the outcome of the comparison by the comparator.70. The system of claim 69, further comprising:repeating the varying learning rate of the neural network using techniques selected from the group consisting of Hessian matrix and function approximation neighborhood. 71. The system of claim 70, further comprising:repeating the performing and validating of the neural network to enhance reliability of the learnt neural network. 72. The system of claim 71, further comprising:an output module coupled to the analyzer to use the validated neural network to predict values.

이 특허에 인용된 특허 (8)

Francisco Jose Ayala, Adaptive neural learning system.
상세보기
Guiver John P. ; Klimasauskas Casimir C., Apparatus and method for selecting a working data set for model development.
상세보기
Maeda Akira,JPX ; Ashida Hitoshi,JPX ; Taniguchi Yoji,JPX ; Ito Yukiyasu,JPX ; Takahashi Yori,JPX, Data analyzing method.
상세보기
Gersho Marvin H. (Plainfield NJ) Reiter Randall R. (Upper Saddle River NJ), Hybrid multi-layer neural networks.
상세보기
Delanoy Richard L., Machine learning apparatus and method for image searching.
상세보기
de Vries Aalbert, Method and system for training a neural network with adaptive weight updating and adaptive pruning in principal componen.
상세보기
Pham Khai Minh ; Rajkovic Eric Bertrand ; Piffero Veronique, System for data mining using neuroagents.
상세보기
Dimitris K. Agrafiotis ; Roger F. Bone ; Francis R. Salemme ; Richard M. Soll, System, method and computer program product for identifying chemical compounds having desired properties.
상세보기

이 특허를 인용한 특허 (1)

Gummaregula, Sirisha; Dharmakari, Sreekanth, System and method to determine quality of a document screening process.
상세보기

IPC	Description
A	생활필수품
A62	인명구조; 소방(사다리 E06C)
A62B	인명구조용의 기구, 장치 또는 방법(특히 의료용에 사용되는 밸브 A61M 39/00; 특히 물에서 쓰이는 인명구조 장치 또는 방법 B63C 9/00; 잠수장비 B63C 11/00; 특히 항공기에 쓰는 것, 예. 낙하산, 투출좌석 B64D; 특히 광산에서 쓰이는 구조장치 E21F 11/00)
A62B-1/08	.. 윈치 또는 풀리에 제동기구가 있는 것

내보내기 구분	파일저장 인쇄 메일전송
구성항목	기본정보 상세정보 관리번호, 국가코드, 자료구분, 상태, 출원번호, 출원일자, 공개번호, 공개일자, 등록번호, 등록일자, 발명명칭(한글), 발명명칭(영문), 출원인(한글), 출원인(영문), 출원인코드, 대표IPC 관리번호, 국가코드, 자료구분, 상태, 출원번호, 출원일자, 공개번호, 공개일자, 공고번호, 공고일자, 등록번호, 등록일자, 발명명칭(한글), 발명명칭(영문), 출원인(한글), 출원인(영문), 출원인코드, 대표출원인, 출원인국적, 출원인주소, 발명자, 발명자E, 발명자코드, 발명자주소, 발명자 우편번호, 발명자국적, 대표IPC, IPC코드, 요약, 미국특허분류, 대리인주소, 대리인코드, 대리인(한글), 대리인(영문), 국제공개일자, 국제공개번호, 국제출원일자, 국제출원번호, 우선권, 우선권주장일, 우선권국가, 우선권출원번호, 원출원일자, 원출원번호, 지정국, Citing Patents, Cited Patents
저장형식	Text(ASCII format) Excel format PIAS분석(.xls)
메일정보	받는사람 (필수) @ 보내는사람 (선택) @ 제목 내용 KISTI 검색결과 이메일 서비스
안내	총 건의 자료가 검색되었습니다. 다운받으실 자료의 인덱스를 입력하세요. (1-10,000) 검색결과의 순서대로 최대 10,000건 까지 다운로드가 가능합니다. 데이타가 많을 경우 속도가 느려질 수 있습니다.(최대 2~3분 소요) 다운로드 파일은 UTF-8 형태로 저장됩니다. 파일의 내용이 제대로 보이지 않을실 때는 웹브라우저 상단의 보기 -> 인코딩 -> 자동선택 여부를 확인하십시오. ~ Text(ASCII format) Excel format

연합인증

Combinatorial approach for supervised neural network learning 원문보기

초록 ▼

대표청구항 ▼

연구과제 타임라인

이 특허에 인용된 특허 (8)

이 특허를 인용한 특허 (1)

관련 콘텐츠

특허 원문 보기

IPC 상위 출원인

AI-Helper ※ AI-Helper는 오픈소스 모델을 사용합니다.

선택된 텍스트

연합인증

Combinatorial approach for supervised neural network learning 원문보기

초록 ▼

대표청구항 ▼

연구과제 타임라인

전체(0) 논문(0) 특허(0) 보고서(0)

전체(0) 논문(0) 특허(0) 보고서(0)

이 특허에 인용된 특허 (8)

이 특허를 인용한 특허 (1)

관련 콘텐츠

특허 원문 보기

IPC 상위 출원인

AI-Helper ※ AI-Helper는 오픈소스 모델을 사용합니다.

선택된 텍스트