System for providing data analysis services using a support vector machine for processing data received from a remote source
원문보기
IPC분류정보
국가/구분
United States(US) Patent
등록
국제특허분류(IPC7판)
G06F-015/18
G06N-003/00
G06N-003/12
출원번호
US-0814431
(2010-06-11)
등록번호
US-8275723
(2012-09-25)
발명자
/ 주소
Barnhill, Stephen D.
Guyon, Isabelle
Weston, Jason
출원인 / 주소
Health Discovery Corporation
대리인 / 주소
Musick, Eleanor M.
인용정보
피인용 횟수 :
1인용 특허 :
39
초록▼
A network-based system is provided for performing data analysis services using a support vector machine for analyzing data received from a remote user connected to the network. The user transmits a data set to be analyzed and along with an account identifier that allows the analysis service provider
A network-based system is provided for performing data analysis services using a support vector machine for analyzing data received from a remote user connected to the network. The user transmits a data set to be analyzed and along with an account identifier that allows the analysis service provider to collect payment for the processing services. Once payment has been confirmed, the service provider's server transmits the analysis results to the remote user.
대표청구항▼
1. A computer system for providing data analysis services accessible via a distributed network, the system comprising: a remote source in communication with the distributed network;a server in communication with the distributed network for receiving a data set having unknown patterns and an account
1. A computer system for providing data analysis services accessible via a distributed network, the system comprising: a remote source in communication with the distributed network;a server in communication with the distributed network for receiving a data set having unknown patterns and an account identifier from the remote source, wherein the server is further operable for communicating over the distributed network with an institution associated with the account identifier in order to provide for payment for the data analysis services from an account identified by the account identifier;one or more storage devices in communication with the server for storing a training and testing dataset having known patterns and the data set from the remote source;a processor in communication with the one or more storage devices for executing a support vector machine, the processor further operable for: pre-processing the training and testing data to clean, transform or expand the data;training and testing the support vector machine until an optimal solution is achieved;collecting the data set from the remote source;pre-processing the data set from the remote source;inputting the pre-processed data set from the remote source into the tested and trained support vector machine to produce an output comprising a recognized pattern within the data set from the remote source; andtransmitting the recognized pattern to the server;wherein the server is further operable for transmitting the recognized pattern to the remote source or another remote source after ensuring that payment from the account has been secured. 2. The system of claim 1, wherein the financial institution is a bank. 3. The system of claim 1, wherein the financial institution is a credit or debit card company. 4. The system of claim 1, wherein the data set from the remote source comprises inventory and audit data. 5. The system of claim 1, wherein the data set from the remote source comprises insurance data. 6. The system of claim 1, wherein the data set from the remote source comprises stock market or commodity market data. 7. The system of claim 1, wherein the data set from the remote source comprises medical data. 8. The system of claim 7, wherein the remote source comprises a medical laboratory. 9. The system of claim 7, wherein the medical data comprises a mammogram. 10. The system of claim 7, wherein the medical data comprises genomic or proteomic data. 11. The system of claim 7, wherein the medical data comprises clinical case information. 12. The system of claim 7, wherein the medical data comprises data from clinical samples. 13. The system of claim 1, wherein the processor is further operable for executing a feature selection algorithm on the training and testing dataset to identify a subset of determinative features within a large number of features that describe the training and testing dataset and generating a ranked listing of determinative features. 14. The system of claim 13, wherein the feature selection algorithm is recursive feature elimination. 15. The system of claim 1, wherein the distributed network is the Internet. 16. The system of claim 1, wherein the distributed network is an on-demand communications link. 17. A system for providing data analysis services using a support vector machine for analyzing data received from a remote source, the system comprising: a server in communication with a distributed network for receiving a data set and an account identifier from a remote source, the remote source also in communication with the distributed network, wherein the server is further operable for communicating with an institution for conducting a financial transaction in order to receive funds from an account identified by the account identifier;one or more storage devices in communication with the server for storing the dataset;a processor for executing a support vector machine, the processor further operable for: training and testing the support vector machine using a training dataset to provide a trained support vector machine;collecting the dataset received from the remote source;pre-processing the dataset from the remote source;inputting the dataset from the remote source into the trained support vector machine to produce an output comprising a recognized pattern within the dataset from the remote source; andtransmitting the output to the remote source or another remote source after ensuring that funds from the account have been secured. 18. The system of claim 17, wherein the processor is further operable for post-processing the output to generate an alphanumerical classifier corresponding to the recognized pattern, wherein the alphanumerical classifier is transmitted to the remote source after ensuring that funds from the account have been secured. 19. The system of claim 17, wherein the institution for conducting a financial transaction is a bank. 20. The system of claim 17, wherein the financial institution is a credit or debit card company. 21. The system of claim 17, wherein the data set from the remote source comprises inventory and audit data. 22. The system of claim 17, wherein the data set from the remote source comprises insurance data. 23. The system of claim 17, wherein the data set from the remote source comprises stock market or commodity market data. 24. The system of claim 17, wherein the data set from the remote source comprises medical data. 25. The system of claim 24, wherein the remote source comprises a medical laboratory. 26. The system of claim 24, wherein the medical data comprises a mammogram. 27. The system of claim 24, wherein the medical data comprises genomic or proteomic data. 28. The system of claim 24, wherein the medical data comprises clinical case information. 29. The system of claim 24, wherein the medical data comprises data from clinical samples. 30. The system of claim 17, wherein the processor is further operable for executing a feature selection algorithm on the training dataset to identify a subset of determinative features within a large number of features that describe the training and testing dataset and generating a ranked listing of determinative features. 31. The system of claim 30, wherein the feature selection algorithm is recursive feature elimination. 32. The system of claim 17, wherein the distributed network is the Internet. 33. The system of claim 17, wherein the distributed network is an on-demand communications link. 34. A system for providing data analysis services using a support vector machine for analyzing data received from a remote source, the system comprising: a server in communication with a distributed network for receiving a data set and an account identifier from a remote source, the remote source also in communication with the distributed network, wherein the server is further operable for communicating with an institution for conducting a financial transaction in order to receive funds from an account identified by the account identifier;one or more storage devices in communication with the server for storing the dataset;a processor for executing, training and testing a support vector machine, the processor further operable for: collecting the dataset received from the remote source;pre-processing the dataset from the remote source;inputting the dataset from the remote source into a trained support vector machine to produce an output comprising a recognized pattern within the dataset from the remote source; andtransmitting the output to the remote source or another remote source after confirming that funds from the account have been secured. 35. The system of claim 34, wherein the processor is further operable for post-processing the output to generate an alphanumerical classifier corresponding to the recognized pattern, wherein the alphanumerical classifier is transmitted to the remote source after ensuring that funds from the account have been secured. 36. The system of claim 34, wherein the institution for conducting a financial transaction is a bank. 37. The system of claim 34, wherein the financial institution is a credit or debit card company. 38. The system of claim 34, wherein the data set from the remote source comprises inventory and audit data. 39. The system of claim 34, wherein the data set from the remote source comprises insurance data. 40. The system of claim 34, wherein the data set from the remote source comprises stock market or commodity market data. 41. The system of claim 34, wherein the data set from the remote source comprises medical data. 42. The system of claim 41, wherein the remote source comprises a medical laboratory. 43. The system of claim 41, wherein the medical data comprises a mammogram. 44. The system of claim 41, wherein the medical data comprises genomic or proteomic data. 45. The system of claim 41, wherein the medical data comprises clinical case information. 46. The system of claim 41, wherein the medical data comprises data from clinical samples. 47. The system of claim 34, wherein the processor is further operable for executing a feature selection algorithm to identify a subset of determinative features within a large number of features that describe the dataset used to train the support vector machine and generating a ranked listing of determinative features for separating the dataset according to the recognized pattern. 48. The system of claim 47, wherein the feature selection algorithm is recursive feature elimination. 49. The system of claim 34, wherein the distributed network is the Internet. 50. The system of claim 34, wherein the distributed network is an on-demand communications link.
연구과제 타임라인
LOADING...
LOADING...
LOADING...
LOADING...
LOADING...
이 특허에 인용된 특허 (39)
Hull, Jonathan J.; Erol, Berna; Graham, Jamey; Hart, Peter E.; Piersol, Kurt, Authoring tools using a mixed media environment.
Guyon, Isabelle; Reiss, Edward P.; Doursat, René; Weston, Jason Aaron Edward; Lewis, David D., Data mining platform for bioinformatics and other knowledge discovery.
Kamath, Vivek P.; Brown, Craig S.; Pence, John B.; Shekaran, M. Chandra; Lorimor, Thomas G.; Firman, Thomas R.; Gentile, Elizabeth J.; Toussaint, Keith M., Extended file system.
Schuetze,Hinrich H.; Yu,Chia Hao; Velipasaoglu,Omer Emre; Stukov,Stan, System and method for processing semi-structured business data using selected template designs.
Teper Jeffrey A. ; Koneru Sudheer ; Mangione Gordon ; Balaz Rudolph ; Contorer Aaron M. ; Chao Lucy, System and method for providing trusted brokering services over a distributed network.
Nudd, Geoffrey H.; Weyl, Stephen; Graham, Jamey; Erol, Berna; Hart, Peter E.; Hull, Jonathan J., System and method for using individualized mixed document.
Hull, Jonathan J.; Erol, Berna; Graham, Jamey; Hart, Peter E.; Lee, Dar-Shyang; Piersol, Kurt Wesley, System and methods for creation and use of a mixed media environment.
Barnhill, Stephen; Guyon, Isabelle; Weston, Jason, System for providing data analysis services using a support vector machine for processing data received from a remote source.
Horvitz Eric ; Heckerman David E. ; Dumais Susan T. ; Sahami Mehran ; Platt John C., Technique which utilizes a probabilistic classifier to detect "junk" e-mail by automatically updating a training and re-training the classifier based on the updated training set.
Hull, Jonathan J.; Erol, Berna; Graham, Jamey; Hart, Peter E.; Lee, Dar-Shyang; Piersol, Kurt, Triggering applications based on a captured text in a mixed media environment.
※ AI-Helper는 부적절한 답변을 할 수 있습니다.