Methods, systems, and apparatus, including computer programs encoded on one or more computer storage devices, for training a predictive model. In one aspect, a method includes receiving over a network predictive modeling training data from a client computing system. The training data and multiple tr
Methods, systems, and apparatus, including computer programs encoded on one or more computer storage devices, for training a predictive model. In one aspect, a method includes receiving over a network predictive modeling training data from a client computing system. The training data and multiple training functions obtained from a repository of training functions are used to train multiple predictive models. A score is generated for each of the trained predictive models, where each score represents an estimation of the effectiveness of the respective trained predictive model. A first trained predictive model is selected from among the trained predictive models based on the generated scores. Access to the first trained predictive model is provided over the network.
대표청구항▼
1. A computer-implemented method comprising: training each of a plurality of predictive models using training data, wherein the predictive models include two or more predictive models of a same type that are trained with different combinations of features of the training data;generating, for each of
1. A computer-implemented method comprising: training each of a plurality of predictive models using training data, wherein the predictive models include two or more predictive models of a same type that are trained with different combinations of features of the training data;generating, for each of the plurality of trained predictive models, a respective score that represents an estimation of an effectiveness of the respective trained predictive model;receiving a request for a prediction that includes input data from a client system;in response to receiving the request for the prediction, selecting a first subset of the plurality of trained predictive models based on the respective scores of the trained predictive models in the first subset, wherein the plurality of trained predictive models includes the first subset and a second subset, each subset comprises at least one trained predictive model, the first subset and the second subset are disjoint sets, and the predictive models in the first subset have higher respective scores than predictive models that were not selected;obtaining a respective predictive output from only each of the selected predictive models in the first subset based on the request and using the input data;combining the predictive outputs to generate a result; andproviding the result to the client system. 2. The method of claim 1 wherein training each of the plurality of predictive models using the training data further comprising: partitioning the training data into k partitions where k is an integer greater than 1;performing k-fold cross-validation of the predictive model using the k partitions; andusing a cross-validation score for the predictive model as the respective score for the predictive model. 3. The method of claim 1 wherein the respective score is based on, at least, an estimate of resource usage costs for the trained model. 4. The method of claim 1 wherein training each of the plurality of predictive models using the training data comprises training the predictive model using all of the training data. 5. The method of claim 1, further comprising receiving the training data from a client computing system. 6. The method of claim 1, further comprising allocating a fee obtained for use of the predictive models in the first subset to one or more respective parties associated with the predictive models in the first subset. 7. The method of claim 1 wherein combining the predictive outputs to generate the result further comprises averaging the predictive outputs to generate the result. 8. The method of claim 1 wherein the plurality of trained predictive models includes a Naïve Bayes model, a Perceptron model, a Support Vector Machine model, a linear regression model, a k-nearest neighbor model, or a logistic regression model. 9. The method of claim 1, wherein: receiving the request for the prediction that includes the input data comprises receiving the request for a predictive output of a certain type; andselecting the first subset of the plurality of trained predictive models based on the respective scores of the trained predictive models in the first subset comprises selecting the first subset of the plurality of trained predictive models using the certain type identified in the request. 10. A system comprising: one or more computers; andone or more data storage devices having instructions stored thereon that, when executed by the computers, cause the computers to perform operations comprising: training each of a plurality of predictive models using training data, wherein the predictive models include two or more predictive models of a same type that are trained with different combinations of features of the training data;generating, for each of the plurality of trained predictive models, a respective score that represents an estimation of an effectiveness of the respective trained predictive model;receiving a request for a prediction that includes input data from a client system;in response to receiving the request for the prediction, selecting a first subset of the plurality of trained predictive models based on the respective scores of the trained predictive models in the first subset, wherein the plurality of trained predictive models includes the first subset and a second subset, each subset comprises at least one trained predictive model, the first subset and the second subset are disjoint sets, and the predictive models in the first subset have higher respective scores than predictive models that were not selected;obtaining a respective predictive output from only each of the selected predictive models in the first subset based on the request and using the input data;combining the predictive outputs to generate a result; andproviding the result to the client system. 11. The system of claim 10 wherein training each of the plurality of predictive models using the training data further comprising: partitioning the training data into k partitions where k is an integer greater than 1;performing k-fold cross-validation of the predictive model using the k partitions; andusing a cross-validation score for the predictive model as the respective score for the predictive model. 12. The system of claim 10 wherein the respective score is based on, at least, an estimate of resource usage costs for the trained model. 13. The system of claim 10 wherein training each of the plurality of predictive models using the training data comprises training the predictive model using all of the training data. 14. The system of claim 10, further comprising receiving the training data from a client computing system. 15. The system of claim 10, further comprising allocating a fee obtained for use of the predictive models in the first subset to one or more respective parties associated with the predictive models in the first subset. 16. The system of claim 10 wherein combining the predictive outputs to generate the result further comprises averaging the predictive outputs to generate the result. 17. The system of claim 10 wherein the plurality of trained predictive models includes a Naïve Bayes model, a Perceptron model, a Support Vector Machine model, a linear regression model, a k-nearest neighbor model, or a logistic regression model. 18. The system of claim 10, wherein: receiving the request for the prediction that includes the input data comprises receiving the request for a predictive output of a certain type; andselecting the first subset of the plurality of trained predictive models based on the respective scores of the trained predictive models in the first subset comprises selecting the first subset of the plurality of trained predictive models using the certain type identified in the request. 19. A non-transitory computer-readable storage device encoded with instructions which, when executed by one or more computers, cause the one or more computers to perform operations comprising: training each of a plurality of predictive models using training data, wherein the predictive models include two or more predictive models of a same type that are trained with different combinations of features of the training data;generating, for each of the plurality of trained predictive models, a respective score that represents an estimation of an effectiveness of the respective trained predictive model;receiving a request for a prediction that includes input data from a client system;in response to receiving the request for the prediction, selecting a first subset of the plurality of trained predictive models based on the respective scores of the trained predictive models in the first subset, wherein the plurality of trained predictive models includes the first subset and a second subset, each subset comprises at least one trained predictive model, the first subset and the second subset are disjoint sets, and the predictive models in the first subset have higher respective scores than predictive models that were not selected;obtaining a respective predictive output from only each of the selected predictive models in the first subset based on the request and using the input data;combining the predictive outputs to generate a result; andproviding the result to the client system. 20. The storage device of claim 19 wherein training each of the plurality of predictive models using the training data further comprising: partitioning the training data into k partitions where k is an integer greater than 1;performing k-fold cross-validation of the predictive model using the k partitions; andusing a cross-validation score for the predictive model as the respective score for the predictive model. 21. The storage device of claim 19 wherein the respective score is based on, at least, an estimate of resource usage costs for the trained model. 22. The storage device of claim 19 wherein training each of the plurality of predictive models using the training data comprises training the predictive model using all of the training data. 23. The storage device of claim 19, further comprising receiving the training data from a client computing storage device. 24. The storage device of claim 19, further comprising allocating a fee obtained for use of the predictive models in the first subset to one or more respective parties associated with the predictive models in the first subset. 25. The storage device of claim 19 wherein combining the predictive outputs to generate the result further comprises averaging the predictive outputs to generate the result. 26. The storage device of claim 19 wherein the plurality of trained predictive models includes a Naïve Bayes model, a Perceptron model, a Support Vector Machine model, a linear regression model, a k-nearest neighbor model, or a logistic regression model. 27. The storage device of claim 19, wherein: receiving the request for the prediction that includes the input data comprises receiving the request for a predictive output of a certain type; andselecting the first subset of the plurality of trained predictive models based on the respective scores of the trained predictive models in the first subset comprises selecting the first subset of the plurality of trained predictive models using the certain type identified in the request.
연구과제 타임라인
LOADING...
LOADING...
LOADING...
LOADING...
LOADING...
이 특허에 인용된 특허 (87)
Hales Lynn B. ; Ynchausti Randy A. ; Foot ; Jr. Donald G., Adaptive object-oriented optimization software system.
Dailey, Matthew N.; Pathria, Anu K.; Laffoon, Mark A.; Crooks, Theodore J., Application-specific method and apparatus for assessing similarity between two data objects.
Keeler James D. ; Hartman Eric J. ; Godbole Devendra B. ; Piche Steve ; Arbila Laura ; Ellinger Joshua ; Ferguson ; II R. Bruce ; Krauskop John ; Kempf Jill L. ; O'Hara Steven A. ; Strauss Audrey ; T, Automated method for building a model.
Keeler, James D.; Hartman, Eric J.; Godbole, Devendra B.; Piche, Steve; Arbila, Laura; Ellinger, Joshua; Ferguson, II, R. Bruce; Krauskop, John; Kempf, Jill L.; O'Hara, Steven A.; Strauss, Audrey; Te, Automated method for building a model.
Subramanian, Revathi; Drossu, Radu; Chen, Kevin Chaowen; Dulany, Paul C., Computer-implemented data storage systems and methods for use with predictive model systems.
Subramanian, Revathi; Drossu, Radu; Chen, Chao-Wen (Kevin); Dulany, Paul C.; Luk, Ho Ming, Computer-implemented predictive model generation systems and methods.
Russell,Feng wei Chen; Kini,Ameet M.; Medicke, Jr.,John A.; Plunket,Betsy M.; Sureka,Ashish, Computerized method, system and program product for generating a data mining model.
Sayyarrodsari, Bijan; Hartman, Eric; Axelrud, Celso; Liano, Kadir, Controlling a non-linear process with varying dynamics using non-linear model predictive control.
Luk, Ho Ming; Coates, Pamela E.; Deo, Arati S.; Downs, Sean M.; Friesen, Benjamin A.; Nies, Craig A.; Pathria, Anu K., Detection of insurance premium fraud or abuse using a predictive software system.
Chapman, Craig H.; Parikh, Kush G.; Downs, Oliver B.; Cahn, Robert C.; Hersch, Jesse S., Determining road traffic conditions using data from multiple data sources.
Chapman, Craig H.; Parikh, Kush G.; Downs, Oliver B.; Cahn, Robert C.; Hersch, Jesse S., Determining road traffic conditions using data from multiple data sources.
Tuv, Eugene; Shahapurkar, Somnath; Borisov, Alexander, Method for classifying microelectronic dies using die level cherry picking system based on dissimilarity matrix.
Ferguson, Ralph Bruce; Hartman, Eric J.; Johnson, William Douglas; Hurley, Eric S., System and method for historical database training of support vector machines.
Hofmann, Thomas; Puzicha, Jan Christian, System and method for personalized search, information filtering, and for generating recommendations utilizing statistical latent class models.
Sayyarrodsari,Bijan; Hartman,Eric; Axelrud,Celso; Liano,Kadir, System and method of applying adaptive control to the control of particle accelerators with varying dynamics behavioral characteristics using a nonlinear model predictive control technology.
Mezzatesta Alberto G. ; Jervis Michael A. ; Beard David R. ; Strack Kurt M. ; Tabarovsky Leonty A., Systems and methods for forward modeling of well logging tool responses.
Saidi,Olivier; Verbel,David A.; Teverovskiy,Mikhail, Systems and methods for treating, diagnosing and predicting the occurrence of a medical condition.
Teverovskiy,Mikhail; Verbel,David A.; Saidi,Olivier, Systems and methods for treating, diagnosing and predicting the occurrence of a medical condition.
※ AI-Helper는 부적절한 답변을 할 수 있습니다.