Methods, systems, and apparatus, including computer programs encoded on one or more computer storage devices, for training a predictive model. In one aspect, a method includes receiving over a network predictive modeling training data from a client computing system. The training data and multiple tr
Methods, systems, and apparatus, including computer programs encoded on one or more computer storage devices, for training a predictive model. In one aspect, a method includes receiving over a network predictive modeling training data from a client computing system. The training data and multiple training functions obtained from a repository of training functions are used to train multiple predictive models. A score is generated for each of the trained predictive models, where each score represents an estimation of the effectiveness of the respective trained predictive model. A first trained predictive model is selected from among the trained predictive models based on the generated scores. Access to the first trained predictive model is provided to the client computing system.
대표청구항▼
1. A computer-implemented method comprising: receiving over a network predictive modeling training data from a client computing system;partitioning the training data into a plurality of subsamples;using the plurality of subsamples and a plurality of training functions obtained from a repository of t
1. A computer-implemented method comprising: receiving over a network predictive modeling training data from a client computing system;partitioning the training data into a plurality of subsamples;using the plurality of subsamples and a plurality of training functions obtained from a repository of training functions to train a plurality of predictive models using cross-validation;generating a cross-validation score for each of the plurality of trained predictive models, where each cross-validation score indicates the accuracy of the respective trained predictive model;selecting a first trained predictive model from among the plurality of trained predictive models using the generated cross-validation scores; andproviding access to the first trained predictive model over the network. 2. The method of claim 1, wherein the plurality of subsamples comprises k subsamples and providing access to the first trained predictive model comprises providing access to the first trained predictive model trained with all k subsamples from the plurality of subsamples. 3. The method of claim 2, wherein using the plurality of subsamples and the plurality of training functions obtained from the repository of training functions to train the plurality of predictive models using cross-validation comprises: for each iteration of training: training each of the plurality of predictive models using a different subset of subsamples from the plurality of subsamples comprising k−1 subsamples;determining, for each of the trained predictive models, a similarity between an output from the respective trained predictive model and an expected output identified in another subsample not included in the respective subset of subsamples; andgenerating the cross-validation score for each of the plurality of trained predictive models using the respective similarity. 4. The method of claim 3, comprising: averaging, for each of the trained predictive models, the cross-validation scores generated for the respective trained predictive model, wherein selecting the first trained predictive model from among the plurality of trained predictive models using the generated cross-validation scores comprises selecting the first trained predictive model from among the plurality of trained predictive models using the averages of the generated cross-validation scores. 5. The method of claim 1, comprising: receiving input data, data identifying the first trained predictive model, and a request for a predictive output; andgenerating the predictive output using the first predictive model and the input data. 6. The method of claim 1, wherein the plurality of training functions includes two or more training functions for training predictive models with a same type of predictive output and a same type of input data, where each predictive model is trained with a different training function. 7. The method of claim 1, wherein the plurality of training functions includes two or more training functions for training predictive models with a same type of predictive output and a same type of input data, where each predictive model is trained with a different hyper-parameter configuration. 8. The method of claim 1, wherein providing access to the first trained predictive model comprises providing a universal resource locator (URL) that identifies the first trained predictive model as an addressable resource. 9. The method of claim 1, wherein the cross-validation comprises k-fold cross-validation. 10. A system, comprising: a data processing apparatus; anda non-transitory computer readable storage medium in data communication with the data processing apparatus and storing instructions executable by the data processing apparatus and upon such execution cause the data processing to perform operations comprising: receiving over a network predictive modeling training data from a client computing system;partitioning the training data into a plurality of subsamples;using the plurality of subsamples and a plurality of training functions obtained from a repository of training functions to train a plurality of predictive models using cross-validation;generating a cross-validation score for each of the plurality of trained predictive models, where each cross-validation score indicates the accuracy of the respective trained predictive model;selecting a first trained predictive model from among the plurality of trained predictive models using the generated cross-validation scores; andproviding access to the first trained predictive model over the network. 11. The system of claim 10, wherein the plurality of subsamples comprises k subsamples and providing access to the first trained predictive model comprises providing access to the first trained predictive model trained with all k subsamples from the plurality of subsamples. 12. The system of claim 11, wherein using the plurality of subsamples and the plurality of training functions obtained from the repository of training functions to train the plurality of predictive models using cross-validation comprises: for each iteration of training: training each of the plurality of predictive models using a different subset of subsamples from the plurality of subsamples comprising k−1 subsamples;determining, for each of the trained predictive models, a similarity between an output from the respective trained predictive model and an expected output identified in another subsample not included in the respective subset of subsamples; andgenerating the cross-validation score for each of the plurality of trained predictive models using the respective similarity. 13. The system of claim 12, the operations comprising: averaging, for each of the trained predictive models, the cross-validation scores generated for the respective trained predictive model, wherein selecting the first trained predictive model from among the plurality of trained predictive models using the generated cross-validation scores comprises selecting the first trained predictive model from among the plurality of trained predictive models using the averages of the generated cross-validation scores. 14. The system of claim 10, the operations comprising: receiving input data, data identifying the first trained predictive model, and a request for a predictive output; andgenerating the predictive output using the first predictive model and the input data. 15. The system of claim 10, wherein the plurality of training functions includes two or more training functions for training predictive models with a same type of predictive output and a same type of input data, where each predictive model is trained with a different training function. 16. The system of claim 10, wherein the plurality of training functions includes two or more training functions for training predictive models with a same type of predictive output and a same type of input data, where each predictive model is trained with a different hyper-parameter configuration. 17. The system of claim 10, wherein providing access to the first trained predictive model comprises providing a universal resource locator (URL) that identifies the first trained predictive model as an addressable resource. 18. The system of claim 10, wherein the cross-validation comprises k-fold cross-validation. 19. A non-transitory computer readable storage medium storing instructions executable by a data processing apparatus and upon such execution cause the data processing to perform operations comprising: receiving over a network predictive modeling training data from a client computing system;partitioning the training data into a plurality of subsamples;using the plurality of subsamples and a plurality of training functions obtained from a repository of training functions to train a plurality of predictive models using cross-validation;generating a cross-validation score for each of the plurality of trained predictive models, where each cross-validation score indicates the accuracy of the respective trained predictive model;selecting a first trained predictive model from among the plurality of trained predictive models using the generated cross-validation scores; andproviding access to the first trained predictive model over the network. 20. The computer readable storage medium of claim 19, wherein the plurality of training functions includes two or more training functions for training predictive models with a same type of predictive output and a same type of input data, where each predictive model is trained with a different hyper-parameter configuration.
연구과제 타임라인
LOADING...
LOADING...
LOADING...
LOADING...
LOADING...
이 특허에 인용된 특허 (85)
Hales Lynn B. ; Ynchausti Randy A. ; Foot ; Jr. Donald G., Adaptive object-oriented optimization software system.
Dailey, Matthew N.; Pathria, Anu K.; Laffoon, Mark A.; Crooks, Theodore J., Application-specific method and apparatus for assessing similarity between two data objects.
Keeler James D. ; Hartman Eric J. ; Godbole Devendra B. ; Piche Steve ; Arbila Laura ; Ellinger Joshua ; Ferguson ; II R. Bruce ; Krauskop John ; Kempf Jill L. ; O'Hara Steven A. ; Strauss Audrey ; T, Automated method for building a model.
Keeler, James D.; Hartman, Eric J.; Godbole, Devendra B.; Piche, Steve; Arbila, Laura; Ellinger, Joshua; Ferguson, II, R. Bruce; Krauskop, John; Kempf, Jill L.; O'Hara, Steven A.; Strauss, Audrey; Te, Automated method for building a model.
Subramanian, Revathi; Drossu, Radu; Chen, Kevin Chaowen; Dulany, Paul C., Computer-implemented data storage systems and methods for use with predictive model systems.
Subramanian, Revathi; Drossu, Radu; Chen, Chao-Wen (Kevin); Dulany, Paul C.; Luk, Ho Ming, Computer-implemented predictive model generation systems and methods.
Russell,Feng wei Chen; Kini,Ameet M.; Medicke, Jr.,John A.; Plunket,Betsy M.; Sureka,Ashish, Computerized method, system and program product for generating a data mining model.
Sayyarrodsari, Bijan; Hartman, Eric; Axelrud, Celso; Liano, Kadir, Controlling a non-linear process with varying dynamics using non-linear model predictive control.
Luk, Ho Ming; Coates, Pamela E.; Deo, Arati S.; Downs, Sean M.; Friesen, Benjamin A.; Nies, Craig A.; Pathria, Anu K., Detection of insurance premium fraud or abuse using a predictive software system.
Chapman, Craig H.; Parikh, Kush G.; Downs, Oliver B.; Cahn, Robert C.; Hersch, Jesse S., Determining road traffic conditions using data from multiple data sources.
Chapman, Craig H.; Parikh, Kush G.; Downs, Oliver B.; Cahn, Robert C.; Hersch, Jesse S., Determining road traffic conditions using data from multiple data sources.
Tuv, Eugene; Shahapurkar, Somnath; Borisov, Alexander, Method for classifying microelectronic dies using die level cherry picking system based on dissimilarity matrix.
Ferguson, Ralph Bruce; Hartman, Eric J.; Johnson, William Douglas; Hurley, Eric S., System and method for historical database training of support vector machines.
Hofmann, Thomas; Puzicha, Jan Christian, System and method for personalized search, information filtering, and for generating recommendations utilizing statistical latent class models.
Sayyarrodsari,Bijan; Hartman,Eric; Axelrud,Celso; Liano,Kadir, System and method of applying adaptive control to the control of particle accelerators with varying dynamics behavioral characteristics using a nonlinear model predictive control technology.
Mezzatesta Alberto G. ; Jervis Michael A. ; Beard David R. ; Strack Kurt M. ; Tabarovsky Leonty A., Systems and methods for forward modeling of well logging tool responses.
Saidi,Olivier; Verbel,David A.; Teverovskiy,Mikhail, Systems and methods for treating, diagnosing and predicting the occurrence of a medical condition.
Teverovskiy,Mikhail; Verbel,David A.; Saidi,Olivier, Systems and methods for treating, diagnosing and predicting the occurrence of a medical condition.
※ AI-Helper는 부적절한 답변을 할 수 있습니다.