[특허]System and method for efficient large-scale data processing

System and method for efficient large-scale data processing 원문보기

IPC분류정보
국가/구분	United States(US) Patent 등록
국제특허분류(IPC7판)	G06F-007/00
출원번호	UP-0871244 (2004-06-18)
등록번호	US-7650331 (2010-02-22)
발명자 / 주소	Dean, Jeffrey Ghemawat, Sanjay
출원인 / 주소	Google Inc.
대리인 / 주소	Morgan, Lewis & Bockius LLP
인용정보	피인용 횟수 : 172 인용 특허 : 9

초록 ▼

A large-scale data processing system and method includes one or more application-independent map modules configured to read input data and to apply at least one application-specific map operation to the input data to produce intermediate data values, wherein the map operation is automatically parallelized across multiple processors in the parallel processing environment. A plurality of intermediate data structures are used to store the intermediate data values. One or more application-independent reduce modules are configured to retrieve the intermediate data values and to apply at least one application-specific reduce operation to the intermediate data values to provide output data.

대표청구항 ▼

What is claimed is: 1. A system for large-scale processing of data, comprising: a plurality of processes executing on a plurality of interconnected processors; the plurality of processes including a master process, for coordinating a data processing job for processing a set of input data, and worker processes; the master process, in response to a request to perform the data processing job, assigning input data blocks of the set of input data to respective ones of the worker processes; each of a first plurality of the worker processes including an application-independent map module for retrieving a respective input data block assigned to the worker process by the master process and applying an application-specific map operation to the respective input data block to produce intermediate data values, wherein at least a subset of the intermediate data values each comprises a key/value pair, and wherein at least two of the first plurality of the worker processes operate simultaneously so as to perform the application-specific map operation in parallel on distinct, respective input data blocks; a partition operator for processing the produced intermediate data values to produce a plurality of intermediate data sets, wherein each respective intermediate data set includes all key/value pairs for a distinct set of respective keys, and wherein at least one of the respective intermediate data sets includes respective ones of the key/value pairs produced by a plurality of the first plurality of the worker processes; and each of a second plurality of the worker processes including an application-independent reduce module for retrieving data, the retrieved data comprising at least a subset of the key/value pairs from a respective intermediate data set of the plurality of intermediate data sets and applying an application-specific reduce operation to the retrieved data to produce final output data corresponding to the distinct set of respective keys in the respective intermediate data set of the plurality of intermediate data sets, and wherein at least two of the second plurality of the worker processes operate simultaneously so as to perform the application-specific reduce operation in parallel on multiple respective subsets of the produced intermediate data values. 2. The system of claim 1, wherein each of the worker processes includes a map process thread and a reduce process thread, the map process thread configured to execute the application-independent map module and the reduce process thread configured to execute the application-independent reduce module. 3. The system of claim 1, wherein the master process is configured to automatically determine a number of distinct map tasks and a number of distinct reduce tasks to perform the data processing job, and to automatically assign the distinct map tasks and the distinct reduce tasks to the worker processes in accordance with availability of the worker processes executing on the interconnected processors such that some of the distinct map tasks and some of the distinct reduce tasks are assigned to worker processes later, during performance of the data processing job, than other ones of the distinct map tasks and the distinct reduce tasks. 4. The system of claim 3, wherein the number of distinct map tasks exceeds in number the first plurality of the worker processes to which the master process can assign the distinct map tasks, and wherein the master process maintains status information with respect to the distinct map tasks awaiting assignment to one of the first plurality of worker processes. 5. The system of claim 3, wherein the master process is configured to maintain a task status table, denoting for each of the distinct map tasks and each of the distinct reduce tasks one of the worker processes, if any, to which each of the distinct map tasks or each of the distinct reduce tasks has been assigned, and a status of each of the distinct map tasks or each of the distinct reduce tasks. 6. The system of claim 5, wherein the master process is further configured to maintain a process status tables, denoting which of the worker processes has been assigned one of the distinct map tasks, and which of the worker processes has been assigned one of the distinct reduce tasks, and a status of each of the worker processes. 7. The system of claim 5, wherein the application-specific map operation includes an application-specific combiner operation for combining initial data values produced by the application-specific map operation so as to produce the intermediate data values. 8. The system of claim 5, wherein the application-specific map operation includes an application-specific combiner operation for combining initial data values produced by the application-specific map operation having shared keys so as to produce the intermediate data values. 9. A method of performing a large-scale data processing job, comprising: executing a plurality of processes on a plurality of interconnected processors, the plurality of processes including a master process for coordinating the large-scale data processing job for processing a set of input data, and worker processes; in the master process, in response to a request to perform the large-scale data processing job, assigning input data blocks of the set of input data to respective ones of the worker processes; in each of a first plurality of the worker processes, executing an application-independent map module to retrieve a respective input data block assigned to the worker process by the master process and to apply an application-specific map operation to the respective input data block to produce intermediate data values, wherein at least a subset of the intermediate data values each comprises a key/value pair, and wherein at least two of the first plurality of the worker processes operate simultaneously so as to perform the application-specific map operation in parallel on distinct, respective input data blocks; using a partition operator to process the produced intermediate data values to produce a plurality of intermediate data sets, wherein each respective intermediate data set includes all key/value pairs for a distinct set of respective keys, and wherein at least one of the respective intermediate data sets includes respective ones of the key/value pairs produced by a plurality of the first plurality of the worker processes; and in each of a second plurality of the worker processes, executing an application-independent reduce module to retrieve data, the retrieved data comprising at least a subset of the key/value pairs from a respective intermediate data set of the plurality of intermediate data sets and applying an application-specific reduce operation to the retrieved data to produce final output data corresponding to the distinct set of respective keys in the respective intermediate data set of the plurality of intermediate data sets, and wherein at least two of the second plurality of the worker processes operate simultaneously so as to perform the application-specific reduce operation in parallel on multiple respective subsets of the produced intermediate data values. 10. The method of claim 9, wherein each of the worker processes includes a map process thread and a reduce process thread, the map process for executing the application-independent map module and the reduce process thread for executing the application-independent reduce module. 11. The method of claim 9, wherein the master process maintains a task status table, denoting for each of the distinct map tasks and each of the distinct reduce tasks the respective ones of the worker processes, if any, to which each of the distinct map tasks or each of the distinct reduce tasks has been assigned, and a status of each of the distinct map tasks or each of the distinct reduce tasks. 12. The method of claim 11, wherein the master process maintains a process status table, denoting which of the worker processes has been assigned one of the distinct map tasks, and which of the worker processes has been assigned one of the distinct reduce tasks, and a status of each of the worker processes. 13. The method of claim 9, wherein the application-specific map operation includes an application-specific combiner operation for combining initial data values produced by the application-specific map operation so as to produce the intermediate data values. 14. The method of claim 9, wherein the application-specific map operation includes an application-specific combiner operation for combining initial data values produced by the application-specific map operation having shared keys so as to produce the intermediate data values. 15. The method of claim 9, including, in the master process, automatically determining a number of distinct map tasks and a number of distinct reduce tasks to perform the data processing job, and automatically assigning the distinct map tasks and the distinct reduce tasks to the worker processes in accordance with availability of the worker processes such that some of the distinct map tasks and some of the distinct reduce tasks are assigned to worker processes later, during performance of the large-scale data processing job, than other ones of the distinct map tasks and the distinct reduce tasks. 16. The method of claim 15, wherein the number of distinct map tasks exceeds in number the first plurality of the worker processes to which the master process can assign the distinct map tasks, and wherein the method includes, in the master process, maintaining status information with respect to the distinct map tasks awaiting assignment to one of the first plurality of worker processes.

이 특허에 인용된 특허 (9)

Hardwick Jonathan C.,GBX, Dynamic load balancing among processors in a parallel computer.
상세보기
Dageville,Benoit; Amor,Patrick A., Managing parallel execution of work granules according to their affinity.
상세보기
Tsuchida Masashi,JPX ; Masai Kazuo,JPX ; Torii Shunichi,JPX, Method and system of database divisional management for parallel database system.
상세보기
Matsuzawa Hirofumi,JPX ; Fukuda Takeshi,JPX, Method for executing aggregate queries, and computer system.
상세보기
Waddington William H. ; Tan Leng Leng ; Grewell Patricia, Method for managing shared resources in a multiprocessing computer system.
상세보기
Waddington William H. ; Tan Leng Leng ; Grewell Patricia, Method for managing termination of a lock-holding process using a waiting lock.
상세보기
Allen,Terry Dennis; Desai,Paramesh S.; Shibamiya,Akira; Tie,Hong Sang; Tsang,Annie S., Method, system, and program for optimizing database query execution.
상세보기
Douglas P. Brown ; Allen N. Diaz ; Donald R. Pederson, Multi-threading, multi-tasking architecture for a relational database management system.
상세보기
Hardwick Jonathan C.,GBX, Nested parallel 2D Delaunay triangulation method.
상세보기

이 특허를 인용한 특허 (172)

Beda, III, Joseph S.; Kadatch, Andrew, Adjustable virtual network performance.
상세보기
Beda, III, Joseph S.; McLuckie, Craig I., Advertising auction system.
상세보기
Jacobs, Michael N., Apparatuses and methods for parallel analytics.
상세보기
Lin, Wei-Hao; Green, Travis H. K.; Kaplow, Robert; Fu, Gang; Mann, Gideon S., Assessing accuracy of trained predictive models.
상세보기
Lin, Wei-Hao; Green, Travis; Kaplow, Robert; Fu, Gang; Mann, Gideon S., Assessing accuracy of trained predictive models.
상세보기
Queru, Jean Baptiste Maurice, Authentication based on proximity to mobile device.
상세보기
Queru, Jean Baptiste Maurice, Authentication based on proximity to mobile device.
상세보기
Queru, Jean Baptiste Maurice, Authentication based on proximity to mobile device.
상세보기
Kishore, Shaunak; Dray, Karl, Automated invalidation of job output data in a job processing system.
상세보기
Kishore, Shaunak; Dray, Karl, Automated invalidation of job output data in a job-processing system.
상세보기
Rus, Silvius V.; Jiang, Wei, Automated load-balancing of partitions in arbitrarily imbalanced distributed mapreduce computations.
상세보기
Risbood, Pankaj; Sarda, Parag Kacharulal; Kulkarni, Rahul S.; Jain, Rohit; Shenoy, Vittaldas Sachin; Sahasranaman, Vivek, Automated software updating based on prior activity.
상세보기
Kadatch, Andrew; Khorun, Sergey, Bandwidth throttling of virtual disks.
상세보기
Dua, Swaranjit Singh, Bulk matching with update.
상세보기
Weinstein, Eugene; Kumar, Sanjiv; Moreno, Ignacio L.; Senior, Andrew W.; Bhat, Nikhil Prasad, Caching speech recognition scores.
상세보기
Risbood, Pankaj; Sarda, Parag Kacharulal; Kulkarni, Rahul S.; Jain, Rohit; Shenoy, Vittaldas Sachin; Sahasranaman, Vivek, Cloud-based deployment using object-oriented classes.
상세보기
Beda, III, Joseph S.; Czajkowski, Grzegorz J.; Zhao, Jerry, Clustering for parallel processing.
상세보기
Beda, III, Joseph S.; Czajkowski, Grzegorz J.; Zhao, Yonggang, Clustering for parallel processing.
상세보기
Lin, Wei-Hao; Green, Travis H.; Kaplow, Robert; Fu, Gang; Mann, Gideon S., Combining predictive models in predictive analytical modeling.
상세보기
Nakadai, Shinji; Asahara, Masato, Communication control device communication control system, communication control method and program.
상세보기
Konno, Kazuya; Watanabe, Kazuhiko, Computer system and divided job processing method and program.
상세보기
Kulkarni, Rahul S.; Sahasranaman, Vivek; Jain, Rohit; Shenoy, Vittaldas Sachin; Risbood, Pankaj; Sarda, Parag Kacharulal, Correlating status information generated in a computer network.
상세보기
Kulkarni, Rahul S.; Sahasranaman, Vivek; Jain, Rohit; Shenoy, Vittaldas Sachin; Risbood, Pankaj; Sarda, Parag Kacharulal, Correlating status information generated in a computer network.
상세보기
Natanzon, Assaf; Shemer, Jehuda; Baruch, Leehod; Bigman, Ron; Lieberman, Amit, Creating a virtual access point in time on an object based journal replication.
상세보기
Sylves, Eric F., Customer sentiment analysis using recorded conversation.
상세보기
Wiggins, Zachary J., Data anonymity and separation for user computation.
상세보기
Wiggins, Zachary J., Data anonymity and separation for user computation.
상세보기
Wang, Yuewei; Rudys, Algis P.; Yang, Stewart, Deterministic data processing.
상세보기
Chandramouli, Badrish; Goldstein, Jonathan; Quamar, Abdul Hussain, Deterministic progressive big data analytics.
상세보기
Odom, Jeffrey M.; Fikes, Michael P.; Swift, John, Distributed complex event processing.
상세보기
Narang, Ankur; Soman, Jyothish, Distributed data scalable adaptive map-reduce framework.
상세보기
Narang, Ankur; Soman, Jyothish, Distributed data scalable adaptive map-reduce framework.
상세보기
Beda, III, Joseph S.; Baker, Brandon S., Distribution of cryptographic host keys in a cloud computing environment.
상세보기
Matsuzawa, Keiichi; Yamamoto, Akira, Distribution processing unit of shared storage.
상세보기
Jarjur, Omar S.; Anderson, Evan K., Dynamic key management.
상세보기
Breckenridge, Jordan M.; Green, Travis; Kaplow, Robert; Lin, Wei-Hao; Mann, Gideon S., Dynamic predictive modeling platform.
상세보기
Beda, III, Joseph S., Exposing data to virtual machines.
상세보기
Chang, Lei; Yang, Ziye; Mao, Wenbo; He, Ying; Du, Junping, File system for storage area network.
상세보기
Li, Jin; Mehrotra, Sanjeev, Functional programming in distributed computing.
상세보기
Yu, Yuan; Gunda, Pradeep Kumar; Isard, Michael A, General distributed reduction for data parallel computing.
상세보기
Cai, Bin; Xiang, Zhe; Xue, Wei; Yang, Bo; Yu, Qi, Generating map task output with version information during map task execution and executing reduce tasks using the output including version information.
상세보기
Sagiraju, Krishna C., Generation and deployment of scripts for large scale processing framework services.
상세보기
Goldman, Seth; Eustis, David; Hulubei, Tudor; Danaher, John, Handling bulk and incremental updates while maintaining consistency.
상세보기
Risbood, Pankaj; Sarda, Parag Kacharulal; Kulkarni, Rahul S.; Jain, Rohit; Shenoy, Vittaldas Sachin; Sahasranaman, Vivek, High-level language for specifying configurations of cloud-based deployments.
상세보기
Risbood, Pankaj; Sarda, Parag Kacharulal; Kulkarni, Rahul S.; Jain, Rohit; Shenoy, Vittaldas Sachin; Sahasranaman, Vivek, High-level language for specifying configurations of cloud-based deployments.
상세보기
Lin, Wei-Hao; Green, Travis H.; Kaplow, Robert; Fu, Gang; Mann, Gideon S., Hosting predictive models.
상세보기
Peretz, Ervin, Hybrid local/remote infrastructure for data processing with lightweight setup, powerful debuggability, controllability, integration, and productivity features.
상세보기
Zhang, Xiong; Yang, Hung-chih; Lange, Danny, Identifying collocations in a corpus of text in a distributed computing environment.
상세보기
Danaher, John, Incremental schema consistency validation on geographic features.
상세보기
Danaher, John, Incremental schema consistency validation on geographic features.
상세보기
Cohen, Jeffrey Ira; Lonergan, Luke; Welton, Caleb E., Integrating map-reduce into a distributed relational database.
상세보기
Cohen, Jeffrey Ira; Lonergan, Luke; Welton, Caleb E., Integrating map-reduce into a distributed relational database.
상세보기
Chattopadhyay, Biswapesh; Lin, Liang, Joining tables in a mapreduce procedure.
상세보기
Palanisamy, Balaji; Singh, Aameek, Locality-aware resource allocation for cloud computing.
상세보기
Kadatch, Andrew; Halcrow, Michael A., Log structured volume encryption for virtual machines.
상세보기
Beda, III, Joseph S.; Mehat, Sanjeet Singh; Earhart, III, Robert H.; Thornton, Andrew; McWherter, David T.; Anderson, Evan K.; Berreth, Frank, Managed boot in a cloud system.
상세보기
Shao, Wei; Lin, Zhen, Managing system resources.
상세보기
Srivas, Mandayam C.; Ravindra, Pindikura; Saradhi, Uppaluri Vijaya; Pande, Arvind Arun; Sanapala, Chandra Guru Kiran Babu; Renu, Lohit Vijaya; Vellanki, Vivekanand; Kavacheri, Sathya; Hadke, Amit, Map-reduce ready distributed file system.
상세보기
Srivas, Mandayam C.; Ravindra, Pindikura; Saradhi, Uppaluri Vijaya; Pande, Arvind Arun; Sanapala, Chandra Guru Kiran Babu; Renu, Lohit Vijaya; Vellanki, Vivekanand; Kavacheri, Sathya; Hadke, Amit, Map-reduce ready distributed file system.
상세보기
Srivas, Mandayam C.; Ravindra, Pindikura; Saradhi, Uppaluri Vijaya; Pande, Arvind Arun; Sanapala, Chandra Guru Kiran Babu; Renu, Lohit Vijaya; Vellanki, Vivekanand; Kavacheri, Sathya; Hadke, Amit Ashoke, Map-reduce ready distributed file system.
상세보기
Srivas, Mandayam C.; Ravindra, Pindikura; Saradhi, Uppaluri Vijaya; Pande, Arvind Arun; Sanapala, Chandra Guru Kiran Babu; Renu, Lohit Vijaya; Vellanki, Vivekanand; Kavacheri, Sathya; Hadke, Amit Ashoke, Map-reduce ready distributed file system.
상세보기
Srivas, Mandayam C.; Ravindra, Pindikura; Saradhi, Uppaluri Vijaya; Pande, Arvind Arun; Sanapala, Chandra Guru Kiran Babu; Renu, Lohit Vijaya; Vellanki, Vivekanand; Kavacheri, Sathya; Hadke, Amit Ashoke, Map-reduce ready distributed file system.
상세보기
Srivas, Mandayam C.; Ravindra, Pindikura; Saradhi, Uppaluri Vijaya; Pande, Arvind Arun; Sanapala, Chandra Guru Kiran Babu; Renu, Lohit Vijaya; Vellanki, Vivekanand; Kavacheri, Sathya; Hadke, Amit Ashoke, Map-reduce ready distributed file system.
상세보기
Reddy Byreddy, Bhaskar; Ramaiah, Ramu; Punnoose, Vinay, Method and server cluster for map reducing flow services and large documents.
상세보기
Majeed, Basim; Afzal, Ali; Leida, Marcello; Colombo, Maurizio, Method and system for continuous query processing.
상세보기
Dua, Swaranjit Singh, Method and system for distributed bulk matching and loading.
상세보기
Cai, Bin; Xiang, Zhe; Xue, Wei; Yang, Bo; Yu, Qi, Method and system for operating a data center by reducing an amount of data to be processed.
상세보기
Bogrett, Steven, Method and system for performing transactional updates in a key-value store.
상세보기
Leida, Marcello; Afzal, Ali; Taylor, Paul; Majeed, Basim, Method and system for processing data queries.
상세보기
Li, Yan; Lin, Hai Bo; Zhang, Yue; Zheng, Kai, Method and system of network transfer adaptive optimization in large-scale parallel computing system.
상세보기
Li, Yan; Lin, Hai Bo; Zhang, Yue; Zheng, Kai, Method and system of network transfer adaptive optimization in large-scale parallel computing system.
상세보기
McLennan, Christopher S.; Kramer, Joseph T.; Taylor, James P., Method for horizontal scale delta encoding.
상세보기
Pjesivac-Grbovic, Jelena; Goldman, Kenneth Jerome; Faulkner, Matthew; Kendall, Wesley, Method for learning backup policies for large-scale distributed computing.
상세보기
McLennan, Christopher S.; Kramer, Joseph T.; Taylor, James P., Method for the preemptive creation of binary delta information within a computer network.
상세보기
Kreidenko, Valery, Methods for distributed application visibility and reporting and devices thereof.
상세보기
Sahasranaman, Vivek; Risbood, Pankaj; Sarda, Parag Kacharulal; Shenoy, Vittaldas Sachin; Jain, Rohit, Monitoring and automatically managing applications.
상세보기
Lin, Wei-Hao; Green, Travis H. K.; Kaplow, Robert; Fu, Gang; Mann, Gideon S., Multi-label modeling using a plurality of classifiers.
상세보기
Anderson, Evan K., Network address translation for virtual machines.
상세보기
Lin, Wei-Hao; Green, Travis H. K.; Kaplow, Robert; Fu, Gang; Mann, Gideon S., Normalization of predictive model scores.
상세보기
Lin, Wei-Hao; Green, Travis H. K.; Kaplow, Robert; Fu, Gang; Mann, Gideon S., Normalization of predictive model scores.
상세보기
Piovanelli, Matteo; Levi, Alessandro; Furlan, Silvano, Optimization methods for feature detection.
상세보기
Singh, Gyanit; Chiu, Chi-Hsien; Sundaresan, Neelakantan, Parallel data stream processing system.
상세보기
Chambers, Craig D.; Raniwala, Ashish; Perry, Frances J.; Adams, Stephen R.; Henry, Robert R.; Bradshaw, Robert; Weizenbaum, Nathan, Parallel processing of data.
상세보기
Chambers, Craig D.; Raniwala, Ashish; Perry, Frances J.; Adams, Stephen R.; Henry, Robert R.; Bradshaw, Robert; Weizenbaum, Nathan, Parallel processing of data.
상세보기
Chambers, Craig D.; Raniwala, Ashish; Perry, Frances J.; Adams, Stephen R.; Henry, Robert R.; Bradshaw, Robert; Weizenbaum, Nathan, Parallel processing of data.
상세보기
Chambers, Craig D.; Raniwala, Ashish; Perry, Frances J.; Adams, Stephen R.; Henry, Robert R.; Bradshaw, Robert; Weizenbaum, Nathan, Parallel processing of data.
상세보기
Chambers, Craig D.; Raniwala, Ashish; Perry, Frances J.; Henry, Robert R.; Tigani, Jordan, Parallel processing of data.
상세보기
Goldman, Kenneth J.; Chandra, Tushar Deepak; Shaked, Tal; Zhao, Yonggang, Parallel processing of data.
상세보기
Goldman, Kenneth J.; Chandra, Tushar Deepak; Shaked, Tal; Zhao, Yonggang, Parallel processing of data.
상세보기
Goldman, Kenneth J.; Chandra, Tushar; Shaked, Tal; Zhao, Jerry, Parallel processing of data.
상세보기
Chambers, Craig D.; Raniwala, Ashish; Perry, Frances J.; Henry, Robert R.; Tigani, Jordan, Parallel processing of data for an untrusted application.
상세보기
Chambers, Craig D.; Raniwala, Ashish; Perry, Frances J.; Henry, Robert R.; Tigani, Jordan, Parallel processing of data for an untrusted application.
상세보기
Chambers, Craig D.; Raniwala, Ashish; Perry, Frances J.; Henry, Robert R.; Tigani, Jordan, Parallel processing of data for an untrusted application.
상세보기
Dorin, Dov Yaron; Goldshuv, Alon, Parallel streaming of external data.
상세보기
Dorin, Dov Yaron; Goldshuv, Alon; Horn, Noa; Shacked, Alex, Parallel streaming of external data.
상세보기
Dorin, Dov Yaron; Goldshuv, Alon; Shacked, Alex, Parallel streaming of external data.
상세보기
Dorin, Dov Yaron; Goldshuv, Alon; Shacked, Alex; Lonergan, Luke, Parallel streaming of external data.
상세보기
Blanchflower, Sean Mark; Gallagher, Darren John, Performance and scalability in an intelligent data operating layer system.
상세보기
Lipton, Daniel; Weiss, Samuel L., Post-processing phase in a distributed processing system using assignment information.
상세보기
Mann, Gideon S.; Breckenridge, Jordan M.; Lin, Wei-Hao, Predictive analytic modeling platform.
상세보기
Mann, Gideon S.; Breckenridge, Jordan M.; Lin, Wei-Hao, Predictive analytic modeling platform.
상세보기
Mann, Gideon S.; Breckenridge, Jordan M.; Lin, Wei-Hao, Predictive analytic modeling platform.
상세보기
Mann, Gideon S.; Breckenridge, Jordan M.; Lin, Wei-Hao, Predictive analytic modeling platform.
상세보기
Mann, Gideon S.; Breckenridge, Jordan M.; Lin, Wei-Hao, Predictive analytic modeling platform.
상세보기
Lin, Wei-Hao; Green, Travis H. K.; Kaplow, Robert; Fu, Gang; Mann, Gideon S., Predictive analytical model matching.
상세보기
Lin, Wei-Hao; Green, Travis H. K.; Kaplow, Robert; Fu, Gang; Mann, Gideon S., Predictive analytical model matching.
상세보기
Lin, Wei-Hao; Green, Travis H. K.; Kaplow, Robert; Fu, Gang; Mann, Gideon S., Predictive analytical model selection.
상세보기
Lin, Wei-Hao; Green, Travis H. K.; Kaplow, Robert; Fu, Gang; Mann, Gideon S., Predictive analytical modeling for databases.
상세보기
Lin, Wei-Hao; Green, Travis H. K.; Kaplow, Robert; Fu, Gang; Mann, Gideon S., Predictive model application programming interface.
상세보기
Lin, Wei-Hao; Green, Travis H. K.; Kaplow, Robert; Fu, Gang; Mann, Gideon S., Predictive model application programming interface.
상세보기
Mozolewski, Mark Brian; Villarreal, Carlos; Sathyanarayana, Sumanth M; Smith, Michael R, Prioritization of network traffic in a distributed processing system.
상세보기
Chattopadhyay, Biswapesh; Lin, Liang; Liu, Weiran; Dvorský, Marián, Processing data in a MapReduce framework.
상세보기
Fisher, Danyel A.; Drucker, Steven M.; Goldstein, Jonathan D.; Chandramouli, Badrish; DeLine, Robert A.; Platt, John C.; Barnett, Mike, Progressive query computation using streaming architectures.
상세보기
Fisher, Danyel A.; Drucker, Steven M.; Goldstein, Jonathan D.; Chandramouli, Badrish; DeLine, Robert A.; Platt, John C.; Barnett, Mike, Progressive query computation using streaming architectures.
상세보기
Shenoy, Vittaldas Sachin; Risbood, Pankaj; Sahasranaman, Vivek; Kern, Christoph; Anderson, Evan K., Providing application programs with access to secured resources.
상세보기
Shenoy, Vittaldas Sachin; Risbood, Pankaj; Sahasranaman, Vivek; Kern, Christoph; Anderson, Evan K., Providing application programs with access to secured resources.
상세보기
Harris, Matthew S.; Kadatch, Andrew; Khorun, Sergey; Hamilton, Carl, Providing snapshots of virtual storage devices.
상세보기
Harris, Matthew S.; Kadatch, Andrew; Khorun, Sergey; Hamilton, Carl, Providing snapshots of virtual storage devices.
상세보기
Harris, Matthew S.; Kadatch, Andrew; Khorun, Sergey; Hamilton, Carl, Providing snapshots of virtual storage devices.
상세보기
Hildrum, Kirsten W.; Khandekar, Rohit M.; Kumar, Vibhore; Parekh, Sujay S.; Rajan, Deepak; Wolf, Joel L.; Wu, Kun-Lung, Reducing the response time of flexible highly data parallel task by assigning task sets using dynamic combined longest processing time scheme.
상세보기
Vaidyanathan, Kalyanaraman; Gross, Kenny C., Repartitioning parallel SVM computations using dynamic timeout.
상세보기
Hildrum, Kirsten W.; Khandekar, Rohit M.; Kumar, Vibhore; Parekh, Sujay S.; Rajan, Deepak; Wolf, Joel L.; Wu, Kun-Lung, Scheduling parallel data tasks.
상세보기
Hildrum, Kirsten W.; Khandekar, Rohit M.; Kumar, Vibhore; Parekh, Sujay S.; Rajan, Deepak; Wolf, Joel L.; Wu, Kun-Lung, Scheduling parallel data tasks.
상세보기
Berreth, Frank; Moon, Eric A.; Henry, Robert R., Secure inter-process communication.
상세보기
Risbood, Pankaj; Sarda, Parag Kacharulal; Kulkarni, Rahul S.; Jain, Rohit; Shenoy, Vittaldas Sachin; Sahasranaman, Vivek, Selection of ranked configurations.
상세보기
Liu, Liang; Qu, Junmei; Zhu, Chao Qiang; Zhuang, Wei, Shuffle optimization in map-reduce processing.
상세보기
Liu, Liang; Qu, Junmei; Zhu, Chao Qiang; Zhuang, Wei, Shuffle optimization in map-reduce processing.
상세보기
Gera, Bharat K.; Kulkarni, Shrinivas S., Smarter big data processing using collaborative map reduce frameworks.
상세보기
Siohan, Olivier; Moreno Mengibar, Pedro J., Speech recognition using associative mapping.
상세보기
Chelba, Ciprian I., Speech recognition using non-parametric models.
상세보기
Chelba, Ciprian I.; Xu, Peng; Pereira, Fernando, Speech recognition using variable-length context.
상세보기
Kadatch, Andrew; Greenfield, Lawrence E., Storing data across a plurality of storage nodes.
상세보기
Kadatch, Andrew; Greenfield, Lawrence E., Storing data across a plurality of storage nodes.
상세보기
Rowstron, Antony; Costa, Paolo; O'Shea, Gregory Francis; Donnelly, Austin, Supporting distributed key-based processes.
상세보기
Kim, Jin Cheol, System and method for accelerating mapreduce operation.
상세보기
Pike, Robert C.; Quinlan, Sean; Dorward, Sean M.; Dean, Jeffrey; Ghemawat, Sanjay, System and method for analyzing data records.
상세보기
Alcantara, Joao; Alves, Vladimir; Cassia, Ricardo; Lazo, Vincent, System and method for executing data processing tasks using resilient distributed datasets (RDDs) in a storage device.
상세보기
Salessi, Nader; Alcantara, Joao, System and method for executing map-reduce tasks in a storage device.
상세보기
Dean, Jeffrey; Ghemawat, Sanjay, System and method for large-scale data processing using an application-independent framework.
상세보기
Malewicz, Grzegorz; Dvorsky, Marian; Colohan, Christopher B.; Thomson, Derek P.; Levenberg, Joshua Louis, System and method for limiting the impact of stragglers in large-scale parallel data processing.
상세보기
Malewicz, Grzegorz; Dvorsky, Marian; Colohan, Christopher B.; Thomson, Derek P.; Levenberg, Joshua Louis, System and method for limiting the impact of stragglers in large-scale parallel data processing.
상세보기
Malewicz, Grzegorz; Dvorsky, Marian; Colohan, Christopher B.; Thomson, Derek P.; Levenberg, Joshua Louis, System and method for limiting the impact of stragglers in large-scale parallel data processing.
상세보기
Malewicz, Grzegorz; Dvorsky, Marian; Colohan, Christopher B.; Thomson, Derek P.; Levenberg, Joshua Louis, System and method for limiting the impact of stragglers in large-scale parallel data processing.
상세보기
Jennings, Terry Don, System and method for search-based work assignments in a contact center.
상세보기
Gupta, Rajeev; Ravindra, Padmashree; Roy, Prasan, System and method for shared execution of mixed data flows.
상세보기
Gupta, Rajeev; Ravindra, Padmashree; Roy, Prasan, System and method for shared execution of mixed data flows.
상세보기
Goldman, Kenneth Jerome; Mancuso, Anthony, System and method for variable aggregation in order for workers in a data processing to share information.
상세보기
Maddhirala, Anil K.; Subbarayan, Ravikumar; Chinnathambi, Senthil K., System and method of multithreaded processing across multiple servers.
상세보기
Dasdan, Ali, System and/or method for balancing allocation of data among reduce processes by reallocation.
상세보기
Madhavan, Raghavachari K.; Johnson-Laird, Russell E. (Ben), Systems and methods for compiling and analyzing bids in an auction of securities.
상세보기
Garg, Vikas K.; Gupta, Raj; Narang, Ankur, Systems and methods for performing parallel multi-level data computations.
상세보기
Burdick, Douglas Ronald; Ghoting, Amol; Krishnamurthy, Rajasekar; Pednault, Edwin Peter Dawson; Reinwald, Berthold; Sindhwani, Vikas; Tatikonda, Shirish; Tian, Yuanyuan; Vaithyanathan, Shivakumar, Systems and methods for processing machine learning algorithms in a MapReduce environment.
상세보기
Wong, Laura; Munamala, Srikala; Pereshyvaylo, Sergiy; Tamhankar, Hemant; Zou, Ping, Systems and methods to process a request received at an application program interface.
상세보기
Wong, Laura; Munamala, Srikala; Pereshyvaylo, Sergiy; Tamhankar, Hemant; Zou, Ping, Systems and methods to process a request received at an application program interface.
상세보기
Xu, Peng; Pereira, Fernando; Chelba, Ciprian I., Training acoustic models using distributed computing techniques.
상세보기
Cai, Bin; Li, Li; Xiang, Zhe; Xue, Wei; Yang, Bo, Transmission of Map/Reduce data in a data center.
상세보기
Anderson, Evan K., Transparent load-balancing for cloud computing services.
상세보기
Anderson, Penelope; Amos, Richard; Annapureddy, Yashwanth; Haddad, Nicholas; Kalsi, Aaditya; Lane, Thomas; Martin, Jocelyn; Procopio, Michael; Rangasamy, Anandan; Stewart, James; Wang, Wei, Unified mapreduce framework for large-scale data processing.
상세보기
Breckenridge, Jordan M.; Green, Travis H. K.; Kaplow, Robert; Lin, Wei-Hao; Mann, Gideon S., Updateable predictive analytical modeling.
상세보기
Breckenridge, Jordan M.; Green, Travis; Kaplow, Robert; Lin, Wei-Hao; Mann, Gideon S., Updateable predictive analytical modeling.
상세보기
Beda, III, Joseph S.; McLuckie, Craig I.; Eck, Christopher L., Updating virtual machine generated metadata to a distribution service for sharing and backup.
상세보기
Chandra, Tushar Deepak; Shaked, Tal; Ie, Tze Way Eugene; Singer, Yoram; Redstone, Joshua, Using specialized workers to improve performance in machine learning.
상세보기
Kadatch, Andrew; Khorun, Sergey, Virtual block devices.
상세보기
Kadatch, Andrew; Khorun, Sergey, Virtual block devices.
상세보기
Risbood, Pankaj; Sahasranaman, Vivek, Virtual machine name resolution.
상세보기
Beda, III, Joseph S.; Kedia, Ridhima, Virtual machine service access.
상세보기
Anderson, Evan K.; Petrescu-Prahova, Cristian; Beda, III, Joseph S., Virtual network for virtual machine communication and migration.
상세보기
Beda, III, Joseph S.; Petrescu-Prahova, Cristian; Kern, Christoph, Virtual network pairs.
상세보기
Beda, III, Joseph S.; Petrescu-Prahova, Cristian; Kern, Christoph; Anderson, Evan K., Virtual network pairs.
상세보기
Petrescu-Prahova, Cristian; Kern, Christoph; Anderson, Evan K.; Beda, III, Joseph S., Virtual network protocol.
상세보기
Petrescu-Prahova, Cristian; Kern, Christoph; Anderson, Evan K.; Beda, III, Joseph S., Virtual network protocol.
상세보기
Fisher, Danyel A.; König, Arnd Christian; Drucker, Steven, Visualization of changing confidence intervals.
상세보기

IPC	Description
A	생활필수품
A62	인명구조; 소방(사다리 E06C)
A62B	인명구조용의 기구, 장치 또는 방법(특히 의료용에 사용되는 밸브 A61M 39/00; 특히 물에서 쓰이는 인명구조 장치 또는 방법 B63C 9/00; 잠수장비 B63C 11/00; 특히 항공기에 쓰는 것, 예. 낙하산, 투출좌석 B64D; 특히 광산에서 쓰이는 구조장치 E21F 11/00)
A62B-1/08	.. 윈치 또는 풀리에 제동기구가 있는 것

내보내기 구분	파일저장 인쇄 메일전송
구성항목	기본정보 상세정보 관리번호, 국가코드, 자료구분, 상태, 출원번호, 출원일자, 공개번호, 공개일자, 등록번호, 등록일자, 발명명칭(한글), 발명명칭(영문), 출원인(한글), 출원인(영문), 출원인코드, 대표IPC 관리번호, 국가코드, 자료구분, 상태, 출원번호, 출원일자, 공개번호, 공개일자, 공고번호, 공고일자, 등록번호, 등록일자, 발명명칭(한글), 발명명칭(영문), 출원인(한글), 출원인(영문), 출원인코드, 대표출원인, 출원인국적, 출원인주소, 발명자, 발명자E, 발명자코드, 발명자주소, 발명자 우편번호, 발명자국적, 대표IPC, IPC코드, 요약, 미국특허분류, 대리인주소, 대리인코드, 대리인(한글), 대리인(영문), 국제공개일자, 국제공개번호, 국제출원일자, 국제출원번호, 우선권, 우선권주장일, 우선권국가, 우선권출원번호, 원출원일자, 원출원번호, 지정국, Citing Patents, Cited Patents
저장형식	Text(ASCII format) Excel format PIAS분석(.xls)
메일정보	받는사람 (필수) @ 보내는사람 (선택) @ 제목 내용 KISTI 검색결과 이메일 서비스
안내	총 건의 자료가 검색되었습니다. 다운받으실 자료의 인덱스를 입력하세요. (1-10,000) 검색결과의 순서대로 최대 10,000건 까지 다운로드가 가능합니다. 데이타가 많을 경우 속도가 느려질 수 있습니다.(최대 2~3분 소요) 다운로드 파일은 UTF-8 형태로 저장됩니다. 파일의 내용이 제대로 보이지 않을실 때는 웹브라우저 상단의 보기 -> 인코딩 -> 자동선택 여부를 확인하십시오. ~ Text(ASCII format) Excel format

연합인증

System and method for efficient large-scale data processing 원문보기

초록 ▼

대표청구항 ▼

연구과제 타임라인

이 특허에 인용된 특허 (9)

이 특허를 인용한 특허 (172)

관련 콘텐츠

특허 원문 보기

IPC 상위 출원인

AI-Helper ※ AI-Helper는 오픈소스 모델을 사용합니다.

선택된 텍스트

연합인증

System and method for efficient large-scale data processing 원문보기

초록 ▼

대표청구항 ▼

연구과제 타임라인

전체(0) 논문(0) 특허(0) 보고서(0)

전체(0) 논문(0) 특허(0) 보고서(0)

이 특허에 인용된 특허 (9)

이 특허를 인용한 특허 (172)

관련 콘텐츠

특허 원문 보기

IPC 상위 출원인

AI-Helper ※ AI-Helper는 오픈소스 모델을 사용합니다.

선택된 텍스트