Entity data attribution using disparate data sets
원문보기
IPC분류정보
국가/구분
United States(US) Patent
등록
국제특허분류(IPC7판)
G06F-017/30
G06N-007/00
출원번호
US-0209544
(2016-07-13)
등록번호
US-10223429
(2019-03-05)
발명자
/ 주소
Michel, Jean-Baptiste
Hampton, Alan
Shukla, Ananya
Sivakumar, I. K. Ashok
출원인 / 주소
PALANTIR TECHNOLOGIES INC.
대리인 / 주소
Schwegman Lundberg & Woessner, P.A.
인용정보
피인용 횟수 :
0인용 특허 :
36
초록▼
Systems and methods for using disparate data sets to attribute data to an entity are disclosed. Disparate data sets can be obtained from a variety of data sources. The disclosed systems and methods can obtain a first and second data set. Trajectories can represent multiple data records in a data set
Systems and methods for using disparate data sets to attribute data to an entity are disclosed. Disparate data sets can be obtained from a variety of data sources. The disclosed systems and methods can obtain a first and second data set. Trajectories can represent multiple data records in a data set associated with an entity. Trajectories from the obtained data sets can be used to associate data stored among the various data sets. The association can be based on the agreement between the trajectories. The associated data records can further be used to associate the entities related to the associated data records.
대표청구항▼
1. A system for attributing data to entities using disparate data sets, the system comprising: a memory device configured to store a set of instructions; andone or more processing devices configured to execute the set of instructions to: receive, at a database, a first data set that comprises a firs
1. A system for attributing data to entities using disparate data sets, the system comprising: a memory device configured to store a set of instructions; andone or more processing devices configured to execute the set of instructions to: receive, at a database, a first data set that comprises a first set of records associated with a first entity, the first set of records comprising a first set of location data and a first set of temporal data;receive, at the database, a second data set that comprises a second set of records associated with a second entity, the second set of records comprising a second set of location data and a second set of temporal data;determine a first trajectory of the first entity based on the first set of location data and the first set of temporal data from the first set of records, the first trajectory defining a first order of the first set of records;determine a second trajectory of the second entity based on the second set of location data and the second set of temporal data of the second set of records, the second trajectory defining a second order of the second set of records;perform a comparison of the first order of the first trajectory that corresponds with the first entity, and the second order of the second trajectory that corresponds with the second entity; andattribute the second set of records of the second entity to the first entity within the database based on the comparison. 2. The system of claim 1 wherein the first entity is the same as the second entity. 3. The system of claim 1 wherein the first data set or the second data includes data associated with at least one of transaction data, social network data, consumer data, provisioning data, and product data. 4. The system of claim 1 wherein the first trajectory and the second trajectory are based on location information. 5. The system of claim 4 wherein the location information includes an area. 6. The system of claim 1 wherein the first trajectory and the second trajectory are based on at least one of time and date information. 7. The system of claim 1 wherein the first trajectory and the second trajectory are based on at least a threshold number of related records. 8. The system of claim 1 wherein criteria for determining agreement between the first trajectory and the second trajectory is partially based on a type of data in the first data set and a type of data in the second data set. 9. The system of claim 1 wherein criteria for determining agreement between the first trajectory and the second trajectory is partially based on a unicity of the first data set and a unicity of the second data set. 10. The system of claim 1 wherein the one or more processing devices are further configured to execute the set of instructions to: analyze the resolution using a probabilistic model. 11. A method for attributing data to entities using disparate data sets, the method comprising: receiving, at a database, a first data set that comprises a first set of records associated with a first entity, the first set of records comprising a first set of location data and a first set of temporal data;receiving, at the database, a second data set that comprises a second set of records associated with a second entity, the second set of records comprising a second set of location data and a second set of temporal data;determining a first trajectory of the first entity based on the first set of location data and the first set of temporal data from the first set of records, the first trajectory defining a first order of the first set of records;determining a second trajectory of the second entity based on the second set of location data and the second set of temporal data of the second set of records, the second trajectory defining a second order of the second set of records;performing a comparison of the first order of the first trajectory that corresponds with the first entity, and the second order of the second trajectory that corresponds with the second entity; andattributing the second set of records of the second entity to the first entity within the database based on the comparison. 12. The method of claim 11 wherein the first entity is the same as the second entity. 13. The method of claim 11 wherein the first trajectory and the second trajectory are based on location information. 14. The method of claim 13 wherein the location information includes an area. 15. The method of claim 11 wherein the first trajectory and the second trajectory are based on at least a threshold number of related records. 16. A non-transitory computer-readable medium storing a set of instructions that are executable by one or more processing device to the one or more processing devices to perform a method to attribute data to entities using disparate data sets, the method comprising: receiving, at a database, a first data set that comprises a first set of records associated with a first entity, the first set of records comprising a first set of location data and a first set of temporal data;receiving, at the database, a second data set that comprises a second set of records associated with a second entity, the second set of records comprising a second set of location data and a second set of temporal data;determining a first trajectory of the first entity based on the first set of location data and the first set of temporal data from the first set of records, the first trajectory defining a first order of the first set of records;determining a second trajectory of the second entity based on the second set of location data and the second set of temporal data of the second set of records, the second trajectory defining a second order of the second set of records;performing a comparison of the first order of the first trajectory that corresponds with the first entity, and the second order of the second trajectory that corresponds with the second entity; andattributing the second set of records of the second entity to the first entity within the database based on the comparison. 17. The non-transitory computer readable medium of claim 16 wherein the first entity is the same as the second entity. 18. The non-transitory computer readable medium of claim 16 wherein the first trajectory and the second trajectory are based on location information. 19. The non-transitory computer readable medium of claim 18 wherein the location information includes an area. 20. The non-transitory computer readable medium of claim 16 wherein the first trajectory and the second trajectory are based on at least a threshold number of related records.
연구과제 타임라인
LOADING...
LOADING...
LOADING...
LOADING...
LOADING...
이 특허에 인용된 특허 (36)
Ma, Songtao; Huria, Sangeeta; Klein, Eric; Crews, Tim, Banking system controlled responsive to data bearing records.
Gabbert, Charles Keith; Robbins, Mark Wayne; Lombard, Robin J.; Woolums, Thomas Michael; Moiceanu, Corneliu, Centralized terminology and glossary development.
Greenstein, Paul G.; Grunin, Galina; Nguyen, Luu Q., Facilitating management of service elements usable in providing information technology service offerings.
Gopinathan Krishna M. ; Biafore Louis S. ; Ferguson William M. ; Lazarus Michael A. ; Pathria Anu K. ; Jost Allen, Fraud detection using predictive modeling.
Vishniac, Ephraim Meriwether; Isman, Marshall A.; Bay, Paul; Bromley, H. Mark; Richardson, John L., Managing storage of individually accessible data units.
Kantrowitz, Mark, Method and apparatus for efficient identification of duplicate and near-duplicate documents and text spans using high-discriminability text fragments.
Bunzel, Breeana D.; Rangara, Akbar A.; Chan, Kai Chun, Method and system for automatic correlation of check-based payments to customer accounts and/or invoices.
Creeden, Denis Michael; Glionna, Jesse; Poulter, Martha Cecilia; Kaptinski, John Stephen; Persico, James Robert; Doolittle, William Roy; Cascade, Ryan Stuart; van Heyst, Amanda Jenks; Ernst, David Andrew; Chomienne, Kathleen Mary; Bellish, Robert Wayne; Crowley, Robert Francis, Methods and systems for managing risk management information.
Burns, Michael J.; West, Robert A.; Brumfield, Harris; Ziemkiewicz, Peter F., System and method for money management in electronic trading environment.
Evanitsky, Eugene Stephen; Moore, John A.; Coene, Matthew Dylan; Schlonski, Steve; Chlebove, Wilma Wandersleben, System and method of on-demand document processing.
Ginter Karl L. ; Shear Victor H. ; Sibert W. Olin ; Spahn Francis J. ; Van Wie David M., Systems and methods for secure transaction management and electronic rights protection.
※ AI-Helper는 부적절한 답변을 할 수 있습니다.