[특허]Large-range-first cross-camera visual target re-identification method

Large-range-first cross-camera visual target re-identification method 원문보기

IPC분류정보
국가/구분	United States(US) Patent 등록
국제특허분류(IPC7판)	G06T-007/20 G06T-007/246 G06K-009/00 G06K-009/46 G06K-009/62 G06T-007/277 G06T-007/215
출원번호	US-0307805 (2014-04-30)
등록번호	US-9996939 (2018-06-12)
국제출원번호	PCT/CN2014/076640 (2014-04-30)
국제공개번호	WO2015/165092 (2015-11-05)
발명자 / 주소	Huang, Kaiqi Cao, Lijun Chen, Weihua
출원인 / 주소	Institute of Automation Chinsese Academy of Sciences
대리인 / 주소	Howard, Jeremy
인용정보	피인용 횟수 : 0 인용 특허 : 8

초록 ▼

The present invention relates to a large-range-first cross-camera visual target re-identification method. The method comprises: step S1, obtaining initial single-camera tracks of targets; step S2, calculating a piecewise major color spectrum histogram feature of each track, and obtaining a track feature representation; step S3, obtaining a calculation formula of the similarity between any two tracks by using a minimum uncertainty method, so as to obtain the similarity between any two tracks; and step S4, performing global data association on all the tracks by using a maximum posterior probability method, so as to obtain a cross-camera tracking result. The target re-identification method of the present invention achieves high correct identification accuracy.

대표청구항 ▼

1. A large-range-first cross-camera visual target re-identification method, characterized in that said method comprises: step S1: obtaining initial single-camera tracks of targets;step S2: calculating a piecewise major color spectrum histogram feature of each track and obtaining a track feature representation;step S3: obtaining a calculation formula for the similarity between any two tracks by using a minimum uncertainty method so as to obtain the similarity between any two tracks; andstep S4: performing global data association on all the tracks by using a maximum posterior probability method to obtain a cross-camera tracking result. 2. The method according to claim 1, characterized in that in said step S1, for each track, a mean value of confidence of all frames is used to represent a track accuracy of the track: From here it is clear the claim is calculating specific characteristics of data and then utilizing specific methodology on particular characteristics of the tracks: c=∑j=tste⁢⁢αj/(te-ts)(1)wherein the confidence α represents the result of tracking of each frame, α<0.2 means that the tracked target is lost, and ts and te are respectively the start frame and end frame of the track;a finally formed set of tracks of all targets is L={l1, l2, . . . , lN}, wherein N is a track summary, and each track li=[xi, ci,si,ti,ai] represents the position, accuracy, scene, time and apparent features of the track, respectively. 3. A large-range-first cross-camera visual target re-identification method, the method comprising: step S1: obtaining initial single-camera tracks of targets;step S2: calculating a piecewise major color spectrum histogram feature of each track and obtaining a track feature representation;step S3: obtaining a calculation formula for the similarity between any two tracks by using a minimum uncertainty method so as to obtain the similarity between any two tracks; andstep S4: performing global data association on all the tracks by using a maximum posterior probability method to obtain a cross-camera tracking result,wherein step S2 includes calculating color histograms of targets of each frame, then dividing the color space into 16*2 colors according to the values of H and S, and selecting the first n color values as the features of said targets in said frame: h={C1,C2, . . . ,Cn} (2) wherein Ci is one color of the first n colors whose sum of the pixel numbers accounts for above 90% of that of the total pixel numbers, and a general feature of each track is: H=Σi=1mkhi (3) wherein mk is the length of track k; calculating similarities therebetween as Λ=Sim(hi,hj) for all features hi in the general feature H, and finding a movement period through information of similarities between each frame in the track, then re-segmenting the original track feature H according to the period, wherein the periodic information p that might exist in the general feature H is obtained by: p=arg⁢⁢maxt⁢1mk-t⁢∑j=1mk-t⁢⁢Λj,j+t(4) and the track is re-segmented uniformly according to the periodic information p so as to obtain a piecewise major color spectrum histogram feature of the track: H={H1,H2, . . . ,Hd} (5) in which d=┌mk/p┐ represents the number of segments into which the track is segmented. 4. The method according to claim 3, characterized in that said step S3 specifically includes: calculating a similarity between two tracks to guide matching between the tracks, and maximizing the similarity while minimizing the uncertainty, so that the obtained similarity match value can reflect the real similarity relation between two tracks, wherein the matching formula is: Dis⁡(HA,HB)=1-max⁢⁢Sim⁡(HiA,HjB)-min⁢⁢Sim⁡(HuA,HvB)max⁢⁢Sim⁡(HiA,HjB)+min⁢⁢Sim⁡(HuA,HvB)(6) in which HA and HB are piecewise major color spectrum histogram features of two tracks, and HiA and HjB are certain segments thereof, i={1, 2, . . . , dA}, j={1, 2, . . . , dB}. 5. The method according to claim 4, characterized in that said step S4 specifically includes: step S4-1: obtaining each globally associated track T={li1, li2, . . . lik}, and obtaining a general set of associated tracks T={T1, T2, . . . , Tm}, m being the number of associated tracks; then obtaining a maximum posterior probability of set T when a given set L of tracks and the associated tracks do not overlap: T*=arg⁢⁢maxT⁢∏i⁢⁢P⁡(li❘T)⁢∏Tk∈T⁢⁢P⁡(Tk)⁢⁢Ti⋂Tj=ϕ,∀i≠j(7) wherein P(li/T) is the similarity of track li, and P(Tk) is a possible priori probability of associated tracks, which can be represented by a Markov chain containing a transition probability ΠP(lki+1|lki); step S4-2: building a graph structure, wherein each node represents a track li and its value is ci, each edge represents a priori probability P(li→lj), and obtaining a set that enables T* to be the maximum from the minimum cost function flow of the entire graph, wherein the cost energy eij of each flow is represented by a negative logarithmic function as: eij=⁢-log⁢⁢P⁡(L❘li→lj)⁢P⁡(li→lj)=⁢-log⁡(Pm*Pt*Pa)(8) in which Pm and Pt respectively represent match probabilities of motion information and time information between tracks, and Pa represents the match probability of apparent features of tracks, whose matching similarity formula is: Pa={Dis⁡(HA,HB)if⁢⁢si=sjλ⁢⁢Dis⁡(HA,HB)if⁢⁢si≠sj(9) and the cost energy of each flow is obtained, then a traversing is performed to finally obtain a set T that enables the posterior probability to be the maximum, which is just the result of multi-camera target tracking and re-identification. 6. The method according to claim 1, wherein step S2 includes finding a movement period through information of similarities of an original track feature between each frame in the track, then re-segmenting the original track feature according to the period. 7. A non-transitory storage medium tangibly embodying a program of machine-readable instructions executable by a digital processing apparatus to perform a method, said method comprising: step S1: obtaining initial single-camera tracks of targets;step S2: calculating a piecewise major color spectrum histogram feature of each track and obtaining a track feature representation;step S3: obtaining a calculation formula for the similarity between any two tracks by using a minimum uncertainty method so as to obtain the similarity between any two tracks; andstep S4: performing global data association on all the tracks by using a maximum posterior probability method to obtain a cross-camera tracking result. 8. The non-transitory storage medium according to claim 7, characterized in that in said step S1, for each track, a mean value of confidence of all frames is used to represent a track accuracy of the track: From here it is clear the claim is calculating specific characteristics of data and then utilizing specific methodology on particular characteristics of the tracks: c=∑j=tste⁢⁢αj/(te-ts)(1) wherein the confidence α represents the result of tracking of each frame, α<0.2 means that the tracked target is lost, and ts and te are respectively the start frame and end frame of the track; a finally formed set of tracks of all targets is L={l1, l2, . . . , lN}, wherein N is a track summary, and each track li=[xi,ci,si,ti, ai] represents the position, accuracy, scene, time and apparent features of the track, respectively. 9. The non-transitory storage medium according to claim 8, characterized in that said step S2 specifically includes: calculating color histograms of targets of each frame, then dividing the color space into 16*2 colors according to the values of H and S, and selecting the first n color values as the features of said targets in said frame: h={C1,C2, . . . ,Cn} (2) wherein Ci is one color of the first n colors whose sum of the pixel numbers accounts for above 90% of that of the total pixel numbers, and a general feature of each track is: H=Σi=1mkhi (3) wherein mk is the length of track k; calculating similarities therebetween as Λ=Sim(hi,hj) for all features hi in the general feature H, and finding a movement period through information of similarities between each frame in the track, then re-segmenting the original track feature H according to the period, wherein the periodic information p that might exist in the general feature H is obtained by: p=arg⁢⁢maxt⁢1mk-t⁢∑j=1mk-t⁢⁢Λj,j+t(4) and the track is re-segmented uniformly according to the periodic information p so as to obtain a piecewise major color spectrum histogram feature of the track: H={H1,H2, . . . ,Hd} (5) in which d=┌mk/p┐ represents the number of segments into which the track is segmented. 10. The non-transitory storage medium according to claim 9, characterized in that said step S3 specifically includes: calculating a similarity between two tracks to guide matching between the tracks, and maximizing the similarity while minimizing the uncertainty, so that the obtained similarity match value can reflect the real similarity relation between two tracks, wherein the matching formula is: Dis⁡(HA,HB)=1-max⁢⁢Sim⁡(HiA,HjB)-min⁢⁢Sim⁡(HuA,HvB)max⁢⁢Sim⁡(HiA,HjB)+min⁢⁢Sim⁡(HuA,HvB)(6) in which HA and HB are piecewise major color spectrum histogram features of two tracks, and HiA and HjB are certain segments thereof, i={1, 2, . . . , dA}, j={1, 2, . . . , dB}. 11. The non-transitory storage medium according to claim 10, characterized in that said step S4 specifically includes: step S4-1: obtaining each globally associated track Ti={li1, li2, . . . , lik}, and obtaining a general set of associated tracks T={T1, T2, . . . , Tm}, m being the number of associated tracks; then obtaining a maximum posterior probability of set T when a given set L of tracks and the associated tracks do not overlap: T*=arg⁢⁢maxT⁢∏i⁢⁢P⁡(li❘T)⁢∏Tk∈T⁢⁢P⁡(Tk)⁢⁢Ti⋂Tj=ϕ,∀i≠j(7) wherein P(l1|T) is the similarity of track li, and P(Tk) is a possible priori probability of associated tracks, which can be represented by a Markov chain containing a transition probability √P(lki+1|lki); step S4-2: building a graph structure, wherein each node represents a track li and its value is ci, each edge represents a priori probability P(li→lj), and obtaining a set that enables T* to be the maximum from the minimum cost function flow of the entire graph, wherein the cost energy eij of each flow is represented by a negative logarithmic function as: eij=⁢-log⁢⁢P⁡(L❘li→lj)⁢P⁡(li→lj)=⁢-log⁡(Pm*Pt*Pa)(8) in which Pm and Pt respectively represent match probabilities of motion information and time information between tracks, and Pa represents the match probability of apparent features of tracks, whose matching similarity formula is: Pa={Dis⁡(HA,HB)if⁢⁢si=sjλ⁢⁢Dis⁡(HA,HB)if⁢⁢si≠sj(9) and the cost energy of each flow is obtained, then a traversing is performed to finally obtain a set T that enables the posterior probability to be the maximum, which is just the result of multi-camera target tracking and re-identification.

이 특허에 인용된 특허 (8)

Lv, Fengjun; Xu, Wei; Gong, Yihong, Efficient multi-hypothesis multi-human 3D tracking in crowded scenes.
상세보기
Ray, Lawrence A.; Nicponski, Henry, Face detecting camera and method.
상세보기
Rubenstein, Eric P., Image analysis by object addition and recovery.
상세보기
Zhao, Tao; Aggarwal, Manoj; Kumar, Rakesh; Sawhney, Harpreet, Method and apparatus for tracking objects over a wide area using a network of stereo sensors.
상세보기
Center, Jr.,Julian L.; Wren,Christopher R.; Basu,Sumit; Gusyatin,Evgeniy, Methods of establishing a communications link using perceptual sensing of a user's presence.
상세보기
Owechko, Yuri; Medasani, Swarup; Saisan, Payam, Multi-view cognitive swarm for object recognition and 3D tracking.
상세보기
Lipton, Alan J.; Strat, Thomas M.; Venetianer, Pèter L.; Allmen, Mark C.; Severson, William E.; Haering, Niels; Chosak, Andrew J.; Zhang, Zhong; Frazier, Matthew F.; Seekas, James S.; Hirata, Tasuki; Clark, John, Video surveillance system employing video primitives.
상세보기
Keaton,Patricia Trish; Dominguez,Sylvia; Sayed,Ali H., Vision-based pointer tracking and object classification method and apparatus.
상세보기

IPC	Description
A	생활필수품
A62	인명구조; 소방(사다리 E06C)
A62B	인명구조용의 기구, 장치 또는 방법(특히 의료용에 사용되는 밸브 A61M 39/00; 특히 물에서 쓰이는 인명구조 장치 또는 방법 B63C 9/00; 잠수장비 B63C 11/00; 특히 항공기에 쓰는 것, 예. 낙하산, 투출좌석 B64D; 특히 광산에서 쓰이는 구조장치 E21F 11/00)
A62B-1/08	.. 윈치 또는 풀리에 제동기구가 있는 것

내보내기 구분	파일저장 인쇄 메일전송
구성항목	기본정보 상세정보 관리번호, 국가코드, 자료구분, 상태, 출원번호, 출원일자, 공개번호, 공개일자, 등록번호, 등록일자, 발명명칭(한글), 발명명칭(영문), 출원인(한글), 출원인(영문), 출원인코드, 대표IPC 관리번호, 국가코드, 자료구분, 상태, 출원번호, 출원일자, 공개번호, 공개일자, 공고번호, 공고일자, 등록번호, 등록일자, 발명명칭(한글), 발명명칭(영문), 출원인(한글), 출원인(영문), 출원인코드, 대표출원인, 출원인국적, 출원인주소, 발명자, 발명자E, 발명자코드, 발명자주소, 발명자 우편번호, 발명자국적, 대표IPC, IPC코드, 요약, 미국특허분류, 대리인주소, 대리인코드, 대리인(한글), 대리인(영문), 국제공개일자, 국제공개번호, 국제출원일자, 국제출원번호, 우선권, 우선권주장일, 우선권국가, 우선권출원번호, 원출원일자, 원출원번호, 지정국, Citing Patents, Cited Patents
저장형식	Text(ASCII format) Excel format PIAS분석(.xls)
메일정보	받는사람 (필수) @ 보내는사람 (선택) @ 제목 내용 KISTI 검색결과 이메일 서비스
안내	총 건의 자료가 검색되었습니다. 다운받으실 자료의 인덱스를 입력하세요. (1-10,000) 검색결과의 순서대로 최대 10,000건 까지 다운로드가 가능합니다. 데이타가 많을 경우 속도가 느려질 수 있습니다.(최대 2~3분 소요) 다운로드 파일은 UTF-8 형태로 저장됩니다. 파일의 내용이 제대로 보이지 않을실 때는 웹브라우저 상단의 보기 -> 인코딩 -> 자동선택 여부를 확인하십시오. ~ Text(ASCII format) Excel format

연합인증

Large-range-first cross-camera visual target re-identification method 원문보기

초록 ▼

대표청구항 ▼

연구과제 타임라인

이 특허에 인용된 특허 (8)

관련 콘텐츠

특허 원문 보기

IPC 상위 출원인

AI-Helper ※ AI-Helper는 오픈소스 모델을 사용합니다.

선택된 텍스트

연합인증

Large-range-first cross-camera visual target re-identification method 원문보기

초록 ▼

대표청구항 ▼

연구과제 타임라인

전체(0) 논문(0) 특허(0) 보고서(0)

전체(0) 논문(0) 특허(0) 보고서(0)

이 특허에 인용된 특허 (8)

관련 콘텐츠

특허 원문 보기

IPC 상위 출원인

AI-Helper ※ AI-Helper는 오픈소스 모델을 사용합니다.

선택된 텍스트