최소 단어 이상 선택하여야 합니다.
최대 10 단어까지만 선택 가능합니다.
다음과 같은 기능을 한번의 로그인으로 사용 할 수 있습니다.
NTIS 바로가기다음과 같은 기능을 한번의 로그인으로 사용 할 수 있습니다.
DataON 바로가기다음과 같은 기능을 한번의 로그인으로 사용 할 수 있습니다.
Edison 바로가기다음과 같은 기능을 한번의 로그인으로 사용 할 수 있습니다.
Kafe 바로가기국가/구분 | United States(US) Patent 등록 |
---|---|
국제특허분류(IPC7판) |
|
출원번호 | US-0870542 (2010-08-27) |
등록번호 | US-8719006 (2014-05-06) |
발명자 / 주소 |
|
출원인 / 주소 |
|
대리인 / 주소 |
|
인용정보 | 피인용 횟수 : 67 인용 특허 : 512 |
In response to a word of a text sequence, a first part-of-speech (POS) tag is generated using a statistical part-of-speech (POS) tagger based on a corpus of trained text sequences, each representing a likely POS of a word for a given text sequence. A second POS tag is generated using a rule-based PO
In response to a word of a text sequence, a first part-of-speech (POS) tag is generated using a statistical part-of-speech (POS) tagger based on a corpus of trained text sequences, each representing a likely POS of a word for a given text sequence. A second POS tag is generated using a rule-based POS tagger based on a set of one or more rules associated with a type of an application associated with the text sequence. A final POS tag is assigned to the word of the text sequence for TTS synthesis based on the first POS tag and the second POS tag.
1. A computer-implemented method for text-to-speech (TTS) synthesis, comprising: in response to a word of a text sequence, generating a first part-of-speech POS tag using a statistical POS tagger based on a corpus of trained text sequences, each representing a likely POS of a word for a given text s
1. A computer-implemented method for text-to-speech (TTS) synthesis, comprising: in response to a word of a text sequence, generating a first part-of-speech POS tag using a statistical POS tagger based on a corpus of trained text sequences, each representing a likely POS of a word for a given text sequence, wherein the first POS tag is selected from a first POS tag set;generating a second POS tag using a rule-based POS tagger based on a set of one or more rules associated with a type of an application associated with the text sequence, wherein the second POS tag is selected from a second POS tag set that is different from the first POS tag set;calculating a first confidence score for the second POS tag based on a statistic data of applying a rule associated with the second POS tag, wherein the first confidence score is calculated based on a percentage of successful applications of the rule in previous TTS synthesis;designating the second POS tag as the final POS tag if the first confidence score is greater than or equal to a first predetermined threshold;designating the first POS tag as the final POS tag if the first confidence score is less than the first predetermined threshold;assigning a final POS tag to the word of the text sequence for TTS synthesis based on the first POS tag and the second POS tag;adjusting the first confidence score for the rule for future TTS synthesis based on whether the second POS tag has been selected as the final POS tag; andremoving the rule from the set of one or more rules if the first confidence score is below a second predetermined threshold. 2. The method of claim 1, wherein assigning a final POS tag comprises assigning either the first POS tag or the second POS tag as the final POS tag if the first POS tag and the second POS tag are identical. 3. The method of claim 1, wherein assigning a final POS tag comprises assigning the first POS tag as the final POS tag if the set of one or more rules do not contain a suitable rule corresponding to the text sequence. 4. The method of claim 1, further comprising: calculating a second confidence score for the first POS tag based on a successful rate of application of the first POS tag using the statistical POS tagger;designating the second POS tag as the final POS tag if the first confidence score is greater than or equal to the second confidence score; anddesignating the first POS tag as the final POS tag if the first confidence score is less than the second confidence score. 5. The method of claim 4, further comprising adjusting one or more parameters of the statistical POS tagger for future usage based on whether the first POS tag has been selected as the final POS tag. 6. A non-transitory machine-readable storage medium having instructions stored therein, which when executed by a machine, cause the machine to perform a method for text-to-speech (TTS) synthesis, the method comprising: in response to a word of a text sequence, generating a first part-of-speech (POS) tag using a statistical POS tagger based on a corpus of trained text sequences, each representing a likely POS of a word for a given text sequence, wherein the first POS tag is selected from a first POS tag set;generating a second POS tag using a rule-based POS tagger based on a set of one or more rules associated with a type of an application associated with the text sequence, wherein the second POS tag is selected from a second POS tag set that is different from the first POS tag set;calculating a first confidence score for the second POS tag based on a statistic data of applying a rule associated with the second POS tag, wherein the first confidence score is calculated based on a percentage or successful applications of the rule in previous TTS synthesis;designating the second POS tag as the final POS tag if the first confidence score is greater than or equal to a first predetermined threshold;designating the first POS tag as the final POS tag if the first confidence score is less than the first predetermined threshold;assigning a final POS tag to the word of the text sequence for TTS synthesis based on the first POS tag and the second POS tag;adjusting the first confidence score for the rule for future TTS synthesis based on whether the second POS tag has been selected as the final POS tag; andremoving the rule from the set of one or more rules if the first confidence score is below a second predetermined threshold. 7. The machine-readable storage medium of claim 6, wherein assigning a final POS tag comprises assigning either the first POS tag or the second POS tag as the final POS tag if the first POS tag and the second POS tag are identical. 8. The machine-readable storage medium of claim 6, wherein assigning a final POS tag comprises assigning the first POS tag as the final POS tag if the set of one or more rules do not contain a suitable rule corresponding to the text sequence. 9. The machine-readable storage medium of claim 6, wherein the method further comprises: calculating a second confidence score for the first POS tag based on a successful rate of application of the first POS tag using the statistical POS tagger;designating the second POS tag as the final POS tag if the first confidence score is greater than or equal to the second confidence score; anddesignating the first POS tag as the final POS tag if the first confidence score is less than the second confidence score. 10. The machine-readable storage medium of claim 9, wherein the method further comprises adjusting one or more parameters of the statistical POS tagger for future usage based on whether the first POS tag has been selected as the final POS tag. 11. A computer-implemented method for text-to-speech (TTS) synthesis, the method comprising: in response to a word of a text sequence, generating a first part-of-speech (POS) tag using a statistical POS tagger based on a corpus of trained text sequences, each representing a likely POS of a word for a given text sequence, wherein the first POS tag is selected from a first POS tag set;generating a second POS tag using a rule-based POS tagger based on a set of one or more rules associated with a type of an application associated with the text sequence, wherein the second POS tag is selected from a second POS tag set that is different from the first POS tag set;converting the second POS tag to a corresponding tag in the first POS tag set; andassigning a final POS tag to the word of the text sequence for TTS synthesis based on the first POS tag and the second POS tag. 12. The method of claim 11, wherein converting the second POS tag includes using a table that translates tags between the first POS tag set and the second POS tag set. 13. A computer-implemented method for text-to-speech (TTS) synthesis, the method comprising: in response to a word of a text sequence, generating a first part-of-speech (POS) tag using a statistical POS tagger based on a corpus of trained text sequences, each representing a likely POS of a word for a given text sequence, wherein the first POS tag is selected from a first POS tag set;generating a second POS tag using a rule-based POS tagger based on a set of one or more rules associated with a type of an application associated with the text sequence, wherein the second POS tag is selected from a second POS tag set that is different from the first POS tag set;converting the first POS tag to a corresponding tag in the second POS tag set; andassigning a final POS tag to the word of the text sequence for TTS synthesis based on the first POS tag and the second POS tag. 14. A computer-implemented method for text-to-speech (TTS) synthesis, the method comprising: in response to a word of a text sequence, generating a first part-of-speech (POS) tag using a statistical POS tagger;generating a second POS tag using a rule-based POS tagger;calculating a confidence score for the second POS tag based on a statistic data of applying a rule associated with the second POS tag,assigning a final POS tag to the word of the text sequence for TTS synthesis, including: assigning the second POS tag as the final POS tag if the confidence score is greater than or equal to a first predetermined threshold; andassigning the first POS tag as the final POS tag if the confidence score is less than the first predetermined threshold;adjusting the confidence score for the rule for future TTS synthesis based on whether the second POS tag has been selected as the final POS tag; andremoving the rule from the set of one or more rules if the confidence score is below a second predetermined threshold. 15. The method of claim 14, wherein the confidence score is calculated based on a percentage of successful applications of the rule in previous TTS synthesis. 16. The method of claim 14, wherein the first POS tag is selected from a first POS tag set, and wherein the second POS tag is selected from a second POS tag set that is different from the first POS tag set. 17. A system, comprising: one or more processors; andmemory having instructions stored thereon, the instructions, when executed by the one or more processors, cause the processors to perform operations comprising: in response to a word of a text sequence, generating a first part-of-speech (POS) tag using a statistical POS tagger;generating a second POS tag using a rule-based POS tagger;calculating a confidence score for the second POS tag based on a statistic data of applying a rule associated with the second POS tag;assigning a final POS tag to the word of the text sequence for TTS synthesis, including: assigning the second POS tag as the final POS tag if the confidence score is greater than or equal to a first predetermined threshold; andassigning the first POS tag as the final POS tag if the confidence score is less than the first predetermined threshold;adjusting the confidence score for the rule for future TTS synthesis based on whether the second POS tag has been selected as the final POS tag; and removing the rule from the set of one or more rules if the confidence score is below a second predetermined threshold.
Copyright KISTI. All Rights Reserved.
※ AI-Helper는 부적절한 답변을 할 수 있습니다.