G10L25/03

Device, system and method for identifying a scene based on an ordered sequence of sounds captured in an environment
11521626 · 2022-12-06 · ·

An identification device, method and system for identifying a scene in an environment. The environment includes at least one sound capture device. The identification device is configured to identify the scene based on at least two sounds captured in the environment. Each of the at least two sounds are associated respectively with at least one sound class. The scene is identified by taking account of a chronological order in which the at least two sounds were captured.

Whispering voice recovery method, apparatus and device, and readable storage medium

A method, an apparatus and a device for converting a whispered speech, and a readable storage medium are provided. The method is implemented based on the whispered speech converting model. The whispered speech converting model is trained in advance by using recognition results and whispered speech training acoustic features of whispered speech training data as samples and using normal speech acoustic features of normal speech data parallel to the whispered speech training data as sample labels. A whispered speech acoustic feature and a preliminary recognition result of whispered speech data are acquired, then the whispered speech acoustic feature and the preliminary recognition result are inputted into a preset whispered speech converting model to acquire a normal speech acoustic feature outputted by the model. In this way, the whispered speech can be converted to a normal speech.

Whispering voice recovery method, apparatus and device, and readable storage medium

A method, an apparatus and a device for converting a whispered speech, and a readable storage medium are provided. The method is implemented based on the whispered speech converting model. The whispered speech converting model is trained in advance by using recognition results and whispered speech training acoustic features of whispered speech training data as samples and using normal speech acoustic features of normal speech data parallel to the whispered speech training data as sample labels. A whispered speech acoustic feature and a preliminary recognition result of whispered speech data are acquired, then the whispered speech acoustic feature and the preliminary recognition result are inputted into a preset whispered speech converting model to acquire a normal speech acoustic feature outputted by the model. In this way, the whispered speech can be converted to a normal speech.

METHOD, DEVICE AND SYSTEM FOR DETERMINING RELATIVE ANGLE BETWEEN INTELLIGENT DEVICES
20220365166 · 2022-11-17 ·

The present application provides a method, device and system for determining a relative angle between intelligent devices, and intelligent devices. The method is applicable to a first intelligent device. The first intelligent device includes a first sound detection module and a second sound detection module. The relative angle between intelligent devices can be determined quickly, simply, conveniently and accurately.

METHOD, DEVICE AND SYSTEM FOR DETERMINING RELATIVE ANGLE BETWEEN INTELLIGENT DEVICES
20220365166 · 2022-11-17 ·

The present application provides a method, device and system for determining a relative angle between intelligent devices, and intelligent devices. The method is applicable to a first intelligent device. The first intelligent device includes a first sound detection module and a second sound detection module. The relative angle between intelligent devices can be determined quickly, simply, conveniently and accurately.

Determining corrections to be applied to a multichannel audio signal, associated coding and decoding
20220358937 · 2022-11-10 ·

A method and device for determining a set of corrections to be made to a multichannel sound signal, in which the set of corrections is determined on the basis of an item of information representative of a spatial image of an original multichannel signal and an item of information representative of a spatial image of the original multichannel signal that has been coded and then decoded.

Dynamic vocabulary customization in automated voice systems

Techniques to dynamically customize a menu system presented to a user by a voice interaction system are provided. Audio data from a user that includes the speech of a user can be received. Features can be extracted from the received audio data, including a vocabulary of the speech of the user. The extracted features can be compared to features associated with a plurality of user group models. A user group model to assign to the user from the plurality of user group models can be determined based on the comparison. The user group models can cluster users together based on estimated characteristics of the users and can specify customized menu systems for each different user group. Audio data can then be generated and provided to the user in response to the received audio data based on the determined user group model assigned to the user.

Dynamic vocabulary customization in automated voice systems

Techniques to dynamically customize a menu system presented to a user by a voice interaction system are provided. Audio data from a user that includes the speech of a user can be received. Features can be extracted from the received audio data, including a vocabulary of the speech of the user. The extracted features can be compared to features associated with a plurality of user group models. A user group model to assign to the user from the plurality of user group models can be determined based on the comparison. The user group models can cluster users together based on estimated characteristics of the users and can specify customized menu systems for each different user group. Audio data can then be generated and provided to the user in response to the received audio data based on the determined user group model assigned to the user.

METHOD AND SYSTEM FOR IDENTIFYING RECIPIENTS OF A REWARD ASSOCIATED WITH A CONVERSION

The present teaching relates to method and system for evaluating a conversion. The method extracts meta-information including a conversion parameter and a reward. The meta-information corresponds to a conversion associated with an advertisement displayed previously by a plurality of entities. The method receives a plurality of claims for the conversion from one or more entities, and selects a claim corresponding to an entity from the plurality of claims based on the conversion parameter and information included in the plurality of claims. Further, the method transmits information related to the selected claim.

METHOD AND SYSTEM FOR IDENTIFYING RECIPIENTS OF A REWARD ASSOCIATED WITH A CONVERSION

The present teaching relates to method and system for evaluating a conversion. The method extracts meta-information including a conversion parameter and a reward. The meta-information corresponds to a conversion associated with an advertisement displayed previously by a plurality of entities. The method receives a plurality of claims for the conversion from one or more entities, and selects a claim corresponding to an entity from the plurality of claims based on the conversion parameter and information included in the plurality of claims. Further, the method transmits information related to the selected claim.