G10L2025/906

Method and system for diagnosing coronary artery disease (CAD) using a voice signal
10796714 · 2020-10-06 · ·

The present invention extends to methods, systems, for diagnosing coronary artery disease (CAD) in patients by using their voice signal comprising receiving voice signal data indicative of speech from the patient.

System and method of social networks for dogs or other pets
10776615 · 2020-09-15 ·

A social network system for pets comprises a sensor for detecting or collecting a pet's voices or body signals, a translation unit for comparing the detected pet's voices or body signals with the stored sample patterns, determining the pet's emotions or feelings by choosing one or many sample patterns which are most closely matched to the detected pet's voices or body signals, and a processing unit for performing one or many actions based on the determined pet's emotions or feelings.

Apparatus, method, and non-transitory computer-readable storage medium for storing program for utterance section detection

A method for utterance section detection includes: executing pitch gain calculation processing that includes calculating a pitch gain indicating an intensity of periodicity of an audio signal expressing a voice of a speaker for each of frames that are obtained by dividing the audio signal and that each have a predetermined length; and executing utterance section detection processing that includes determining that an utterance section on the audio signal starts when the pitch gain becomes greater than or equal to a first threshold value after a non-utterance section on the audio signal lasts, wherein the utterance section detection processing further includes determining that the utterance section ends when the pitch gain becomes less than a second threshold value lower than the first threshold value after the utterance section lasts.

COGNITIVE FUNCTION EVALUATION DEVICE, COGNITIVE FUNCTION EVALUATION SYSTEM, COGNITIVE FUNCTION EVALUATION METHOD, AND NON-TRANSITORY COMPUTER-READABLE STORAGE MEDIUM

A cognitive function evaluation device includes: an instruction unit that instructs quick pronunciation of pseudoword in which a predetermined syllable is repeated; an obtainment unit that obtains voice data indicating a voice of an evaluatee who has received an instruction; a calculation unit that calculates a feature from the voice data obtained by the obtainment unit; an evaluation unit that evaluates a cognitive function of the evaluatee from the feature calculated by the calculation unit; and an output unit that outputs a result of the evaluation by the evaluation unit.

METHOD AND APPARATUS FOR DISPLAYING PITCH INFORMATION IN LIVE WEBCAST ROOM, AND STORAGE MEDIUM
20200194027 · 2020-06-18 ·

The present disclosure provides a method for displaying pitch information in a live webcast room. The method includes: determining first human voice pitch information based on a human voice of a streamer captured by a streamer terminal in a live webcast room; acquiring information of a song, the song being at least one of a song played by the streamer terminal and a song sung by the streamer; acquiring standard pitch information of the song based on the information of the song; and displaying the first human voice pitch information and the standard pitch information on an audience terminal in the live webcast room.

Predicting glottal insufficiency using frequency analysis

Systems and methods of predicting glottal insufficiency by at least one hardware processor including receiving a voice recording comprising a phonation by a subject, analysis of the voice recording to calculate a fundamental frequency contour curve of the phonation, and measurement of at least one of (i) a time period from a start of the phonation until the contour curve reaches a settled level, (ii) a slope of the contour curve during the time period, and (iii) an area under the contour curve during that time period. In certain embodiments, the processor subsequently, determines a glottal closure insufficiency in the subject based on these measurements.

Mechanism and instrumentation for metering conversations

A conversation meter comprises a memory storage comprising instructions and one or more processors in communication with the memory storage. The one or more processors execute the instructions to perform: accessing audio data representing a conversation among a plurality of people; analyzing the audio data to associate one or more portions of the audio data with each person of the plurality of people; analyzing the portions of the audio data to determine one or more conversation metrics for each person of the plurality of people; and causing presentation of at least one of the determined conversation metrics.

HARMONY GENERATION DEVICE AND STORAGE MEDIUM

A harmony generation device and a program for the same which can generate a natural harmony sound are provided. The harmony generation device (1) generates first and second harmony tones to which a voice input through a microphone (M) is shifted in pitch by first and second shift amounts calculated based on both the voice input through the microphone (M) and a chord determined from performance information of an electric guitar (G) input through an input device (34). That is, since the first and second harmony tones can be tones based on the chord of the electric guitar (G) that changes from moment to moment, the harmony sound obtained by mixing the first and second harmony tones with the voice input through the microphone (M) can be a natural harmony sound that is rich in variation according to the chord of the electric guitar (G).

Annoyance noise suppression

Personal audio systems and methods are disclosed. A personal audio system includes a class table storing processing parameters respectively associated with a plurality of annoyance noise classes, a controller, and a processor. The controller identifies an annoyance noise class of an annoyance noise included in an ambient audio stream and retrieves, from the class table, one or more processing parameters associated with the identified annoyance noise class. The processor to processes the ambient audio stream according to the one or more retrieved processing parameters class to provide a personal audio stream. The processor includes a pitch tracker to identify a fundamental frequency of the annoyance noise and a filter bank including a band reject filter tuned to the fundamental frequency.

System and Method for Relative Enhancement of Vocal Utterances in an Acoustically Cluttered Environment
20200074995 · 2020-03-05 ·

The invention discloses systems and methods for enhancing the sound of vocal utterances of interest in an acoustically cluttered environment. The system generates canceling signals (sound suppression signals) for an ambient audio environment and identifies and characterizes desired vocal signals and hence a vocal stream or multiple streams of interest. Each canceling signal, or collectively, the noise canceling stream, is processed so that signals associated with the desired audio stream or streams are dynamically removed from the canceling stream. This modified noise canceling stream is combined (electronically or acoustically) with the ambient to effectuate a destructive interference of all ambient sound except for the removed audio streams, thus enhancing the vocal streams with respect to the unwanted ambient sound. Cepstral analysis may be used to identify a fundamental frequency associated with a voiced human utterance. Filtering derived from that analysis removes the voiced utterance from the canceling signal.