H04N5/278

NON-TRANSITORY TANGIBLE STORAGE DEVICE, SUBTITLE DISPLAY PROCESSING DEVICE, AND SERVER
20220342623 · 2022-10-27 · ·

A subtitle display processing device, a server, and a non-transitory tangible storage medium that stores a program are provided. The subtitle display processing device includes: a data outputter configured to output text data; a display setter configured to set a display position and a display size of a text display area; an indicator configured to display the text data in the text display area; and a changer configured to change at least one of the display position and the display size of the text display area in accordance with a preset change condition.

NON-TRANSITORY TANGIBLE STORAGE DEVICE, SUBTITLE DISPLAY PROCESSING DEVICE, AND SERVER
20220342623 · 2022-10-27 · ·

A subtitle display processing device, a server, and a non-transitory tangible storage medium that stores a program are provided. The subtitle display processing device includes: a data outputter configured to output text data; a display setter configured to set a display position and a display size of a text display area; an indicator configured to display the text data in the text display area; and a changer configured to change at least one of the display position and the display size of the text display area in accordance with a preset change condition.

SUBTITLE GENERATION METHOD AND APPARATUS, AND DEVICE AND STORAGE MEDIUM
20230128946 · 2023-04-27 ·

The present disclosure provides a subtitle generation method and apparatus, a device, and a storage medium, and the method includes: in response to a subtitle generation triggering operation directed against at least one audio track in a target audio-video file, performing speech recognition on audio data on each audio track respectively to obtain text fragments corresponding to each audio track; and generating subtitles of the target audio-video file based on the text fragments corresponding to each audio track. Compared with a method of performing overall speech recognition on audio data on all audio tracks, in the present disclosure, independent speech recognition is performed on the audio data on each audio track, and thus, the influences of the audio tracks on each other are avoided, so that more accurate speech recognition results can be obtained, thereby improving the accuracy of subtitles generated based on the speech recognition results.

METHODS OF ADJUSTING A POSITION OF IMAGES, VIDEO, AND/OR TEXT ON A DISPLAY SCREEN OF A MOBILE ROBOT

Implementations of the disclosed subject matter provide a mobile robot that moves within an area and captures image data by an image sensor. A position of text, an image, and/or video on a display screen of a display mounted to the mobile robot may be adjusted based on the image data captured by the mobile robot that includes one or more persons that are within the area. The text, the image, and/or the video may be at the adjusted position in the display screen of the display and audio via a speaker of the mobile robot to the one or more persons based on their heights, eye level, whether they are seated, or the like.

METHODS OF ADJUSTING A POSITION OF IMAGES, VIDEO, AND/OR TEXT ON A DISPLAY SCREEN OF A MOBILE ROBOT

Implementations of the disclosed subject matter provide a mobile robot that moves within an area and captures image data by an image sensor. A position of text, an image, and/or video on a display screen of a display mounted to the mobile robot may be adjusted based on the image data captured by the mobile robot that includes one or more persons that are within the area. The text, the image, and/or the video may be at the adjusted position in the display screen of the display and audio via a speaker of the mobile robot to the one or more persons based on their heights, eye level, whether they are seated, or the like.

CAPTIONING COMMUNICATION SYSTEMS

A method to generate a contact list may include receiving an identifier of a first communication device at a captioning system. The first communication device may be configured to provide first audio data to a second communication device. The second communication device may be configured to receive first text data of the first audio data from the captioning system. The method may further include receiving and storing contact data from each of multiple communication devices at the captioning system. The method may further include selecting the contact data from the multiple communication devices that include the identifier of the first communication device as selected contact data and generating a contact list based on the selected contact data. The method may also include sending the contact list to the first communication device to provide the contact list as contacts for presentation on an electronic display of the first communication device.

CAPTIONING COMMUNICATION SYSTEMS

A method to generate a contact list may include receiving an identifier of a first communication device at a captioning system. The first communication device may be configured to provide first audio data to a second communication device. The second communication device may be configured to receive first text data of the first audio data from the captioning system. The method may further include receiving and storing contact data from each of multiple communication devices at the captioning system. The method may further include selecting the contact data from the multiple communication devices that include the identifier of the first communication device as selected contact data and generating a contact list based on the selected contact data. The method may also include sending the contact list to the first communication device to provide the contact list as contacts for presentation on an electronic display of the first communication device.

CAPTION MODIFICATION AND AUGMENTATION SYSTEMS AND METHODS FOR USE BY HEARING ASSISTED USER
20230066793 · 2023-03-02 ·

A system and method for facilitating communication between an assisted user (AU) and a hearing user (HU) includes receiving an HU voice signal as the AU and HU participate in a call using AU and HU communication devices, transcribing HU voice signal segments into verbatim caption segments, processing each verbatim caption segment to identify an intended communication (IC) intended by the HU upon uttering an associated one of the HU voice signal segments, for at least a portion of the HU voice signal segments (i) using an associated IC to generate an enhanced caption different than the associated verbatim caption, (ii) for each of a first subset of the HU voice signal segments, presenting the verbatim captions via the AU communication device display for consumption, and (iii) for each of a second subset of the HU voice signal segments, presenting enhanced captions via the AU communication device display for consumption.

CAPTION MODIFICATION AND AUGMENTATION SYSTEMS AND METHODS FOR USE BY HEARING ASSISTED USER
20230066793 · 2023-03-02 ·

A system and method for facilitating communication between an assisted user (AU) and a hearing user (HU) includes receiving an HU voice signal as the AU and HU participate in a call using AU and HU communication devices, transcribing HU voice signal segments into verbatim caption segments, processing each verbatim caption segment to identify an intended communication (IC) intended by the HU upon uttering an associated one of the HU voice signal segments, for at least a portion of the HU voice signal segments (i) using an associated IC to generate an enhanced caption different than the associated verbatim caption, (ii) for each of a first subset of the HU voice signal segments, presenting the verbatim captions via the AU communication device display for consumption, and (iii) for each of a second subset of the HU voice signal segments, presenting enhanced captions via the AU communication device display for consumption.

Invitation media overlays for private collections of media content items
11665116 · 2023-05-30 · ·

Method of generating invitation media overlays for private collections starts with processor receiving first media content item from first client device associated with first user. Processor receives from first client device a selection of invitation media overlay to be applied to media content item. Invitation media overlay is associated with private collection of media content items. Processor generates modified first media content item by overlaying invitation media overlay on first media content item. Processor generates the private collection of media content items including modified first media content item. Processor receives from first client device selection of second user associated with the second user and causes modified first media content item to be displayed by the second client device. Processor receives selection of invitation media overlay from second client device and causes the private collection of media content items to be displayed by second client device. Other embodiments are described herein.