G06F16/483

Systems and methods for improvements to user experience testing

Systems and methods for transcription analysis of a recording are provided. The recording includes an audio and screenshot/video portion. The audio portion is transcribed using a machine learned model. Models may be selected by the recording quality and potentially accents or other speech patterns that are present. The transcription is then linked to the video/screen capture chronology, so that automatic scrolling is enabled, clip selection from the transcription, and searching to a video time is possible. There is improvements to user experience question generation, review of study results, and in managing the study participants.

Systems and methods for improvements to user experience testing

Systems and methods for transcription analysis of a recording are provided. The recording includes an audio and screenshot/video portion. The audio portion is transcribed using a machine learned model. Models may be selected by the recording quality and potentially accents or other speech patterns that are present. The transcription is then linked to the video/screen capture chronology, so that automatic scrolling is enabled, clip selection from the transcription, and searching to a video time is possible. There is improvements to user experience question generation, review of study results, and in managing the study participants.

Generating visual media collections for a dynamic social networking account
11562014 · 2023-01-24 · ·

The present disclosure describes systems, non-transitory computer-readable media, and methods for generating a visual media collection for a social networking account and provide access to (or distribute) images, videos, or other visual media items from the visual media collection separate from social networking posts uncategorized within such a collection for the social networking account. For example, based on follow requests specific to a visual media collection, the disclosed systems can further distribute visual media items in collections posts from a particular visual media collection differing from other visual media collections and from social networking posts uncategorized within such a collection of a social networking account. In certain implementations, the disclosed systems further provide search results comprising a visual media item from a visual media collection based on a description or annotation for the visual media collection or a shared visual media collection with another visual media item.

Generating visual media collections for a dynamic social networking account
11562014 · 2023-01-24 · ·

The present disclosure describes systems, non-transitory computer-readable media, and methods for generating a visual media collection for a social networking account and provide access to (or distribute) images, videos, or other visual media items from the visual media collection separate from social networking posts uncategorized within such a collection for the social networking account. For example, based on follow requests specific to a visual media collection, the disclosed systems can further distribute visual media items in collections posts from a particular visual media collection differing from other visual media collections and from social networking posts uncategorized within such a collection of a social networking account. In certain implementations, the disclosed systems further provide search results comprising a visual media item from a visual media collection based on a description or annotation for the visual media collection or a shared visual media collection with another visual media item.

System and method for enriching a concept database

A system and method for enriching a concept database. The method includes determining, based on at least one signature of a first multimedia data element (MMDE) and signatures of a plurality of existing concepts in the concept database, at least one first concept among the plurality of existing concepts, wherein each of the at least one first concept matches a portion of the at least one signature of the first MMDE; generating a reduced representation of the first MMDE, wherein generating the reduced representation further comprises removing the portion of the first MMDE matching the at least one first concept; comparing the reduced representation of the first MMDE to signatures representing a plurality of second MMDEs to determine a plurality of matching second MMDEs; generating, based on the reduced representation of the first MMDE and the plurality of matching second MMDEs, at least one second concept; and adding the generated at least one second concept to the concept database.

System and method for enriching a concept database

A system and method for enriching a concept database. The method includes determining, based on at least one signature of a first multimedia data element (MMDE) and signatures of a plurality of existing concepts in the concept database, at least one first concept among the plurality of existing concepts, wherein each of the at least one first concept matches a portion of the at least one signature of the first MMDE; generating a reduced representation of the first MMDE, wherein generating the reduced representation further comprises removing the portion of the first MMDE matching the at least one first concept; comparing the reduced representation of the first MMDE to signatures representing a plurality of second MMDEs to determine a plurality of matching second MMDEs; generating, based on the reduced representation of the first MMDE and the plurality of matching second MMDEs, at least one second concept; and adding the generated at least one second concept to the concept database.

Layout-Aware Multimodal Pretraining for Multimodal Document Understanding

Systems and methods for document processing that can process and understand the layout, text size, text style, and multimedia of a document can generate more accurate and informed document representations. The layout of a document paired with text size and style can indicate what portions of a document are possibly more important, and the understanding of that importance can help with understanding of the document. Systems and methods utilizing a hierarchical framework that processes the block-level and the document-level of a document can capitalize on these indicators to generate a better document representation.

Guided information viewing and storage features within web browsers
11704475 · 2023-07-18 ·

The present disclosure relates to non-transitory computer readable mediums (CRMs) for guided-viewing of annotations and the process or organizing and connecting annotations of web documents within web browsers. The rationale for creating and using such computer readable medium is discussed in detail within this disclosure. Throughout the course of this explanation, various steps are dissected and explained in detail in the context of exemplary embodiments to elaborate on the relevant data structures and the architectures, messaging patterns, and use cases that provide the context for these data structures.

Guided information viewing and storage features within web browsers
11704475 · 2023-07-18 ·

The present disclosure relates to non-transitory computer readable mediums (CRMs) for guided-viewing of annotations and the process or organizing and connecting annotations of web documents within web browsers. The rationale for creating and using such computer readable medium is discussed in detail within this disclosure. Throughout the course of this explanation, various steps are dissected and explained in detail in the context of exemplary embodiments to elaborate on the relevant data structures and the architectures, messaging patterns, and use cases that provide the context for these data structures.

DETECTION DEVICE

A detection device detecting a scene related to a sponsor credit included in a commercial message from a target video is provided. The detection device comprises a detection unit that associates, from a preliminary video, a still image related to the sponsor credit with an audio signal related to the sponsor credit included other than in a frame or an audio signal configuring the commercial message so as to detect the scene related to the sponsor credit from the target video.