G06F16/686

SYSTEMS AND METHODS FOR TRANSFORMING DIGITIAL AUDIO CONTENT INTO VISUAL TOPIC-BASED SEGMENTS

A system for platform-independent visualization of audio content, in particular audio tracks utilizing a central computer system in communication with user devices via a computer network. The central system utilizes various algorithms to identify spoken content from audio tracks and selects visual assets associated with the identified content. Thereafter, a visualized audio track is available for users to listen and view. Audio tracks, for example Podcasts, may be segmented into topical audio segments based upon themes or topics, with segments from disparate podcasts combined into a single listening experience, based upon certain criteria, e.g., topics, themes, keywords, and the like.

Music recommendations from trending queries

A plurality of music playlists created on a content sharing platform and having rankings are identified. A plurality of popular external search queries submitted via one or more search engine platforms external to the content sharing platform are identified. A subset of the plurality of music playlists that matches any of the plurality of popular external search queries is determined, and rankings of the determined subset of music playlists are improved. The personalized music recommendations for the user are created based on rankings of the plurality of music playlists, and the personalized music recommendations are provided for presentation to the user.

TECHNOLOGIES FOR CREATING, ALTERING, AND PRESENTING MEDIA CONTENT

Different types of media experiences can be developed based on characteristics of the consumer. “Linear” experiences may require execution of a pre-built script, although the script could be dynamically modified by a media production platform. Linear experiences can include guided audio tours that are modified or updated based on the location of the consumer. “Enhanced” experiences include conventional media content that is supplemented with intelligent media content. For example, turn-by-turn directions could be supplemented with audio descriptions about the surrounding area. “Freeform” experiences, meanwhile, are those that can continually morph based on information gleaned from a consumer. For example, a radio station may modify what content is being presented based on the geographical metadata uploaded by a computing device associated with the consumer.

LIFELOG DEVICE UTILIZING AUDIO RECOGNITION, AND METHOD THEREFOR

The present invention relates to a lifelog device utilizing audio recognition and a method therefor, and to a device capable of recording and classifying audio lifelogs by means of an artificial intelligence algorithm. To this end, the lifelog device of the present invention comprises: an input unit for inputting lifelog data including an audio signal; and analysis unit for analyzing the inputted data; a determination unit for classifying the class of the data on the basis of the analyzed analysis value; and a recording unit for recording the inputted data and the classified class of the data.

SYSTEMS AND METHODS FOR PHONETIC-BASED NATURAL LANGUAGE UNDERSTANDING
20230017352 · 2023-01-19 ·

Systems and methods are described for modifying a phonetic search index based on a use frequency associated with phonetic representations of text terms included in metadata of a media item. A first phonetic representation of a text term of the metadata, pronounced as a word, may be generated. A second phonetic representation of the text term may be generated by concatenating a phonetic representation of each letter in the text term. A database may be queried to determine use frequencies of the first and second phonetic representations, one of which may be selected based on a comparison of the use frequencies. A phonetic search index may be modified by including an entry for the selected phonetic representation. A voice query related to the media item may be received, and a reply to the voice query may be generated for output by performing a lookup in the modified phonetic search index.

GEOLOCATION BASED PLAYLISTS
20230222160 · 2023-07-13 ·

A data package is received from a plurality of devices. Each data package comprises audio content captured by a respective device from the plurality of devices. Each data package further comprises metadata including a location of the respective device when the audio content was captured and a time at which the audio content was captured. A subset of the data packages that include audio content captured within a specified geographic area and within a specified time period is identified based on the metadata. A playlist for the specified area and the specified time period is generated based on the subset of data packages. The playlist may be provided to at least a first device.

Proximity based audio collaboration

A method includes: defining, by a computer device, an audio collaborative environment; defining, by the computer device, an access control of the audio collaborative environment, wherein the access control includes a geofence; receiving, by the computer device, a request from at least one user device to connect to the audio collaborative environment; determining, by the computer device, the at least one user device satisfies the access control; connecting, by the computer device, the at least one user device to an audio channel of the audio collaborative environment; recording, by the computer device, audio data transmitted on the audio channel by the at least one user device; storing, by the computer device, the audio data in a record; tagging, by the computer device, respective portions of the audio data in the record; and presenting one of the respective portions of the audio data to a user based on the tagging.

Systems and methods for determining descriptors for media content items

An electronic device obtains a plurality of collections of media content items, each collection of media content items being associated with text generated by one or more users of the media-providing service. The electronic device determines a coincidence metric for a first descriptor and a first media content item, the coincidence metric corresponding to a likelihood that the first descriptor appears in the text associated with a respective collection of media content items that includes the first media content item. Based on the coincidence metric, the electronic device generates a new collection of media content items for a first user. The new collection of media content items corresponds to the first descriptor and includes the first media content item.

STRUCTURING AUDIO SESSION DATA WITH INDEPENDENTLY QUERYABLE SEGMENTS FOR EFFICIENT DETERMINATION OF HIGH VALUE CONTENT AND/OR GENERATION OF RECOMBINANT CONTENT
20230222159 · 2023-07-13 ·

This disclosure relates generally to data processing devices and, more particularly, to a method, a device, and/or a system of structuring audio session data with independently queryable segments for efficient determination of high value content and/or generation of recombinant content. In one embodiment, a system for analyzing use of audio files to determine high value content includes a database server storing a data container referencing a segment data comprising an audio data and a segment UID that is independently addressable with a database query. A playback manager receives a playback request and streams the audio data to a device of a user. An interest marker engine receives an interest notification including a first audio time point and a second audio time point and generates an interest marker. An analytics server then generates an insight data from the interest marker and stores the insight data in association with the segment data.

MEDIA COMPOSITION USING NON-FUNGIBLE TOKEN (NFT) CONFIGURABLE PIECES

A system and method for receiving one or more non-fungible tokens (NFTs) that are associated with links to digital assets, ownership information, NFT metadata, media content metadata, or other media content information. The system may associate the one or more NFTs with other NFTs to create a collection of NFTs that form a song, album, video, or other collection/combination of media content. The system may provide a NFT collectible player to interact with the combination of NFTs in a particular order or for a particular duration.