SYSTEM FOR AUTOMATIC, CUSTOM FOREIGN CONTENT INSERTION INTO DIGITAL CONTENT STREAMS
20170195373 ยท 2017-07-06
Inventors
- Andy Toh (Brooklyn, NY, US)
- Enrique Allegretta (Buenos Aires, AR)
- Pablo Muro (Buenos Aires, AR)
- Benjamin Eidelman (Buenos Aires, AR)
- Brian McConnell (Brooklyn, NY, US)
- Avrohom Y. Charish (New York, NY, US)
Cpc classification
H04L12/1822
ELECTRICITY
H04L67/06
ELECTRICITY
H04M3/56
ELECTRICITY
H04L12/1827
ELECTRICITY
H04L65/403
ELECTRICITY
H04M7/123
ELECTRICITY
H04M2203/5063
ELECTRICITY
International classification
Abstract
Embodiments of the systems and methods disclosed herein enable automated and customized insertion of advertisements in real-time and in response to a request for a content file. Purveyors of content, such as audio and video, often use advertisements to ensure profitability. Embodiments enable users to customize the location(s) for advertisement insertion, and further embodiments allow for real-time insertion of ads, the ads can further be customized to the ad recipient. Real-time, automated delivery improves current technology in several ways by, for example, eliminating the need to manually inserting or choosing advertisements, customizing advertisement delivery to the content recipient, improving the speed of delivery of ad-supported content, giving users the ability to customize the location of advertisement insertion, thereby improving the final ad-supported product by inserting ads in logical, non-pre-planned locations of content files.
Claims
1. A method comprising: receiving, at a audio server system, a content file; placing, via a content marking tool, insertion markers; storing, via a content storage device, the content file and the insertion markers; receiving, at the audio server system a request for the content file; and delivering, via the audio server system, the content file to one or more listener computers, wherein delivering the content file comprises delivering advertisements during playback of the content file when playback reaches an insertion marker.
2. The method of claim 1, wherein receiving the content file comprises receiving a live stream.
3. The method of claim 1, wherein receiving the content file comprises receiving a previously recorded live stream.
4. The method of claim 1 further comprising identifying demographics of users of the one or more listener computers.
5. The method of claim 4 further comprising delivering advertisements based on the demographics of the users of the one or more listener computers.
6. The method of claim 1 further comprising automatically identifying locations for placing insertion markers.
7. The method of claim 6 further comprising providing an interface for moving or deleting the automatically placed insertion markers.
8. The method of claim 6 further comprising identifying moments of silence for automatically placing the insertion markers.
9. A method comprising: providing, by an audio server system, a host dashboard, the host dashboard configured to enable a moderator to invite one or more guests to participate in a conference call based on information provided by the audio server system, the audio server system enabling the moderator to engage in bi-directional communication with the invited one or more guests; recording, by the audio server system, the conference call to generate a content file; storing, by the audio server system, the content file in a database at the audio server system; transmitting, to a user, a user interface to display an image representative of the content file; receiving, at the audio server system via a network, one or more insertion markers; storing, in the database at the audio server system, the one or more insertion markers; receiving, at the audio server system, a request, from one or more listener computers, to download the content file; retrieving, from the database at the audio server system, the content file and the one or more insertion markers; splitting, via the audio server system, the content file into parts at locations indicated by the one or more insertion markers; inserting, via the audio server system, advertisements or foreign content between the parts of the content file; transmitting, via the audio server system, the content file and advertisements or foreign content to the one or more listener computers.
10. The method of claim 9 further comprising providing an interface for placing one or more deletion markers, where the audio server system will not stream content that exists between the one or more deletion markers.
11. The method of claim 9 further comprising receiving a previously recorded content file.
12. The method of claim 9 further comprising identifying demographics of users of the one or more listener computers.
13. The method of claim 12 further comprising delivering advertisements based on the demographics of the users of the one or more listener computers.
14. The method of claim 9 further comprising automatically identifying locations for placing insertion markers.
15. The method of claim 14 further comprising providing an interface for moving or deleting the automatically placed insertion markers.
16. The method of claim 14 further comprising identifying moments of silence for automatically placing the insertion markers.
17. A system comprising: an audio server system configured to provide a host dashboard and further configured to provide an audio stream; a database configured to store the audio stream in a content file for future playback; an ad server, in communication with the audio server system, configured to store advertisements or foreign content; the host dashboard including an insertion point button configured to identify an insertion marker in the content file, wherein the audio server system stores the insertion marker in the database, and the insertion marker represents a location for one or more advertisements; and a web server in communication with the database for streaming the content file to one or more users.
18. The system of claim 17, wherein the audio server system is further configured to automatically identify locations for placing insertion markers.
19. The system of claim 17, further comprising an interface configured to move or delete the automatically placed insertion markers.
20. The system of claim 17, wherein the audio server system is further configured to identify moments of silence to automatically place the insertion markers.
Description
BRIEF DESCRIPTION OF THE DRAWINGS
[0031] So that the above recited features and advantages of example embodiments can be understood in detail, a more particular description of embodiments, briefly summarized above, may be had by reference to the embodiments thereof that are illustrated in the appended drawings. It is to be noted, however, that the appended drawings illustrate only exemplary embodiments and are therefore not to be considered limiting of its scope, for the invention may admit to other equally effective embodiments.
[0032]
[0033]
[0034]
[0035]
[0036]
[0037]
[0038]
[0039]
[0040]
[0041]
[0042]
[0043]
[0044]
DESCRIPTION
[0045] The following description of the exemplary embodiments of the invention is not intended to limit the invention to these exemplary embodiments, but rather to enable any person skilled in the art to make and use this invention. Presently, exemplary embodiments of the invention are shown in the above-identified figures and described in detail below. In describing the exemplary embodiments, like or identical reference numerals are used to identify common or similar elements. The figures are not necessarily to scale and certain features and certain views of the figures may be shown exaggerated in scale or in schematic in the interest of clarity and conciseness.
1. Hardware of the System
[0046] As shown in
[0047] Referring to
[0048] The moderator terminal 14 includes a computer terminal 22 and a two-way audio communication device 24, such as a landline telephone, mobile telephone, VOIP, soft phone or the like, indirectly connected to the audio server system 12 via the networks 20a and 20b. Although the two-way audio communication device 24 is shown separately, the two-way audio communication device 24 can be implemented as a part of the computer terminal 22 so long as such computer terminal 22 is adapted for audio communication. For example, the computer terminal 22 can be provided with a suitable microphone and speaker system. In addition, the two-way audio communication device 24 can be adapted to communicate with the audio server system 12 using either the network 20a or 20b. As discussed below, the computer terminal 22 can be provided with a web browser to permit the moderator to access a variety of information provided by the audio server system 12 regarding the network-based audio/visual conferences. Such information may include call-in telephone numbers, scheduling information or the like and can be provided on a web-page.
[0049] The network 20a may be a packetized or packet-switched network such as the world's public IP-based packet-switched networks, also known as the Internet or some other network-type, such as a wide area network (WAN) or local area network (LAN). The network 20b may be a circuit-switched network such as a public switched network typically used to make telephone calls, i.e., the network of the world's public circuit-switched telephone networks, also known as the PSTN. However, it should be understood that the networks 20a or 20b may be provided as other types of networks, such as a cellular telephone network. For purposes of clarity, the network 20a will be referred to hereinafter as a packetized or packet-switched network, and the network 20b will be referred to hereinafter as a switched network. In an exemplary embodiment, the two-way audio communication device 24 is a conventional telephone provided separately from the computer terminal 22 and communicates with the audio server system 12 via the switched network 20b.
[0050] The guest terminal 16 is also provided with a two-way audio communication device 30, which is shown by way of example as a telephone connected to the switched network 20b. However, it should be understood that the communication device 30 can be implemented in other manners, such as a computer terminal having suitable software and a microphone and speaker, or a landline telephone, mobile telephone, soft phone or voice over internet telephone. In addition, the guest terminal 16 may also be provided with a computer terminal (not shown) having access to the network 20a and also having a web browser to permit the guest to access a variety of information provided by the audio server system 12 regarding the network-based conferences. Such information may include call-in telephone numbers, scheduling information or the like and can be provided on a web-page.
[0051] The listener terminals 18 include a computer terminal 34 for accessing a variety of information provided by the audio server system 12, such as call-in telephone numbers, scheduling information, one-way audio streams of real-time or near real-time network-based audio/video conferences, or stored audio streams of past (not real-time) audio/video conferences. The listener terminals 18 may also include a separate one-way communication device 36 permitting the listener to listen to audio streams of real-time or near real-time network-based audio/video conferences. The one-way communication device 36 can be implemented, by way of example, as a two-way communication device, such as a landline telephone, mobile telephone, soft phone or voice over internet telephone, only allowing the listener to listen to the audio streams of real-time or near real-time or past network-based audio/video conferences.
[0052] The computer terminal 22 or 34 may be a computer having an Internet connection, for example through a direct Internet connection, a LAN, or through an Internet service provider. The computer terminals 22 or 34 may be a windows-based PC, a Macintosh, a cellular telephone or a personal data assistant for example. The computer terminals 22 or 34 can include speakers and web-browser software, such as Microsoft's Internet Explorer or Netscape's Navigator, having audio/video player software such as Real Network's Real Player or Windows Media Player for receiving media streams. The computer terminal 22 may also include a microphone and software for audio output/input to permit two-way audio communication with the audio server system 12.
[0053] One embodiment of the audio server system 12 is shown in more detail in
[0054] The audio server system 12 is also provided with a conferencing system 44, a web server 46, one or more content storage devices, e.g., database or NFS servers 47, one or more real-time media encoder 48a, one or more archive media encoder 48b, a time sync server 49, and a streaming server 50. The moderator terminal 14, and the guest terminal(s) 16 communicate with the conferencing system 44 via the networks 20a, 20b and interface devices 40a and 40b to provide a telephone conference connection for two-way audio communication during the network-based audio/video conference. The listener terminal(s) 18 communicate with the conferencing system 44, or the streaming server 50 to receive one-way or two-way communication during the network-based audio/video conference. When the listener terminal(s) 18 communicate with the conferencing system 44 in a two-way manner, i.e., unmuted, such listener terminal(s) 18 function similarly to guest terminal(s) 16.
[0055] The real-time media encoder 48a receives, in real-time or near real-time, the audio data (or a representation thereof) of the network-based audio/video conference and converts such audio data (or a representation thereof) into a streaming media format. Such audio data is then passed to a time sync server 49.
[0056] During the audio/video conference, the conferencing system 44 outputs a representation of the audio data to the content storage device, e.g., database or NFS server 47, to record the representation of the audio data and save such representation as a content file. Once the audio/video conference is over, the file is output to the archive media encoder 48b, which encodes the representation of the audio data into a streaming format. This encoded data is input to the time sync server 49, described previously. The time sync server 49 then provides the representation of the audio data to the streaming server 50. A hyperlink or button may then be provided on a web page provided by the web server 46 containing a URL directing a listener terminal 18 to the representation of the audio data in the streaming format hosted by the streaming server 50. It should be understood that the real-time media encoder 48a and the archive media encoder 48b can be implemented as a same media encoder, or separately.
[0057] The audio server system 12 also includes the web server 46. The web server 46 functions as an interface between the conferencing system 44 and the streaming server 50 of the audio server system 12 and the network 20a, and runs web server software (stored on one or more computer readable medium) to generate and deliver various web pages for display at the moderator, guest and listener terminals 14, 16 and 18. As discussed in detail below, such web pages delivered by the web server 46 include various input sections and graphical user interfaces (GUIs) that enable (1) remote moderator users to interactively schedule, setup, and control two-way communication access to the network-based audio/video conference, (2) remote guest users to interactively join, communicate with the moderator and listen to the network-based audio/video conference, and (3) remote listeners to listen to the network-based audio/video conference or become guests. The web server 46 enables remote listeners to listen to the real-time or near real-time network-based audio/video conference by connecting the listener terminals 18 to the streaming server 50. In one embodiment, the web server 46 can also connect the listener terminal 18 to the conferencing system 44. This feature is described in more detail below.
[0058] In an exemplary embodiment, the various web pages provided by the web server 46 are available to the public via the network 20a and the web server 46 connects listener terminals 18 to the streaming server 50 without typically requiring any authentication, invitation or verification (in certain instances authentication or verification may be required, such as when the show includes explicit material, and in certain instances the moderator can send out invitations to promote their show). So, the network-based audio/video conference is made available for essentially any listener having a listener terminal 18 capable of accessing the web server 46 and having streaming media software loaded on their listener terminal 18 for converting the representation of the audio data in the streaming media format into sound. As will be discussed in more detail below, due to the compatibility of the audio server system 12 with the packetized network 20a and the switched network 20b the moderator(s), guest(s) and listener(s) can setup, schedule, participate and/or listen to the network-based audio/video conference utilizing conventional telephones and computers having web browsers.
2. Overview of function of the Audio Conferencing System 10
[0059] During a network-based audio/video conference, audio is received by the conferencing system 44 from the two-way communication device 24, e.g., the telephone, of the moderator terminal 14 via the network 20b. The conferencing system 44 transmits a representation of the audio to guest terminal(s) 16 or listener terminal(s) 18 in a first listener group via the network 20b, and also transmits (or at least makes available) a representation of the audio to guest terminal(s) 16 or listener terminal(s) 18 in a second listener group via the packetized network 20a. The audio/video conference can be transmitted to the first listener group and the second listener group in real-time or near real-time (e.g., within a time delay of a few seconds).
[0060] A moderator can be a person who wishes to transmit voice, music, or any other suitable audio for one or more talk shows or audio/visual conferences and utilizes the moderator terminal 14 to communicate with the audio server system 12 via the network 20a or 20b. From the standpoint of the system 10, the moderator can be identified by a password (such as a PIN), but may be identified by any suitable method, such as CallerID or voice signature. A talk show or audio/visual conference can be scheduled for a particular day and time and may be scheduled for a particular timeslot (including start time and end time). However, in other embodiments, the talk show or audio/visual conference can be unscheduled or spontaneous. The talk show or audio/visual conference can be associated with a particular moderator (or moderators). A talk show or audio/visual conference can be described as scheduled, pre-show, in progress, or completed. The web server 46 can be adapted to permit the moderator to invite guests or listeners to the audio/video conference. In this regard, the moderator can login to a computer system hosted by the web server 46 and customize and send e-mail invites to friends and colleagues.
[0061] A guest is a listener who wishes to listen to the talk show or audio/visual conference and also engage in two-way communication with the moderator(s) during the talk show or audio/visual conference. From the standpoint of the system 10, the guest may be identified by a password (such as a PIN), or any other suitable method, such as CallerID or voice signature.
[0062] A listener, or first listener, or second listener is a person who wishes to listen to or view the talk show or audio/visual conference and receive the voice, music, or other suitable audio from the moderator and/or guest. From the standpoint of the system 10, a listener can be authenticated or verified, or not, by any particular method such as a password (such as a PIN), callerID or voice signature, although certainly the telephone number, IP address or other identifier of the listener terminal 18 may be automatically provided to the audio server system 12 for an identification of the listener terminal 18.
[0063] A first listener group is one or more listeners or guests in separate locations. A second listener group is one or more listeners or guests in separate locations from the listeners or guests in the first listener group.
[0064] An audio/visual conference can be either unidirectional or bi-directional, for example, participants might only be able to receive information but not input information or communicate back in the conference. Audio/visual conferences can contain only audio, only video, or both audio and video. All combinations of these embodiments are contemplated.
[0065] A computer readable medium, as used herein, refers to a device capable of storing data in a format that can be read by a computer. Examples of computer readable mediums include a memory, a magnetic disk, an optical disk or a tape.
3. Receiving and Transmitting Audio
[0066] A person seeking to be a moderator typically visits the web server 46 utilizing their moderator computer 22 and signs up for a show and agrees to a password. Then, the web server 46 provides the moderator with a moderator telephone number. After the moderator signs up for a show and agrees to a password, the moderator can call the moderator telephone number and identifies themselves with the password, as shown in
[0067] Then, a representation of the audio is transmitted to the guest terminal 16 or listener terminal 18 of one or more listeners or guests in a first listener group via the telecom switch 40a and network 20b to deliver a representation of the voice, music, or any other suitable audio from the moderator to one or more guests and listeners. The representation of the audio may be an exact representation of the voice, music, or any other suitable audio transmitted from the moderator. The representation of the audio, however, is can be a compressed, filtered, censored, or otherwise processed version of the voice, music, or any other suitable audio transmitted from the moderator. The audio may be transmitted through the circuit-switched telephone network 20b using any suitable audio codec. The audio method and system can evaluate the caller ID of the moderator, and can use the G.729 audio codec for phone calls from an international (or remote) location and the PCMU audio codec for phone calls from a domestic (or nearby) location. However, other types of codecs could be used.
[0068] The audio server system 10 can provide the first listener group with a listener telephone number that corresponds to a particular moderator or show. The listener telephone number is typically provided to the guests or listeners in the first listener group by posting the listener telephone number on a web page associated with the particular moderator or show provided by the web server 46. However, the listener telephone number can be provided in other manners, such as by including the listener telephone number in advertisements for the talk show or audio/visual conference.
[0069] When a first listener calls the listener telephone number, the conferencing system 44 may be configured to play an audio clip (such as a greeting) associated with the particular moderator or show. If there is a show scheduled to start within a predetermined period (such as 15 minutes), the conferencing system 44 can transmit an audio signal to the first listener (as a pre-recorded voice) indicating the time until the start of the show. Although the audio/video conference system 10 might not require a password from the first listener, the system 10 may require a password from the first listener in certain situations (e.g., shows with explicit material).
[0070] The audio server system 12 can also transmit or pass a representation of the audio to a second listener group in real-time or near real-time via the network 20a. The representation of the audio is automatically provided to the media encoder 48a and the database, e.g., NFS server 47, from the conferencing system 44. The NFS server 47 records the representation of the audio (in real-time or near real-time) and saves the representation as a file. The real-time streaming of the representation of the audio can be accomplished by setting the media encoder 48a up as a listener of the audio/video conference. In one embodiment, this is accomplished by placing an inbound or outbound phone call to the media encoder 48a by the conferencing system 44 to connect the media encoder 48a as a listener of the audio/video conference. The connection between the conferencing system 44 and the media encoder 48a can utilize a high quality codec.
[0071] As discussed above, the audio stream can be provided to the listener or the guest utilizing either the network 20a or 20b. To listen to the audio stream utilizing the network 20a, the listener or guest utilizes their guest terminal 16 or listener terminal 18 to browse a web page associated with the moderator, talk show or audio/visual conference. The web page can be provided with suitable hyperlink(s) adapted to provide a listener URL, that corresponds to a particular moderator or show, to the listener terminal 18 upon activation by the listener. When the listener points their web browser to the particular URL, the audio server system 12 connects the listener terminal 18 to the streaming server 50 to connect the listener to the audio stream. This can be implemented by the web server 46 sending a signal via a signal path 53a to the streaming server 50 to activate the streaming server 50 to connect to the listener terminal 18 via a signal path 53b, or by the web server 46 providing the listener URL to the listener terminal 18 and then the listener terminal 18 connecting to the streaming server 50 via the signal path 53b. The signal paths 53a and 53b are shown separately for purposes of illustration, however, the signal paths 53a and 53b could be the same or different.
[0072] To connect to the audio/video conference via the network 20b, the moderator, guest or listener uses their terminal 14, 16 or 18 to dial into the audio/video conference utilizing a call-in number. Or, the moderator, guest or listener can utilize their terminal 14, 16 or 18 to view a web page from the web server 46 and actuate a hyperlink that actuates an outbound call to connect to the conferencing system 44 using Voice Over IP via the networks 20a and the media gateways, firewall 40b.
[0073] The streaming server 50 may be configured to play an audio and/or video clip (such as a greeting) associated with the particular moderator or show. If there is a show scheduled to start within a predetermined period (such as 15 minutes), the system 10 can include a step of transmitting an audio and/or video signal to the listener indicating the time until the start of the show. Although the system 10 might not require a password from the second listener, the system 10 may require a password from the second listener in certain situations (e.g., shows with explicit material).
3. Example Implementation
[0074]
[0075] Shown in
[0076] Referring to
[0077] Referring to
[0078]
[0079] While the embodiment described above allows users to choose insertion points, other embodiments include automatically identifying insertion points. These insertion points can be identified by, for example, identifying periods of silence, time of day, time of playback, beginning or end, etc.
[0080]
[0081] After the content file is constructed, as illustrated in step 6, embodiments can store the reconstructed content file on a CDN or other local or remote storage such that the content file need not be reconstructed. The reconstructed file can have an end time, e.g., the time that the content file should no longer be used, such as the soonest expiration date of one or more of the ads in the content file. The end time can also be a default time period, such as one week. After the end time, the content file can be deleted and reconstructed once again with new advertisements.
[0082] Advertisements can come directly from advertisers or be read by the content creators themselves, which is sometimes called native advertisements. Advertisements for video can be either full-screen or partial screen, the ads can either be audio, video or static images. The ads typically comprise multiple files to be inserted at the insertion markers. Ads can be chosen in real-time when needed while streaming content. Some embodiments stream content via traditional streaming methods, such as using a media streaming widget, e.g., windows media player or QuickTime. Other embodiments can utilize a custom player that can either pause content playback while delivering an advertisement, or alternatively playback can continue with the advertisement placed directly into the content being played back via a content stream or via a locally-stored file.
[0083] Additional embodiments include the feature of automatically recording an ad being delivered to a user, either via a log that the ad was downloaded or via a local log that the user actually viewed or listened to the ad, or whether the ad was displayed or played to the user. Such an event can decrement a counter of the total number of ads sold in a campaign. In addition, counters for the content creator can be incremented to record a number of ads delivered as the result of delivering the content creator's content. The incremented counter can then be used to determine payments due to the content creator as a result of advertisement delivery.
[0084] Embodiments can determine which ads to serve based on ad campaigns sold by the content distributor, or ads sold by third parties selling ad inventory owned by the content distributor. The ads can be stored on the Internet or delivered to the content distributors by the advertisers. It can be advantageous for advertisers to maintain their advertisements on the Internet to ensure that their advertisements are up-to-date, thereby allowing them to revise their advertisements without establishing a new contract or contacting the content distributor directly. In this way, ads can be continuously updated automatically.
[0085]
[0086]
[0087]
[0088] In some embodiments, sections of a content file might not actually be deleted, but instead may be skipped between deletion markers. For instance, when a player reaches a deletion marker, playback will skip until the next deletion marker. In streaming embodiments, packets between the deletion markers will not be streamed. In downloading embodiments, a content file is reconstructed with inserted content, based on insertion markers, and deleted sections are removed based on deletion markers.
[0089] In other embodiments, users can move insertion markers, even after the files have been delivered to recipients. In another embodiment, users can receive content files with unique placement of ad insertion markers. For instance, some recipients may pay a premium for fewer advertisements or no advertisements. This can be accomplished by storing several versions of insertion markers for a content file, or insertion markers can be skipped, depending on whether a recipient should receive more or fewer advertisements.
[0090]
[0091]
[0092]
[0093] As a person skilled in the art will recognize from the previous detailed description and from the figures and claims, modifications and changes can be made to the exemplary embodiments of the invention without departing from the spirit and scope of this invention defined in the following claims.