VIDEO PROCESSING APPARATUS AND VIDEO PROCESSING METHOD COOPERATING WITH TELEVISION BROADCASTING SYSTEM
20180199002 ยท 2018-07-12
Inventors
Cpc classification
H04N19/132
ELECTRICITY
H04N21/4622
ELECTRICITY
H04N21/42638
ELECTRICITY
H04N19/70
ELECTRICITY
H04N21/4316
ELECTRICITY
H04N21/234363
ELECTRICITY
H04N21/2353
ELECTRICITY
International classification
H04N21/431
ELECTRICITY
H04N21/462
ELECTRICITY
Abstract
A video processing apparatus includes a down-sampling circuit, a combining circuit, a metadata generating circuit, and an encoder. The down-sampling circuit down-samples P videos according to predetermined picture layout information of K picture layouts. Each of the videos corresponds to a television program. The combining circuit combines the P down-sampled videos according to the predetermined picture layout information to generate combined videos corresponding to the K picture layouts. The metadata generating circuit generates metadata that describes television program information corresponding to the picture layouts according to the predetermined picture layout information. The encoder encodes the combined videos and the metadata to image data that conforms to a predetermined broadcast format for a television broadcasting system to broadcast.
Claims
1. A video processing apparatus, operating with a television broadcasting system that broadcasts P videos in a predetermined broadcast format, each of the P videos corresponding to a television program, P being an integer greater than 1, the video processing apparatus comprising: a down-sampling circuit, receiving the P videos, down-sampling the P videos according to predetermined picture layout information corresponding to K types of picture layouts to generate P down-sampled videos, where K is a positive integer; a combining circuit, coupled to the down-sampling circuit, combining the P down-sampled videos according to the predetermined picture layout information to generate a combined video comprising a plurality of combined pictures, wherein the P down-sampled videos correspond to P sub-pictures and each of the combined pictures comprises at least one sub-picture; a metadata data generating circuit, generating metadata for the combined video according to the predetermined picture layout information, the metadata describing television program information of each of the K picture layouts and an encoder, coupled to the combining circuit and the metadata generating circuit, encoding the combined video and the metadata to a set of image data that conforms to the predetermined broadcast format of the television broadcasting system.
2. The video processing apparatus according to claim 1, wherein the down-sampling circuit down-samples in a space axis when one of the K picture layouts requires a combined picture to comprise a plurality of sub-pictures.
3. The video processing apparatus according to claim 1, wherein when the positive integer K is greater than 1, the down-sampling circuit down-samples in a time axis in a way that the combining circuit generates a first combined picture at a first time point and generates a second combined picture at a second time point, the first combined picture and the second combined picture respectively corresponding to different picture layouts in the K picture layouts.
4. The video processing apparatus according to claim 1, wherein when one of the K picture layouts requires a combined picture to comprise a plurality of sub-pictures, where the positive integer K is greater than 1, the down-sampling circuit down-samples in both a space axis and a time axis.
5. The video processing apparatus according to claim 1, wherein the predetermined broadcasting format is a transport stream, and the encoder encodes the combined video and the metadata into an elementary stream.
6. The video processing apparatus according to claim 1, wherein the metadata further comprises K index values each pointing to one of the K picture layouts; the encoder encodes the combined video and the metadata to have a bitstream structure, writes the television program information of each of the picture layouts into a first level of the bitstream structure and writes the K index values into a second level of the bitstream structure.
7. The video processing apparatus according to claim 6, wherein the first level of the bitstream structure corresponds to a plurality of consecutive combined pictures and the second level of the bitstream structure corresponds to a single combined picture; the encoder encodes such that each of the combined pictures in the combined video carries one of the K index values.
8. The video processing apparatus according to claim 1, wherein the television program information corresponding to each of the picture layouts described by the metadata comprises at least one of the following information: a program channel identification code corresponding to each sub-picture, a program provider identification code corresponding to each sub-picture and a program type identification code corresponding to each sub-picture.
9. A video processing method, operating with a television broadcasting system that broadcasts P videos in a predetermined broadcast format, each of the P videos corresponding to a television program, P being an integer greater than 1, the video processing method comprising: a) receiving the P videos and predetermined picture layout corresponding to K types of picture layouts, where K is a positive integer; b) down-sampling the P videos according to predetermined picture layout information to generate P down-sampled videos, c) combining the P down-sampled videos according to the predetermined picture layout information to generate a combined video comprising a plurality of combined pictures, wherein the P down-sampled videos correspond to P sub-pictures and each of the combined pictures comprises at least one sub-picture; d) generating metadata for the combined video according to the predetermined picture layout information, the metadata describing television program information of each of the K picture layouts; and e) encoding the combined video and the metadata to a set of image data that conforms to the predetermined broadcast format of the television broadcasting system.
10. The video processing method according to claim 9, wherein when one of the K picture layouts requires a combined picture to comprise a plurality of sub-pictures, step (b) is performed in a space axis.
11. The video processing method according to claim 9, wherein when the positive integer K is greater than 1, step (b) is performed in a time axis, and step (c) comprises: generating a first combined picture at a first time point and a second combined picture at a second time point, the first combined picture and the second combined picture respectively corresponding to different picture layouts in the K picture layouts.
12. The video processing method according to claim 9, wherein when one of the K picture layouts requires a combined picture to comprise a plurality of sub-pictures and the positive integer K is greater than 1, step (b) is performed in both a time axis and space axis.
13. The video processing method according to claim 9, wherein the metadata generated in step (d) comprises: K sets of picture layout information that describes television program information corresponding to each of the K picture layouts; and K index values, each pointing to one of the K picture layouts; wherein, step (e) comprises encoding the combined video and the metadata to have a bitstream structure, writing the television program information of each of the picture layouts into a first level of the bitstream structure and writing the K index values into a second level of the bitstream structure.
14. The video processing method according to claim 13, wherein the first level of the bitstream structure corresponds to a plurality of consecutive combined pictures and the second level of the bitstream structure corresponds to a single combined picture; step (e) comprises encoding such that each of the combined pictures in the combined video to carry one of the K index values.
Description
BRIEF DESCRIPTION OF THE DRAWINGS
[0012]
[0013]
[0014]
[0015]
[0016]
[0017]
[0018]
[0019] It should be noted that, the drawings of the present invention are not detailed circuit diagrams, and connection lines therein are for indicating signal flows only. The interactions between the functional elements/or processes are not necessarily achieved through direct electrical connections. Further, functions of the individual elements are not necessarily distributed as depicted in the drawings, and separate blocks are not necessarily implemented by separate electronic elements.
DETAILED DESCRIPTION OF THE INVENTION
[0020]
[0021] In
[0022] The down-sampling circuit 12 of the video processing apparatus 100 also receives the P videos. In the example in
[0023] The combining circuit 14 combines the P down-sampled videos according to the same predetermined picture layout information to generate a combined video corresponding to the K picture layouts. Several examples of the predetermined picture layouts, as well as how the down-sampling circuit 12 and the combining circuit 14 operate in response to the predetermined picture layouts, are introduced below.
[0024] Refer to
[0025] In response to the predetermined picture layout information that requires a combined picture to include a plurality of sub-pictures, the down-sampling circuit 12 may down-sample each of the original pictures in the first video to the fourth video respectively along the directions of the length and width of a space axis, so as to scale down the picture size to one-quarter of the original picture sizes (reducing both of the length and width by one-half). For example, if the picture size of each original picture is 1920*1080 pixels, the picture size of each down-sampled video is 960*540 pixels.
[0026] In response to the above predetermined picture layout information, the combining circuit 14 combines four pictures of the four down-sampled videos to one single picture, where the above four pictures are sampled at the same time point.
[0027] In practice, the down-sampling circuit 12 may include multiple average calculating circuits to calculate average values, and may divide an original picture into multiple sets each including 2*2 pixels. The average calculating circuits determine one average value of image data (e.g., grayscale values) of four pixels in each set to generate a new set of pixel image data using that average value in order to achieve down-sampling in the space axis. The combining circuit 14 may include a frame buffer having a size of 1920*1080 pixels for the down-sampling circuit 12 to write the newly generated pixel image data therein. According to the predetermined picture layout information, the combining circuit 14 may determine an appropriate position for writing each set of new pixel image data to the frame buffer, such that the four down-sampled pictures(each in a size of 960*540 pixels) form a new combined picture in the frame buffer. Taking
[0028] The metadata generating circuit 18 generates metadata for the combined video according to the predetermined picture layout information, wherein the metadata describes television program information corresponding to each of the K picture layouts. The television system 300 may obtain the predetermined picture layout information and/or other associated information from the metadata. For example, in addition to the numbers and position allocations of the picture layouts, the television program information described by the metadata may further include at least one type of following information: a program channel identification code corresponding to each sub-picture in each picture layout, a program provider identification code corresponding to each sub-picture, and a program type (e.g., news, travel and sports) identification code corresponding to each sub-picture.
[0029] The encoder 16 encodes the combined video generated by the combining circuit 14 and the metadata generated by the metadata generating circuit 18 to image data that conforms to a predetermined broadcast format, and provides the image data to the television broadcasting system 200 to broadcast. Take an example where the television broadcasting system 200 adopts the high efficiency video coding (HEVC) specification for instance. The encoder 16 may encode the combined video and the metadata to an elementary stream, which is then packaged to a transport stream and broadcasted by the television broadcasting system 200. In other words, the format of the combined video may be the same as those of other common television programs, and may be considered as one television program and broadcasted. If the television system 300 receives, decodes and plays this television program, the associated effect conforms to the predetermined picture layout in
[0030] Refer to
[0031]
[0032] Next, in response to the current predetermined picture layout information, the combining circuit 14 combines four pictures of the first to fourth down-sampled video at the same time point to one single picture; combines four pictures of the fifth to the eighth down-sampled video to one single picture.
[0033] Similarly, the metadata generating circuit 18 generates metadata that describes television program information corresponding to the two picture layouts for the combined video according to the current predetermined picture layout information. The encoder 16 encodes the combined video and the metadata of the combined video to image data that conforms to the broadcast format of the television broadcasting system 200, and provides the image data to the television broadcasting system 200 to broadcast.
[0034] In practice, through the metadata, the television system 300 learns the predetermined picture layout information that the video processing apparatus 100 adopts, and manipulates the image data that the video processing apparatus 100 generates for desired applications. For example, the combined video data in
[0035] In the foregoing embodiments, each combined picture includes 2*2 sub-pictures, which is however not to be construed as a limitation to the scope of the present invention. The predetermined picture layout information is flexible regardless of whether down-sampling is performed in the space axis or the time axis. For example, one picture layout designated by the predetermined picture layout information may include 2*3 or 4*3 sub-pictures, which do not need to be entirely same-sized.
[0036] Through the above concept, the video processing apparatus 100 and the television broadcasting system 200 may provide pictures corresponding to down-sampled videos of tens or even hundreds of television programs to the television system 300 through merely one set or several sets of image data. The television system 300 may determine the down-sampled videos corresponding to which of the television programs are to be retrieved and recombined to one or multiple new dynamic television walls. Further, the picture layout actually displayed on the screen of the television system 300 may also be determined by a user.
[0037]
[0038]
[0039] It should be noted that, technical details of down-sampling a video in the space axis or the time axis according to a predetermined ratio are generally known to one person skilled in the art, and shall be omitted herein. The combining circuit 14 may be implemented by various types of circuits, e.g., a programmable logic gate array, an application-specific integrated circuit, a microcontroller, a microprocessor, and a digital signal processor. Further, the combining circuit 14 may be designed to complete its tasks through executing a processor command stored in a memory.
[0040] In one embodiment, the metadata from the metadata generating circuit 18 includes K index values, which respectively point to the K picture layouts. For example, the metadata generating circuit 18 may have the index value 1 point to the first picture layout, the second index value 2 point to the second picture layout, and so forth. Taking
[0041] One benefit of writing the television program information and the index values to different levels is, the video processing apparatus 100 is not required to record the associated television program information in the metadata of each picture. By obtaining the metadata in a higher level using the index value of each picture, the television system can obtain the detailed information of the picture. Thus, the data size of the data transmitted from the television broadcasting system 200 to the television system 300 may be effectively reduced.
[0042] A video processing method operating with a television broadcasting system is further provided according to another embodiment of the present invention.
[0043] One person skilled in the art can understand that, the operation variations in the description associated with the video processing apparatus 100 are applicable to the image processing method in
[0044] While the invention has been described by way of example and in terms of the preferred embodiments, it is to be understood that the invention is not limited thereto. On the contrary, it is intended to cover various modifications and similar arrangements and procedures, and the scope of the appended claims therefore should be accorded the broadest interpretation so as to encompass all such modifications and similar arrangements and procedures.