Reception apparatus, reception method, transmission apparatus, and transmission method
10469552 ยท 2019-11-05
Assignee
Inventors
Cpc classification
H04N21/472
ELECTRICITY
H04L67/02
ELECTRICITY
H04N21/8456
ELECTRICITY
H04N21/8543
ELECTRICITY
H04N21/26258
ELECTRICITY
H04N21/458
ELECTRICITY
International classification
G06F15/16
PHYSICS
H04N21/458
ELECTRICITY
H04N21/262
ELECTRICITY
H04N21/2343
ELECTRICITY
H04N21/845
ELECTRICITY
H04N21/8543
ELECTRICITY
H04N21/472
ELECTRICITY
Abstract
A reception apparatus includes circuitry configured to receive a digital broadcast signal. Based on a media presentation description (MPD), which conforms to the moving picture expert group-dynamic adaptive streaming over HTTP (MPEG-DASH) standard and is transmitted in the digital broadcast signal or via a communication network, the circuitry performs at least one of selection or grouping of components according to a level of a component hierarchy associated with the components, and controls playback of content of at least one of the components.
Claims
1. A reception apparatus, comprising: circuitry configured to receive a digital broadcast signal; and based on a media presentation description (MPD) that conforms to the moving picture expert group-dynamic adaptive streaming over HTTP (MPEG-DASH) standard and includes first information indicating that a plurality of AdaptationSet elements are associated and second information indicating that a plurality of AdaptationSet elements are grouped and belong to the same group to be selected, and is transmitted in the digital broadcast signal or via a communication network, control playback of content, wherein the MPD includes a first element for designating a pair of AdaptationSet elements, the first element is a Subset element, and a set of AdaptationSet elements is designated by a contains-attribute of the Subset element, wherein the MPD includes a second element for defining a relationship between the AdaptationSet elements of the MPD, a relationship between Representation elements, or a relationship between SubRepresentation elements, and wherein a relationship between Representation element groups under the AdaptationSet element is designated by the second element.
2. The reception apparatus according to claim 1, wherein the second element is an EssentialProperty element, and wherein a relationship between the Representation element groups under the AdaptationSet element is designated by a schemeldUri attribute of the EssentialProperty element.
3. A reception method of a reception apparatus, the method comprising: receiving a digital broadcast signal; and based on a media presentation description (MPD) that conforms to the moving picture expert group-dynamic adaptive streaming over HTTP (MPEG-DASH) standard and includes first information indicating that a plurality of AdaptationSet elements are associated and second information indicating that a plurality of AdaptationSet elements are grouped and belong to the same group to be selected, and is transmitted in the digital broadcast signal or via a communication network, controlling, by the circuitry, playback of content, wherein the MPD includes a first element for designating a pair of AdaptationSet elements, the first element is a Subset element, and a set of AdaptationSet elements is designated by a contains-attribute of the Subset element, wherein the MPD includes a second element for defining a relationship between the AdaptationSet elements of the MPD, a relationship between Representation elements, or a relationship between SubRepresentation elements, and wherein a relationship between Representation element groups under the AdaptationSet element is designated by the second element.
4. A transmission apparatus, comprising: circuitry configured to generate a media presentation description (MPD) that conforms to the moving picture expert group-dynamic adaptive streaming over HTTP (MPEG-DASH) standard, and includes first information indicating that a plurality of AdaptationSet elements are associated and second information indicating that a plurality of AdaptationSet elements are grouped and belong to the same group to be selected; and transmit the generated MPD in a digital broadcast signal or via a communication network, wherein the MPD includes a first element for designating a pair of AdaptationSet elements, the first element is a Subset element, and a set of AdaptationSet elements is designated by a contains-attribute of the Subset element, wherein the MPD includes a second element for defining a relationship between the AdaptationSet elements of the MPD, a relationship between Representation elements, or a relationship between SubRepresentation elements, and wherein a relationship between Representation element groups under the AdaptationSet element is designated by the second element.
5. The transmission apparatus according to claim 4, wherein one or a plurality of components constituting a multiplexed stream are enumerated for each of the plurality of AdaptationSet elements, the one or the plurality of components include one or a plurality of audio components.
6. The transmission apparatus according to claim 5, wherein the one or a plurality of components are listed in one or a plurality of Representation elements under the AdaptationSet elements.
7. The transmission apparatus according to claim 4, wherein the first information and the second information are specified as attributes of elements described in the Period elements of the MPD.
8. The transmission apparatus according to claim 7, wherein the attributes of the elements are described in the Period element are defined in on the same Subset element.
9. A transmission method of a transmission apparatus, the method comprising: generating, by circuitry of the transmission apparatus, a media presentation description (MPD) that conforms to the moving picture expert group-dynamic adaptive streaming over HTTP (MPEG-DASH) standard and includes first information indicating that a plurality of AdaptationSet elements are associated and second information indicating that a plurality of AdaptationSet elements are grouped and belong to the same group to be selected; and transmitting, by the circuitry, the generated MPD in a digital broadcast signal or via a communication network, wherein the MPD includes a first element for designating a pair of AdaptationSet elements, the first element is a Subset element, and a set of AdaptationSet elements is designated by a contains-attribute of the Subset element, wherein the MPD includes a second element for defining a relationship between the AdaptationSet elements of the MPD, a relationship between Representation elements, or a relationship between SubRepresentation elements, and wherein a relationship between Representation element groups under the AdaptationSet element is designated by the second element.
Description
BRIEF DESCRIPTION OF DRAWINGS
(1)
(2)
(3)
(4)
(5)
(6)
(7)
(8)
(9)
(10)
(11)
(12)
(13)
(14)
(15)
(16)
(17)
(18)
(19)
(20)
(21)
(22)
(23)
(24)
(25)
(26)
(27)
(28)
(29)
(30)
DESCRIPTION OF EMBODIMENTS
(31) Hereinafter, embodiments of the present technology will be described with reference to the drawings. In addition, a description will be made in the following order.
(32) 1. Structure of MPD
(33) 2. Structure of component layer
(34) 3. First Embodiment
(35) (1) Implementation method of function of layer of each level
(36) (1-1) Function of layer of the level 3
(37) (1-2) Function of layer of the level 2
(38) (1-3) Function of layer of the level 1
(39) (2) Specific operation example
(40) (2-1) Operation example 1: mixing playback of audio
(41) (2-2) Operation example 2: group selection of audio
(42) (2-3) Operation example 3: playback of video and audio
(43) 4. Second Embodiment
(44) (1) Implementation method of function of layer of each level
(45) (1-1) Function of layer of the level 3
(46) (1-2) Function of layer of the level 2
(47) (1-3) Function of layer of the level 1
(48) (2) Specific operation example
(49) (2-1) Operation example 4: mixing playback of audio
(50) (2-2) Operation example 5: group selection of audio
(51) 5. Third Embodiment
(52) (1) Example of other use of Subset element
(53) (2) Operation example 6: Grouping for simultaneous presentation of component
(54) 6. System configuration
(55) 7. Flow of specific process of each apparatus
(56) 8. Configuration of computer
1. Structure of MPD
(57) <Structure of MPD>
(58)
(59) Here, as illustrated in
(60) Specifically, the AdaptationSet element represents streams which are encoded from various sources. Then, streams which are a plurality of choices having different parameters such as, for example, bit rates are listed, by placing the Representation element in the AdaptationSet element, in order to select the stream on a receiver side according to a parametric such as a bit rate. Generally, the AdaptationSet element and the Representation element correspond to a single stream such as a stream of a video, audio, or subtitles.
(61) Further, when the AdaptationSet element represents a stream in which a plurality of streams such as a video stream, an audio stream, and a subtitle stream are multiplexed, the multiplexed streams which are a plurality of choices having different parameters such as, for example, bit rates are listed, by placing the Representation element in the AdaptationSet element. In other words, as illustrated in
(62) Further, in this case, it is possible to describe the attribute of the stream of each component constituting the multiplexed stream, by further placing the SubRepresentation element under the Representation element. In other words, as illustrated in
(63) In addition, the AdaptationSet element corresponds to a stream in which a plurality of streams are multiplexed, in addition to a single stream such as a video stream or an audio stream. In the standard of an MPEG-DASH, the attributes and the elements of
2. Structure of Component Layer
(64) However, there is a requirement of the selection and structuring of a component, as a requirement for the broadcast service. Here, the selection of a component refers to selection of a best component, depending on various conditions such as a capacity of a codec corresponding to a receiver and a network environment. Further, the structuring of the stream component refers to combining and simultaneously displaying a plurality of stream components. In order to satisfy such a requirement for the selection and structuring of the components, as illustrated in
(65) In
(66) The layer of the level 3 corresponds to Level 3 in
(67) In
(68) The layer of the level 2 corresponds to Level 2 in
(69) In
(70) The layer of the level 1 corresponds to Level 1 in
(71) In
(72) As described above, since components are represented as a plurality of layer structures, for example, the requirement for the selection of the component is satisfied by the layer of the level 1 and the layer of the level 3, and the requirement for the structuring of the component is satisfied by the layer of the level 2. Hereinafter, in a case of providing a broadcast service by using the MPEG-DASH method, three embodiments will be described for a specific method for realizing a component layer structure of
3. First Embodiment
(1) Implementation Method of Function of Layer of Each Level
(73) In a first embodiment, in a case of providing a broadcast service by using the MPEG-DASH method, it is possible to use elements which have already been defined in the standard specification of MPD, in order to satisfy the service requirement of the component layer structure of
(1-1) Function of Layer of the Level 3
(74) The function of the layer of the level 3 of
(1-2) Function of Layer of the Level 2
(75) The function of the layer of the level 2 of
(76) In the Subset element, the value of the id attribute of the AdaptationSet element is stored as a list, in the contains-attribute of the Subset element, in order to specify the set of AdaptationSet elements.
(77) In
(78) In this manner, the function of the layer of the level 2 can be implemented by performing the association of a plurality of AdaptationSet elements, by the contains-attribute of the Subset element.
(79) In
(80) In order to implement grouping of the layer of the level 2, the layer of the CompositeComponent is mapped to the Subset element of MPD. Here, the pair of AdaptationSet elements is designated by the set of the components which are simultaneously played by the contains-attribute of the Subset element.
(81) In the example of
(82) Further, the relationship represented as 3 in
(83) Only one Representation element is disposed under the other AdaptationSet element, and one component is disposed corresponding thereto. In this case, the relationship represented as 2 in
(84) In other words, since the value of the id attribute of one AdaptationSet element and the value of the id attribute of the other AdaptationSet element are designated in the contains-attribute of the Subset element of the MPD, the components listed in the Representation elements under the AdaptationSet elements are simultaneously played. Thus, since the component selected in the layer of the PickOneComponent of the level 3 and the ElementaryComponent are grouped in the layer of the CompositeComponent, the function of the layer of the level 2 is implemented.
(1-3) Function of Layer of Level 1
(85) Some of the functions of the layer of the level 1 of
(86)
(87) In
(88) Further, since the AdaptationSet element of id=12, the AdaptationSet element of id=74, and the AdaptationSet element of id=385 which are designated as group=2 have the same attribute value, only one component is played among components listed in the Representation elements under the AdaptationSet elements.
(89) In this manner, it is possible to implement the function of the layer of the level 1 by grouping a plurality of AdaptationSet elements by the group attribute of the AdaptationSet element.
(90) In
(91) In order to implement the grouping of the layer of the level 1, the layer of the PickOneComponent of the level 1 is mapped to the group attribute of the AdaptationSet element of the MPD. Here, the attribute value of the same group attribute is designated in the AdaptationSet element belonging to the same group, by the group attribute of the AdaptationSet element, and thus the group is shared.
(92) In
(93) Further, in the example of
(94) Further, in the example of
(95) In other words, since it is possible to group two AdaptationSet elements other than the AdaptationSet elements associated by the contains-attribute of the Subset element, by the group attribute of the AdaptationSet element of the MPD, one component is selected from the components listed in the Representation element under the AdaptationSet elements. Thus, in the layer of the PickOneComponent of the level 1, the component selected in the layer of the PickOneComponent of the level 3 and the ElementaryComponent are grouped.
(96) However, it is not possible to group the AdaptationSet elements associated by the contains-attribute of the Subset element, by the group attribute of the AdaptationSet element of the MPD. In other words, even if using the group attribute of the AdaptationSet element corresponding to the layer of the PickOneComponent of the level 1, it is not possible to associate the CompositeComponent of the level 2 denoted by 5 of the solid line in
(97) <Expansion of MPD>
(98) In this manner, in the standard specification of the current MPD, it is not possible to perform the association with the Subset element corresponding to the CompositeComponent of the level 2 belonging to the group designated by the group attribute of the AdaptationSet element corresponding to the PickOneComponent of the level 1. Thus, the expansion of the MPD is performed in order to enable the association between the values of such group attributes and the Subset elements, thereby allowing the function of the layer of the level 1 to be fully implemented.
(99) Specifically, a new group attribute is added in the Subset element by the expansion of the MPD.
(100) In
(101) Further, 12 74 is designated in the contains-attribute of a second Subset element, and this indicates that the AdaptationSet element of id=12 and the AdaptationSet element of id=74 are the set of the AdaptationSet elements to be simultaneously played. Further, 2 is designated in the group attribute of the second Subset element, and this indicates that the pair of the AdaptationSet element of id=12 and the AdaptationSet element of id=74 belongs to a group 2.
(102) Further, 65 385 is designated in the contains-attribute of a third Subset element, and this indicates that the AdaptationSet element of id=65 and the AdaptationSet element of id=385 are the set of the AdaptationSet elements to be simultaneously played. Further, 2 is designated in the group attribute of the third Subset element, and this indicates that the pair of the AdaptationSet element of id=65 and the AdaptationSet element of id=385 belongs to the group 2.
(103) In other words, in the example of
(104) In this manner, it is possible to represent the relationship represented as 5 of a dotted line in
(2) Specific Operation Example
(105) Next, specific operation examples of the first embodiment will be described with reference to
(2-1) Operation Example 1
(106)
(107) In the MPD of
(108) Two Representation elements are described under the AdaptationSet element of id=11. Dialogs (Dlg) are listed as audio components having different bit rates, in a range of the AdaptationSet element of id=11 which is a higher element, by the Representation elements. In this example, a dialog of a high bit rate and a dialog of a low bit rate are listed, and one audio component is adaptively selected, for example, depending on the environmental conditions of a network and the like, in the layer of the level 3 (PickOneComponent).
(109) Only one Representation element is described under the AdaptationSet element of id=12. Effect (Efct) is listed as an audio component, by the Representation elements. Accordingly, one audio component is typically selected. In addition, the audio component selected in this manner corresponds to the ElementaryComponent.
(110) Two Representation elements are described under the AdaptationSet element of id=13. A plurality of pieces of music are listed as audio components having different bit rates, in a range of the AdaptationSet element of id=13 which is a higher element, by the Representation elements. In this example, music of a high bit rate and music of a low bit rate are listed, and one audio component is adaptively selected, for example, depending on the environmental conditions of a network and the like, in the layer of the level 3 (PickOneComponent).
(111) In this manner, in the MPD of
(112) Further, in the MPD of
(113) Thus, in the Operation example 1 of
(2-2) Operation Example 2
(114)
(115) In the MPD of
(116) Two Representation elements are described under the AdaptationSet element of id=11, Stereos of a high bit rate and a low bit rate are listed as audio components having different bit rates, and one audio component is adaptively selected, for example, depending on the environmental conditions of a network and the like, in the layer of the level 3 (PickOneComponent).
(117) In this manner, the function of the layer of the level 3 in the component layer structure of
(118) Two Representation elements are described under the AdaptationSet element of id=21, two pieces of music of a high bit rate and a low bit rate are listed as audio components having different bit rates, and one audio component is adaptively selected, for example, depending on the environmental conditions of a network and the like, in the layer of the level 3 (PickOneComponent).
(119) Only one Representation element is described under the AdaptationSet element of id=22, Dialog ((Dlg)) is listed as an audio component, and one audio component is typically selected. The audio component selected in this manner corresponds to the ElementaryComponent.
(120) Only one Representation element is described under the AdaptationSet element of id=23, Effect (Efct) is listed as an audio component, and one audio component is typically selected. The audio component selected in this manner corresponds to the ElementaryComponent.
(121) In this manner, the function of the layer of the level 3 in the component layer structure of
(122) Further, in the MPD of
(123) Here, in the MPD of
(124) In this manner, the function of the layer of the level 1 in the component layer structure of
(125) Thus, in the Operation example 2 of
(2-3) Operation Example 3
(126)
(127) In the MPD of
(128) In the group 1 of the video, base, ext, SV1, and SV2 are listed as video components in respective Representation elements. Here, base represents a video component corresponding to a basic video signal that can be played alone, and ext represents a video component corresponding to a video signal for expansion. Further, in SV1 and SV2, SV is an abbreviation of SubView, and represents a subview which is an auxiliary area of a main view which is a main display area.
(129) Under the AdaptationSet element of id=11, only one Representation element is described, and one video component corresponding to the basic video signal that can be played alone is typically selected. The video component selected in this way corresponds to the ElementaryComponent. In addition, the video component is for the main view.
(130) Under the AdaptationSet element of id=12, four Representation elements are described, one video component corresponding to the basic video signal and three video components corresponding to the video signals for expansion are listed, and a relationship is shown in which the video components corresponding to the video signals for expansion are referred to in the video component corresponding to the basic video signal by the Representation element. Accordingly, the video components corresponding to the basic video signal and the video signal for expansion are selected, for example, depending on the environmental conditions of a network and the like, in the layer of the level 3 (PickOneComponent). In addition, the video component is for the main view.
(131) Under the AdaptationSet element of id=13, only one Representation element is described, and one video component for a subview 1 is typically selected. The video component selected in this way corresponds to the ElementaryComponent.
(132) Under the AdaptationSet element of id=14, three Representation elements are described, three video components for a subview 2 are listed, and one video component is adaptively selected, for example, depending on the environmental condition of a network and the like, in the layer of the level 3 (PickOneComponent).
(133) In this manner, the function of the layer of the level 3 in the component layer structure of
(134) Here, in the MPD of
(135) In this manner, the function of the layer of the level 1 in the component layer structure of
(136) Meanwhile, in the group 2 of audio, NrRbst, HiRbst, MCD, SV1, and SV2 are listed as audio components in respective Representation elements. NrRbst and HiRbst represent audio components having robustness. In other words, NrRbst is an abbreviation of Normal Robustness, and means an audio component having normal robustness. Further, HiRbst is an abbreviation of High Robustness, and means an audio component having high robustness.
(137) Further, MCD is an abbreviation of Multi-channel Dev, and represents a multi-channel audio component. Further, in SV1 and SV2, SV is an abbreviation of subview and represents an audio component for subview.
(138) Under the AdaptationSet element of id=21, two Representation elements are described, an audio component having normal robustness and an audio component having high robustness are listed, and one audio component is adaptively selected, for example, depending on the environmental condition of a network and the like, in the layer of the level 3 (PickOneComponent). In addition, the audio component is for the main view.
(139) Under the AdaptationSet element of id=22, only one Representation element is described, and one multi-channel audio component is typically selected. The audio component selected in this way corresponds to the ElementaryComponent. In addition, the audio component is for the main view.
(140) Under the AdaptationSet element of id=23, only one Representation element is described, and one audio component for subview 1 is typically selected. Similarly, under the AdaptationSet element of id=24, one audio component for subview 2 is typically selected. The audio components for subview selected in this way correspond to the ElementaryComponent.
(141) Here, in the MPD of
(142) In this manner, the function of the layer of the level 1 in the component layer structure of
(143) Thus, in the Operation example 3 of
4. Second Embodiment
(1) Implementation Method of Function of Layer of Each Level
(144) Also in a second embodiment, in a case of providing a broadcast service by using the MPEG-DASH method, it is possible to use elements which have already been defined in the standard specification of the MPD, in order to satisfy the service requirement of the component layer structure of
(1-1) Function of Layer of the Level 3
(145) The function of the layer of the level 3 of
(1-2) Function of Layer of the Level 2
(146) The function of the layer of the level 2 of
(147) As the dependency description attribute, a new element is defined by using the EssentialProperty element (
(148) Here, a case of using the EssentialProperty element as the dependency description attribute will be described as an example. As illustrated in
(149) For example, urn:ATSC is defined as a name space authority, and an Uri attribute value of urn:ATSC:mixingGroup defined by the authority is defined. Thus, an AdaptationSet attribute group of the audio component having the EssentialProperty element having the Uri attribute value as the value of the schemeldUri attribute can be defined to mean that the audio components are mixed and output.
(150) In this case, it is possible to define, for example, a value for identifying an audio component group of which audio component is to be mixed, a relative volume for an entire volume after the audio components are mixed, and the like, in the value attribute. For example, as the relative volume for the entire volume, a fractional value in the range of 0 to 1 is designated when the entire volume is set to 1.
(151)
(152) In
(153) Further, in
(154) Thus, the audio component corresponding to the AdaptationSet element of id=1 and the audio component corresponding to the AdaptationSet element of id=2, which belong to a mixing group of 23, are mixed and played. Further, in the mixing playback, the relative volume of the audio component corresponding to the AdaptationSet element of id=1 is 0.37, and the relative volume of the audio component corresponding to the AdaptationSet element of id=2 is 0.25.
(155) In this manner, it is possible to implement the function of the layer of the level 2 by grouping the EssentialProperty element, and a plurality of AdaptationSet elements by the schemeldUri attribute and the value attribute.
(156) In
(157) In order to implement the grouping of the layer of the level 2, the layer of the CompositeComponent and the EssentialProperty element disposed in the AdaptationSet element of the MPD are mapped. Here, the pair of AdaptationSet elements is designated, as the set of components that are simultaneously played by the schemeldUri attribute and the value attribute of the EssentialProperty element.
(158) In the example of
(159) Further, the relationship represented as 3 in
(160) Under the other AdaptationSet element, only one Representation element is disposed, and one component is disposed corresponding thereto. In this case, the relationship represented as 2 in
(161) In other words, when the Uri attribute value which is, for example, urn:ATSC:mixingGroup is defined for the schemeldUri attribute of the EssentialProperty element disposed in the AdaptationSet element of the MPD, if urn:ATSC:mixingGroup is defined as the value of the schemeldUri attribute, in each AdaptationSet element, and the same group ID is designated as the value of the value attribute, the audio components under the AdaptationSet elements are mixed and played. Thus, since the component selected in the layer of the PickOneComponent of the level 3 and the ElementaryComponent are grouped in the layer of the CompositeComponent, the function of the layer of the level 2 is implemented.
(1-3) Function of Layer of the Level 1
(162) The function of the layer of the level 1 of
(163)
(164) In
(165) Further, since the AdaptationSet element of id=12, the AdaptationSet element of id=74, and the AdaptationSet element of id=385 which are designated as group=2 have the same attribute value, only one component is played among components listed in the Representation elements under the AdaptationSet elements.
(166) In this manner, it is possible to implement the function of the layer of the level 1 by grouping a plurality of AdaptationSet elements by the group attribute of the AdaptationSet element.
(167) In
(168) In order to implement the grouping of the layer of the level 1, the layer of the PickOneComponent of the level 1 is mapped to the group attribute of the AdaptationSet element of the MPD. Here, the attribute value of the same group attribute is designated in the AdaptationSet element belonging to the same group, by the group attribute of the AdaptationSet element, and thus the group is shared.
(169) In
(170) In other words, when the Uri attribute value which is, for example, urn:ATSC:mixingGroup is defined for the schemeldUri attribute of the EssentialProperty element disposed in the AdaptationSet element of the MPD, if urn:ATSC:mixingGroup is defined as the value of the schemeldUri attribute, in each AdaptationSet element, and the same group ID is designated as the value of the value attribute, the audio components under the AdaptationSet elements are mixed and played.
(171) Thus, since in the layer of the CompositeComponent represented as 5 of a dotted line in
(172) Further, in the example of
(173) Further, in the example of
(174) In other words, in the second embodiment, differently from the first embodiment, since the function of the layer of the level 2 is implemented not by using the contains-attribute of the Subset element, but by using the EssentialProperty element disposed in the AdaptationSet element, it is possible to group the AdaptationSet elements associated by the schemeldUri attribute and the value attribute of the EssentialProperty element, by the group attribute of the AdaptationSet element.
(175) In other words, in the second embodiment, it is possible to represent the relationship represented as 5 of a dotted line in
(176) In addition, the example of using the EssentialProperty element as the dependency description attribute has been described above, but without being limited thereto, for example, other dependency description attributes such as the SupplementalProperty element may be used.
(2) Specific Operation Example
(177) The specific operation examples of the second embodiment will be described with reference to
(2-1) Operation Example 4
(178)
(179) Here, in the Operation example 4 of
(180) In
(181) Two Representation elements are described under the first AdaptationSet element, Dialogs ((Dlg)) of a high bit rate and a low bit rate are listed as audio components having different bit rates, and one audio component is adaptively selected, for example, depending on the environmental condition of a network and the like, in the layer of the level 3 (PickOneComponent).
(182) Further, urn:...:mixing is designated as the schemeldUri attribute of the EssentialProperty element and 1 is designated as the value attribute in the first AdaptationSet element. In other words, the component selected in the layer of the level 3 belongs to a mixing group 1 in which group ID (mxgrpid) of 1 is designated.
(183) Only one Representation element is described under the second AdaptationSet element, Effect (Efct) is listed as an audio component, and one audio component is typically selected. The audio component selected in this manner corresponds to the ElementaryComponent.
(184) Further, urn:...:mixing is designated as the schemeldUri attribute of the EssentialProperty element and 1 is designated as the value attribute in the second AdaptationSet element. In other words, the component corresponding to the ElementaryComponent belongs to a mixing group 1 in which group ID (mxgrpid) of 1 is designated.
(185) Two Representation elements are described under the third AdaptationSet element, two pieces of music of a high bit rate and a low bit rate are listed as audio components having different bit rates, and one audio component is adaptively selected, for example, depending on the environmental condition of a network and the like, in the layer of the level 3 (PickOneComponent).
(186) Further, urn:...:mixing is designated as the schemeldUri attribute of the EssentialProperty element and 1 is designated as the value attribute in the third AdaptationSet element. In other words, the component selected in the layer of the level 3 belongs to a mixing group 1 in which group ID (mxgrpid) of 1 is designated.
(187) In this manner, the function of the layer of the level 3 in the component layer structure of
(188) Further, since urn:...:mixing is designated as the schemeldUri attribute of the EssentialProperty element and 1 is designated as the value attribute in the three AdaptationSet elements, the component belongs to the same mixing group 1. In this manner, since a plurality of AdaptationSet elements are grouped by the schemeldUri attribute and the value attribute of the EssentialProperty element, the function of the layer of the level 2 in the component layer structure of
(189) Thus, in the Operation example 4 of
(2-2) Operation Example 5
(190)
(191) In
(192) Further, in the Operation example 5 of
(193) In
(194) Two Representation elements are described under the first AdaptationSet element, Stereos (STR) of a high bit rate and a low bit rate are listed as audio components having different bit rates, and one audio component is adaptively selected, for example, depending on the environmental condition of a network and the like, in the layer of the level 3 (PickOneComponent).
(195) Further, group=1 is designated as the group attribute in the first AdaptationSet element. In other words, the component selected in the layer of the level 3 belongs to the group 1.
(196) Two Representation elements are described under the second AdaptationSet element, two pieces of music of a high bit rate and a low bit rate are listed as audio components having different bit rates, and one audio component is adaptively selected, for example, depending on the environmental condition of a network and the like, in the layer of the level 3 (PickOneComponent).
(197) Further, urn:...:mixing is designated as the schemeldUri attribute of the EssentialProperty element and 1 is designated as the value attribute in the second AdaptationSet element. In other words, the component selected in the layer of the level 3 belongs to a mixing group 1 in which group ID (mxgrpid) of 1 is designated. Further, since group=1 is designated as the group attribute in the second AdaptationSet element, the component selected in the layer of the level 3 also belongs to the group 1 of the level 1.
(198) Only one Representation element is described under the third AdaptationSet element, Dialog ((Dlg)) is listed as an audio component, and one audio component is typically selected. The audio component selected in this manner corresponds to the ElementaryComponent.
(199) Further, urn:...:mixing is designated as the schemeldUri attribute of the EssentialProperty element and 1 is designated as the value attribute in the third AdaptationSet element. In other words, the component selected in the layer of the level 3 belongs to a mixing group 1 in which group ID (mxgrpid) of 1 is designated. Further, since group=1 is designated as the group attribute in the third AdaptationSet element, the component corresponding to the ElementaryComponent also belongs to the group 1 of the level 1.
(200) Only one Representation element is described under the fourth AdaptationSet element, Effect (Efct) is listed as an audio component, and one audio component is typically selected. The audio component selected in this manner corresponds to the ElementaryComponent.
(201) Further, urn:...:mixing is designated as the schemeldUri attribute of the EssentialProperty element and 1 is designated as the value attribute in the fourth AdaptationSet element. In other words, the component corresponding to the ElementaryComponent belongs to a mixing group 1 in which group ID (mxgrpid) of 1 is designated. Further, since group=1 is designated as the group attribute in the fourth AdaptationSet element, the component corresponding to the ElementaryComponent also belongs to the group 1 of the level 1.
(202) In this manner, the function of the layer of the level 3 in the component layer structure of
(203) Further, since urn:...:mixing is designated as the schemeldUri attribute of the EssentialProperty element and 1 is designated as the value attribute in the second to fourth AdaptationSet elements, among the four AdaptationSet elements, the components belong to the same mixing group 1. In this manner, since a plurality of AdaptationSet elements are grouped by the schemeldUri attribute and the value attribute of the EssentialProperty element, the function of the layer of the level 2 in the component layer structure of
(204) Further, since group=1 is designated as the group attribute in all four AdaptationSet elements, the components belong the group of the same level 1. In this manner, the function of the layer of the level 1 in the component layer structure of
(205) Thus, in the Operation example 5 of
5. Third Embodiment
(1) Example of Other Use of Subset Element
(206) In the first embodiment, the example in which Subset elements are used in the MPD in order to satisfy the function of the layer of the level 2 in the component layer structure of
(2) Operation Example 6
(207)
(208) In the MPD of
(209) In the group 1 of the video, base, ext, SV1, and SV2 are listed as video components in respective Representation elements. Here, base represents a video component corresponding to a basic video signal that can be played alone, and ext represents a video component corresponding to a video signal for expansion. Further, in SV1 and SV2, SV represents a subview which is an auxiliary area of a main view which is a main display area.
(210) Under the AdaptationSet element of id=11, only one Representation element is described, and one video component corresponding to the basic video signal that can be played alone is typically selected. The video component selected in this way corresponds to the ElementaryComponent. In addition, the video component is for the main view.
(211) urn:...:SVC is designated as the attribute value of the schemeldUri attribute of the EssentialProperty element in the AdaptationSet element of id=12. Here, in the Operation example 6 in
(212) Accordingly, four Representation elements under the AdaptationSet element of id=12 have dependency of a Base-Enhance relationship. In other words, under the AdaptationSet element of id=12, four Representation elements are described in which one video component corresponding to the basic video signal and three video components corresponding to the video signals for expansion are listed, and the four Representation elements have dependency of a Base-Enhance relationship. Accordingly, the video components corresponding to the basic video signal and the video signal for expansion are selected, for example, depending on the environmental condition of a network and the like, in the layer of the level 3 (PickOneComponent). In addition, the video component is for the main view.
(213) In this manner, in the second embodiment, the example in which EssentialProperty elements are used in the MPD in order to satisfy the function of the layer of the level 2 in the component layer structure of
(214) Under the AdaptationSet element of id=13, only one Representation element is described, and one video component for a subview 1 is typically selected. The video component selected in this way corresponds to the ElementaryComponent.
(215) Under the AdaptationSet element of id=14, three Representation elements are described, three video components for a subview 2 are listed, and one video component is adaptively selected, for example, depending on the environmental condition of a network and the like, in the layer of the level 3 (PickOneComponent).
(216) In this manner, the function of the layer of the level 3 in the component layer structure of
(217) Here, in the MPD of
(218) In this manner, the function of the layer of the level 1 in the component layer structure of
(219) Meanwhile, in a group 2 of audio, NrRbst, HiRbst, MCD, SV1, and SV2 are listed as audio components in respective Representation elements. NrRbst means an audio component having normal robustness. Further, HiRbst means an audio component having high robustness. MCD represents a multi-channel audio component. Further, in SV1 and SV2, SV represents an audio component for subview.
(220) Under the AdaptationSet element of id=21, two Representation elements are described, an audio component having normal robustness and an audio component having high robustness are listed, and one audio component is adaptively selected, for example, depending on the environmental condition of a network and the like, in the layer of the level 3 (PickOneComponent). In addition, the audio component is for the main view.
(221) Under the AdaptationSet element of id=22, only one Representation element is described, and one multi-channel audio component is typically selected. The audio component selected in this way corresponds to the ElementaryComponent. In addition, the audio component is for the main view.
(222) Under the AdaptationSet element of id=23, only one Representation element is described, and one audio component for subview 1 is typically selected. Similarly, under the AdaptationSet element of id=24, one audio component for subview 2 is typically selected. The audio components for subview selected in this way correspond to the ElementaryComponent.
(223) Here, in the MPD of
(224) In this manner, the function of the layer of the level 1 in the component layer structure of
(225) Further, in the MPD of
(226) Specifically, 11 21 is designated in the contains-attribute of the first Subset element, and this indicates that the AdaptationSet element of id=11 and the AdaptationSet element of id=21 are a pair of the AdaptationSet elements to be simultaneously presented. In other words, a video component (V-base) and an audio component (A-NrRbst or A-HiRbst) are components for a main view to be simultaneously played.
(227) Further, 11 22 is designated in the contains-attribute of the second Subset element, and this indicates that the AdaptationSet element of id=11 and the AdaptationSet element of id=22 are a pair of the AdaptationSet elements to be simultaneously presented. In other words, a video component (V-base) and an audio component (A-MCD) are components for a main view to be simultaneously played.
(228) Further, 12 21 is designated in the contains-attribute of the third Subset element, and this indicates that the AdaptationSet element of id=12 and the AdaptationSet element of id=21 are a pair of the AdaptationSet elements to be simultaneously presented. In other words, video components (V-base and V-ext) and an audio component (A-NrRbst or A-HiRbst) are components for a main view to be simultaneously played.
(229) Further, 12 22 is designated in the contains-attribute of the fourth Subset element, and this indicates that the AdaptationSet element of id=12 and the AdaptationSet element of id=22 are a pair of the AdaptationSet elements to be simultaneously presented. In other words, video components (V-base and V-ext) and an audio component (A-MCD) are components for a main view to be simultaneously played.
(230) Further, 13 23 is designated in the contains-attribute of the fifth Subset element, and this indicates that the AdaptationSet element of id=13 and the AdaptationSet element of id=23 are a pair of the AdaptationSet elements to be simultaneously presented. In other words, a video component (V-SV1) and an audio component (A-SV1) are components for a subview 1 to be simultaneously played.
(231) Further, 14 24 is designated in the contains-attribute of the sixth Subset element, and this indicates that the AdaptationSet element of id=14 and the AdaptationSet element of id=24 are a pair of the AdaptationSet elements to be simultaneously presented. In other words, a video component (V-SV2) and an audio component (A-SV2) are components for a subview 2 to be simultaneously played.
(232) Hitherto, in the Operation example 6 of
6. System Configuration
(233) (Configuration Example of Broadcast Communication System)
(234)
(235) As illustrated in
(236) The channel streamer 10 supplies metadata of content including various types of components such as a video, audio, and subtitles, to the MPD providing device 20. Further, the channel streamer 10 supplies stream data of content including various types of components such as a video, audio, and subtitles, to the DASH segmenter and streamer 30.
(237) Here, for example, in a service that provides the content, the video component and the audio component having different bit rates such as 1 Mbps, 5 Mbps, 10 Mbps, and 20 Mbps are prepared in order to implement the adaptive streaming delivery, as components constituting the content. The channel streamer 10 provides the components as the stream data, to the DASH segmenter and streamer 30.
(238) The MPD providing device 20 generates an MPD based on the metadata of content provided from the channel streamer 10 and segment metadata provided from the DASH segmenter and streamer 30. The generated MPD is transmitted by using broadcasting or communication.
(239) In other words, when the MPD providing device 20 transmits the MPD in a digital broadcast signal, for example, the MPD file is periodically multicast in a file delivery over unidirectional transport (FLUTE) session. Further, when the MPD providing device 20 transmits the MPD through a communication network, the MPD file is unicast to the client device 50 through the network 90, in response to, for example, a request for the MPD from the client device 50.
(240) The DASH segmenter and streamer 30 generates segment data based on the stream data of content provided from the channel streamer 10. The generated segment data is transmitted by using broadcasting or communication.
(241) In other words, when the DASH segmenter and streamer 30 transmits the segment data in a digital broadcast signal, for example, the segment data is periodically multicast in the FLUTE session. Further, when the DASH segmenter and streamer 30 transmits the segment data through a communication network, the segment data is unicast to the client device 50 through the network 90, in response to, for example, a request for the segment data from the client device 50.
(242) The client device 50 receives the MPD file that is multicast or unicast by the MPD providing device 20. Further, the client device 50 selects a best component among a plurality of component candidates constituting content, based on the MPD file. The client device 50 receives the segment data that is multicast or unicast by the DASH segmenter and streamer 30, based on the result of selection of the component. Then, the client device 50 plays the content by restoring the stream data of content from the received segment data.
(243) The broadcast communication system 1 is configured as described above. Next, the detailed configuration of each device constituting the broadcast communication system 1 of
(244) (Configuration Example of Device on Transmission Side)
(245)
(246) As illustrated in
(247) The channel streamer 10 is configured with a content management unit 101, a content accumulation unit 102, and a communication unit 103. The content management unit 101 manages the content accumulated in the content accumulation unit 102.
(248) The communication unit 103 supplies the metadata of content accumulated in the content accumulation unit 102 to the MPD providing device 20, in response to control from the content management unit 101. Further, the communication unit 103 supplies the stream data of content accumulated in the content accumulation unit 102 to the DASH segmenter and streamer 30, in response to control from the content management unit 101.
(249) The MPD providing device 20 is configured with an MPD generation unit 201, a communication unit 202, and a transmission unit 203.
(250) The communication unit 202 receives the metadata of content provided from the channel streamer 10 and the segment metadata provided from the DASH segmenter and streamer 30, and supplies them to the MPD generation unit 201. The MPD generation unit 201 generates an MPD file, based on the metadata of content and the segment metadata supplied from the communication unit 202, and supplies the MPD file to the communication unit 202 or the transmission unit 203.
(251) The communication unit 202 transmits the MPD file supplied from the MPD generation unit 201, to the client device 50 through the network 90, in response to the request of the MPD from the client device 50. Further, the transmission unit 203 modulates the MPD file supplied from the MPD generation unit 201, and supplies a digital broadcast signal through an antenna 204. In addition, at this time, the MPD file is transmitted in, for example, the FLUTE session.
(252) The DASH segmenter and streamer 30 is configured with a segment data generation unit 301, a segment metadata generation unit 302, a communication unit 303, and a transmission unit 304.
(253) The communication unit 303 receives the stream data of content provided from the channel streamer 10, and supplies the stream data of content to the segment data generation unit 301. The segment data generation unit 301 generates segment data, based on the stream data of content supplied from the communication unit 303, and supplies the segment data to the communication unit 303 or the transmission unit 304.
(254) The communication unit 303 transmits the segment data supplied from the segment data generation unit 301, to the client device 50 through the network 90, in response to the request for the segment data from the client device 50. Further, the transmission unit 304 modulates the segment data supplied from the segment data generation unit 301, and transmits a digital broadcast signal through an antenna 305. In addition, at this time, the segment data is transmitted in, for example, a FLUTE session.
(255) Further, the segment data generation unit 301 supplies the generated segment data to the segment metadata generation unit 302. The segment metadata generation unit 302 generates segment metadata, based on the segment data supplied from the segment data generation unit 301, and supplies the segment data to the communication unit 303. The communication unit 303 supplies the segment metadata supplied from the segment metadata generation unit 302 to the MPD providing device 20.
(256) In addition, in
(257) (Configuration Example of Device on Reception Side)
(258)
(259) As illustrated in
(260) The control unit 501 controls the operation of each unit of the client device 50.
(261) The reception unit 502 receives the digital broadcast signal transmitted from the device on the transmission side through an antenna 507, in response to control from the control unit 501. The reception unit 502 demodulates the digital broadcast signal, and supplies the obtained data to the control unit 501. The control unit 501 performs various process on the data supplied from the reception unit 502.
(262) The display 503 displays an image according to the stream data of content in response to control from the control unit 501. The speaker 504 outputs audio according to the stream data of content in response to control from the control unit 501.
(263) The recording unit 505 is configured with, for example, a hard disk, and records the stream data of content in response to the control of the control unit 501. The communication unit 506 exchanges various types of data with the device on the transmission side through the network 90, in response to the control of the control unit 501.
(264) The control unit 501 is configured with an MPD acquisition unit 551, a component selection unit 552, a segment data acquisition unit 553, and a segment data playback unit 554.
(265) The MPD acquisition unit 551 controls the reception unit 502 or the communication unit 506 so as to acquire the MPD file transmitted in broadcasting or communication, and supplies the MPD file to the component selection unit 552. The component selection unit 552 selects a best component among a plurality of component candidates, based on the MPD file supplied from the MPD acquisition unit 551, and supplies the result of the selection to the segment data acquisition unit 553.
(266) The segment data acquisition unit 553 controls the reception unit 502 or the communication unit 506, based on the result of the selection from the component selection unit 552 so as to acquire the segment data transmitted in broadcasting or communication, and supplies the segment data to the segment data playback unit 554. The segment data playback unit 554 supplies the video stream data and the audio stream data which are obtained from the segment data from the segment data acquisition unit 553 to the display 503 and the speaker 504, respectively. Thus, an image of content is displayed on the display 503, and audio synchronized with the video is output from the speaker 504.
(267) In addition, the configuration of the client device 50 of
7. Flow of Specific Process of Each Apparatus
(268) As described above, the device on the transmission side may unicast the MPD file and the segment data in response to the request from the client device 50 on the reception side, or regardless of the request from the client device 50 on the reception side, may periodically multicast them. Thus, hereinafter, cases of multicast and unicast the MPD file and the segment data will be respectively described.
(269) <Flow of Process During Multicast Delivery>
(270) First, the flow of a specific process in each device during the multicast delivery will be described with reference to the flowchart of
(271) The channel streamer 10 performs the process of steps S101 to S102. Specifically, in step S101, the content management unit 101 acquires metadata of content from the content accumulation unit 102, and supplies the metadata to the MPD providing device 20 through the communication unit 103. Further, in step S102, the content management unit 101 acquires stream data of content from the content accumulation unit 102, and supplies the stream data to the DASH segmenter and streamer 30 through the communication unit 103.
(272) The MPD providing device 20 performs the process of steps S201 to S202. Specifically, the MPD providing device 20 acquires the metadata of content provided from the channel streamer 10, and segment metadata provided from the DASH segmenter and streamer 30 (steps S101, and S303). In step S201, the MPD generation unit 201 generates an MPD based on the metadata of content and the segment metadata.
(273) In step S202, the transmission unit 203 transmits a digital broadcast signal corresponding to the MPD generated in the process of step S201 through the antenna 204. Here, the MPD file is periodically multicast in a FLUTE session.
(274) The DASH segmenter and streamer 30 performs the process of steps S301 to S304. Specifically, the DASH segmenter and streamer 30 acquires stream data of content provided from the channel streamer 10 (step S102). In step S301, the segment data generation unit 301 generates segment data, based on the stream data of content. Further, in step S302, the segment metadata generation unit 302 generates segment data, based on the segment data generated in the process of step S301.
(275) In step S303, the communication unit 303 supplies the segment metadata generated in the process of step S302 to the MPD providing device 20. Further, in step S304, the transmission unit 304 transmits a digital broadcast signal corresponding to the segment data generated in the process of step S301 through the antenna 305. Here, the segment data is periodically multicast in a FLUTE session.
(276) The client device 50 performs the process of steps S501 to S504. Specifically, in step S501, the reception unit 502 receives the digital broadcast signal corresponding to the MPD which is multicast from the MPD providing device 20 through the antenna 507. Here, the MPD acquisition unit 501 acquires the MPD file which is obtained by demodulating the digital broadcast signal and transmitted in the FLUTE session.
(277) In step S502, the component selection unit 552 selects the best component among a plurality of component candidates, based on the MPD acquired in the process of step S501. Here, the component has the component layer structure of
(278) In step S503, the reception unit 502 receives the digital broadcast signal corresponding to the segment data which is multicast from the DASH segmenter and streamer 30 through the antenna 507. Here, the segment data acquisition unit 553 acquires the segment data which is multicast from the DASH segmenter and streamer 30, and transmitted in the FLUTE session, based on the result of the selection process of step s502.
(279) In step S504, the segment data playback unit 554 restores the stream data of content from the segment data which is acquired in the process of step s503, and supplies the video stream data and the audio stream data to the display 503 and the speaker 504, respectively. Thus, an image of content is displayed on the display 503, and audio synchronized with the video is output from the speaker 504.
(280) Hitherto, the flow of a specific process in each device during the multicast delivery has been described.
(281) <Flow of Process During Unicast Delivery>
(282) Next, the flow of a specific process in each device during the unicast delivery will be described with reference to the flowchart of
(283) The details of the process of steps S151 to S152 which are performed by the channel streamer 10 are the same as the process of steps S101 to S102 of
(284) The MPD providing device 20 performs the process of steps S251 and S252. Specifically, the MPD providing device 20 acquires the metadata of content provided from the channel streamer 10 and the segment metadata provided from the DASH segmenter and streamer 30 (steps S151, and S353). In step S251, the MPD generation unit 201 generates the MPD file, based on the metadata of content and the segment metadata.
(285) In step S252, when receiving a request for the MPD from the client device 50, the communication unit 202 transmits the MPD file generated in step S251 to the client device 50 through the network 90, in response to the request for the MPD.
(286) The DASH segmenter and streamer 30 performs the process of steps S351 to S354. Specifically, the DASH segmenter and streamer 30 acquires the stream data of content provided from the channel streamer 10 (step S152). In step S351, the segment data generation unit 301 generates segment data, based on the stream data of content. Further, in step S352, the segment metadata generation unit 302 generates segment metadata, based on the segment data generated in the process of step S351.
(287) In step S353, the communication unit 303 provides the segment metadata generated in the process of step S352 to the MPD providing device 20. Further, in step S354, when receiving a request for the segment data from the client device 50, the communication unit 303 transmits the segment data generated in the process of step S351 to the client device 50 through the network 90, in response to the request for the segment data.
(288) The client device 50 performs the process of steps S551 to S556. Specifically, in step S551, the communication unit 506 accesses the MPD providing device 20 through the network 90, in response to control from the MPD acquisition unit 551 so as to make a request for the MPD. In step S552, the communication unit 506 receives the MPD file that is unicast from the MPD providing device 20 through the network 90, in response to control from the MPD acquisition unit 551.
(289) In step S553, the component selection unit 552 selects a best component among a plurality of component candidates, based on the MPD file acquired in the process of step S552. Here, although the component has the component layer structure in
(290) In step S554, the communication unit 506 accesses the DASH segmenter and streamer 30 through the network 90, in response to control from the segment data acquisition unit 553 so as to make a request for the segment data. In step S555, the communication unit 506 receives the segment data that is unicast from the DASH segmenter and streamer 30 through the network 90, in response to control from the segment data acquisition unit 553.
(291) In step S556, the segment data playback unit 554 restores the stream data of content from the segment data which is acquired in the process of step S555, and supplies the video stream data and the audio stream data to the display 503 and the speaker 504, respectively. Thus, an image of content is displayed on the display 503, and audio synchronized with the video is output from the speaker 504.
(292) Hitherto, the flow of a specific process in each device during the unicast delivery has been described.
8. Configuration of Computer
(293) The series of processes described above can be performed by hardware, or software. When the series of processes are performed by software, programs constituting the software are installed in a computer.
(294) In a computer 900, a central processing unit (CPU) 901, a read only memory (ROM) 902, and a random access memory (RAM) 903 are connected to each other through a bus 904. Further, an input and output interface 905 is connected to the bus 904. An input unit 906, an output unit 907, a recording unit 908, a communication unit 909, and a drive 910 are connected to the input and output interface 905.
(295) The input unit 906 includes a keyboard, a mouse, a microphone, and the like. The output unit 907 includes a display, a speaker, and the like. The recording unit 908 includes a hard disk, a nonvolatile memory, and the like. The communication unit 909 includes a network interface, and the like. The drive 910 drives a removable media 911 such as a magnetic disk, an optical disk, a magneto-optical disk, and a semiconductor memory.
(296) In the computer 900 configured as described above, a series of processes described above is performed by the CPU 901 loading a program stored in the ROM 902 or the recording unit 908 on the RAM 903 and executing the program, through the input and output interface 905 and the bus 904.
(297) The program that the computer 900 (CPU 901) executes can be provided by being recorded, for example, on the removable media 911 as package media or the like. Further, the program can be provided through a wired or wireless transmission medium such as a local area network, the Internet, and digital satellite broadcasting.
(298) In the computer 900, the program can be installed in the recording unit 908, by mounting the removable media 911 in the drive 910 through the input and output interface 905. Further, the program can be received by the communication unit 909 and installed in the recording unit 908, through a wired or wireless transmission medium. Alternatively, the program can be installed in advance in the ROM 902 or the recording unit 908.
(299) Here, in this specification, the processes that the computer performs according to the program may not to be performed chronologically in the order described in the flowcharts. In other words, the processes that the computer performs according to the program include processes performed in parallel or individually (for example, a process by a parallel process or an object). Further, the program may be processed by a single computer (processor), or may be processed in a distributed manner by a plurality of computers.
(300) In addition, the embodiments of the present technology are not limited to the embodiments described above, and various modifications are possible in a scope without departing from the spirit of the present technology.
(301) In addition, the present technology can have the following configurations.
(302) (1)
(303) A reception apparatus including
(304) circuitry configured to
(305) receive a digital broadcast signal; and
(306) based on a media presentation description (MPD), which conforms to the moving picture expert group-dynamic adaptive streaming over HTTP (MPEG-DASH) standard and is transmitted in the digital broadcast signal or via a communication network,
(307) perform at least one of selection or grouping of components according to a level of a component hierarchy associated with the components, and
(308) control playback of content of at least one of the components.
(309) (2)
(310) The reception apparatus according to (1),
(311) wherein the component hierarchy includes a first level, a second level, and a third level,
(312) wherein the third level is for adaptively selecting one of the components,
(313) wherein the second level is for grouping the component that is selected in the third level and components that are not a target of the third level into one group component, and
(314) wherein the first level is for selecting one of the group component grouped in the second level, the component selected in the third level, and the components that are not targets of the second level and the third level.
(315) (3)
(316) The reception apparatus according to (2),
(317) wherein a function of the third level is implemented by mapping the third level to an AdaptationSet element of the MPD, and mapping Representation elements or SubRepresentation elements listed in the AdaptationSet element to the components that are not the target of the third level,
(318) wherein a function of the second level is implemented by a pair of AdaptationSet elements being designated by a first element disposed at the same level as that of the AdaptationSet element defined in the MPD, and
(319) wherein a function of the first level is implemented by performing grouping by a group attribute of the AdaptationSet element and a group attribute defined in the first element.
(320) (4)
(321) The reception apparatus according to (3),
(322) wherein the first element is a Subset element, and
(323) wherein a set of AdaptationSet elements is designated by a contains-attribute of the Subset element.
(324) (5)
(325) The reception apparatus according to (3) or (4),
(326) wherein grouping is performed for each category of the component, in the first level.
(327) (6)
(328) The reception apparatus according to (2),
(329) wherein a function of the third level is implemented by mapping the third level to the AdaptationSet element of the MPD, and mapping Representation elements or SubRepresentation elements listed in the AdaptationSet element to the components that are not the target of the third level,
(330) wherein a function of the second level is implemented by grouping a plurality of AdaptationSet elements by a second element that defines a relationship between the AdaptationSet elements of the MPD, a relationship between the Representation elements, or a relationship between the SubRepresentation elements, and
(331) wherein a function of the first level is implemented by performing grouping by a group attribute of the AdaptationSet element.
(332) (7)
(333) The reception apparatus according to (6),
(334) wherein the second element is an EssentialProperty element, and
(335) wherein a set of AdaptationSet elements is designated by a schemeldUri attribute and a value attribute of the EssentialProperty element.
(336) (8)
(337) The reception apparatus according to (6) or (7),
(338) wherein grouping is performed for each category of the component, in the first level.
(339) (9)
(340) The reception apparatus according to any one of (1) to (8),
(341) wherein the at least one of the components is transmitted in the digital broadcast signal or via the communication network.
(342) (10)
(343) A reception method of a reception apparatus, the method c including receiving a digital broadcast signal; and
(344) based on a media presentation description (MPD), which conforms to the moving picture expert group-dynamic adaptive streaming over HTTP (MPEG-DASH) standard and is transmitted in the digital broadcast signal or via a communication network, performing, by circuitry of the reception apparatus, at least one of selection or grouping of components according to a level of a component hierarchy associated with the components; and
(345) controlling, by the circuitry, playback of content of at least one of the components.
(346) (11)
(347) A transmission apparatus including
(348) circuitry configured to
(349) generate a media presentation description (MPD) that conforms to the moving picture expert group-dynamic adaptive streaming over HTTP (MPEG-DASH) standard, the MPD including information for performing at least one of selection or grouping of components according to a level of a component hierarchy associated with the components and controlling playback of content of at least one of the components; and
(350) transmit the generated MPD in a digital broadcast signal or via a communication network.
(351) (12)
(352) A transmission method of a transmission apparatus, the method including
(353) generating, by circuitry of the transmission apparatus, a media presentation description (MPD) that conforms to the moving picture expert group-dynamic adaptive streaming over HTTP (MPEG-DASH) standard, the MPD including information for performing at least one of selection or grouping of components according to a level of a component hierarchy associated with the components and controlling playback of content of at least one of the components; and
(354) transmitting, by the circuitry, the generated MPD in a digital broadcast signal or via a communication network.
(355) (13)
(356) The reception apparatus according to any one of (1) to (9),
(357) wherein the MPD includes a first element for designating a pair of AdaptationSet elements, in a range outside of the component hierarchy.
(358) (14)
(359) The reception apparatus according to (13),
(360) wherein the first element is a Subset element, and
(361) wherein a set of AdaptationSet elements is designated by a contains-attribute of the Subset element.
(362) (15)
(363) The reception apparatus according to (13) or (14),
(364) wherein the MPD includes a second element for defining a relationship between the AdaptationSet elements of the MPD, a relationship between the Representation elements, or a relationship between the SubRepresentation elements, and
(365) wherein a relationship between Representation element groups under the AdaptationSet element is designated by the second element.
(366) (16)
(367) The reception apparatus according to (15),
(368) wherein the second element is an EssentialProperty element, and
(369) wherein a relationship between the Representation element groups under the AdaptationSet element is designated by the schemeldUri attribute of the EssentialProperty element.
(370) (17)
(371) The reception apparatus according to any one of (13) to (16),
(372) wherein the component hierarchy includes a first level, a second level, and a third level,
(373) wherein the third level is for adaptively selecting one of the components,
(374) wherein the second level is for grouping the component that is selected in the third level and components that are not a target of the third level into one group component, and
(375) wherein the first level is for selecting one component among the group component grouped in the second level, the component selected in the third level, and the components that are not targets of the second level and the third level.
(376) (18)
(377) The reception method according to (10),
(378) wherein the MPD includes a first element for designating a pair of AdaptationSet elements, in a range outside of the component hierarchy.
(379) (19)
(380) The transmission apparatus according to (11), wherein the circuitry is further configured to generate the MPD including a first element for designating a pair of AdaptationSet elements, in a range outside of the component hierarchy.
(381) (20)
(382) The transmission method according to (12), wherein the step of generating the MPD further includes generating the MPD including a first element for designating a pair of AdaptationSet elements, in a range outside of the component hierarchy.
(383) It should be understood by those skilled in the art that various modifications, combinations, sub-combinations and alterations may occur depending on design requirements and other factors insofar as they are within the scope of the appended claims or the equivalents thereof.
REFERENCE SIGNS LIST
(384) 1 Broadcast communication system 10 Channel streamer 20 MPD providing device 30 DASH segmenter and streamer 50 Client device 90 Network 101 Content management unit 102 Content accumulation unit 103 Communication unit 201 MPD generation unit 202 Communication unit 203 Transmission unit 301 Segment data generation unit 302 Segment metadata generation unit 303 Communication unit 304 Transmission unit 501 Control unit 502 Reception unit 503 Display 504 Speaker 505 Recording unit 506 Communication unit 551 MPD acquisition unit 552 Component selection unit 553 Segment data acquisition unit 554 Segment data playback unit 900 Computer 901 CPU