Patent classifications
H04N2013/0085
Method and apparatus for determining disparity
A disparity determination method and apparatus are provided. The disparity determination method includes receiving first signals of an event from a first sensor disposed at a first location and second signals of the event from a second sensor disposed at a second location that is different than the first location, and extracting a movement direction of the event, based on at least one among the first signals and the second signals. The disparity determination method further includes determining a disparity between the first sensor and the second sensor, based on the movement direction, a difference between times at which the event is sensed by corresponding pixels in the first sensor, and a difference between times at which the event is sensed by corresponding pixels in the first sensor and the second sensor.
Method and apparatus for buffer management in cloud based virtual reality services
Provided is a method for creating a virtual reality content, storing the virtual reality content in a transmission buffer, and after that, managing the transmission buffer. A server creates the virtual reality content based on user's motion information, stores the virtual reality content in the transmission buffer and is allowed to modify the virtual reality content stored in the transmission buffer based on subsequently received user's motion information, so that the most recent user's motion information can be appropriately reflected in the virtual reality content. It is possible to provide a more immersive virtual reality service.
Apparatus and method for referring to motion status of image capture device to generate stereo image pair to auto-stereoscopic display for stereo preview
A stereo preview apparatus has an auto-stereoscopic display, an input interface, a motion detection circuit, and a visual transition circuit. The input interface receives at least an input stereo image pair including a left-view image and a right-view image generated from an image capture device. The motion detection circuit evaluates a motion status of the image capture device. The visual transition circuit generates an output stereo image pair based on the input stereo image pair, and outputs the output stereo image pair to the auto-stereoscopic display for stereo preview, wherein the visual transition circuit refers to the evaluated motion status to configure adjustment made to the input stereo image pair when generating the output stereo image pair.
Method for assessing objective quality of stereoscopic video based on reduced time-domain weighting
A method for assessing an objective quality of a stereoscopic video based on reduced time-domain weighting, which considers a time domain perception redundant characteristic of human eyes during a video perception, includes steps of: through a motion intensity mean value and a motion intensity variance of an undistorted stereoscopic video and the motion intensity mean value and the motion intensity variance of each frame group, determining a motion intensity level of each frame group of the undistorted stereoscopic video; for the frame groups having different motion intensity levels, selecting undistorted reduced stereoscopic images through different frame extracting strategies with different densities; measuring a quality of a simultaneous distorted reduced stereoscopic image relative to the undistorted reduced stereoscopic image; through weighting each quality of the simultaneous distorted reduced stereoscopic image relative to the undistorted reduced stereoscopic image, obtaining a quality of a distorted stereoscopic video relative to the undistorted stereoscopic video.
VIRTUAL REALITY ENVIRONMENT BASED MANIPULATION OF MULTILAYERED MULTI-VIEW INTERACTIVE DIGITAL MEDIA REPRESENTATIONS
Various embodiments of the present disclosure relate generally to systems and methods for generating multi-view interactive digital media representations in a virtual reality environment. According to particular embodiments, a plurality of images is fused into a first content model and a first context model, both of which include multi-view interactive digital media representations of objects. Next, a virtual reality environment is generated using the first content model and the first context model. The virtual reality environment includes a first layer and a second layer. The user can navigate through and within the virtual reality environment to switch between multiple viewpoints of the content model via corresponding physical movements. The first layer includes the first content model and the second layer includes a second content model and wherein selection of the first layer provides access to the second layer with the second content model.
Methods and systems for assigning locations to devices
A location identification system analyzes information received corresponding to a device detected in a room of a patient. On detecting a location identification of the device, the system assigns the device to the location corresponding to the location identification. In embodiments, the system retrieves patient and care team information for the location. The location and patient and care team information may be communicated to a central video monitoring system.
METHOD AND APPARATUS OF ENCODING/DECODING IMAGE DATA BASED ON TREE STRUCTURE-BASED BLOCK DIVISION
Disclosed are methods and apparatuses for image data encoding/decoding. A method of decoding an image includes receiving a bitstream in which the image is encoded; obtaining index information for specifying a block division type of a current block in the image; and determining the block division type of the current block from a candidate group pre-defined in the decoding apparatus. The candidate group includes a plurality of candidate division types, including at least one of a non-division, a first quad-division, a second quad-division, a binary-division or a triple-division. The method also includes dividing the current block into a plurality of sub-blocks; and decoding each of the sub-blocks with reference to syntax information obtained from the bitstream.
Methods for automatic registration of 3D image data
A method for automatic registration of 3D image data, captured by a 3D image capture system having an RGB camera and a depth camera, includes capturing 2D image data with the RGB camera at a first pose; capturing depth data with the depth camera at the first pose; performing an initial registration of the RGB camera to the depth camera; capturing 2D image data with the RGB camera at a second pose; capturing depth data at the second pose; and calculating an updated registration of the RGB camera to the depth camera.
APPARATUS AND METHOD FOR TRANSMITTING AND RECEIVING ADDITIONAL INFORMATION FOR 3D IMAGE
A method and apparatus for generating additional information used to reconstruct an additional image through steps of: generating information for movement compensation based on the original right image of the stereoscopic image and the previous frame of the right image; and generating first additional information for reconstructing the right image to a high resolution based on the original right image and the information for movement compensation are provided.
Three-dimensional hand tracking using depth sequences
In the field of Human-computer interaction (HCI), i.e., the study of the interfaces between people (i.e., users) and computers, understanding the intentions and desires of how the user wishes to interact with the computer is a very important problem. The ability to understand human gestures, and, in particular, hand gestures, as they relate to HCI, is a very important aspect in understanding the intentions and desires of the user in a wide variety of applications. In this disclosure, a novel system and method for three-dimensional hand tracking using depth sequences is described. Some of the major contributions of the hand tracking system described herein include: 1.) a robust hand detector that is invariant to scene background changes; 2.) a bi-directional tracking algorithm that prevents detected hands from always drifting closer to the front of the scene (i.e., forward along the z-axis of the scene); and 3.) various hand verification heuristics.