H04N5/268

MEDICAL VIDEO PROCESSING SYSTEM AND ENCODER
20220408118 · 2022-12-22 · ·

Provided is a medical video processing system capable of moderating changes in image quality of medical video resulted from encoding, and, an encoder used for the medical video system. A medical video system 1000 has a monitor group 300 and an encoder 400 that accept medical video input from a switches 100 through separate transmission paths, and the encoder 400 subjects the input medical video to encoding as well as image quality adjustment.

Intelligent Multi-Camera Switching with Machine Learning
20220408029 · 2022-12-22 ·

Multiple cameras in a conference room, each pointed in a different direction. At least a primary camera includes a microphone array to perform sound source localization (SSL). The SSL is used in combination with a video image to identify the speaker from among multiple individuals that appear in the video image. Neural network or machine learning processing is performed on the primary camera video of the identified speaker to determine the facial pose of speaker. The locations of the other cameras with respect to the primary camera have been determined. Using those locations and the facial pose, the camera with the best frontal view of the speaker is determined. That camera is set as the designated camera to provide video for transmission to the far end.

Intelligent Multi-Camera Switching with Machine Learning
20220408029 · 2022-12-22 ·

Multiple cameras in a conference room, each pointed in a different direction. At least a primary camera includes a microphone array to perform sound source localization (SSL). The SSL is used in combination with a video image to identify the speaker from among multiple individuals that appear in the video image. Neural network or machine learning processing is performed on the primary camera video of the identified speaker to determine the facial pose of speaker. The locations of the other cameras with respect to the primary camera have been determined. Using those locations and the facial pose, the camera with the best frontal view of the speaker is determined. That camera is set as the designated camera to provide video for transmission to the far end.

Matching Active Speaker Pose Between Two Cameras
20220408015 · 2022-12-22 ·

Described are multiple cameras in a conference room, each pointed in a different direction. A primary camera includes a microphone array to perform sound source localization (SSL). The SSL is used in combination with a video image to identify the speaker from among multiple individuals that appear in the video image. Pose information of the speaker is developed. Pose information of each individual identified in each other camera is developed. The speaker pose information is compared to the pose information of the individuals from the other cameras. The best match for each other camera is selected as the speaker in that camera. The speaker views of each camera are compared to determine the speaker view with the most frontal view of the speaker. That camera is selected to provide the video for provision to the far end.

Matching Active Speaker Pose Between Two Cameras
20220408015 · 2022-12-22 ·

Described are multiple cameras in a conference room, each pointed in a different direction. A primary camera includes a microphone array to perform sound source localization (SSL). The SSL is used in combination with a video image to identify the speaker from among multiple individuals that appear in the video image. Pose information of the speaker is developed. Pose information of each individual identified in each other camera is developed. The speaker pose information is compared to the pose information of the individuals from the other cameras. The best match for each other camera is selected as the speaker in that camera. The speaker views of each camera are compared to determine the speaker view with the most frontal view of the speaker. That camera is selected to provide the video for provision to the far end.

Electronic device and method for controlling electronic device

An electronic device and a controlling method thereof are provided. The electronic device includes a memory, a first camera including a first image sensor and at least one processor, and the at least one processor obtains a plurality of image frames by photographing surroundings of the electronic device through the first camera, sets a region of interest (ROI) on the plurality of image frames, obtains a motion identification map corresponding to each of the plurality of image frames and select at least one image frame from among the plurality of image frames based on the obtained motion identification map, identifies whether there is a motion of an object on the ROI set on the selected at least one image frame, and performs a Super Slow Motion (SSM) function through the first camera based on a result of the identification.

Electronic device and method for controlling electronic device

An electronic device and a controlling method thereof are provided. The electronic device includes a memory, a first camera including a first image sensor and at least one processor, and the at least one processor obtains a plurality of image frames by photographing surroundings of the electronic device through the first camera, sets a region of interest (ROI) on the plurality of image frames, obtains a motion identification map corresponding to each of the plurality of image frames and select at least one image frame from among the plurality of image frames based on the obtained motion identification map, identifies whether there is a motion of an object on the ROI set on the selected at least one image frame, and performs a Super Slow Motion (SSM) function through the first camera based on a result of the identification.

INTELLIGENT MULTI-CAMERA SWITCHING WITH MACHINE LEARNING

Multiple cameras in a conference room, each pointed in a different direction and including a microphone array to perform sound source localization (SSL). The SSL is used in combination with the video image to identify the speaker from among multiple individuals that appear in the video image. Neural network or machine learning processing is performed on the identified speaker to determine the quality of the front or facial view of the speaker. The best view of the speaker's face from the various cameras is selected to be provided to the far end. If no view is satisfactory, a default view is selected and that is provided to the far end. The use of the SSL allows selection of the proper individual from a group of individuals in the conference room, so that only the speaker's head is analyzed for the best facial view and then framed for transmission.

INTELLIGENT MULTI-CAMERA SWITCHING WITH MACHINE LEARNING

Multiple cameras in a conference room, each pointed in a different direction and including a microphone array to perform sound source localization (SSL). The SSL is used in combination with the video image to identify the speaker from among multiple individuals that appear in the video image. Neural network or machine learning processing is performed on the identified speaker to determine the quality of the front or facial view of the speaker. The best view of the speaker's face from the various cameras is selected to be provided to the far end. If no view is satisfactory, a default view is selected and that is provided to the far end. The use of the SSL allows selection of the proper individual from a group of individuals in the conference room, so that only the speaker's head is analyzed for the best facial view and then framed for transmission.

Image processing device, and image processing method
11528429 · 2022-12-13 · ·

An image processing device includes an evaluation unit configured to evaluate whether a first region of a captured image satisfies a quality condition, and a composition frame setting unit configured to set a different composition frame in the captured image in accordance with an evaluation result of the first region.