Automatic identification of medically relevant video elements

Abstract

Apparatus for an automatic identification of medically relevant video elements, the apparatus including a data input, configured to receive a data stream of image slices, wherein the data stream of image slices represents a temporal course of a view of image slices defined by a masking strip of video images from a video which has been recorded during a medical surgery on a patient an analysis apparatus configured to analyze the data stream of image slices via an analysis including at least one predefined analysis step for the presence of at least one sought-for feature and to output a result of the presence, and a processing device configured to output a start mark which indicates a correspondence between the presence and a position in the data stream of image slices if the result indicates the presence of the sought-for feature. Also, a corresponding method is disclosed.

Claims

1. An apparatus for an automatic identification of medically relevant video elements from a video, the apparatus comprising: a data input, configured to receive a data stream of image slices, wherein the data stream of image slices represents a temporal course of a view of image slices defined by a masking strip of video images from the video which has been recorded during a medical surgery on a patient, an analysis apparatus configured to analyze the data stream of image slices via an analysis comprising at least one predefined analysis step for a presence of at least one sought-for feature and to output a result of the presence, and a processing device configured to output a start mark which indicates a correspondence between the presence and a position in the data stream of image slices if the result indicates the presence of the sought-for feature.

2. The apparatus of claim 1, wherein the analysis determines a color saturation in at least one image slice from the data stream of image slices and indicates the presence of the sought-for feature if the color saturation falls below a threshold of the color saturation, wherein the sought-for feature is a smoke emission.

3. The apparatus of claim 1, wherein the analysis performs a transformation of color information in at least one image slice from the data stream of image slices into the L*a*b* color space and indicates whether color values of pixel in the at least one image slice exceed a threshold in an a-channel of the L*a*b* color space, wherein the sought-for feature is the presence of a metallic instrument.

4. The apparatus of claim 3, wherein the analysis determines in at least two image slices, where the color values of the pixels exceed the threshold in the a-channel, the respective position of the pixels and calculates from a comparison of the positions of the pixels a movement of the pixels, wherein the sought-for feature is a movement of the metallic instrument.

5. The apparatus of claim 1, wherein the analysis determines a hue in at least one image slice from the data stream of image slices and indicates whether the hue exceeds a threshold of a red component, wherein the sought-for feature is a recording within a body of the patient.

6. The apparatus of claim 1, wherein the analysis determines a luminance in at least one image slice from the data stream of image slices and indicates whether the luminance exceeds a threshold, wherein the sought-for feature is a recording is outside of a housing.

7. The apparatus of claim 1, wherein the analysis determines a hue in at least one image slice from the data stream of image slices and indicates whether the hue exceeds a threshold of a red component, wherein the sought-for feature is a bleeding.

8. The apparatus of claim 1, wherein the analysis calculates by comparing the positions of corresponding pixels in at least two image slices from the data stream of image slices a movement of pixels, wherein the sought-for feature is a movement of a recording device that recorded the video.

9. The apparatus of claim 1, wherein the analysis evaluates at least one image slice from the data stream of image slices regarding a degree of sharpness and indicates whether the degree of sharpness falls below a threshold, wherein the sought-for feature is a blurring.

10. The apparatus of claim 3, wherein upon presence of the metallic instrument and a blurring the sought-for feature is that a recording device which recorded the video is located in a tubular instrument.

11. The apparatus of claim 1, wherein the analysis generates a target image slice from a target image using the masking strip and compares at least one image slice from the data stream of image slices with the target image slice, wherein the sought-for feature is a match or a high similarity between the target image slice and the at least one image slice.

12. The apparatus of claim 1, further comprising a generation device for image slices configured to receive the video, to mask the video images except for the masking strip and to output resulting image slices as the data stream of image slices, in particular to the data input.

13. The apparatus according to claim 12, wherein the generation device for image slices is configured to compress the video images before masking and/or to compress the resulting image slices, in particular using a discrete cosine transformation.

14. The apparatus according to claim 13, wherein the compression is applied to blocks of pixels of the video image, wherein the blocks, in particular, have a size of 88 pixels.

15. The apparatus of claim 1, wherein the data input is in communication with the analysis apparatus, and the processing device is in electrical communication with the analysis apparatus and a recording device.

16. A method for an automatic identification of medically relevant video elements, the method comprising the steps of: receiving, via a data input, a data stream of image slices, wherein the data stream of image slices represents a temporal course of a view of image slices defined by a masking strip of video images from a video which has been recorded during a medical surgery on a patient, analyzing, via an analysis comprising at least one predefined analysis step, the data stream of image slices for a presence of a sought-for feature, outputting a result of the presence, and outputting a start mark which indicates a correspondence between the presence and a position in the data stream of image slices, if the result indicates the presence of the sought-for feature.

17. A non-transitory computer-readable information storage media having stored thereon instructions, that when executed by one or more processors, cause to be performed a method for an automatic identification of medically relevant video elements comprising: receiving, via a data input, a data stream of image slices, wherein the data stream of image slices represents a temporal course of a view of image slices defined by a masking strip of video images from a video which has been recorded during a medical surgery on a patient, analyzing, via an analysis comprising at least one predefined analysis step, the data stream of image slices for a presence of a sought-for feature, outputting a result of the presence, and outputting a start mark which indicates a correspondence between the presence and a position in the data stream of image slices, if the result indicates the presence of the sought-for feature.

18. The media of claim 17, wherein the analyzing determines a color saturation in at least one image slice from the data stream of image slices and indicates the presence of the sought-for feature if the color saturation falls below a threshold of the color saturation, wherein the sought-for feature is a smoke emission.

19. The media of claim 17, wherein the analyzing performs a transformation of color information in at least one image slice from the data stream of image slices into the L*a*b* color space and indicates whether color values of pixel in the at least one image slice exceed a threshold in an a-channel of the L*a*b* color space, wherein the sought-for feature is the presence of a metallic instrument.

20. The media of claim 17, wherein the analyzing generates a target image slice from a target image using the masking strip and compares at least one image slice from the data stream of image slices with the target image slice, wherein the sought-for feature is a match or a high similarity between the target image slice and the at least one image slice.

Description

BRIEF DESCRIPTION OF THE DRAWINGS

(1) The patent or application file contains at least one drawing executed in color. Copies of this patent or patent application publication with color drawing(s) will be provided by the Office upon request and payment of the necessary fee.

(2) Embodiments of the disclosure are shown in the figures and are explained in the following description. The figures show:

(3) FIG. 1 shows an embodiment of an apparatus for an automatic identification of medically relevant video elements;

(4) FIG. 2 shows an embodiment of a masking strip;

(5) FIG. 3 shows an example of an endoscopic video image;

(6) FIG. 4 shows a further embodiment of a masking strip;

(7) FIG. 5 shows an embodiment of an image slice that results from the application of the masking strip;

(8) FIGS. 6 to 13 show further embodiments of one or more masking strips;

(9) FIGS. 14 to 21 show embodiments for the use of pixel blocks;

(10) FIG. 22 shows a first embodiment of a schematic representation when using image slices;

(11) FIG. 23 shows a second embodiment of a schematic representation when using image slices;

(12) FIGS. 24A-B show a first embodiment of a specific graphical representation when using image slices; and

(13) FIGS. 25A-B show a second embodiment of a specific graphical representation when using image slices; and

DESCRIPTION OF PREFERRED EMBODIMENTS

(14) FIG. 1 shows an embodiment of an apparatus 10 for an automatic identification of medically relevant video elements. The apparatus 10 comprises a data input 12 which is configured to receive a data stream 14 of image slices, wherein the data stream 14 of image slices represents a temporal course of a view of image slices defined by a masking strip 30 (see FIG. 2) of video images 16 from a video 18 which has been recorded during a medical surgery on a patient.

(15) An analysis apparatus 20 is configured to analyze the data stream 14 of image slices via an analysis comprising at least one predefined analysis step for the presence of at least one sought-for feature and to output a result E of the presence.

(16) The apparatus 10 further comprises a processing device 22 which is configured to output a start mark SM which indicates a correspondence between the presence and a position in the data stream 14 of image slices if the result E indicates the presence of the sought-for feature. In this embodiment the processing device 22 is further configured to output an end mark EM which indicates a correspondence between a missing presence of the sought-for feature and a position in the data stream 14 of image slices, if the result E indicates the missing presence of the sought-for feature after a start mark SM has been output.

(17) The apparatus 10 further comprises a generation device 24 for image slices which is configured to receive the video 18, mask the video images 16 except for the masking strip and to output resulting image slices 26 as the data stream 14 of image slices, in particular to the data output 12.

(18) FIG. 2 shows an embodiment of a masking strip.

(19) In addition, FIG. 2 shows a possible way to describe the characteristic features of the masking strip 30. VH describes the height of a video image 16, VB describes the width direction of the video image 16 and VM describes the center point of the video image 16. SH describes the height of the masking strip 30, SB describes a width of the masking strip 30 and SM describes a center point of the masking strip 30. A distance between the center point VM of the video and the center point SM of the masking strip 30 has a value of B along the width direction of the video image 16 and a value of H along the height direction of the video image 16. Here, a strip angle SW of the masking strip 30 is measured relative to the height direction of the video image.

(20) FIG. 3 shows an exemplary endoscopic video image.

(21) FIG. 4 shows a further embodiment of a masking strip.

(22) FIG. 5 shows an embodiment of an image slice which results from the application of a masking strip. Before the image slice 26 is transferred into the data stream 14 of image slices, the image slice 26 is preferably rotated into an upright orientation so that the image slices 26 in the data stream 14 of image slices are adjacent to each other along their long side.

(23) FIG. 6 shows an embodiment of a masking strip 30 which is arranged vertically and in the middle.

(24) FIG. 7 shows an embodiment of a masking strip which is arranged in the middle and at an angle of 45.

(25) FIG. 8 shows an embodiment of a masking strip which is at approximately 33% of the width VB of the video image and is vertically arranged.

(26) FIG. 9 shows an embodiment having a first masking strip 30 and a second masking strip 30 which are vertically oriented. The first masking strip 30 is at approximately 33% of the image width and the second masking strip 30 is at approximately 67% of the image width.

(27) FIG. 10 shows an embodiment of a first masking strip 30, a second masking strip 30 and a third masking strip 30 which are vertically oriented. The first masking strip 30 is arranged at approximately 50% of the image width, the second masking strip 30 is arranged at approximately 25% of the image width, and the third masking strip 30 is arranged at approximately 75% of the image width.

(28) FIG. 11 shows an embodiment of a masking strip 30 which is arranged at approximately 50% of the height VH of the video image and is horizontally oriented.

(29) FIG. 12 shows a combination of the masking strips according to FIG. 6 and FIG. 11.

(30) FIG. 13 shows an embodiment having a vertical first masking strip 30, a horizontal second masking strip 30 and a horizontal third masking strip 30. The first masking strip 30 is arranged at approximately 50% of the image width, the second masking strip 30 is arranged at approximately 33% of the image height, and the third masking strip 30 is arranged at approximately 67% of the image height.

(31) FIG. 14 shows a masking strip 30 which is arranged in the middle at an angle of approximately 45. FIG. 15 shows symbolically how the masking strip 30 may be represented by pixel blocks 34, wherein the pixel blocks 34 may in particular have a size of 88 pixels.

(32) FIG. 16 shows a masking strip 30 which is arranged in the middle and at an angle of approximately 26. FIG. 17 shows symbolically how the image strip 30 may be represented by pixel blocks 34, wherein the pixel blocks 34 may in particular have a size of 88 pixel.

(33) FIG. 18 shows a masking strip 30 which is arranged in the middle and at an angle of approximately 18. FIG. 19 shows symbolically how the image strip 30 may be represented by pixel blocks 34, wherein the pixel blocks 34 may in particular have a size of 88 pixel.

(34) FIG. 20 shows a masking strip 30 which is arranged in the middle and at an angle of approximately 11. FIG. 21 shows symbolically how the image strip 30 may be represented by pixel blocks 34, wherein the pixel blocks 34 may in particular have a size of 88 pixel.

(35) FIG. 22 shows a first embodiment of a schematic representation when using image slices. There are shown: a video area 51, a circular representation 52 of all image slices 26, especially in the shape of a videogram, a circular representation 53 of the phases of the surgery, an indicator 53a of relevant video images comprising a thumbnail previous, an indicator 53b of relevant video segments, a circular indicator 54 for the indication of a quality of the video content, an indicator 55 indicating a time and zoom factor, an indicator indicating the point in time of the image shown in the video area 51 within the data stream 14 of image slices or within the video 18, a multi-function pointer 57 comprising a thumbnail preview, a previous window 58 and a control area 59.

(36) The video is shown at 51. As a result of a post-processing of the video data and a compressed visualization of the video 18, a data stream 14 of image slices is show at 52. Phases of the surgery which may have been marked by the user are shown at 53.

(37) Medically relevant video elements are indicated at 53a and 53b. An evaluation is shown at 54, wherein according to an exemplary embodiment the following colors are used: green as an indication of a good quality of the images, orange as an indication for a bad quality (e.g. blurred, too dark, shaky), blue as an indication of a recording outside the body of a patient, dark blue as an indication of a recording using fluorescence, and black as an indication that no evaluation was performed. Time-related information, especially the length of the video or a zoom factor, is shown at 55. The present point in time of playing the video is shown at 56.

(38) Using the multi-functional pointer 57 is possible to jump to a certain point in time. The preview window 58 shows the corresponding frame from the respective position within the video. In addition to the automatic identification of medically relevant video elements, further relevant video elements as well as text messages, speech recordings and drawings can be manually added using a context menu of the multi-functional pointer 57. The context menu may comprise specific functions like delete POI, delete SOI, modify start mark of POI, modify start mark/end mark of SOI, add description and export to playlist. Exporting to playlist means that manually marked segments or automatically determined segments may be concatenated and then exported. Such exported data represents only a part of the total data that is available. Upon request, only this data is shown in the circular representation.

(39) To control the video, functions like start, stop, pause, fast advance, fast rewind, jump to next POI, jump to previous POI may be provided at 59 when a control element, in particular a mouse pointer, enters this area. The speed of the fast advance or fast rewind may be configured.

(40) When the video 18 is played, it is shown at 51, and the representations 52, 53, 54 as well as the indicators 53a, 53b rotate counter-clockwise at a speed which corresponds to the zoomed length of the video 18. The present point in time is shown at the 12 o'clock position at 56. For other exemplary embodiments, the representations 52, 53, 54 and indicators 53a, 53b may be static while 56 rotates in a clockwise direction.

(41) At a control element, in particular the wheel of a mouse, the zoom factor may be changed at 55. If the mouse wheel is moved forward, the zoom factor is increased and only the corresponding part of the video is shown at 51. The representation of the data at 52, 53, 54, 53a and 53b is adjusted correspondingly along with the speed of the circular representations. If the mouse wheel is rotated backwards, the zoom factor is reduced and said adjustments are performed correspondingly.

(42) FIG. 23 shows a second embodiment of a schematic representation when using image slices. There are shown: a video area 51, a circular representation 52 of all image slices 26, in particular as a videogram, as circular representation 53 of the phases of the surgery, a circular indicator 54a of relevant video images and relevant video segments, configurable circular indicators 54b of relevant video images and relevant video segments which may be provided from other medical data sources (for example medical devices, navigational systems, process management systems, anesthesia systems, imaging radiology systems), an indicator 55 indicating a time and zoom factor, an indicator indicating the point in time of the image shown in the video area 51 within the data stream 14 of image slices or within the video 18, a multi-function pointer 57 comprising a thumbnail preview, a previous window 58 and a control area 59.

(43) Regarding the functionality the explanations made in the context of FIG. 22 apply correspondingly. Said colors are represented in the black- and white representation by corresponding hachures.

(44) FIG. 24 shows a first embodiment of a specific graphical representation when using image slices.

(45) FIG. 25 shows a second embodiment of a specific graphical representation when using image slices.

Automatic identification of medically relevant video elements

Assignee

Inventors

Cpc classification

Classification Explorer

G06F18/22

PHYSICS

Classification Explorer

G06V2201/03

PHYSICS

Classification Explorer

G06T2207/10016

PHYSICS

Classification Explorer

G06V2201/034

PHYSICS

Classification Explorer

G06T7/74

PHYSICS

Classification Explorer

G06V10/421

PHYSICS

Classification Explorer

G06V20/40

PHYSICS

Classification Explorer

G06T7/90

PHYSICS

Classification Explorer

G06T7/248

PHYSICS

Classification Explorer

G06T2207/10024

PHYSICS

Classification Explorer

G06V10/25

PHYSICS

Classification Explorer

G06T7/0016

PHYSICS

Classification Explorer

G06T2207/10068

PHYSICS

Classification Explorer

G06T2210/41

PHYSICS

International classification

Classification Explorer

G06T7/00

PHYSICS

Classification Explorer

G06K9/62

PHYSICS

Classification Explorer

G06T7/90

PHYSICS

Classification Explorer

G06K9/00

PHYSICS

Classification Explorer

G06T7/246

PHYSICS

Classification Explorer

G06T7/73

PHYSICS

Classification Explorer

G06K9/50

PHYSICS

Classification Explorer

G06K9/32

PHYSICS

Abstract

Claims

Description