G06V30/166

ELECTRONIC DEVICE, METHOD, AND NON-TRANSITORY COMPUTER READABLE STORAGE MEDIUM FOR RESTORING LOW-RESOLUTION IMAGE BY USING IMAGE RESTORATION MODEL FOR EXTRACTING GLOBAL CONTEXT INFORMATION
20250348978 · 2025-11-13 ·

According to an embodiment, an electronic device receives a request to restore a second input image with a first resolution representing a specified portion of a first input image to an output image with a second resolution exceeding the first resolution. The electronic device, based on the received request, executes an image restoration model including a first encoder for extracting first feature information from the first input image, a second encoder for extracting second feature information from the second input image, and a decoder for generating the output image with the second resolution based on multi head cross attention between the first feature information and the second feature information. The electronic device provides the output image with the second resolution obtained based on the execution of the image restoration model, as a response to the request.

Recognition system for recognizing multiple inputs of gestures, handwriting symbols and virtual keys on touch screen
12554347 · 2026-02-17 · ·

A recognition system for recognizing multiple inputs of gestures, handwriting symbols and virtual keys on a touch screen includes a touch IC serves to convert a plurality of touch signals of the touch screen to a touch data frame. A processor set is connected to the touch IC and serves to perform a touch data processing on the touch data frame. The touch data processing is performed by using a processing directly executed by an OS (Operating System) and a processing of AI (artificial intelligence) recognizing. An AI recognition module is connected to the processor set. The AI recognition module is used for recognizing multiple key inputs, operation gestures and handwriting symbols. The key inputs and handwriting symbols are corrected by a grammar correction and a symbol correction respectively. The touch screen serves to display a virtual keyboard.

Connecting vision and language using fourier transform
12633146 · 2026-05-19 · ·

A method for text-image integration is provided. The method may include receiving a question related to pairable data comprising text data and image data. Embeddings are generated from the text tokens and image encodings. Embeddings are generated from the text tokens and image encodings. The embeddings include text embeddings and image embeddings. A spectral conversion of the text embeddings and the image embeddings is performed to generate spectral data. The spectral data is processed to extract text-image features. The text-image features are processed to generate inferred answers to the question.

Connecting vision and language using fourier transform
12633146 · 2026-05-19 · ·

A method for text-image integration is provided. The method may include receiving a question related to pairable data comprising text data and image data. Embeddings are generated from the text tokens and image encodings. Embeddings are generated from the text tokens and image encodings. The embeddings include text embeddings and image embeddings. A spectral conversion of the text embeddings and the image embeddings is performed to generate spectral data. The spectral data is processed to extract text-image features. The text-image features are processed to generate inferred answers to the question.