Patent classifications
G06V20/62
Assigning case identifiers to video streams
A process mining system performs process mining using visual logs generated from video streams of worker devices. Specifically, for a given worker device, the process mining system obtains a series of images capturing a screen of a worker device while the worker device processes one or more tasks related to an operation process. The process mining system determines activity labels for a plurality of images. An activity label for an image may indicate an activity performed on the worker device when the image was captured. The activity label is determined by extracting information from pixels of the image and inferring the activity of the worker device from the extracted information. The process mining system generates event logs from the visual logs of worker devices and uses the event logs for process mining.
Systems and methods for determining a visual appearance quality of an exterior signage area of a vehicle
Systems and methods for exterior signage evaluation are disclosed herein. An example method includes receiving images of an exterior signage area of an exterior surface of a first vehicle, the images being obtained by the first vehicle, a second vehicle camera or an infrastructure camera, determining current environmental conditions around the first vehicle, processing the images of the exterior signage area using the current environmental conditions, wherein processing includes comparing an expected appearance of the exterior signage area with an actual appearance of the exterior signage area to determine a visual appearance quality of the exterior signage area, and presenting a message on a display that includes the visual appearance quality.
Systems and methods for determining a visual appearance quality of an exterior signage area of a vehicle
Systems and methods for exterior signage evaluation are disclosed herein. An example method includes receiving images of an exterior signage area of an exterior surface of a first vehicle, the images being obtained by the first vehicle, a second vehicle camera or an infrastructure camera, determining current environmental conditions around the first vehicle, processing the images of the exterior signage area using the current environmental conditions, wherein processing includes comparing an expected appearance of the exterior signage area with an actual appearance of the exterior signage area to determine a visual appearance quality of the exterior signage area, and presenting a message on a display that includes the visual appearance quality.
Order post to enable parallelized order taking using artificial intelligence engine(s)
In some aspects, a computing device receives a scan of a code displayed on an order post located near a restaurant, determines that the code is associated with the restaurant, and automatically opens a software application and navigates the software application to an ordering page associated with the restaurant. The computing device initiates receiving, via the software application, input associated with an order, sends the input to a machine learning based software agent executing on a server, receives a predicted response to the input, provides the predicted response as audio output and/or displays the predicted response on the touchscreen display device. After the order is complete, the computing device sends order data associated with the order to the restaurant. After receiving an indication from the restaurant that the order is ready, the computing device indicates that the order is ready to be picked up.
Order post to enable parallelized order taking using artificial intelligence engine(s)
In some aspects, a computing device receives a scan of a code displayed on an order post located near a restaurant, determines that the code is associated with the restaurant, and automatically opens a software application and navigates the software application to an ordering page associated with the restaurant. The computing device initiates receiving, via the software application, input associated with an order, sends the input to a machine learning based software agent executing on a server, receives a predicted response to the input, provides the predicted response as audio output and/or displays the predicted response on the touchscreen display device. After the order is complete, the computing device sends order data associated with the order to the restaurant. After receiving an indication from the restaurant that the order is ready, the computing device indicates that the order is ready to be picked up.
Method and system for reducing manual review of license plate images for assessing toll charges
A tolling system is operable to reduce the number of manual reviews of a toll point images needed to process toll fee charges by separately reporting from both toll points and mobile device in vehicles running a tolling application program the lane and crossing time when traversing a toll point. A tolling service can match records produced by the toll points with records providing by the mobile device when the toll point cannot immediately determine the identity of the toll customer passing through the toll point.
Method and system for reducing manual review of license plate images for assessing toll charges
A tolling system is operable to reduce the number of manual reviews of a toll point images needed to process toll fee charges by separately reporting from both toll points and mobile device in vehicles running a tolling application program the lane and crossing time when traversing a toll point. A tolling service can match records produced by the toll points with records providing by the mobile device when the toll point cannot immediately determine the identity of the toll customer passing through the toll point.
SYSTEMS AND METHODS FOR DETECTING TEXT IN IMAGES
In some embodiments, apparatuses and methods are provided herein useful to detecting text in images. In some embodiments, a system for detecting text in images comprises a database configured to store images and a control circuit configured to retrieve an image, generate, based on the image, a collection of augmented images, detect characters in each of the augmented images, generate bounding boxes for the characters in each of augmented images, recognize the characters in each of the augmented images, select, based on the recognition of the characters in each of the augmented images, candidate characters, wherein the candidate characters are selected based on consistency of the recognition of the characters in each of the augmented images, detect, for the image, a color associated with the characters, and store, in the database, the image, the candidate characters, and the color associated with the characters.
SYSTEMS AND METHODS FOR DETECTING TEXT IN IMAGES
In some embodiments, apparatuses and methods are provided herein useful to detecting text in images. In some embodiments, a system for detecting text in images comprises a database configured to store images and a control circuit configured to retrieve an image, generate, based on the image, a collection of augmented images, detect characters in each of the augmented images, generate bounding boxes for the characters in each of augmented images, recognize the characters in each of the augmented images, select, based on the recognition of the characters in each of the augmented images, candidate characters, wherein the candidate characters are selected based on consistency of the recognition of the characters in each of the augmented images, detect, for the image, a color associated with the characters, and store, in the database, the image, the candidate characters, and the color associated with the characters.
SYSTEMS AND METHODS OF MEDIA PROCESSING
Media processing systems and techniques are described. A media processing system receives image data that represents an environment captured by an image sensor. The media processing system receives an indication of an object in the environment that is represented in the image data. The media processing system divides the image data into regions, including a first region and a second region. The object is represented in one of the plurality of regions. The media processing system modifies the image data to obscure the first region without obscuring the second region based on the object being represented in the one of the plurality of regions. The media processing system outputs the image data after modifying the image data. In some examples, the object is depicted in the first region and not the second region. In some examples, the object is depicted in the second region and not the first region.