Autonomous systems human controller simulation
11148671 · 2021-10-19
Assignee
Inventors
Cpc classification
B60W50/14
PERFORMING OPERATIONS; TRANSPORTING
G06F3/017
PHYSICS
G06F3/011
PHYSICS
B60W50/0098
PERFORMING OPERATIONS; TRANSPORTING
B60W40/08
PERFORMING OPERATIONS; TRANSPORTING
B60W60/00
PERFORMING OPERATIONS; TRANSPORTING
B60K35/00
PERFORMING OPERATIONS; TRANSPORTING
International classification
B60W40/08
PERFORMING OPERATIONS; TRANSPORTING
B60W50/00
PERFORMING OPERATIONS; TRANSPORTING
B60K35/00
PERFORMING OPERATIONS; TRANSPORTING
Abstract
Dynamic online and real-time virtual humans exhibit behaviors associated with inputs and outputs of an autonomous control system. To foster the awareness and trust, the virtual humans exhibit situational awareness via apparent (e.g., rendered) behaviors based on inputs such as direct system measures, functions of the measures, estimates of state based on measures, and estimates of state based on a priori constraints. The virtual humans also exhibit situational control via apparent behaviors associated with outputs such as direct control of devices, functions of control, actions based on high-level goals, and the optional use of virtual versions of conventional physical controls. A dynamic virtual human who continually exhibits awareness of the system state and relevant contextual circumstances, along with the ability to directly control the system, is used to reduce negative feelings associated with the system such as uncertainty, concern, stress, or anxiety on the part of real humans.
Claims
1. A computer-implemented method of conveying to a vehicle non-occupant their spatial detection by a trust-sensitive autonomous vehicle control system operating the vehicle, a computer processor generating attentive gesture images through a vehicle-mounted display device viewable by the non-occupant, the method comprising the steps of: providing a computer comprising the computer processor accessing non-transitory computer-readable memory, the memory storing computer-readable instructions that, when executed by the processor, cause the processor to evaluate input data and logically modify the output of the display device in response; receiving, by one or more sensors communicatively coupled to the processor, relative location data of the non-occupant used by the trust-sensitive autonomous control system to make real-world automation decisions; and generating a plurality of attentive gesture images through the display device viewable to the non-occupant wherein the attentive gesture images change in response to the location of the non-occupant relative to the vehicle wherein the attentive gestures images are selected from the group consisting of head orientation, pupil orientation, limb orientation, and facial expression whereby the attentive gestures track the location of the non-occupant relative to the vehicle.
2. The method of claim 1, wherein the attentive gesture images include turning a head of a virtual human driver in the spatial direction of the non-occupant.
3. The method of claim 1, wherein the attentive gesture images include a graphically rendered virtual human driver actions include graphically rendered rotation of vehicle steering wheel.
4. The method of claim 1, wherein the attentive gesture images include a graphically rendered virtual human driver actions include graphically rendered vehicle pedal engagement.
5. The method of claim 1, wherein the attentive gesture images include a virtual human driver positionally rendered over the controls of the existent vehicle.
6. The method of claim 1, wherein the display is a two-dimensional, flat panel display integral to the vehicle.
7. The method of claim 1, wherein the display is a heads-up, projected display integral to the vehicle.
8. The method of claim 1, wherein the display is a transparent, organic light-emitting diode display integral to the vehicle.
9. The method of claim 1, wherein the attentive gesture images are displayed through an embedded organic light emitting diode screen within a window of the vehicle.
10. The method of claim 1, wherein the attentive gesture images are displayed through a heads-up display projected onto a window of the vehicle.
11. A computer-implemented method of conveying to a vehicle non-occupant their spatial detection by a trust-sensitive autonomous vehicle control system operating the vehicle, a computer processor generating attentive gesture images through an augmented reality display device worn by the non-occupant, the method comprising the steps of: providing a computer comprising the computer processor accessing non-transitory computer-readable memory, the memory storing computer-readable instructions that, when executed by the processor, cause the processor to evaluate input data and logically modify the output of the display device in response; receiving, by one or more sensors communicatively coupled to the processor, relative location data of the non-occupant used by the trust-sensitive autonomous control system to make real-world automation decisions; and generating a plurality of attentive gesture images and transmitting the images to the augmented reality headset wherein the attentive gesture images change in response to the location of the non-occupant relative to the vehicle and the attentive gestures track the location of the non-occupant relative to the vehicle.
12. The method of claim 11, wherein the attentive gesture images of the non-occupant's spatial location detected by the vehicle are presented through the non-occupant's augmented reality headset as an object external to the current spatial location of the vehicle.
13. The method of claim 12, wherein the object is an anthropomorphic form.
Description
BRIEF DESCRIPTION OF THE DRAWINGS
(1) For a fuller understanding of the invention, reference should be made to the following detailed description, taken in connection with the accompanying drawings, in which:
(2)
(3)
(4)
(5)
(6)
(7)
(8)
(9)
(10)
(11)
(12)
(13)
(14)
(15)
(16)
(17)
(18)
(19)
(20)
(21)
(22)
(23)
(24)
(25)
DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENT
(26) The present invention introduces the use of a dynamic online and real-time virtual humans, whose behaviors are associated with inputs and outputs of an autonomous control system, to facilitate real human awareness of and trust in the autonomous control system. To foster the awareness and trust, the virtual humans themselves exhibit situational awareness via apparent (e.g., rendered) behaviors based on inputs such as direct system measures, functions of the measures, estimates of state based on measures, and estimates of state based on a priori constraints.
(27) The virtual humans also exhibit situational control via apparent behaviors associated with outputs such as direct control of devices (including control system feedback), functions of control (e.g., abstractions of control), actions based on high-level goals, and the optional use of virtual versions of conventional physical controls. A dynamic virtual human who continually exhibits awareness of the system state and relevant contextual circumstances, along with the ability to directly control the system, is used to reduce negative feelings associated with the system such as uncertainty, concern, stress, or anxiety on the part of real humans.
(28) Unlike the prior art, the virtual human is based directly on the inputs and outputs of the autonomous control system, where “inputs and outputs” includes direct measures/estimates and controls, derived measures/estimates and controls, and associated a priori constraints. This contrasts with the use of a virtual human simply for more natural conversational interfaces. The primary advantage here is that the real humans witness a virtual human who appears to be both situationally aware and reliably in control of the system, in a way that corresponds directly to the real system inputs (e.g., sensor measurements) and real system outputs (e.g., control of a vehicle).
(29) One possible embodiment involves a separate subsystem that receives select input and output signals from the primary autonomous control system, including direct measures/outputs, estimates, functions of measures and estimates, predictions of events, high-level control, etc. The system would then maintain an ongoing dynamic (time-varying) representation of the virtual human, who would always (as a default) exhibit some “idle” behavior approximating that of a real human. For example, they would continually move their body in small amounts, occasionally look around for no apparent reason, blink, sniff, etc. This sedentary behavior would be smoothly transitioned into direct input or output behaviors, e.g., a head turning towards an emergency vehicle, and the hands appearing to turn the steering wheel as the autonomous system pulls the car off the road.
(30) An embodiment may employ what a person of ordinary skill would recognize as some form of Augmented Reality (AR), e.g., the virtual human and virtual controls could be displayed via a head-worn stereo display, a fixed auto-stereo display, or a monocular display that is configured to apparently position the virtual human in the desire location. Some versions of such AR display paradigms might require a form of six-degree-of-freedom (6 DOF) tracking of a display or the user's head with respect to the car, to render the virtual human in the proper place, shape, etc. etc. The disclosed techniques could also be employed without AR; for example, the aware and acting virtual human could be shown using a conventional 2D display mounted in a vehicle dashboard or center console, on a conventional computer monitor, or on the display of a tablet computer or a smartphone.
(31) The appearance of the virtual human need not be limited to just the users of the system. For example, in the case of an autonomous car, pedestrians or bystanders (real humans who are NOT in the vehicle) could also see the virtual humans if they (the pedestrians or bystanders) themselves have AR capabilities, e.g., see an apparently aware and in control “human” parking the car. This could increase the confidence of vulnerable nearby individuals (e.g., pedestrians) that the system is safe.
(32) Situational Awareness:
(33) A virtual human exhibiting situational awareness can be illustrated using the example of an autonomous automobile. System input signals will normally reveal or predict a critical emergent event such as another vehicle crossing dangerously in front of the autonomous vehicle, an objecting falling off a truck immediately in front of the vehicle, and a child on a bike (or an animal) darting in front of the vehicle. In each of these examples, a continuous combination of autonomous system inputs and outputs would be involved in actually controlling the vehicle. The very same inputs and outputs could, at the same time, be used to render a virtual human who appears to be aware and has the situation under control, e.g., the virtual human could turn their head toward the unexpected object, verbalize awareness, and appear to be acting to control the vehicle (per the actual outputs). Another example involves an autonomous vehicle changing lanes.
(34) Before an autonomous system would initiate a lane change, the autonomous vehicle's system sensors would be used to scan the space around the car, looking for (ensuring the absence of) objects. In this case, the virtual human would turn their head, appearing to check for clearance before changing lanes. The virtual human's hands would appear on the real (or a virtual) steering wheel, and they would appear to turn the wheel (via the output signals affecting actual turning) simultaneously as they continue to check for clearance (via the real input signals associated with sideways-looking sensors).
(35) It is important to convey sufficient situational awareness and control even in the absence of specific events, i.e., when the system inputs do not indicate any immediate or predicted danger. In such circumstances, to reflect the continuous ongoing monitoring of the input signals, the virtual human could periodically turn its head as if scanning mirrors (rear, side), checking traffic on sides/behind, etc. The behavior should appear to be reliable but random (implemented via a pseudorandom sequence) so that the real human perceives it as being real, thoughtful, and reliable.
(36) In response to an approaching emergency vehicle, the virtual human should turn its head toward the emergency vehicle where it is sensed, verbalize awareness, and if the control system decides to pull off the road, the virtual human should (per the system inputs and outputs) appear to simultaneously check for clearance while turning the wheel (similar to the lane change example above) while also appearing to apply the brakes.
(37) The virtual human need not remain inside the car—indeed, the virtual human should move to a pose that conveys the best representation of human awareness correlated with the system inputs (e.g., sensors). For example, when parallel parking, the virtual human could teleport or step out of the car, position herself on the sidewalk near the front of the car, and gesture to indicate the clearance is OK (or “stop”) corresponding to what the system senses and does.
(38) During a traffic jam, the virtual human could walk, run, or fly ahead (outside of the car) to “see what's happening” and then report back to the inhabitants. Or, the virtual human could deploy a virtual (or real) drone to fly on ahead and could appear to look at a tablet display providing information to the virtual human.
(39) In systems where there is no apparent direct physical control, e.g., an autonomous system to make and operationalize stock buying/selling decisions, or a system for autonomous medical diagnosis, in response to system input, the virtual human could appear to consult corresponding sources (inputs) such as a newspaper, a television, a stock “ticker” feed, a medical journal, etc. The virtual human could even appear to consult other virtual humans—a behavior that can further increase confidence in the apparent decisions.
(40) Reliable Control:
(41) A virtual human exhibiting reliable control of an autonomous system can also be illustrated using the example of an autonomous automobile. For example, as indicated above, the virtual human could be rendered such that their hands are gripping the real (or a virtual) steering wheel. As the system commands the electro-mechanical/hydraulic actuators to turn the vehicle, the virtual human should be correspondingly rendered as if affecting the turning in a conventional (not-autonomous) vehicle. If the autonomous system determines that the horn should be sounded (a “honk”), the virtual human's hand should follow the “honk” output—moving to and appearing to press the real (or virtual) horn button. As indicated above, in the case of an approaching emergency vehicle, when the autonomous system determines it needs to apply the brakes, and the output commands are sent to the braking subsystem, the virtual human's foot should move to and appear to press on the real (or virtual) brake pedal. Similarly, if the autonomous system sends an output signal to engage a turn signal, the virtual human should appear to be the one initiating the action.
(42) Reliable control goes beyond driving (controlling the vehicle rate and direction) to controls, including interior amenities such as climate control, entertainment, mobile telephone use, etc. For example, if a system input indicates a user request to “play jazz” or “lower the temperature” or “call my mother,” the virtual human could move their head to indicate listening, and then when the autonomous system creates an output signal to play the music (or adjust the climate control or use the phone) the virtual human would be moved in ways that correspond to a real human assisting with those activities, in synchrony with the output signals.
(43) The actions of the virtual human might be accompanied by temporary virtual objects apparently associated with actions, e.g., a virtual climate control panel could appear when the virtual human is going to change the temperature, and a virtual radio when changing the station. If the virtual human is seated in front of real physical controls, system outputs should be programmed to move the virtual human toward, on, and interacting with those controls. If there are no real physical controls in front of the virtual, e.g., the virtual human is in the passenger seat, or the car is very advance (with no apparent conventional controls), the system could optionally render conventional virtual controls, which the virtual human would then interact with in response to system output commands.
(44) In systems where there is no apparent direct physical control, e.g., an autonomous system to make and operationalize stock buying/selling decisions, or a system for autonomous medical diagnosis, in response to system output, the virtual human could appear to select options (outputs) corresponding to the actual outputs, e.g., push virtual buttons, type virtual commands, or even instruct another virtual human to carry out the stock order or to convey the medical diagnosis.
(45) Variations
(46) As indicated above, the position and apparent perspective of the virtual human could be varied to match the current circumstances associated with the automated control system inputs or outputs. For example, as indicated above, the virtual human could be positioned in the “driver's seat” in front of real controls, or in a seat where there are no controls, but virtual controls could (optionally) be rendered. Note that while the inventors do not envision a virtual human superimposed on top of a real human, there might be circumstances where that would be beneficial. As indicated above with the parallel parking example, there are circumstances where it would be natural and beneficial to have the virtual human appear outside the vehicle in the way a real human would, if they were helping to park.
(47) The demographic and other characteristics of the virtual human could be varied to better suit the real human users of the autonomous system (e.g., autonomous car, stock market system, medical system). For example, the size of the virtual human could be varied anywhere within a range of normal human stature, but also to allow more “superhuman” behavior such as a miniature virtual human appearing on the dashboard, the hood of the car, etc. The gender, race, apparent ethnicity, and culturally-specific behaviors of the virtual human could be varied. The number of virtual humans could be varied—two or more virtual humans appearing to collaborate, or consult can convey more certainty to a real human user.
(48) Turning to
(49) The contents of video monitor 30 are shown in
(50) As vehicle 10 navigates, control virtual human 70 not only conveys behaviors responsive to sensor data, it displays operational activity. For example, as vehicle 10 moves within a lane, control virtual human 70 turns the steering wheel in synchronization. For a lane change, control virtual human 70 operates a turn signal.
(51)
(52)
(53)
(54)
(55) In
(56)
(57) In an alternative embodiment of the invention, vehicle 210 wirelessly transmits to an augmented reality display worn by cyclist 245, rendered indicia acknowledging vehicle 210 has registered the presence of cyclist 245. For example, the augmented reality display (e.g., MICROSOFT HOLOLENS) displays a green “checkmark” over the real-time spatial location of vehicle 210 with respect to cyclist 245. Thus, cyclist 245 receives visual assurance that vehicle 210 is “aware” of the presence of cyclist 245. This reduces the cognitive load on cyclist 245, who would otherwise exercise greater monitoring that vehicle 210 “sees” cyclist 245. In this manner, whether vehicle 210 is autonomous, semi-autonomous, or manually operated, sensors in vehicle 210 do more than prevent accidental impact into cyclist 245. A trust loop is established that reduces the uncertainty and/or anxiety of cyclist 245 with respect to vehicle 210.
(58) A similar embodiment is shown in
(59) In
(60)
(61) The inventive concept of the present invention is transposable to various autonomous and semi-autonomous systems. One such application is in medical monitoring and automation. In
(62) In an alternative embodiment of
(63) An embodiment and application of the invention has utility for security applications. In
(64) In an alternative embodiment shown in
(65) Computer and Software Technology
(66) The present invention may be embodied on various platforms. The following provides an antecedent basis for the information technology that may be utilized to enable the invention.
(67) Embodiments of the present invention may be implemented in hardware, firmware, software, or any combination thereof. Embodiments of the present invention may also be implemented as instructions stored on a machine-readable medium, which may be read and executed by one or more processors. A machine-readable medium may include any mechanism for storing or transmitting information in a form readable by a machine (e.g., a computing device). For example, a machine-readable medium may include read-only memory (ROM); random access memory (RAM); magnetic disk storage media; optical storage media; flash memory devices; electrical, optical, acoustical or other forms of propagated signals (e.g., carrier waves, infrared signals, digital signals, etc.), and others.
(68) Further, firmware, software, routines, instructions may be described herein as performing certain actions. However, it should be appreciated that such descriptions are merely for convenience and that such actions, in fact, result from computing devices, processors, controllers, or other devices executing the firmware, software, routines, instructions, etc.
(69) The machine-readable medium may be, for example, but not limited to, an electronic, magnetic, optical, electromagnetic, infrared, or semiconductor system, apparatus, or device, or any suitable combination of the foregoing. More specific examples (a non-exhaustive list) of the computer-readable storage medium would include the following: an electrical connection having one or more wires, a portable computer diskette, a hard disk, a random access memory (RAM), a read-only memory (ROM), an erasable programmable read-only memory (EPROM or Flash memory), an optical fiber, a portable compact disc read-only memory (CD-ROM), an optical storage device, a magnetic storage device, or any suitable combination of the foregoing. In the context of this document, a computer-readable storage medium may be any non-transitory, tangible medium that can contain, or store a program for use by or in connection with an instruction execution system, apparatus, or device. Storage and services may be on-premise or remotely, such as in the “cloud” through vendors operating under the brands, MICROSOFT AZURE, AMAZON WEB SERVICES, RACKSPACE, and KAMATERA.
(70) A machine-readable signal medium may include a propagated data signal with machine-readable program code embodied therein, for example, in baseband or as part of a carrier wave. Such a propagated signal may take any of a variety of forms, including, but not limited to, electro-magnetic, optical, or any suitable combination thereof. A machine-readable signal medium may be any machine-readable medium that is not a computer-readable storage medium, and that can communicate, propagate, or transport a program for use by or in connection with an instruction execution system, apparatus, or device. However, as indicated above, due to circuit statutory subject matter restrictions, claims to this invention as a software product are those embodied in a non-transitory software medium such as a computer hard drive, flash-RAM, optical disk, or the like.
(71) Program code embodied on a machine-readable medium may be transmitted using any appropriate medium, including but not limited to wireless, wireline, optical fiber cable, radio frequency, etc., or any suitable combination of the foregoing. Machine-readable program code for carrying out operations for aspects of the present invention may be written in any combination of one or more programming languages, including an object-oriented programming language such as Java, C#, C++, Visual Basic or the like and conventional procedural programming languages, such as the “C” programming language or similar programming languages. Additional languages may include scripting languages such as PYTHON, LUA, and PERL.
(72) For communications, particularly to non-occupants, a number of wireless communication protocols may be used. For example, vehicle-to-vehicle (V2V) communication enables vehicles to wirelessly exchange information about their speed, location, and heading. V2V communication allows vehicles to broadcast and receive omni-directional messages, creating a 360-degree “awareness” of other vehicles in proximity. Vehicles, pedestrians, traffic control systems, cyclists and others equipped with appropriate hardware and software use the messages from surrounding vehicles to determine navigational paths and hazards as they develop. The technology employs visual, tactile, and audible alerts—or, a combination of these alerts—to warn occupants and non-occupants outside the vehicle. While initially planned as an ad hoc network, connectivity infrastructure, redundancy and quality of service features permit centralized storage and processing. Alternatively, a hybrid environment of on-premise (vehicular ad hoc networks, VANETS) networks may be enhanced or supplemented by centralized system through protocols under the brands or designations BLUETOOTH, 3G, 4G, 5G, WIFI, satellite or other wireless modalities.
(73) In yet another embodiment of the invention, the virtual human is a “companion” to a specific individual. An individual grants to companion access to the individual's calendar, electronic communications and the like. Accordingly, the companion's knowledge, appearance and settings roam with the individual. The companion is thus personalized to the individual. For example, where four individuals are all simultaneous occupants of a vehicle and wear personal display devices (e.g., MICROSOFT HOLOLENS equipment), unique companions appears to each individual but the companions all receive and interact with vehicular and external data to convey attentive observation and action in automated systems. In an embodiment, each additional individual with a personal display device sees the virtual human companions (whether common/shared or unique) from their individual personal perspectives. In other words, the view of the companion is configured for where they are sitting in the vehicle.
(74) In yet another embodiment of the invention, multiple virtual humans coexisting in the rendered simulated environment. For example, one virtual human is rendered as a “driver” of a vehicle and a second virtual human is rendered as the vehicle “navigator” giving attention towards navigation or roadside hazards. The plurality of virtual humans may be made to interact with each other further enhancing the reality of the simulated rendering.
(75) In yet another embodiment of the invention, a virtual human may be rendered in a first vehicle making motions (e.g., waving, nodding, gesturing, etc. . . . ) toward a second vehicle. These gestures may be communicatively isolated as each vehicle is operating automated systems independently. Alternatively, with V2V communications (or through a centralized, remote network point), the virtual human in the first vehicle is representing an automation or V2V decision to let the second vehicle merge ahead of it. Accordingly, virtual human rendering may be shared with individuals outside the immediate network or domain of the computer processor operating the virtual human.
(76) Aspects of the present invention are described below with reference to flowchart illustrations and/or block diagrams of methods, apparatus (systems) and computer program products according to embodiments of the invention. It will be understood that each block of the flowchart illustrations and/or block diagrams, and combinations of blocks in the flowchart illustrations and/or block diagrams, can be implemented by machine-readable program instructions.
(77) Glossary of Claim Terms
(78) Augmented Reality means technologies that superimposes a computer-generated image on a user's view of the real world, thus providing a composite view.
(79) Avatar means an icon or figure representing a particular person in a computer simulation.
(80) Black Box means any complex array of equipment, typically electronic, wherein the contents and controls are considered enigmatic to the user.
(81) Engagement Set means parameter values in the situational data causing the visual and/or audible graphic rendering of a virtual human in gestures and movements that convey to an observing existent human that the autonomous control system is engaged in action that changes the previous autonomous control system conditions.
(82) Observant Set means parameter values in the situational data causing visual and/or audible graphic rendering of a virtual human in gestures and movements that convey to an observing existent human that the autonomous control system is aware and cognitive of situational data likely to cause the autonomous control system to take imminent action or change the action it is currently taking.
(83) Existent Human means a non-virtual, living human directly or indirectly subject to the trust-sensitive autonomous control system's performance.
(84) Existent Vehicle means a non-virtual, real-world vehicle subject to an autonomous control system's operation.
(85) Haptic means perception and manipulation of objects using the senses of touch and proprioception.
(86) Head Mounted Display (HMD) is a digital display device worn on the head or integrated into a helmet. An HMD may present a completely virtual reality environment or may also reflect projected images wherein a user may see through it in augmented reality environments. Some commercially available HMDs include those sold under the brands OCULUS RIFT and MICROSOFT HOLOLENS.
(87) Mixed Reality means the combination of virtual and real worlds to generate new environments and visualizations wherein physical and digital objects co-exist and interact in real-time.
(88) Situational Data means sensor and control data the trust-sensitive autonomous control systems uses and/or generates in its control capacity.
(89) Tactile means of or connected with the sense of touch.
(90) UV mapping means the 3D modeling process of projecting a 2D image to a 3D model's surface for texture mapping.
(91) Virtual Environment means the audio, visual, tactile, and other sensory features of a computer-generated simulation.
(92) Virtual Human means a computer-generated, 2-dimensional, or 3-dimensional anthropomorphic character that conveys gestures, poses, interactions, and even audio consistent with a fully attentive, competent manual operator of a system that is otherwise autonomous or semi-autonomous with respect to an existent human observing the virtual human.
(93) Virtual Reality means a computer-generated simulation of a three-dimensional image or environment that can be interacted with in a seemingly real or physical way by a person using special electronic equipment, such as a helmet with a screen inside or gloves fitted with sensors
(94) Virtual Retinal Display (VRD) means a display system that draws a raster display directly onto the retina of an eye. The recipient sees what appears to be a conventional display floating in space in front of them. This is also known as a retinal scan display (RSD) or retinal projector (RP).
(95) The advantages set forth above, and those made apparent from the foregoing description, are efficiently attained. Since certain changes may be made in the above construction without departing from the scope of the invention, it is intended that all matters contained in the foregoing description or shown in the accompanying drawings shall be interpreted as illustrative and not in a limiting sense.