What is Diminished Virtuality? A Directional and Layer-Based Taxonomy for the Reality-Virtuality Continuum

doi:10.2196/52904

Original Paper

What is Diminished Virtuality? A Directional and Layer-Based Taxonomy for the Reality-Virtuality Continuum

¹Institute of Computer Graphics and Vision, Graz University of Technology, Graz, Austria

²Center for Virtual and Extended Reality in Medicine, Essen University Hospital, Essen, Germany

³Institute for Artificial Intelligence in Medicine, Essen University Hospital, Essen, Germany

⁴Department of Oral and Maxillofacial Surgery, University Hospital RWTH Aachen, Aachen, Germany

⁵Institute of Medical Informatics, University Hospital RWTH Aachen, Aachen, Germany

⁶Partner Site Essen, German Cancer Consortium, Essen, Germany

⁷Department of Physics, TU Dortmund University, Dortmund, Germany

Corresponding Author:

Jan Egger, PhD

Institute of Computer Graphics and Vision

Graz University of Technology

Inffeldgasse 16c/2

Graz, 8010

Austria

Phone: 43 316 873 5076

Email: egger@icg.tugraz.at

ABSTRACT

The concept of reality-virtuality (RV) continuum was introduced by Paul Milgram and Fumio Kishino in 1994. It describes a spectrum that ranges from a purely physical reality (the real world) to a purely virtual reality (a completely computer-generated environment), with various degrees of mixed reality in between. This continuum is “realized” by different types of displays to encompass different levels of immersion and interaction, allowing for the classification of different types of environments and experiences. What is often overlooked in this concept is the act of diminishing real objects (or persons, animals, etc) from the reality, that is, a diminution, rather than augmenting it, that is, an augmentation. Hence, we want to propose in this contribution an update or modification of the RV continuum where the diminished reality aspect is more prominent. We hope this will help users, especially those who are new to the field, to get a better understanding of the entire extended reality (XR) topic, as well as assist in the decision-making for hardware (devices) and software or algorithms that are needed for new diminished reality applications. However, we also propose another, more sophisticated directional and layer-based taxonomy for the RV continuum that we believe goes beyond the mediated and multimediated realities. Furthermore, we initiate the question of whether the RV continuum truly ends on one side with physical reality.

JMIR XR Spatial Comput 2024;1:e52904

doi:10.2196/52904

Citation

Please cite as:

Loading authors...

KEYWORDS

Introduction

The reality-virtuality (RV) continuum is a concept introduced by Paul Milgram and Fumio Kishino [1] in 1994. It describes a spectrum that ranges from a purely physical reality (the real world) to a purely virtual reality (VR; a completely computer-generated environment), with various degrees of mixed reality (MR) in between. This continuum is “realized” by different types of displays [2] to encompass different levels of immersion and interaction, allowing for the classification of different types of environments and experiences. The RV continuum helps us understand the varying levels of immersion and interactivity that technology can provide. As technology advances, the boundaries between these immersion levels can become more fluid, and new hybrid experiences can emerge. The continuum is particularly relevant in fields such as VR, augmented reality (AR), and MR, where researchers and developers aim to create more compelling and natural experiences that bridge the gap between the physical and virtual worlds. We used ChatGPT (OpenAI) [3] to gauge the current state of the RV continuum. According to ChatGPT, the continuum is often divided into several main categories (note, we adapted the ChatGPT results and enhanced it with concrete examples, where necessary; Textbox 1 [4]). The original ChatGPT transcript is shown in Multimedia Appendix 1 [3].

Textbox 1. Main categories of the reality-virtuality continuum, modified from ChatGPT. The original ChatGPT transcript is shown in Multimedia Appendix 1.

Diminished Reality

What is often overlooked in this concept is the act of diminishing real objects (or persons, animals, etc) from reality, rather than augmenting the reality with virtual things [8,9]. An introduction to the topic can be found in Cheng et al [10]. A reason for this is that diminishing something from reality needs, in general, a sophisticated understanding of the real scene or environment to make the diminishing aspect convincing. In AR, the real world is just overwritten with a virtual object. In diminished reality (DR), however, the real-world part that is augmented or diminished needs to seemingly fit to the reality around it. In addition, this should all be performed in real time when a user is walking around the real world, and an algorithm has to do the following (note that the first 3 items are part of the Extent of World Knowledge axis of the taxonomy by Milgram and Kishino [1]):

Detect and track the real object that has to be removed or diminished;
Perform geometric modeling of the scene and objects to be added or subtracted (preexisting or captured once or in real time);
Apply the lighting model of the scene to objects added or to part of the revealed scene when something is removed (preexisting or captured once or in real time); and then
Combine all the previous points together as the scene description for the rendering algorithm.

All of this has to be done not only in real time but also with very high precision. The inserted virtual object has to fit seamlessly into and make sense with the reality; minor discrepancies will appear to be a glitch and will be noticed immediately by the user, as we recently observed in a DR user study [11]. In fact, we think that diminution and augmentation require fundamentally different technologies. In our opinion, an augmentation may be needed to alter reality at a certain position with regard to other (real) objects (eg, displaying a patient’s tumor as an AR hologram on the patient in front of you, at the real position, such as for needle guidance [12]), but no seamless and semantic fitting is necessary. As soon as a virtual object needs to fit into the scene semantically, we consider this to require diminution. Hence, for augmentation, you only need a volume rendering process with some basic options, such as position, size, and transparency. For diminution, however, additional fundamentally different technologies are needed. The scene has to be analyzed and understood, and a meaningful replacement has to be generated and inserted as an AR hologram. An example could be glasses that are removed from a person in front of you.

In summary, the user has to get the impression that the real, diminished object does not exist at all in reality [13]. Besides sophisticated algorithms, this course of action needs a considerable amount of computing power. Fortunately, there has been tremendous progress in both areas during the last years, with deep learning–based approaches and GPUs that can run these kinds of algorithms, even in real time. As a result, DR has already found its way into some applications [5], such as virtual furniture removal for redecorating purposes (eg, IKEA Kreativ [14]). Other possible applications for DR include the following:

Privacy enhancing: In a live video feed, certain objects or information can be blurred or removed in real time to protect sensitive or private data.
Training and education: DR can be used to remove distractions in a learning environment or highlight specific items to focus on.
Therapeutic applications: For someone with a phobia of spiders, a DR system could recognize spiders in the person’s field of view and diminish or replace them with less threatening images to reduce anxiety. Additionally, sensory overload, a feature of autism, could be diminished with a DR system, to reduce overstimulation.

Directional and Layer-Based Taxonomy

Nevertheless, for all these aforementioned reasons, we think that DR needs to be more prominent on the RV continuum, as shown in Figure 1 [15], without delving deeper into the broad topics of mediated reality [9] or even multimediated reality [16]. This will not only assist in the decision-making for hardware (devices) and software that are needed for new DR applications but also help unfamiliar users to get a better understanding of the entire extended reality (XR) topic (note that we are addressing this revision to the continuum purely from an application or user point of view [POV], not from the POV of an MR researcher or engineer). An example application for DR could be the real-time anonymization of a face via XR. There is a huge difference if a device detects the eye area and simply inpaints a black bar over the eyes (without considering the surrounding facial area) or inpaints the eyes with different or meaningful ones that fit perfectly to the surrounding facial area. The black bar approach can probably be performed on a current smartphone, whereas the second approach needs much more sophisticated hardware and computing power, with an integrated GPU that can run a trained, deep inpainting neural network in real time (note that a user with an XR headset would move around in general, which also changes the POV on the face to be anonymized, so the inpainting algorithms also has to be executed continuously in real time). In this context, we also think that the upcoming Apple Vision Pro will push the limits in DR, because it is a video-see-through device that can enable DR to reach its full potential [17]. In fact, the Digital Crown hardware of the Apple Vision Pro, which also exists for the Apple Watch, should enable us to seamlessly walk along the whole RV continuum (back and forth) and bring medical DR applications to reality, which are still almost nonexistent currently [18]. A potential example of the photo-editing capabilities of newer cell phones as a diminution operation is shown in Figure 2 [15]. In this medical example, DR enables the removal of a skin tumor virtually from a patient’s face before surgery.

References

Milgram P, Kishino F. A taxonomy of mixed reality visual displays. IEICE Transactions on Information and Systems 1994 Dec;E77-D(12):1321-1329 [FREE Full text]
Milgram P, Takemura H, Utsumi A, Kishino F. Augmented reality: a class of displays on the reality-virtuality continuum. In: SPIE Proceedings, Volume 2351, Telemanipulator and Telepresence Technologies. 1995 Dec 21 Presented at: Photonics for Industrial Applications 1994; October 31 to November 4, 1994; Boston, MA p. 282-292. [CrossRef]
ChatGPT. OpenAI. URL: https://chat.openai.com/ [accessed 2024-01-23]
Kim JK, Chua M, Rickard M, Lorenzo A. ChatGPT and large language model (LLM) chatbots: the current state of acceptability and a proposal for guidelines on utilization in academic medicine. J Pediatr Urol 2023 Oct;19(5):598-604. [CrossRef] [Medline]
Lawler-Sagarin KA, Sagarin BJ, Pederson A. Enhanced community through augmented reality: social benefits of Pokémon Go. Psychol Rep 2023 Aug 22:332941231197155. [CrossRef] [Medline]
Gsaxner C, Li J, Pepe A, Jin Y, Kleesiek J, Schmalstieg D, et al. The HoloLens in medicine: a systematic review and taxonomy. Med Image Anal 2023 Apr;85:102757 [FREE Full text] [CrossRef] [Medline]
Gruber LJ, Egger J, Bönsch A, Kraeima J, Ulbrich M, van den Bosch V, et al. Accuracy and precision of mandible segmentation and its clinical implications: virtual reality, desktop screen and artificial intelligence. Expert Syst Appl 2024 Apr;239:122275. [CrossRef]
Mori S, Ikeda S, Saito H. A survey of diminished reality: techniques for visually concealing, eliminating, and seeing through real objects. IPSJ Transactions on Computer Vision and Applications 2017 Jun 28;9:17. [CrossRef]
Mann S. Mediated reality with implementations for everyday life. Presence Connect 2002 Aug 6 [FREE Full text]
Cheng YF, Yin H, Yan Y, Gugenheimer J, Lindlbauer D. Towards understanding diminished reality. 2022 Apr Presented at: CHI '22: CHI Conference on Human Factors in Computing Systems; April 29 to May 5, 2022; New Orleans, LA p. 1-16. [CrossRef]
Gsaxner C, Mori S, Schmalstieg D, Egger J, Paar G, Bailer W, et al. DeepDR: deep structure-aware RGB-D inpainting for diminished reality. arXiv. Preprint posted online on December 1, 2023 2024. [CrossRef]
Gsaxner C, Li J, Pepe A, Schmalstieg D, Egger J. Inside-out instrument tracking for surgical navigation in augmented reality. 2021 Dec Presented at: VRST '21: 27th ACM Symposium on Virtual Reality Software and Technology; December 8-10, 2021; Osaka, Japan p. 1-11. [CrossRef]
Schmidt S. Blended spaces: perception and interaction in projection-based spatial augmented reality environments [dissertation]. University of Hamburg. 2020. URL: https://ediss.sub.uni-hamburg.de/bitstream/ediss/8644/1/dissertation.pdf [accessed 2024-01-22]
Liang YW, Huang YH. Exploration of user experience in mixed reality for product virtual interaction and display. 2023 Presented at: 2023 IEEE 6th International Conference on Knowledge Innovation and Invention (ICKII); August 11-13, 2023; Sapporo, Japan p. 404-409. [CrossRef]
DALL·E 3. OpenAI. URL: https://openai.com/dall-e-3 [accessed 2024-01-25]
Mann S, Furness T, Yuan Y, Iorio J, Wang Z. All reality: virtual, augmented, mixed (X), mediated (X,Y), and multimediated reality. arXiv. Preprint posted online on April 20, 2018 2024. [CrossRef]
Egger J, Gsaxner C, Chen X, Bian J, Kleesiek J, Puladi B. Apple Vision Pro for healthcare: "the ultimate display"? - entering the wonderland of precision medicine. arXiv. Preprint posted online on August 8, 2023 2024. [CrossRef]
Ienaga N, Bork F, Meerits S, Mori S, Fallavollita P, Navab N, et al. First deployment of diminished reality for anatomy education. 2016 Presented at: 2016 IEEE International Symposium on Mixed and Augmented Reality (ISMAR-Adjunct); September 19-23, 2016; Merida, Mexico p. 294-296. [CrossRef]
Iriqat S, Vatansever F. Comparison of reality types. Uludağ University Journal of The Faculty of Engineering 2020 Dec 31;25(3):1155-1168. [CrossRef]
Hutzler F. Reverse inference is not a fallacy per se: cognitive processes can be inferred from functional imaging data. Neuroimage 2014 Jan 1;84:1061-1069 [FREE Full text] [CrossRef] [Medline]
Wang G, Badal A, Jia X, Maltz JS, Mueller K, Myers KJ, et al. Development of metaverse for intelligent healthcare. Nat Mach Intell 2022 Nov;4(11):922-929. [CrossRef] [Medline]
Zentrum für virtuelle und erweiterte Realität in der Medizin. Universitätsklinikum Essen. URL: https://zvrm.ume.de/ [accessed 2023-01-23]

JMIR Publications

JMIR XR and Spatial Computing (JMXR)

Published on 29.01.24 in Vol 1, No 1 (2024): Jan-Dec

This paper is in the following e-collection/theme issue:

Original Paper

What is Diminished Virtuality? A Directional and Layer-Based Taxonomy for the Reality-Virtuality Continuum

Corresponding Author:

ABSTRACT

JMIR XR Spatial Comput 2024;1:e52904

Citation

KEYWORDS

Introduction

Diminished Reality

Directional and Layer-Based Taxonomy

Acknowledgments

Conflicts of Interest

References

Abbreviations

Copyright

JMIR XR and Spatial Computing (JMXR)

Citing this Article

Published on 29.01.24 in Vol 1, No 1 (2024): Jan-Dec

This paper is in the following e-collection/theme issue:

Original Paper

What is Diminished Virtuality? A Directional and Layer-Based Taxonomy for the Reality-Virtuality Continuum

Corresponding Author:

ABSTRACT

JMIR XR Spatial Comput 2024;1:e52904

Citation

KEYWORDS

Introduction

Diminished Reality

Directional and Layer-Based Taxonomy

Acknowledgments

Conflicts of Interest

References

Abbreviations

Copyright