7.- Discussion
We have presented a real-time system that combines a vision module that estimates the user‘s head pose with a PLOD module that optimizes image rendering based on perceptual parameters. The system was implemented on a fairly modest PC using off the shelf components and it was able to improve the frame rate significantly compared to rendering the same terrain at full resolution. Subject tests were performed to assess the benefits of using uncertainty estimates in conjunction with other parameters. Our results indicated that uncertainty estimates help in making optimizations more seamless to the user. An approach for calculating orientation uncertainty was presented and employed as part of the vision module. However, the jitter in the uncertainty calculations prevented us from achieving the same level of performance compared to using fixed parameters. More details about this work can be found in [23]. Future work includes further investigation of these issues as well as estimating eye-gaze more accurately.
Some recommendations to improve the results of the tracker are using a Gaussian filter on the image before calculating the gradients. A dynamic window size can be implemented to help deal with fast head movement. The optimal value can be computed from the head‘s velocity using previous frames.
The goals were attained but not with 100 percent success. Implementing the recommendations will make the tracker more robust and suitable for its use in the Human Activities Recognition project.
Video
Click here to download a demo video of the system. The green crosshair in the video is the intersection of the head orientation with the screen. The mesh is initially rendered with textures and then as a wireframe to show how the triangles are modified in real time. At the begining the renderring module is not using any optimizations. At 0’44” a static uncertainty measurement is used and one can see that the high resolution region expands. At 0’58” the system is using variable uncertainty and the oscillations become clearly visible.
Acknowledgment
This work was supported by NASA under grant #NCC5-583.
References
- J. H. Clark. “Hierarchical Geometric Models for Visible Surface Algorithms”. Communications of the ACM, vol. 17(2), pages 547-554, 1976.
- T. Oshima, H. Yamamoto, and H. Tamura. “Gaze-Directed Adaptive Rendering for Interacting with Virtual Space”. Proceedings of 1996 IEEE Virtual Reality Annual International Symposium, pages 103-110, 1996.
- David Luebke et al. “Perceptually Driven Simplification Using Gaze-Directed Rendering”. University of Virginia Technical Report CS-2000-04.
- David Luebke et al. Level of Detail for 3D Graphics. Morgan Kaufmann Publishers, 1st ed., 2003.
- David Luebke. “A Developer‘s Survey of Polygonal Simplification Algorithms”. IEEE Computer Graphics &Applications, May 2001.
- Stephen Junkins and Allen Hux.“ Subdividing Reality”. Intel Architecture Labs White Paper, 2000.
- J. Warren and S. Schaefer. “A factored approach to subdivision surfaces”. Computer Graphics and Applications, IEEE, pages 74-81, 2004.
- David H. Eberly. 3D Game Engine Design. Morgan Kaufmann Publishers, 2001.
- J. L. Mannos and D. J. Sakrison. “The Effects of a Visual Fidelity Criterion on the Encoding of Images”. IEEE Transactions on Information Theory, vol. 20(4), pages 525-535, 1974.
- M. Reddy. “Perceptually Modulated Level of Detail for Virtual Environments”. PhD. Thesis. CST-134-97. University of Edinburgh, Edinburgh, Scotland, 1997.
- Andrew T. Duchowski. “Acuity-Matching Resolution Degradation Through Wavelet Coefficient Scaling”. IEEE Transactions in Image Processing, vol. 9(8), pages 1437-1440, August 2000.
- Wilson S. Geisler and Jeffrey S. Perry, “Real-time Simulation of Arbitrary Visual Fields”. ACM Symposium on Eye Tracking Research & Applications, 2002.
- Jeffrey S. Perry and Wilson S. Geisler. “Gaze-contingent real-time simulation of arbitrary visual fields”. Proceedings of SPIE: Human Vision and Electronic Imaging, San Jose, CA, 2002.
- Qiang Ji and Robert M. Haralick. “Error Propagation for Computer Vision Performance Characterization”. International Conference on Imaging Science, Systems, and Technology, Las Vegas, June, 1999.
- Don Murray and James J. Little. “Patchlets: Representing Stereo Vision Data with Surface Elements”. Workshop on the Applications of Computer Vision (WACV), 2005.
- David A. Forsyth and Jean Ponce. Computer Vision A Modern Approach. Prentice Hall, 1st ed., 2003.
- Hunter Murphy and Andrew T. Duchowski. “Gaze-Contingent Level Of Detail Rendering”. EuroGraphics Conference, September 2001.
- Nathaniel Williams et al. “Perceptually guided simplification of lit, textured meshes”. Proceedings of the 2003 Symposium on Interactive 3D graphics, Monterrey, CA, 2003.
- K. Arun et al. “Least-squares fitting of two 3-D point sets”, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 9(5), pages 698-700, 1987.
- Robyn Owens. Lecture notes, http://homepages.inf.ed.ac.uk/rbf/CVonline/LOCAL_COPIES/OWENS/LECT11/node5.html
- “VirtualgloveBox”, http://biovis.arc.nasa.gov/vislab/vgx.htm
- “Effective Human-Computer Interaction in Virtual Environments”, http://www.cse.unr.edu/CVL/current_proj.php
- Javier Martinez, “Rendering Optimizations Guided by Head-Pose Estimates and Their Uncertainty”, M.S. Thesis, Dept of Computer Science and Engineering, University of Nevada, Reno, August 2005.