ViBot | Erasmus Mundus Master | Computer Vision and Robotics
 
 
Erasmus Mundus Master
Master in Computer Vision & Robotics                
 
   Home    Students    Universities    About ViBot    ViBot Day    Contact    Pictures    Acknowledgement    ROBOTS Robot Network - Open in new window    Game
 
 
Rojas Mario Alejandro Rojas Mario Alejandro
marioarojasq@gmail.com   Rojas Mario Alejandro - CV   Rojas Mario Alejandro - Webpage
Project:
Acoustic representation of images for the visually impaired: the human bat Thesis Rojas Mario Alejandro - Thesis
 
Supervisor:
Dr Andrew Harvey
 
Host University:
Heriot-Watt University
 
Personal introduction top

Hola Todos!!!!

I am from Colombia’s capital, Bogotá and despise all the unfavorable press we get the country has a lot of really nice things to offer: all the weathers, good food, lots of fruits, the best coffee in the world, beautiful sightseeing and wonderful people.

I graduated as an Electronic Engineer and a have worked in process control and automation for some time, but as I did my undergraduate final project in quality control via machine vision, I became interested in this field and found the VIBOT master, which has been one great life and academic experience.

I am interested in acquiring job experience in this field but also in pursuing further academic formation, specially in the field of medical imaging and/or human perception but also in the robot localization and navigation, so if you are looking for good “fella” I’ll be happy to join you and put some tropical warm in. Check my CV to know with more detail about me.

 
Project Description and Objectives top

Current approaches for exploiting the ability of the auditory system as a mean to transmit information address the issue of audio as a source of information only to the point of using it as an auxiliary channel or restrict it to environments based on computer graphical interfaces, which limit the complete capability for conveying information about the surroundings. The current project is aimed to tackle this from a different perspective.

The overall objective of this project is to propose, implement and evaluate a system that explores the possibility of supplying auditory information that resembles the one provided by the visual system (in case of its absence), making it feasible for a visually impaired person to navigate in a real environment such as a room.

 
Scientific Approach top

In order to do so, the relevant aspects of the auditory system with respect to perception are reviewed, the binaural interaural sound perception and the spatialized sound perception models are described and implemented in order to determine the validity and the pertinence of the models within the context of the project. To take advantage of the auditory system, the nature of sounds and the models for information mapping namely earcons and auditory icons, were also reviewed, guidelines were drawn from the studies that have been done on the subjects and sets of sounds were used.

With a certain understanding of the auditory models and the sounds which would transmit the information, a system that could take advantage of the auditory system is proposed where by combining a basic video processing block that extracts lateral coordinates from visual markers in a scene, transforming them into azimuth angles and using the model of the Head Related Transfer Function to convolve the modeled monophonic sound source signal, a spatialized version of the created earcons is presented to the user.

 
Results top

The system and its different parts are tested in two separate stages with the purpose of assessing the proposed blocks, sounds, schemes and delays, and findings confirming the limitations of the HRTF and the challenges of the sound sources design as well as the formulation of subject based tests were made.

The HRTF model and its main feature the ``out of the head perception of localization'' was confirmed as well as the front/back confusion consequence of the non-individualized function, was observed.
Earcons were found to be better as a coding scheme for spatialized sounds.
Small delays between sounds when several sources are presented improve significantly the ability to localize individual sources.
Overall the problem tackled seems to have a good or at least, interesting possibilities of developing new concepts for aiding tools for visually impaired people.

 
Acknowledgements top
To my classmates who helped me with the tests; to Gonzalo Muyo for his help in solving technical issues and Prof Harvey for his questions towards generating new approaches.
 
References top

[1] Walker A. and Brewster S.A. Spatial audio in small display. In Personal Technologies, volume 4, pages 144 - 154. 2000.
[2] Durand R. Begault. Head-up auditory display research at nasa ames research center. In Human Factors and Ergonomics Society 39th annual meeting, 1995.
[3] Durand R. Begault. 3-D Sound for Virtual Reality and Multimedia. NASA, April 2000.
[4] Gaver & Bly Buxton. Auditory Icons, chapter 6 Auditory Icons, page 17. Buxton, Gaver & Bly, 2002.
[5] Alan Kan Dennis Lin Andr van Schaik Keir Smith Craig Jin, Teewon Tan and Mathew McGinity. Real-time head-tracked 3d audio with unlimited simultaneous sounds. In ICAD 05 Eleventh Meeting of the International Conference on Auditory Display, July 2005.
[6] Hesham Fouad. Scheduling Algorithms for Real-Time Sound Generation in Virtual Environments. PhD thesis, School of Engineering and Applied Sciences, september 1997.
[7] Juhu Marila Gaetan Lorho and Jarno Hiipakka. Feasibility of multiple non-speech sound presentation using headphones. International Conference on Auditory Display, page 6, 2001.
[8] Bill Gardner and Keith Martin. Hrtf measurements of a kemar dummy-head microphone. Technical report, MIT, 1994.
[9] William W. Gaver. Sound support for collabration. In Proceedings of the Second European Conference on Computer-Supported Collaborative Work, 1991.
[10] Denise A. Sumikawa Meera M. Blattner and Robert M. Greenberg. Earcons and icons:Their structure and design principles. In Lawrence Erlbaum Associates Inc, editor, Human-Computer Interaction, volume 4, page 34. 1989.
[11] Elizabeth D. Mynatt. Auditory presentation of graphical user interfaces.
[12] Elizabeth D. Mynatt. Designing with auditory icons: How well do we identify auditory
cues? In Conference on Human Factors in Computer Systems, 1994.
[13] Rod Jard Paholio. 3d sound synthesis for headphones and spatial audio. DSP Final
Research Paper, 2005.
[14] Myra P. Bussemakers Paul M. Lemmens and Abraham de Haan. E_ects of auditory icons
and earcons on visual categorization: The bigger picture. International Conference on
Auditory Display, page 9, 2001.
[15] Chirs Schmandt and Atty Mullins. Audiostreamer: Exploiting simultaneity for listening.
In CHI 95. Addison Wesley, 1995.
[16] Peter C. Wrigth Stephen A. Brewster and Alistair D. Edwards. An evaluation of earcons
for use in auditory human-computer interfaces. In Conference on Human Factors in Com-
puting Systems, 1993.
[17] Veli-Pekka Raty Stephen Brewster and Atte Kortekangs. Earcons as a method of providing
navigational cues in a menu hierarchy. In Cunningham R.J. Sasse M. A. and Winder R.,
editors, 11th British Computer Society HCI Conference, pages 167 { 183, August 1996.
[18] Inc. The MathWorks. Image acquisition toolbox. Retrieved May 5, 2008, from
http://www.mathworks.com/products/imaq/, 2008.
[19] DeLiang Wang. Computational Auditory Scene Analysis, chapter 5. Binaural Sound Lo-
calization, page 34. John Wiley & Sons, Inc., 2005.

 
Links top

MSc Thesis Project documents:
 
Contact top

Mario A. Rojas Q.         marioarojasq@gmail.com

Prof Andrew Harvey     A.R.harvey@hw.ac.uk

 
 
 
Home | Students | Projects | Universities | About ViBot | ViBot Day | Contact | Pictures | Acknowledgement | ROBOTS Robot Network - Open in new window | Game

Official webpage: http://www.vibot.org