navigation-assistant

About

My goal with this project was to create a novel navigational aid that offers a large amount of information about the user’s surroundings, while also being simple to learn to use and understand. This innovative device offers both reliable object detection and an intuitive feedback mechanism to communicate the information to the user. The device is able to process images from two, front-facing cameras in order to both detect objects and their distances from the user, and output this information to the user through a series of vibration motors in a glove. Ultimately, the device will give blind or visually impared people (BIV) a more complete picture of their environment to help them travel more safely and efficiently.

Materials

2 720p/1080p cameras
Raspberry Pi
Arduino Uno
12 Vibration Motors

Software Packages

OpenCV
TensorFlow
NumPy

Image Processing

Calibrate cameras for distortion by finding some specific points of which we already know the relative positions (e.g. square corners in the chess board). We know the coordinates of these points in real world space and we know the coordinates in the image, so we can solve for the distortion coefficients.

If we know the distance between two cameras (B) and the focal length of camera (f), then the depth of a point in a scene is inversely proportional to the difference in distance of corresponding image points (x and x') and their camera centers. Ultimately, with this information, we can derive the depth of all pixels in an image.

Setup object detection and train on indoor objects that BIV's use as landmarks or have frequent accidents (i.e. lamps, chairs, hand-rails, etc.) by following Google's tutorial on using MobileNet SSdv3 (https://github.com/tensorflow/models)
Combine object detection and depth map in order to relay vibrational information to glove

Left image indicates real world picture with highlighted areas indicating portions of the image that are physically close to the user and bounding boxes of common objects recognized by the object detection model. Center image represents the depth map of the left image where the brighter the pixel, the closer it is to the user. Right image is the vibrational feedback sent to vibrational motors where the colors represent intensity of vibration: red is high, yellow is medium, and green is none.

Conclusion

Currently no other device is able to capture a combination of direction, depth, and object recognition. By running the input from the 2 stereo cameras through visual processing software, we are able to identify common indoor objects as well as relative direction and distance away from a user. Practically, this means if a user was to walk into a room, they would be able to not only identify a hanging lamp 6 feet to their left for example, but also receive feedback that classifies an object in front of them like a door.

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
example_calibration_images		example_calibration_images
README.md		README.md
depth_map.ipynb		depth_map.ipynb
indoor_object_detection.ipynb		indoor_object_detection.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

example_calibration_images

example_calibration_images

README.md

README.md

depth_map.ipynb

depth_map.ipynb

indoor_object_detection.ipynb

indoor_object_detection.ipynb

Repository files navigation

navigation-assistant

About

Materials

Software Packages

Image Processing

Conclusion

About

Releases

Packages

Languages

reza-mohideen/navigation-assistant

Folders and files

Latest commit

History

Repository files navigation

navigation-assistant

About

Materials

Software Packages

Image Processing

Conclusion

About

Resources

Stars

Watchers

Forks

Languages