Eliminating Global Positions within Convolutional Neural Networks
- Presenter
- Jacob Rottenberg
- Campus
- UMass Amherst
- Sponsor
- Weibo Gong, Department of Electrical and Computer Engineering, UMass Amherst
- Schedule
- Session 2, 11:30 AM - 12:15 PM [Schedule by Time][Poster Grid for Time/Location]
- Location
- Poster Board A9, Campus Center Auditorium, Row 1 (A1-A20) [Poster Location Map]
- Abstract
- Computer Vision has not seen the major success that other forms of AI and deep learning models have. This is in part due to the approach that such models employ where they utilize the entire image and global position information within the image. This deviates from nature and the founding ideology of the neural network. The purpose of this research project is to propose a new type of deep learning layer that is able to extract essential information about an image. This ability should allow the recognition and transformation of objects to be much easier. To achieve this, the image is broken down into patches so that global coordinate information is not used. Then using these small patches, their invariant information is processed such that when reconstructing the image and mapping it to the new image, the patches’ location can be tracked. I was able to make this framework in Python using different mainstream libraries like Tensorflow, Keras, Numpy and Scipy. The overall conclusion of this project is that processing images using relative coordinates rather than global ones is viable given enough computing resources and training data. The preliminary results show that invariant information is learned and is transferred via the constructed mapping matrix. These results represent a significant process change in the world of image and video processing. Utilizing the relative coordinate technique may allow for quick processing of image and future prediction within a live-feed video. This has implications for self-driving cars and active predictions in other industries.
- Keywords
- Artificial Intelligence, Convolutional Neural Network, Computer Vision, Deep Learning, Relative Image Coordinates
- Research Area
- Artificial Intelligence
SIMILAR ABSTRACTS (BY KEYWORD)
Research Area |
Presenter |
Title |
Keywords |
Algorithms, Combinatorics, Graph Theory, and Game Theory |
Sathiya Narayanan, Anirudh |
|
Artificial Intelligence
|
Artificial Intelligence |
Russell, Adam |
|
Artificial Intelligence
|
Computer Science |
Thornton, Peter David |
|
Artificial Intelligence
|
Artificial Intelligence |
CHIN, JONATHAN A. |
|
Artificial Intelligence
|
Communication and Media Studies |
Turner, Erica Kay |
|
Artificial Intelligence
|