TY - GEN
T1 - TVAnet
T2 - 34th SIBGRAPI Conference on Graphics, Patterns and Images, SIBGRAPI 2021
AU - Flores-Benites, Victor
AU - Mugruza-Vassallo, Carlos A.
AU - Mora-Colque, Rensso
N1 - Publisher Copyright:
© 2021 IEEE.
PY - 2021
Y1 - 2021
N2 - End-to-end methods facilitate the development of self-driving models by employing a single network that learns the human driving style from examples. However, these models face problems of distributional shift problem, causal confusion, and high variance. To address these problems we propose two techniques. First, we propose the priority sampling algorithm, which biases the training sampling towards unknown observations for the model. Priority sampling employs a trade-off strategy that incentivizes the training algorithm to explore the whole dataset. Our results show uniform training on the dataset, as well as improved performance. As a second approach, we propose a model based on the theory of visual attention, called TVAnet, by which selecting relevant visual information to build an optimal environment representation. TVAnet employs two visual information selection mechanisms: spatial and feature-based attention. Spatial attention selects regions with visual encoding similar to contextual encoding, while feature-based attention selects features disentangled with useful information for routine driving. Furthermore, we encourage the model to recognize new sources of visual information by adding a bottom-up input. Results in the CoRL-2017 dataset show that our spatial attention mechanism recognizes regions relevant to the driving task. TVAnet builds disentangled features with low mutual dependence. Furthermore, our model is interpretable, facilitating the intelligent vehicle behavior. Finally, we report performance improvements over traditional end-to-end models.
AB - End-to-end methods facilitate the development of self-driving models by employing a single network that learns the human driving style from examples. However, these models face problems of distributional shift problem, causal confusion, and high variance. To address these problems we propose two techniques. First, we propose the priority sampling algorithm, which biases the training sampling towards unknown observations for the model. Priority sampling employs a trade-off strategy that incentivizes the training algorithm to explore the whole dataset. Our results show uniform training on the dataset, as well as improved performance. As a second approach, we propose a model based on the theory of visual attention, called TVAnet, by which selecting relevant visual information to build an optimal environment representation. TVAnet employs two visual information selection mechanisms: spatial and feature-based attention. Spatial attention selects regions with visual encoding similar to contextual encoding, while feature-based attention selects features disentangled with useful information for routine driving. Furthermore, we encourage the model to recognize new sources of visual information by adding a bottom-up input. Results in the CoRL-2017 dataset show that our spatial attention mechanism recognizes regions relevant to the driving task. TVAnet builds disentangled features with low mutual dependence. Furthermore, our model is interpretable, facilitating the intelligent vehicle behavior. Finally, we report performance improvements over traditional end-to-end models.
KW - feature based attention
KW - self driving
KW - spatial attention
KW - visual attention
UR - http://www.scopus.com/inward/record.url?scp=85124197349&partnerID=8YFLogxK
U2 - 10.1109/SIBGRAPI54419.2021.00043
DO - 10.1109/SIBGRAPI54419.2021.00043
M3 - Conference contribution
AN - SCOPUS:85124197349
T3 - Proceedings - 2021 34th SIBGRAPI Conference on Graphics, Patterns and Images, SIBGRAPI 2021
SP - 263
EP - 270
BT - Proceedings - 2021 34th SIBGRAPI Conference on Graphics, Patterns and Images, SIBGRAPI 2021
PB - Institute of Electrical and Electronics Engineers Inc.
Y2 - 18 October 2021 through 22 October 2021
ER -