TY - CHAP
T1 - Crime alert! crime typification in news based on text mining
AU - Alatrista-Salas, Hugo
AU - Morzán-Samamé, Juandiego
AU - Nunez-del-Prado, Miguel
N1 - Publisher Copyright:
© Springer Nature Switzerland AG 2020.
PY - 2020
Y1 - 2020
N2 - In this paper we detailed a multinomial classification-based methodology that combines different algorithms (SVM and MLP) with document representations (Tf Idf vectorization and Doc2vec embedding) and: (i) can distinguish between crime-related news and not-crime related news and; (ii) allows the assignment of each crime-related news to its corresponding crime type. With a F1-score of 84% achieved by the MLP with Doc2vec approach, it can be concluded that it is possible to answer the question of how the crimes are committed (what types of crime are perpetrated) and, in this way, offer a thermometer to citizens about criminal activity in a given territory, as reported by news articles.
AB - In this paper we detailed a multinomial classification-based methodology that combines different algorithms (SVM and MLP) with document representations (Tf Idf vectorization and Doc2vec embedding) and: (i) can distinguish between crime-related news and not-crime related news and; (ii) allows the assignment of each crime-related news to its corresponding crime type. With a F1-score of 84% achieved by the MLP with Doc2vec approach, it can be concluded that it is possible to answer the question of how the crimes are committed (what types of crime are perpetrated) and, in this way, offer a thermometer to citizens about criminal activity in a given territory, as reported by news articles.
KW - Classification
KW - Crime analysis
KW - Text mining
KW - Word vectorization and embeddings
UR - http://www.scopus.com/inward/record.url?scp=85062905236&partnerID=8YFLogxK
U2 - 10.1007/978-3-030-12388-8_50
DO - 10.1007/978-3-030-12388-8_50
M3 - Chapter
AN - SCOPUS:85062905236
T3 - Lecture Notes in Networks and Systems
SP - 725
EP - 741
BT - Lecture Notes in Networks and Systems
PB - Springer
ER -