Documents Retrieval for Qualitative Research: Gender Discrimination Analysis

Hugo Alatrista-Salas, Pilar Hidalgo-Leon, Miguel Nunez-Del-Prado

Producción científica: Capítulo del libro/informe/acta de congresoContribución a la conferenciarevisión exhaustiva

3 Citas (Scopus)

Resumen

Gender discrimination is an act of exclusion or differential treatment towards a person due to its sex. This phenomenon has been studied in qualitative research by seeking to analyze and to describe the reality and context of discrimination. Qualitative researchers use a collection of documents such as surveys, interviews among another source. These large full textual documents tend to be unstructured from a Data Science point of view. These data are often complex and tend to show similar information between documents. Nevertheless, the process of selecting relevant information is manual, generating difficulties in categorizing and analyzing relevant piece of information, such as victim's surveys. The main reason in this processing is the use of tools to simplify the task of information selection and to perform it efficiently. This article proposes two methods based on the TF-IDF measure to search documents in a corpus. Our findings show that other methods such as, LSA (Latent Semantics Analysis) and LDA (Latent Dirichlet Allocation) consume a lot of memory, and have a low effectiveness extracting meaningful words than relying on TD-IDF only. The information processed in this case is about testimonies of gender discrimination in university students in Peru.

Idioma originalInglés
Título de la publicación alojada2018 IEEE Latin American Conference on Computational Intelligence, LA-CCI 2018
EditorialInstitute of Electrical and Electronics Engineers Inc.
ISBN (versión digital)9781538646250
DOI
EstadoPublicada - 23 ene. 2019
Publicado de forma externa
Evento2018 IEEE Latin American Conference on Computational Intelligence, LA-CCI 2018 - Gudalajara, México
Duración: 6 nov. 20189 nov. 2018

Serie de la publicación

Nombre2018 IEEE Latin American Conference on Computational Intelligence, LA-CCI 2018

Conferencia

Conferencia2018 IEEE Latin American Conference on Computational Intelligence, LA-CCI 2018
País/TerritorioMéxico
CiudadGudalajara
Período6/11/189/11/18

Huella

Profundice en los temas de investigación de 'Documents Retrieval for Qualitative Research: Gender Discrimination Analysis'. En conjunto forman una huella única.

Citar esto