Organización automática del conocimientola geografía en la Wikipedia

  1. Carlos G. Figuerola 1
  2. Angel Zazo Rodríguez 1
  3. José Luis Alonso Berrocal
  1. 1 Universidad de Salamanca
    info

    Universidad de Salamanca

    Salamanca, España

    ROR https://ror.org/02f40zc51

Journal:
Scire: Representación y organización del conocimiento

ISSN: 1135-3716

Year of publication: 2019

Volume: 25

Issue: 2

Pages: 13-21

Type: Article

DOI: 10.54886/SCIRE.V25I2.4642 DIALNET GOOGLE SCHOLAR lock_openOpen access editor

More publications in: Scire: Representación y organización del conocimiento

Abstract

The Information Technologies drive an unprecedented growth of information, which raises the problem of the organization of it. As it is digital information, it is possible to approach her organization through automated procedures. On the other hand, Network Analysis Techniques are a powerful tool that allows us to model different phenomena and then apply automatic techniques. In this paper we describe the application of these Network Analysis Techniques to model and process an important number of documents, such as the one constituted by Wikipedia articles. The subsequent application of community detection algorithms allows grouping the articles based on their hyperlinks and their thematic affinity. This work focuses, after having applied these techniques, on the geographical relationship of the articles, on their network communities and the connections between them.

Funding information

La investigación objeto de esta comunicación se está financiando con fondos de FEDER de la Unión Europea a través del Programa Nacional del Plan de Investigación Científica, Desarrollo e Innovación Tecnológica (I+D+i) del Ministerio de Economía y Competitividad (CSO2013-49278-EXP) y del Programa Estatal de Generación de Conocimiento y Fortalecimiento Científico y Tecnológico del Sistema de I+D+i (PGC2018-093755-B-I00)

Funders

Bibliographic References

  • Blondel, V. D.; Guillaume, J. L.; Lambiotte, R.; Lefebvre, E. (2008). Fast unfolding of communities in large networks. // Journal of Statistical Mechanics: Theory and Experiment. 10:2008. http://arxiv.org/pdf/0803.0476.pdf (23/03/2019).
  • Bohlin, L.; Edler, D.; Lancichinetti, A.; Rosvall, M. (2014). Community detection and visualization of networks with the map equation framework. // Measuring Scholarly Impact. 3-34.
  • Brandes, U.; Kenis, P.; Lemer, J.; van Raaij, D. (2009). Network Analysis of collaboration structure in Wikipedia. // Proceedings of the 18th international conference on World Wide Web. New York.731-740. Doi: 10.1145/15226709.1526808
  • Chernov, S.; Iofciu, T.; Nejdl, W.; Zhou, X. (2006). Extracting Semantics Relationships between Wikipedia Categories.// SemWiki'06 Buvda, Montenegro. DOI=10.1.1.73.5507
  • Dalton, J.; Dietz, L. (2012), Bi-directional Linkability From Wikipedia to Documents and Back Again: UMass at TREC 2012. // Text Retrieval Conference 2012. Knowledge Base Acceleration Track. http://trec.nist.gov/pubs/trec21/ papers/umass_CIRR.kba.final.pdf (23/03/2019).
  • Edler, D.; Rosvall, M. (2015). The infomap software package.http://www.mapequation.org/code.html (23/03/2019).
  • Gabrilovich, E.; Markovitch, S. (2007). Computing semantic relatedness using wikipedia-based explicit semantic analysis. // Proceedings of the 20th international joint conference on Artifical intelligence. Hyderabad, India: AAAI Press. 1606-1611.
  • Lancichinetti, A.; Fortunato, S. (2009). Community detection algorithms: A comparative analysis.// Physical Review E. 80:5. http://arxiv.org/pdf/0908.1062v2.pdf (23/03/2019).
  • Lee, C.; Cunningham, P. (2014). Community detection: Effective on large social networks. // Journal of Complex Networks. 2:1 19–37. http://comnet.oxfordjournals.org/content/2/1/19.full.pdf+html (23/03/2019).
  • Okoli, C; Mehdi, M.; Nielsen, F.A.; Lanamaki, A. (2012). The People’s Encyclopedia Under the Gaze of the Sages: A Systematic Review of Scholarly Research on Wikipedia. https://ssrn.com/abstract=2021326, http://dx.doi.org/10. 2139/ssrn.2021326 (23/03/2019).
  • Okoli, C.; Mehdi, M.; Mesgari, M.; Nielsen, F. A.; Lanamaki, A. (2014). Wikipedia in the eyes of its beholders: A systematic review of scholarly research on Wikipedia readers and readership. // Journal of the Association for Information Science and Technology. 65:12, 2381-2403.
  • O'Sullivan, D. (2016). Wikipedia: a new community of practice?. London: Routledge.https://doi.org/10.4324/9781315547183 (23/03/2019).
  • Palmero Aprosio, A.; Giuliano, C.; Lavelli, A.(2013) Automatic Mapping of Wikipedia Templates for Fast Deployment of Localised DBpedia Datasets. // Proceedings of the 13th International Conference on Knowledge Management and Knowledge Technologies (i-Know '13). New York:ACM. DOI: https://doi.org/10.1145/2494188.2494196 (23/03/2019).
  • Plantié, M.; Crampes, M. (2013). Survey on social community detection. // Social media retrieval, 65–85. http://hal.archivesouvertes.fr/docs/00/80/42/34/PDF/Survey-on-Social-Community-Detection-V2.pdf (23/03/2019)
  • Rosvall, M.; Axelsson, D.; Bergstrom, C. (2009). The map equation. // European Physical Journal Special Topics. 178 13–23.
  • Salton, G.; McGill, M.J. (1983). Introduction to Modern Information Retrieval. New York, NY: McGraw-Hill.
  • Scott, J. (2013). Social network analysis. Thousand Oaks, CA, US: Sage Publications, Inc
  • Shachaf, P.; Hara, N. (2010). Beyond vandalism: Wikipedia trolls. // Journal of Information Science. 36:3. 357-370.
  • Weale, T. (2006). Utilizing Wikipedia categories for document classification. ftp://ftp.cse.ohio-state.edu/pub/tech-report/ 2008/TR14.pdf (23/03/2019).
  • Zachte, Erik (2019). Wikimedia Traffic Analysis Report – Page Edits per Wikipedia Language – Breakdown https://stats.wikimedia.org/wikimedia/squids/SquidReport PageViewsPerLanguageBreakdown.htm (23/03/2019).
  • Zazo, A. F.; Figuerola, C. G.; Alonso Berrocal, J. L. (2015). Edición de contenidos en un entorno colaborativo: el caso de la Wikipedia en español. // Scire: representación y organización del conocimiento, 21:2 57-67.
  • Zlatic, V.; Bocicevic, M.; Tefancic, H.; Domazet, M. (2006). Wikipedias: collaborative web-based encyclopedias as complex networks. // Physical Review E, 77:1 doi 10.1103/PhisRevE.74.016115.