Discovering Conflicts of Interest across Heterogeneous Data Sources with ConnectionLens - Laboratoire d'informatique de l'X (LIX) Accéder directement au contenu
Communication Dans Un Congrès Année : 2021

Discovering Conflicts of Interest across Heterogeneous Data Sources with ConnectionLens

Oana Balalau
Mhd Yamen Haddad
Stéphane Horel
  • Fonction : Auteur
  • PersonId : 1109601
Théo Bouganim
Ioana Manolescu
Helena Galhardas
  • Fonction : Auteur
  • PersonId : 1109600

Résumé

Investigative Journalism (IJ, in short) requires combining highly heterogeneous digital datasets coming from a wide variety of sources. We have developed ConnectionLens, a system that integrates such sources into a single heterogeneous graph and enables users to query the graph using keywords. The first iteration of the system [7] followed a mediator architecture which severely constrained its query scalability. Thus, we fully re-engineered the system, moving it to a warehouse architecture, and replacing its core components (information extraction, data querying, and interactive interfaces), which allowed us to handle uses cases orders of magnitude larger than the previous platform. In a consortium of computer scientists and investigative journalists, we propose to demonstrate ConnectionLens' capability to integrate arbitrary heterogeneous datasets and query them flexibly by means of keywords. Among several scenarios, our main focus will be on a real-world journalistic use case about situations which may lead to Conflicts of Interest between biomedical experts and various organizations, such as corporations, lobbies, etc. The demonstration will showcase the end-to-end data analysis pipeline, illustrate each system component, and the different parameters governing graph creation and querying.
Fichier principal
Vignette du fichier
main.pdf (1.1 Mo) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)

Dates et versions

hal-03337765 , version 1 (08-09-2021)

Identifiants

Citer

Angelos Christos Anadiotis, Oana Balalau, Francesco Chimienti, Mhd Yamen Haddad, Stéphane Horel, et al.. Discovering Conflicts of Interest across Heterogeneous Data Sources with ConnectionLens. ACM International Conference on Information and Knowledge Management (CIKM 2021), Nov 2021, Online, Australia. ⟨10.1145/3459637.3481982⟩. ⟨hal-03337765⟩
110 Consultations
143 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More