Joint Sequence Complexity Analysis: Application to Social Networks Information Flow

Dimitrios Milioris; Philippe Jacquet

doi:10.1002/bltj.21647

Article Dans Une Revue Bell Labs Technical Journal Année : 2014

Joint Sequence Complexity Analysis: Application to Social Networks Information Flow

(1, 2, 3, 4) , (1, 2)

1
2
3
4

Dimitrios Milioris

Fonction : Auteur
PersonId : 948332

Alcatel-Lucent Bell Labs France [Nozay]

Laboratory of Information, Network and Communication Sciences

École polytechnique

High PERformance COMmunications

Philippe Jacquet

Fonction : Auteur
PersonId : 747926
IdHAL : philippe-jacquet

Alcatel-Lucent Bell Labs France [Nozay]

Laboratory of Information, Network and Communication Sciences

Résumé

In this paper we study joint sequence complexity and its applications for finding similarities between sequences up to the discrimination of sources. The mathematical concept of the complexity of a sequence is defined as the number of distinct subsequences of it. Sequences containing many common parts have a higher joint complexity. The analysis of a sequence in subcomponents is done by suffix trees, which is a simple, fast, and low complexity method to store and recall them from the memory, especially for short sequences. Joint complexity is used for evaluating the similarity between sequences generated by different Markov sources. Markov models well describe the generation of natural text, and their performance can be predicted via linear algebra, combinatorics, and asymptotic analysis. We exploit datasets from different natural languages, for both short and long sequences, with very promising results. The goal is to perform automated online sequence analysis on information streams, e.g., on social networks such as Twitter.

Dimitrios Milioris : Connectez-vous pour contacter le contributeur

https://polytechnique.hal.science/hal-00907364

Soumis le : jeudi 21 novembre 2013-10:48:47

Dernière modification le : mardi 28 février 2023-15:36:23

Dates et versions

hal-00907364 , version 1 (21-11-2013)

Identifiants

HAL Id : hal-00907364 , version 1
DOI : 10.1002/bltj.21647

Citer

Dimitrios Milioris, Philippe Jacquet. Joint Sequence Complexity Analysis: Application to Social Networks Information Flow. Bell Labs Technical Journal, 2014, 18 (4), pp.75-88. ⟨10.1002/bltj.21647⟩. ⟨hal-00907364⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

X INSTITUT-TELECOM UPMC INRIA INRIA2 SORBONNE-UNIVERSITE SU-SCIENCES

403 Consultations

0 Téléchargements

Joint Sequence Complexity Analysis: Application to Social Networks Information Flow

Résumé

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager