Comparison of functional connectivity metrics using an unsupervised approach: a source resting-state EEG study

The study of inter-regional synchronization between brain regions represents an important challenge in neuroimaging. Electroencephalography, given the high temporal resolution, allows the investigation of brain activity, connectivity, and network organization in time and frequency domains. Here, some of the most common metrics used to estimate the strength of functional interaction between pairs of brain regions are compared using source reconstructed time-series from resting-state high-density electroencephalography. Results show that the investigated metrics, on the basis of their connectivity profiles, may be naturally grouped into two main clusters. In particular, this finding shows that metrics which tend to limit the effects of volume conduction/signal leakage, although based on different properties of the original signals, may be partitioned into a specific homogeneous cluster, whilst the metrics which do not correct for these effects form a separate cluster. Moreover, this effect is even clearer when the analysis is replicated at scalp level. In conclusion, although within each cluster different metrics may still capture specific connectivity profiles, this study provides evidence that the result of an arbitrary choice of metric that either does or does not correct for volume conduction and signal leakage is more relevant.

Keywords

EEG

connectivity

resting-state

clustering

source

classification

1 Introduction

It is clearly accepted that neuronal oscillations and their synchronization, as measured between different brain regions, are fundamental for normal brain function. This conclusion has triggered an enormous interest in the development of quantitative techniques aimed to evaluate neuronal synchrony in electrophysiological data, namely electroencephalography (EEG) and magnetoencephalography (MEG). Analysis of the current literature, clearly shows that a large number of metrics have been proposed to quantify inter-regional synchronization. These metrics may roughly be separated into two main categories: those that estimate functional connectivity (FC) and those that estimate effective connectivity (EC). FC is defined as statistical interdependence of neuronal activity recorded from different brain regions. In contrast, EC refers explicitly to the influence that one region exerts over another [1]. Although metrics of EC allow estimation of causal interactions, thus providing information on the direction of interactions, FC metrics are still widely used to estimate whole-brain inter-regional synchronization patterns. The aim of the present paper is to compare a set of commonly used FC metrics, each one able to distinguish (and be influenced by) different aspects of signal interaction [2], so as to understand if different and specific metrics may be grouped on the basis of particular properties of their connectivity profiles. Indeed, each FC metric is based on detection of specific characteristics of the original signals, thus may present inherent advantages and disadvantages. For an exhaustive review of the different FC metrics proposed so far, refer to the recent survey by Kida et al. [2]. Nevertheless, it is not easy to identify the reasons that have motivated authors to choose one specific metric over others. Consequently, in general, it may be difficult to understand how related findings depend on the arbitrary choice of any particular metric. Additionally, it is also difficult to understand the reasons that have motivated the development of tens of new quantitative methods without proper comparison with previously proposed methods. Finally, as pointed out in a recent review [3], different research groups may employ different implementations of FC metrics, an issue that may further hinder the interpretation of results. For all these reasons, in this study, by use of an unsupervised approach, a set of FC metrics are compared that have commonly been used for source reconstructed resting-state EEG connectivity analysis, including: amplitude envelope correlation (AEC) [4], phase lag index (PLI) [5], imaginary component of coherency (iCOH) [6], phase locking value (PLV) [7], and spectral coherence (COH) [8]. It is hypothesized that distinct techniques, although based on different mechanisms of interaction and differently influenced by diverse sources of noise, may still be grouped together on the basis of common properties. The same analysis was reproduced for scalp level data as, to date, several EEG connectivity studies have still implemented this approach.

2 Material and Methods

2.1 Dataset and preprocessing

Freely available [9] and widely used [10-12] eyes-closed resting-state EEG recordings (64 channels, 109 subjects) were used in this study. Raw EEG signals (one minute long) were re-referenced to a common average reference and band-pass filtered to the alpha band (8-13 Hz). Source-reconstructed time-series were obtained by using whitened and a depth-weighted linear L2 minimum norm estimate (wMNE) [13] and projected onto 68 regions of interest as defined by the Desikan-Killiany atlas [14]. Finally, each preprocessed trace was organized into five non-overlapping epochs of 12 seconds.

2.2 Functional connectivity metrics

To perform an exhaustive comparison, it was decided to include some of the more common metrics used to estimate functional interaction between brain regions. AEC [4] is a measure of amplitude coupling based on linear correlations of band-pass filtered envelopes performed after a symmetric orthogonalisation procedure to remove zero-lag correlations. PLI [5] is a measure that ignores zero-lag phase differences and quantifies the asymmetry of the distribution of phase differences between time series. iCOH [6] is given by the cross spectrum divided by the product of the two power spectra and is not affected by linear mixing of an uncorrelated source. The PLV [7] is a measure that quantifies the consistency of phase differences, but is affected by zero-lag. Finally, the COH [8] is given by the squared correlation coefficient that evaluates the consistency of relative amplitude and phase between pairs of signals (in each frequency band) and is strongly affected by the volume conduction effect. In particular, the set of FC metrics investigated in this study includes both metrics that do (AEC, PLI, and iCOH) and do not (PLV and COH) correct for the effects due to volume conduction and signal leakage. A summary of the main properties of the selected FC metrics is presented in Table 0. Despite the importance of their different properties, the effects potentially caused by arbitrary choice of the FC metric are still overlooked, if not ignored. All analysis was performed on five non-overlapping epochs of 12 seconds. The application of each connectivity metric generates a symmetric square (68 $\times$ 68) matrix where each value gives the strength of interaction between paired regions (diagonal values set to zero). From each matrix a connectivity profile, including all the pair-wise connectivity values, was successively extracted as the upper triangular matrix. All analyses were replicated at scalp level (without performing a source-reconstruction procedure). In the latter case the connectivity matrix was composed of 64 $\times$ 64 elements each representing a single EEG channel.

doi.org/10.31083/j.jin.2018.04.0415.t0001

Table 1 Properties of FC metrics

FC metric	Influenced by	Signal leakage correction
AEC	Amplitude and phase	YES
PLI	phase	YES
iCOH	amplitude and phase	YES
PLV	phase	NO
COH	amplitude and phase	NO

2.3 Cluster analysis

The aim was to study natural clusters without any ‘a priori’ assumptions, including any knowledge concerning cluster number. Thus, an unsupervised approach was used to reveal the existence of different groups and to understand if the grouping underlay any common properties among the different FC metrics. One of the goals of the clustering was to gain insight into the structure of the data, with no ‘a priori’ knowledge about either data distribution or group organization. The clustering procedure was based on a $k$ -means approach, using the $k$ -means $+ +$ algorithm for centroid initialization and squared Euclidean distance. Five replicates were employed to search for lower local minima. Since using the wrong number of clusters can lead to meaningless results, a silhouette analysis [15], which can be employed to study the separation distance between the clusters, was used to define the optimal cluster number. Silhouette analysis validates consistency within cluster data. It relates how well each object lies within its cluster by comparing the similarity between an object and its own cluster (cohesion) versus the similarity between an object and other clusters (separation). A high silhouette value indicates that the objects are well matched to their own cluster and poorly matched to neighbouring clusters. In this way, the silhouette values allow a more appropriate choice of the number of clusters ( $M$ ) for the problem at hand. The clustering process is unsupervised, in that it can be applied in the absence of knowledge about the true classes. In the given problem, however, the true classes can be chosen and attributed to the patterns. For example, a class can be related to the fact that a metric either corrects or does not correct for the effects induced by volume conduction and signal leakage, or that a metric is based on mechanisms of phase or amplitude interaction, and so on. Using a hypothetical ground truth allows the assessment of clustering quality. This step is necessary as the silhouette analysis provides an indication of the number of clusters, even if there is no appropriate clustering for the given problem.

To assess the clustering quality on the basis of the discovered common properties between different FC metrics, the purity evaluation measure was used. With $M$ a set of clusters, $D$ a set of classes and $N$ the number of data points, the purity measure may be expressed as:

\[Purity= \frac{1}{N} \sum\limits_{m \in M}\mathop{\max}\limits_{d\in D} \left| m \cap d \right|\]

For each cluster, the purity value takes into account the number of data points from the most common class in the cluster. So, a value of the purity measure near unity indicates that the clustering obtained individuates a group according to the ground truth of the problem. If there is no correct clustering, a low purity value is obtained and the number of clusters given by the silhouette analysis is meaningless.

3 Results and Discussion

The mean global patterns of connectivity from the source analysis, obtained with the different FC metrics, averaged over all 109 subjects and the five epochs, are depicted in Fig. 2.3. Given that the connectivity matrices are shown only to disclose the global patterns of brain region interactions, the values are intentionally not set on the same scale.

As represented in Fig. 1 (left panel), the silhouette analysis for source-based connectivity showed the optimal number of clusters was $K =$ 2, with a mean silhouette value of 0.69. At the next step, with $K =$ 3, the corresponding mean silhouette value was lower (0.60), thus the silhouette analysis gives a clear indication of the required number of clusters. To test the hypothesis that such FC metrics may indeed be grouped on the basis of their response to the effects induced by volume conduction and signal leakage, the corresponding purity measure was evaluated. This allowed clustering quality to be assessed. Consequently, if it was assumed the two clusters were organized on the basis of this property, namely AEC, PLI, and iCOH in one cluster and PLV and COH in the second cluster, a purity value of 99.45% was obtained. Alternative hypotheses, such as assuming the discriminant property was signal phase, not signal amplitude, lead to much smaller purity values. This suggested that clustering may be primarily organized on the basis of how the FC metrics respond to this effect. By replicating the same analysis at scalp level, where the effects due to volume conduction and signal leakage should be still more evident, a mean silhouette value of 0.88 for K $=$ 2 and a purity value of 99.63% was obtained (see Fig. 1 right panel). As already reported for the source analysis, increasing the number of clusters resulted in a decrease in the silhouette value (with K $=$ 3, the mean silhouette value was 0.68).

Fig. 1.

The mean global patterns of connectivity for each of the five FC metrics for source-reconstructed analysis. Brain regions are organized as front (left) to back (right) for the left and the right hemisphere, respectively. Connectivity values are intentionally not given at the same scale.

This shows FC metrics that limit the effects induced by volume conduction and signal leakage (namely AEC, PLI, and iCOH) may be grouped together to give a high clustering quality and easy discrimination from other metrics (PLV and COH), which tend to generate spurious connectivity values. In particular, the effect of this property on the consequent group separation is greater than that induced by the other properties, as phase or amplitude interaction mechanisms should characterize the different FC metric. This result suggests, as also visually depicted in Fig. 2.3, that spurious connectivity may in some way overwrite and superimpose the real functional interactions. Furthermore, this finding is strengthened by the scalp level analysis, where the effects of volume conduction and signal leakage should be even more evident. In this latter case, as shown in Fig. 1, the separation into two clusters is still more marked.

Figure.2.

Mean silhouette values for $K$ varying from 2 to 6 for the source-reconstructed analysis (left panel) and scalp analysis (right panel).

4 Conclusion

In conclusion, this study confirms and suggests that caution should be used when interpreting the results from connectivity analysis, especially when metrics that do not limit the effects induced by volume conduction and signal leakage are implemented.

Acknowledgments

We thank our colleagues at the University of Cagliari for fruitful discussions.

Conflict of Interest

Authors declare no conflict of interest.

References

[1]

Friston

( 2011) Functional and effective connectivity: a review. Brain Connectivility 1(1), 13-36. 2791f9a58079b4907d383e7deaebf386

http%3A%2F%2Fcercor.oxfordjournals.org%2Fexternal-ref%3Faccess_num%3D10.1002%2Fhbm.460020107%26amp%3Blink_type%3DDOI

| Google Scholar | PubMed | Crossref

[2]

Kida

, Tanaka

, Kakigi

( 2016) Multi-dimensional dynamics of human electromagnetic brain activity. Frontiers in Human Neuroscience 9(365), 713. 10.3389/fnhum.2015.00713

26834608

e03e938bb8e745a97417252cb72f41ba

http%3A%2F%2Feuropepmc.org%2Fabstract%2FMED%2F26834608

http://europepmc.org/abstract/MED/26834608

| Google Scholar PubMed | Crossref

[3]

Bastos

, Schoffelen

( 2016) A tutorial review of functional connectivity analysis methods and their interpretational pitfalls. Frontiers in Systems Neuroscience 9(Pt 2), 175. 10.3389/fnsys.2015.00175

26778976

f285beee1914c2c48c176be50aa07e9d

http%3A%2F%2Fpubmedcentralcanada.ca%2Fpmcc%2Farticles%2FPMC4705224%2F

http://pubmedcentralcanada.ca/pmcc/articles/PMC4705224/

| Google Scholar PubMed | Crossref

[4]

Bruns

, Eckhorn

, Jokeit

, Ebner

( 2000) Amplitude envelope correlation detects coupling among incoherent brain signals. Neuroreport 11(7), 1509-1514. 10.1097/00001756-200005150-00028

10841367

df17a4eff0ffffd22756e45333bd6e05

http%3A%2F%2Feuropepmc.org%2Fabstract%2FMED%2F10841367

http://med.wanfangdata.com.cn/Paper/Detail/PeriodicalPaper_PM10841367

| Google Scholar PubMed | Crossref

[5]

Stam

, Nolte

, Daffertshofer

( 2007) Phase lag index: assessment of functional connectivity from multi channel EEG and MEG with diminished bias from common sources. Human Brain Mapping 28(11), 1178-1193. 10.1002/hbm.20346

17266107

9783ed3026f9be53609334a94d502c82

http%3A%2F%2Fonlinelibrary.wiley.com%2Fdoi%2F10.1002%2Fhbm.20346%2Ffull

http://onlinelibrary.wiley.com/doi/10.1002/hbm.20346/full

| Google Scholar PubMed | Crossref

[6]

Nolte

, Bai

, Wheaton

, Mari

, Vorbach

( 2004) Identifying true brain interaction from EEG data using the imaginary part of coherency. Clinical Neurophysiology 115(10), 2292-2307. 10.1016/j.clinph.2004.04.029

15351371

9f30db63b72f287031c7689ee604ed33

http%3A%2F%2Fwww.sciencedirect.com%2Fscience%2Farticle%2Fpii%2FS1388245704001993

http://www.sciencedirect.com/science/article/pii/S1388245704001993

| Google Scholar PubMed | Crossref

[7]

Lachaux

, Rodriguez

, Martinerie

, Varela

( 1999) Measuring phase synchrony in brain signals. Human Brain Mapping 8(4), 194-208. 71bfe7b65b209667c936ff31b8d7e706

http%3A%2F%2Fwww.tandfonline.com%2Fservlet%2Flinkout%3Fsuffix%3DCIT0002%26amp%3Bdbid%3D8%26amp%3Bdoi%3D10.1080%252F15368370802714148%26amp%3Bkey%3D10619414

| Google Scholar | PubMed | Crossref

[8]

Mandel

, Wolf

( 1976) Spectral coherence and the concept of cross-spectral purity. Journal of the Optical Society of America 66(6), 529-535. 10.1364/JOSA.66.000529

34826ffd70f0f2ad5abbd8cb7c32160d

http%3A%2F%2Fwww.opticsinfobase.org%2Fabstract.cfm%3Fid%3D56147

http://www.opticsinfobase.org/abstract.cfm?id=56147

| Google Scholar PubMed | Crossref

[9]

Schalk

, McFarland

, Hinterberger

, Birbaumer

, Wolpaw

( 2004) BCI2000: a general-purpose brain-computer interface (BCI) system. IEEE Transactions on Biomedical Engineering 51(6), 1034-1043. 7054cc375f7f8ecc5e2f0e659765e94f

http%3A%2F%2Fdoi.ieeecomputersociety.org%2Fresolve%3Fref_id%3Ddoi%3A10.1109%2FTBME.2004.827072%26amp%3Brfr_id%3Dmags%2Fmu%2F2005%2F02%2Fmmu2005020070.htm

| Google Scholar PubMed | Crossref

[10]

La Rocca

, Campisi

, Vegso

, Cserti

, Kozmann

, Babiloni

, DE Vico Fallani

( 2014) Human brain distinctiveness based on EEG spectral coherence connectivity. IEEE Transactions on Biomedical Engineering 61(9), 2406-2412. 10.1109/TBME.2014.2317881

24759981

c7414aaa-bd32-4d7a-bec3-deb6bf9ed97f

WOS:000346226100005

463ae0c898cf68cfdaa1a46a1ee854d0

http%3A%2F%2Fieeexplore.ieee.org%2Fxpls%2Ficp.jsp%3Farnumber%3D6799189

http://med.wanfangdata.com.cn/Paper/Detail/PeriodicalPaper_PM24759981

| Google Scholar PubMed | Crossref

[11]

Fraschini

, Hillebrand

, Demuru

, Didaci

, Marcialis

( 2015) An EEG-based biometric system using eigenvector centrality in resting state brain networks. IEEE Signal Processing Letters 22(6), 666-670. 10.1109/LSP.2014.2367091

4fb7a8f47d7956ba6321275188d8208a

http%3A%2F%2Fieeexplore.ieee.org%2Fdocument%2F6945793%2F

http://ieeexplore.ieee.org/document/6945793/

| Google Scholar PubMed | Crossref

[12]

Demuru

, Fara

, Fraschini

( 2013) Brain network analysis of EEG functional connectivity during imagery hand movements. Journal of Integrative Neuroscience 12(04), 441-447. 10.1142/S021963521350026X

24372064

e5b458d7b85433c9006196ceb895b5a7

http%3A%2F%2Fwww.ncbi.nlm.nih.gov%2Fpubmed%2F24372064

http://www.ncbi.nlm.nih.gov/pubmed/24372064

| Google Scholar PubMed | Crossref

[13]

Hämäläinen

( 1984) Interpreting measured magnetic fields in the brain: Estimates of current distributions, technical report TKK-F-A559, Helsinki University of Technology.

| Google Scholar PubMed | Crossref

[14]

Desikan

, Ségonne

, Fischl

, Quinn

, Dickerson

, Blacker

, Buckner

, Dale

, Maguire

, Hyman

, Albert

, Killiany

( 2006) An automated labeling system for subdividing the human cerebral cortex on MRI scans into gyral based regions of interest. Neuroimage 31(3), 968-980. ee7000f3c17b21cbf2469604208e1d61

http%3A%2F%2Fwww.tandfonline.com%2Fservlet%2Flinkout%3Fsuffix%3DCIT0016%26amp%3Bdbid%3D8%26amp%3Bdoi%3D10.1080%252F23273798.2017.1395466%26amp%3Bkey%3D16530430

| Google Scholar PubMed | Crossref

[15]

Rousseeuw

( 1987) Silhouettes: A graphical aid to the interpretation and validation of cluster analysis. Journal of Computational and Applied Mathematics 20, 53-65. 10.1016/0377-0427(87)90125-7

1c7050fed083151d17b6429bf955e6c5

http%3A%2F%2Fdl.acm.org%2Fcitation.cfm%3Fid%3D38772

http://www.sciencedirect.com/science/article/pii/037704278790-1257

| Google Scholar PubMed | Crossref

J. Integr. Neurosci. Print ISSN 0219-6352 Electronic ISSN 1757-448X